NumPy
NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
- GitHub: https://github.com/topics/numpy
- Wikipedia: https://en.wikipedia.org/wiki/NumPy
- Repo: https://github.com/numpy/numpy
- Created by: Travis Oliphant
- Related Topics: python, scipy,
- Last updated: 2026-06-26 00:18:09 UTC
- JSON Representation
https://github.com/achuth-0908/hemoguard-anemia-predictor
A Web App incorporated with a Gradient Boosting Classifier Model, to predict Anemia with given data.
css flask html matplotlib numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/nemeslaszlo/financial-crisis-effects-on-some-stocks
Focusing on bank stocks to see how they progressed throughout the 2008-09 financial crisis all the way to early 2016.
cufflinks jupyter-notebook matplotlib numpy pandas pandas-datareader plotly seaborn
Last synced: 03 May 2026
https://github.com/gauravsakure02/pubg_win_prediction
Predicting PUBG game wins using a dataset involves analyzing various in-game features such as player kills, distance traveled, and weapons acquired to build a model that forecasts the likelihood of winning a match based on player performance, helping in winning game
matplotlib numpy pandas sklearn
Last synced: 29 Apr 2026
https://github.com/ladityagogoi/shadowguard
The ShadowGuard Browser Extension is a powerful tool designed to enhance user experience by identifying and highlighting potential dark patterns on websites. Our extension employs a combination of machine learning algorithms and natural language processing (NLP) models to detect and classify various deceptive design practices
css flask html javascript joblib numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/msadeqsirjani/synapse
seizure detector with neural networks
cnn edf matplotlib neural-network numpy pyedflib python seizure-detection sklearn tensorflow
Last synced: 11 Apr 2026
https://github.com/prakashjha1/stock-trend-prediction
Deep learning for predicting stock market prices and trends has become even more popular than before. I have used yahoo finance to collect the data and LSTM to build the stock trend model.
deep-learning keras lstm lstm-neural-networks machine-learning numpy pandas pandas-datareader prediction python rnn tensorflow
Last synced: 11 Apr 2026
https://github.com/pacatro/lse2text
Deep learning program that translates Spanish Sign Language (LSE) to text in real time.
ai cnn computer-vision deep-learning lse matplotlib numpy pandas python pytorch pytorch-lightning scikit-learn torchmetrics translation
Last synced: 11 Apr 2026
https://github.com/kanishk1420/renewabled-data-helper
An AI-powered assistant designed to help analysts, developers, and investors efficiently analyze and extract critical insights from renewable energy project documents. Built using Python, Google Gemini, FAISS, and Streamlit
gemini-api langchain mongodb numpy python streamlit
Last synced: 11 Apr 2026
https://github.com/kartikeya443/handgesture_detection_opencv
A Computer Vision based model trained on data sets of over 300 inputs for a particular gesture is able to detect and identify human hands gestures effortlessly and is able to track the coordinates for further modifications in its usage for controlling devices like mouse and windows (control features are in development).
deep-learning mediapipe numpy opencv tensorflow
Last synced: 11 Apr 2026
https://github.com/bhattbhavesh91/tf-numpy-example
Faster NumPy using Tensorflow's tf.experimental.numpy module
Last synced: 01 May 2026
https://github.com/sehgalnaval/olympics-data-analysis
I will be building an Olympics Data Analysis Web Application using Streamlit. For development, I will be using Python and Pandas. For plotting, I will be using Seaborn and Plotly libraries. The project will also be deployed to Heroku.
heroku jupyter-notebook matplotlib numpy pandas pycharm python seaborn
Last synced: 11 Apr 2026
https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.
This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.
big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python
Last synced: 11 Apr 2026
https://github.com/aditya-ranjan1234/1stsem-el-dataset-analysis
Netflix Dataset Analysis
beginner matplotlib numpy pandas python
Last synced: 18 Apr 2026
https://github.com/mdaiyub/covid-19-detection
The emergence of Coronavirus Disease 2019 (COVID-19) in early December 2019 has caused immense damage to health and global well-being. Currently, there are approximately five million confirmed cases and the novel virus is still spreading rapidly all over the world. The chest X-ray images along with emerging Artificial Intelligence (AI) methodologies, in particular, Deep Learning (DL) algorithms have recently become a worthy choice for early COVID-19 screening. This work proposes a DL-assisted automated method using X-ray images for the early diagnosis of COVID-19 infection. We evaluate the effectiveness of the pre-trained Convolutional Neural Network (CNN) model. The model has been validated on publicly available chest X-ray images and it is obtained by CNN with an accuracy of 97.26%. This study will be useful for researchers to think about the design of more effective CNN-based models for early COVID-19 detection.
backend early-stopping global keras matplotlib modelcheckpoint numpy os pandas tensorflow
Last synced: 02 Apr 2026
https://github.com/SudipBishwakarma/Bulk-Remove-OneSignal-Users
Bulk remove OneSignal subscribed users blazingly faster with parallel processing.
numpy pandas python python-logger python-multiprocessing python-threading python3
Last synced: 31 Mar 2025
https://github.com/nomadsdev/frequency-insight
SoundFreqAnalyzer: A Python tool to record audio, analyze frequencies, and save results.
audio-analysis audio-recording audio-tool data-visualization fft frequency-analysis keyboard-python numpy pyaudio python scipy signal-processing sound-engineering
Last synced: 21 Feb 2026
https://github.com/fatma-moanes/machine-learning-labs
My implementation for the labs of the Machine Learning course that I studied in my university, Zewail City.
bootstrap data-analysis data-science deep-learning keras knn-classification linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib ml neural-networks numpy pandas pca preprocessing python seaborn svm-classifier
Last synced: 12 Apr 2026
https://github.com/sarincr/introduction-to-pyqt5
PyQt5 is a comprehensive set of Python bindings for Qt v5. Qt is set of cross-platform Python libraries that implement high-level APIs for accessing many aspects of modern desktop and mobile systems.
app appdevelopment data-science kivy kivymd machine-learning numpy opengl pygui pyqt pyqt5 pyqt5-desktop-application pyqt6 pyside2 scipy tensorflow tkinter tkinter-graphic-interface tkinter-gui tkinter-python
Last synced: 13 Apr 2026
https://github.com/guilhermebit/movie-suggestions-using-k-nearest-neighbors
This project implements a movie recommendation system using the K-Nearest Neighbors (KNN) machine learning algorithm.
algorithms data-science euclidean-distance knn machine-learning matplotlib numpy pandas python
Last synced: 12 Apr 2026
https://github.com/srujayreddy/selling-laptops
Predicting whether users will click on a promotional email for laptops based on historical user data and browsing logs.
customer-behavior-analysis feature-engineering logistic-regression machine-learning marketing-analytics numpy pandas predictive-modeling scikit-learn
Last synced: 12 Apr 2026
https://github.com/nirmalvatsyayan/data-analyst-nanodegree
Udacity data analyst nanodegree project submissions and learning
data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree
Last synced: 12 Apr 2026
https://github.com/rkarahul/vehicle-number-plate-detection-and-ocr-reading-using-opencv-and-paddleocr
complated Vehicle Number Plate Detection and OCR Reading Using OpenCV and PaddleOCR
numpy opencv paddleocr paddlepaddle pandas python
Last synced: 12 Apr 2026
https://github.com/balaji24092001/python
All python related content: notes, problems, progress archive.
matplotlib numpy opencv pandas python3 scipy-stats seaborn sklearn sqllite
Last synced: 12 Apr 2026
https://github.com/singhmnprt01/Custom-Deep-Neural-Network-Package
A custom deep neural network package that gives the liberty to design your deep neural network using only NumPy
customized-deep-neural-network deep-learning-framework maths-behind-neural-network numpy
Last synced: 09 Jul 2025
https://github.com/harshita2234/breast-cancer-prediction
A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.
breast-cancer-prediction jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas scikit-learn
Last synced: 12 Apr 2026
https://github.com/paulo-santos-ds/dados-telefonia
Este projeto tem como objetivo desenvolver um modelo que possa analisar o comportamento do cliente e recomendar um dos planos mais recentes de uma companhia telefônica fictícia.
matplotlib numpy pandas python3 sklearn
Last synced: 12 Apr 2026
https://github.com/aymen016/data-visualization
A collection of data analysis projects using Python, covering various datasets and domains. Includes EDA, preprocessing, data visualization (Matplotlib/Seaborn), and Power BI dashboards.
dashboard eda jupyter-notebook matplotlib-pyplot matplotlib-python numpy pandas pandas-dataframe powerbi python seaborn seaborn-python visualization
Last synced: 15 Jun 2026
https://github.com/403errors/offset-to-hydrostatic-curve
Group project for making hydrostatic curves from the given ship offset table using python libraries
data-visualization matplotlib-pyplot numpy python3 sklearn
Last synced: 10 May 2026
https://github.com/ashishtukaral/sign-language-recognition-using-ai
A real-time Sign Language Recognition system using AI to translate hand signs into text and speech, enhancing communication for individuals with hearing impairments. This web application leverages neural networks and computer vision to provide accurate and accessible translation.
ann classification cnn computer-vision deep-learning flask keras machine-learning matplotlib neural-network numpy oops opencv pandas python3 tensorflow
Last synced: 04 Feb 2026
https://github.com/pandeyankit123/books-recommender-system-using-machine-learning
This is a streamlit web application that can recommend various kinds of similar books based on an user interest.
jupyter-notebook machine-learning numpy pandas pickle python streamlit
Last synced: 12 Apr 2026
https://github.com/nemeslaszlo/lending-club-analysis
Lending Club analysis, with a model that can predict whether or nor a borrower will pay back their loan.
binary-classification classification data-visualization keras matplotlib numpy pandas seaborn tensorflow
Last synced: 12 Apr 2026
https://github.com/codehass/facial-emotional-detection
Implement a complete AI pipeline in Computer Vision, from face detection to emotion classification using CNNs (TensorFlow/Keras) and Haar Cascades (OpenCV), and integrate this model into a FastAPI API connected to a PostgreSQL database.
cnn-model fastapi keras mathplotlib numpy opencv postgresql pydantic pytest sqlalchemy tensorflow
Last synced: 12 Apr 2026
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/moinak-majumdar/fake-currency-detection
Indian fake currency detecting mobile application.
cnn deep-learning flutter getx keras machine-learning mobile-app neural-network numpy random-forest scikit-learn support-vector-machines tensorflow
Last synced: 12 Apr 2026
https://github.com/serratotech/python3-for-finance
Investment fundamentals & data analytics exercises
anaconda conda data-analytics finance ipynb jupyter-notebook numpy pandas python python3 python35
Last synced: 12 Apr 2026
https://github.com/lasithaamarasinghe/movie-recommender-system
This ML model recommends movies that may align with the user's preferences based on TF-IDF matrix.
jupyter-notebook machine-learning movie-recommendation movielens-dataset numpy pandas python regex scikit-learn tf-idf-vectorizer
Last synced: 12 Apr 2026
https://github.com/en10/bostonhousing
Boston Housing Dataset Example
boston-housing-dataset boston-housing-price-prediction matplotlib numpy pandas pandas-dataframe python python3 sklearn
Last synced: 12 Apr 2026
https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023
This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.
data-science machine-learning ml numpy pandas python3 scikit-learn
Last synced: 28 Oct 2025
https://github.com/hamdaniqhmqd/kelompok6-sistem-cerdas-bbri
Repository group6-system-smart-bbri is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.
data-visualization numpy pandas python sklearn streamlit
Last synced: 03 Apr 2025
https://github.com/shreeyas-48/creditcardfrauddetection
Project for detecting credit card frauds using neural networks and logistic regression
autoencoder keras logistic-regression matplotlib neural-networks numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/parawanderer/quantumcircuitdesigner
Interactive quantum circuit simulator
diagrams-as-code mathematics numpy quantum-computing tkinter-gui
Last synced: 29 May 2026
https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning
Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!
machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn
Last synced: 12 Apr 2026
https://github.com/rayyan9477/machine-learning-driven-backorder-prediction-system
Experience a state-of-the-art Django web application designed to predict product backorders with exceptional accuracy. This platform leverages advanced machine learning techniques, incorporating pre-trained Random Forest Classifier, Decision Tree, and LGBM models.
matplotlib notebook numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/csoren66/credit-card-fraud-detection
Credit card fraud detection through logistic regression.
logistic-regression machine-learning numpy pandas python sklearn
Last synced: 03 Mar 2025
https://github.com/hansalemaos/cythonnestednumpy
performs sorting and grouping operations on multidimensional NumPy arrays using Cython and hash-based algorithms.
Last synced: 07 Jan 2026
https://github.com/antonio-f/building_a_recurrent_neural_network_step_by_step
Building a Recurrent Neural Network Step by Step
coursera deep-learning long-short-term-memory lstm numpy recurrent-neural-networks rnn step-by-step tutorial
Last synced: 30 Mar 2025
https://github.com/anish202020/real-time-emotion-face-detection
The primary purpose of this website is to detect emotions in real-time using facial expressions captured via a camera.
fer keras numpy opencv python streamlit tensorflow
Last synced: 12 Apr 2026
https://github.com/yas-sim/python-dpcpp-extension-sample-code
Python extension sample code using Intel oneAPI DPC++. The extension does a simple image processing using DPC++ kernel.
box-filter convolution-filter dpcpp gpu-acceleration gpu-programming heterogeneous heterogeneous-parallel-programming image-processing numpy numpy-capi oneapi opencl python python-extension sample-code sycl
Last synced: 13 Apr 2026
https://github.com/mchenryspagg/analysis-of-car-repair-shop-operations
This project presents a comprehensive data analysis of a car repair shop's operations, utilizing an extended dataset of sales receipts.
dataanalysisusingsql datavisualization matplotlib-pyplot mysql mysql-database mysqlconnector numpy pandas python salesanalysis seaborn sql
Last synced: 07 Jan 2026
https://github.com/mezgoodle/bounder
bound-points cluster-analysis clustering numpy pyqt5 python
Last synced: 12 Jun 2025
https://github.com/danhenriquex/data-science-project
The main goal of this project was to apply the concepts of data visualization and analysis.
data-analysis data-science numpy pandas python
Last synced: 12 Apr 2026
https://github.com/hamdaniqhmqd/sistem-cerdas
I use this repository for my storage, especially in the intelligent systems course in semester 3
matplotlib numpy pandas python streamlit
Last synced: 20 Apr 2026
https://github.com/esvs2202/credit-card-fraud-detection-system
This fraud detection system is powered by a Machine Learning model, which accurately identifies whether an initiated transaction is fraudulent.
azurewebapp flask gunicorn-web-server html-css machine-learning matplotlib-pyplot numpy pandas powerbi python3 randomforestclassifier seaborn
Last synced: 12 Apr 2026
https://github.com/ngirimana/ml_knn
Welcome to the KNN Project! This will be a simple project very similar to the lecture, except you'll be given another data set. Go ahead and just follow the directions below.
artificial-intelligence knn-algorithm knn-classification knn-classifier machine-learning matplotlib numpy pandas python3 searbon
Last synced: 12 Apr 2026
https://github.com/shahinuralambhuiyan/learning-ml
Start learning Machine Learning basic from CodeBasics Channel
anaconda jupyter-notebook kaggle-dataset matplotlib numpy pandas pip python
Last synced: 12 Apr 2026
https://github.com/gracysapra/pandas-numpy-data-visualisation
This repository contains essential Python scripts and notebooks for data analysis and visualization. It includes: pandas: Data manipulation and analysis, including operations on series and dataframes. NumPy: Efficient numerical computations and array processing. Data Visualization: Creating insightful visualizations using Matplotlib and Seaborn.
data-science data-visualization matplotlib numpy numpy-arrays pandas pandas-dataframe pandas-series seaborn
Last synced: 07 May 2026
https://github.com/Vice777/SAFE-HEART
Classify the chances of having a Heart Attack based on your Heart's Condition.
kaggle machine-learning matplotlib mlfromscratch mlp-classifier multilayer-perceptron-network numpy pandas pickle streamlit streamlit-webapp
Last synced: 11 Mar 2025
https://github.com/dantasl/probabilidade-analise
Análise sobre o desempenho dos discentes na disciplina Resolução de Problemas Matemáticos Para Tecnologia da Informação (RPMTI) nos semestres de 2015.2, 2016.1 e 2016.2
data-science numpy pandas probability
Last synced: 13 Apr 2026
https://github.com/aryansingla45/image-classification-project
A Deep Learning Model which classifies Cats and Dogs.
artificial-neural-networks convolutional-neural-networks deep-learning keras machine-learning numpy pandas python tensorflow
Last synced: 13 Apr 2026
https://github.com/anandarauf/wifu-detection
Machine Learning Wifu Detection from Dataset Anime and Manga
anime data-science dicoding dicoding-submission machine-learning machinelearning manga numpy object-detection pandas pandas-python python3 sklearn tensorflow waifu waifu-detection
Last synced: 13 Apr 2026
https://github.com/epsoft/text-generation-stopword
text generation stopword
datasets generation nltk nltk-library nltk-python numpy re stopword text wordnet wordnet-library
Last synced: 15 May 2026
https://github.com/epsoft/nlp
NLP
farsi gru nlp numpy pandas persian tensorflow training word-indexer
Last synced: 15 Apr 2026
https://github.com/sebsikora/curve_fitting
A short guide to using Python tools to perform arbitrary curve-fitting of research data via constrained minimisation.
curve-fitting minimisation numpy optimisation python scipy
Last synced: 13 Apr 2026
https://github.com/bobzheng00/medscan-mentor
MediScan Mentor is an innovative application designed to assist medical students in interpreting CT scans.
artificial-intelligence brain-diseases colab-notebook ct-scans deep-learning education machine-learning medical-ai-project medical-imaging mongodb numpy object-detection openai pandas pubmed python pytorch rag streamlit yolov5
Last synced: 17 Jul 2025
https://github.com/aymen016/emotion-recognition-using-face-detection
A deep learning-based project that detects human faces and predicts their emotions in real-time using OpenCV and a trained neural network model.
gpu-acceleration haar-cascade-classifier keras-tensorflow matplotlib numpy opencv python
Last synced: 13 Apr 2026
https://github.com/gregoritsch3/ml_eda_classification_loanapprovalprediction
An EDA and Machine Learning Classification exercise on the Loan Approval dataset demonstrating EDA, feature engineering, StratifiedKFold and the use of Tensorflow NN, SVC, LinearSVC, XGBoost, Naive-Bayes, Bagging, Random Forest and Decision Tree algorithms.etc. The modela are optimized using hyperparameter tuning through GridSearchCV.
eda feature-engineering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/aahad91/idv
Interactive Data Visualization - Master Course
matplotlib numpy pandas plotly python3
Last synced: 08 May 2026
https://github.com/otuemre/simplenet
A simple neural network framework built from scratch using Python and NumPy. Designed as a learning tool, SimpleNET includes essential layers, activation functions, and loss functions, enabling users to build and train basic neural networks from the ground up.
backpropagation deep-learning from-scratch machine-learning neural-network numpy python
Last synced: 13 Apr 2026
https://github.com/otivedani/hedge_hog
Histogram of Oriented Gradients, full numpy
feature-extraction histogram-of-oriented-gradients hog-features numpy
Last synced: 13 Apr 2026
https://github.com/tasninanika/k-nearest_neighbors
K-Nearest Neighbors (KNN) is a simple and powerful machine learning algorithm used for classification and regression.
knn matplotlib numpy pandas python3 seaborn
Last synced: 13 Apr 2026
https://github.com/pvlbzn/kot-or-not
Logistic regression (shallow neural network) which predicts whether input image is a cat or not.
classification-algorithm image-classification logistic-regression machine-learning neural-network numpy
Last synced: 01 Mar 2025
https://github.com/petitgrizzlies/metaheuristiques
Travaux pratiques de meta heuristique. Unige 2016-2017
cffi cpp golang heuristic-algorithm matplotlib numpy python35
Last synced: 13 Apr 2026
https://github.com/aminkhavari78/geoai-challenge-estimating-soil-parameters-from-hyperspectral-images-
Can you predict soil parameters from hyperspectral earth observation data?
dee keras matplotlib neural-network numpy pandas seaborn
Last synced: 13 Apr 2026
https://github.com/shivendrra/Seeker
This project will help to write new scripts for YouTube videos from scratch using fine-tuned Llama-7B
ai artificial-intelligence finetuning google-custom-search-api google-custom-search-engine llama2 llm numpy pandas python pytorch react-native reactjs transformers youtube-api
Last synced: 30 Dec 2025
https://github.com/tansudasli/analytics-sandbox
from Statistical approach to Machine learning
feature-engineering machine-learning matplotlib numpy opencv pandas probability regex scikit-learn seaborn statistics
Last synced: 13 Apr 2026
https://github.com/mtgsoftworks/blink-detecion-player
It is a software that has the ability to stop and play the videos watched on youtube and rewind when the eyes of this software are closed and opened.
dlib numpy opencv-python python3 scipy selenium-webdriver
Last synced: 13 Apr 2026
https://github.com/muhammad-fiaz/gpt
A simple implementation based on the "Attention is All You Need" paper, using GPT-2 for text generation.
attention-is-all-you-need gpt gpt-2 gpt-3 gpt-implementation gpt-using-pytorch gpt2 numpy open-source paper-implementations python pytorch pytorch-implementation
Last synced: 13 Apr 2026
https://github.com/iv4n-ga6l/Realtime-Car-Traffic-Tracking
Realtime car traffic tracking using yolov8 model
cartracking numpy opencv python ultralytics yolov8 yolov8n
Last synced: 28 Apr 2025
https://github.com/iv4n-ga6l/OpenPose-Prediction
Person's body parts or joint position detection using openCV
detection numpy opencv prediction prediction-model python
Last synced: 28 Apr 2025
https://github.com/ngirimana/decision-trees-and-random-forest-
For this project we will be exploring publicly available data from [LendingClub.com](www.lendingclub.com). Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this. Lending club had a [very interesting year in 2016](https://en.wikipedia.org/wiki/Lending_Club#2016), so let's check out some of their data and keep the context in mind. This data is from before they even went public. We will use lending data from 2007-2010 and be trying to classify and predict whether or not the borrower paid back their loan in full. You can download the data from [here](https://www.lendingclub.com/info/download-data.action) or just use the csv already provided. It's recommended you use the csv provided as it has been cleaned of NA values. Here are what the columns represent: * credit.policy: 1 if the customer meets the credit underwriting criteria of LendingClub.com, and 0 otherwise. * purpose: The purpose of the loan (takes values "credit_card", "debt_consolidation", "educational", "major_purchase", "small_business", and "all_other"). * int.rate: The interest rate of the loan, as a proportion (a rate of 11% would be stored as 0.11). Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates. * installment: The monthly installments owed by the borrower if the loan is funded. * log.annual.inc: The natural log of the self-reported annual income of the borrower. * dti: The debt-to-income ratio of the borrower (amount of debt divided by annual income). * fico: The FICO credit score of the borrower. * days.with.cr.line: The number of days the borrower has had a credit line. * revol.bal: The borrower's revolving balance (amount unpaid at the end of the credit card billing cycle). * revol.util: The borrower's revolving line utilization rate (the amount of the credit line used relative to total credit available). * inq.last.6mths: The borrower's number of inquiries by creditors in the last 6 months. * delinq.2yrs: The number of times the borrower had been 30+ days past due on a payment in the past 2 years. * pub.rec: The borrower's number of derogatory public records (bankruptcy filings, tax liens, or judgments).
artificial-intelligence decision-tree-classifier machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn
Last synced: 11 Apr 2026
https://github.com/leabrodyheine/water-pump-status-prediction
This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.
argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn
Last synced: 11 Apr 2026
https://github.com/shibam120302/plant_deseases_prediction
Plant disease forecasting is a management system used to predict the occurrence or change in severity of plant diseases. At the field scale, these systems are used by growers to make economic decisions about disease treatments for control.
Last synced: 13 Apr 2026
https://github.com/juanpablo70/pgad-nba-prediction-project
Programming for Data Analysis from Open Data Sources
data-science jupyter-notebook matplotlib mysql numpy pandas pymysql python scipy seaburn sqlalchemy
Last synced: 13 Apr 2026
https://github.com/anvesham/machinelearning_models_using_matlab
Stepwise Multiple Linear Regression (With Interactions) and Random Forest Regression on predicting the Productivity of the Garment Factory Workers
bagging-ensemble bayesian-optimization decision-trees jupyter-notebook machine-learning matlab multiple-linear-regression-model numpy python random-forest-regression
Last synced: 11 Apr 2026
https://github.com/uea-geral/rna-perceptron-exercise
🤖Disciplina de RNA: treinamento de um neurônio Perceptron.
jupyter-notebook neural-network numpy perceptron python scikit-learn
Last synced: 13 Apr 2026
https://github.com/abhinav330/data-science-projects
The Data Science Projects folder contains various projects related to data analysis, machine learning, and exploratory data analysis (EDA). Each project focuses on a specific dataset and utilizes different techniques and algorithms to extract insights and make predictions.
decision-trees exploratory-data-analysis k-means-clustering k-nearest-neighbours kaggle linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas plotly python random-forest-classifier support-vector-machines
Last synced: 30 May 2026
https://github.com/dipeshgoyal013/ipl_win_probability
A project which help you to check win probability of batting team in inning 2nd
machine-learning matplotlib numpy pandas python sklearn
Last synced: 05 Feb 2026
https://github.com/aymen016/image-processing-algorithms
Explore Python-based image processing from scratch with core algorithms like interpolation, convolution, and filtering!
matplotlib numpy pillow python
Last synced: 13 Apr 2026
https://github.com/itancio/realestateprice
This is an application of deep learning to train and predict real estate price dataset in King County, USA.
datat-visualization machine-learning numpy pandas python seaborn
Last synced: 09 Apr 2026