An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/patilni3/python_data_analysis

Important Tools and its Functions

excel matplotlib numpy pandas seaborn

Last synced: 28 Apr 2026

https://github.com/datarohit/car-evaluation--classification

This is a Kaggle Dataset where we classify the cars using their various features. Here I used plotly to visualize the Accuracy Scores. Also I used CrossValScore to get More accurate Accuracy Score.

classification-algorithm cross-validation-score machine-learning matplotlib-pyplot numpy pandas sklearn

Last synced: 28 Apr 2026

https://github.com/felixsoderstrom/perceptron-for-ocr

Training an AI model to recognize handwritten digits

ai backpropagation machine-learning neural-network numpy pytorch

Last synced: 23 Apr 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/pranav-bot/ml-dl-models-from-scratch

This contains the most commonly used machine learning and deep learning models written from scratch without the use if any libraries except Numpy which is used for calculations

deep-learning descion-tree from-scratch linear-regression logistic-regression machine-learning naive-bayes neural-networks numpy perceptron pytorch qlearning random-forest regression reinforcement-learning support-vector-machines svm

Last synced: 15 Apr 2026

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/zainashrafofficial/ml_project_2024

A machine learning semester project of university, implementing classification and regression models to predict house prices & number of rooms using a dataset with over 160,000 entries.

ai ann artificial-neural-networks classification decision-trees jupyter-notebook linear-regression logistic-regression matplotlib ml numpy pandas python random-forest regression-models svm tensorflow

Last synced: 13 Apr 2026

https://github.com/nazli-d/binary-classification-using-cnn

This project utilizes a CNN model to classify cat and dog images through training and testing processes. The model is created using the Keras library on the TensorFlow backend.

binary-classification cat-and-dog-classifier cnn convolutional-neural-networks flatten keras max-pooling numpy opencv python relu-activation sigmoid-function tensorflow

Last synced: 13 Apr 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/msthamizh/bizcardx-extracting-business-card-data-with-ocr

Developing a Streamlit application that enables users to upload business card images and extract key details using easyOCR. The app allows users to manage the extracted data (view, update, delete) and securely stores it in a MySQL database. It also highlights extracted text directly on the business card image using Pillow.

easyocr mysql numpy pandas pillow python regular-expression streamlit

Last synced: 10 Apr 2026

https://github.com/lakhidaksh-01/carinsightanalyzer

Interactive Car Ratings & Insights

cars matplotlib numpy pandas python seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/manjillama/machine-learning-101

Introduction to machine learning, conda environment, Jupyter Notebook, Pandas, NumPy and Matplotlib.

conda-environment jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/prodev717/neurogen

A simple Python library to build and train deep neural networks (SLP and MLP). It includes functions for forward propagation, backpropagation, loss calculation, and model saving/loading.

ai backpropagation deep-learning machine-learning mlp neural-networks numpy python slp

Last synced: 30 Apr 2026

https://github.com/li-plus/tinynn

An educational PyTorch-like neural network framework based on NumPy

autograd deep-learning machine-learning neural-network numpy python

Last synced: 30 Apr 2026

https://github.com/nemeslaszlo/dog_breed_identification

Dog breed identification model to identify and label 120 different dog breeds.

matplotlib numpy pandas tensorflow tensorflow-hub

Last synced: 13 Apr 2026

https://github.com/suraj-sedai/lstm-deep-learning

A step-by-step guide to mastering sequence prediction using TensorFlow and LSTM. This repository covers everything from basic linear predictions to advanced multi-step forecasting, many-to-many LSTM models, and real-world applications like stock price forecasting and sales prediction.

lstm-neural-networks lstm-sentiment-analysis numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/hklyousfi/cyberxnoteloom

Hybrid piano synthesizer with ADSR envelopes, resonance modeling, and MIDI rendering​

adsr-envelope digital-audio hybrid-synth midi-processor numpy piano-synthesizer python

Last synced: 30 Apr 2026

https://github.com/mrhashemi/optimizers

Python implemntation of Conjugate Gradient method and Adam, and Broyden–Fletcher–Goldfarb–Shanno (BFGS) optimizers from scratch.

adam-optimizer bfgs conjugate-gradient numpy optimizer scipy

Last synced: 30 Apr 2026

https://github.com/gitfrandu4/40982-p1

Repositorio de la Práctica 1 en Visión por Computador, explorando el uso de OpenCV para crear imágenes artísticas, manipular píxeles y capturar video. Incluye ejercicios como diseño de un tablero de ajedrez, estilo Mondrian, y un efecto “Pop Art” con la cámara web.

computer-vision image-processing mondrian-style numpy opencv pop-art python visual-art webcam-capture

Last synced: 09 Apr 2026

https://github.com/avijit-jana/used-car-price-prediction

A Streamlit-based application utilizing machine learning to predict used car prices based on features like make, model, year, fuel type, and transmission, enhancing customer experience and streamlining the pricing process.

carpriceprediction data-science jupyter-notebooks machine-learning numpy pandas pickle pricing-data-science python3 streamlit streamlit-webapp webapplication

Last synced: 11 Apr 2026

https://github.com/octcarp-courses/sustech_cs303-ai_f23_works

(49.8/50) Course code of CS303 2023 Fall: Artificial Intelligence, SUSTech. Taught by Prof. Ke TANG.

numpy pytorch sustech

Last synced: 08 May 2026

https://github.com/epsoft/ner

NER

ner numpy pandas

Last synced: 04 May 2026

https://github.com/snehilsanyal/ee524

Course webpage for IIT Guwahati EE524 Machine Learning Lab (Jul-Nov 2020) Session

course-webpage machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/epsoft/tensorflow-basics

Tensorflow Basics

basics numpy tensorflow

Last synced: 01 May 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/rahulvictor12/the-movie-database-data-scrapper

A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.

beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping

Last synced: 11 Apr 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/khaymanii/titanic_survival_prediction_-model

This Model was built using Python and Logistic Regression algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 May 2026

https://github.com/rakibhhridoy/machinelearning-featureselection

Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.

extratreesclassifier feature-selection gridsearchcv lasso-regression logistic-regression machine-learning numpy pandas pca rfe rfecv scikit-learn selectkbest

Last synced: 02 May 2026

https://github.com/oguzhansarigol/expected-goals-xg-data-analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 02 May 2026

https://github.com/iv4n-ga6l/realtime-car-traffic-tracking

Realtime car traffic tracking using yolov8 model

cartracking numpy opencv python ultralytics yolov8 yolov8n

Last synced: 02 May 2026

https://github.com/italoseara/cet083

Medidas de Posição (ou separatrizes) - CET083

data-science matplotlib numpy pandas portuguese python statistics university

Last synced: 03 May 2026

https://github.com/harmanveer-2546/dust-detection-on-solar-panel-using

The accumulation of dust, snow, bird drops etc. on the surface of solar panels reduces the efficiency of the solar modules and hence the amount of produced energy. Monitoring and cleaning solar panels is a crucial task, hence developing an optimal procedure to monitor and clean these panels is very important in order to increase modules efficiency.

classification dirty inceptionv3-model keras numpy pandas plotly python regularizer sequential solar-panels tensorflow visualization

Last synced: 13 Apr 2026

https://github.com/aymen016/emotion-recognition-using-face-detection

A deep learning-based project that detects human faces and predicts their emotions in real-time using OpenCV and a trained neural network model.

gpu-acceleration haar-cascade-classifier keras-tensorflow matplotlib numpy opencv python

Last synced: 13 Apr 2026

https://github.com/cs-joy/numpyv1.25.2

@numpy

numpy

Last synced: 03 May 2026

https://github.com/realtristan/micronet

Neural network library from scratch

neural-networks numpy python pytorch

Last synced: 03 May 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 13 Apr 2026

https://github.com/screamman-ssh/facemousecontrol

This project is about using your nose tip for the mouse's cursor and open your mouth for the left click.

autopy computer-vision face-detection mediapipe-facemesh mouse numpy opencv python python-3-7 ui

Last synced: 13 Apr 2026

https://github.com/dwarkadhish2005/house-price-predictor

Applied neural network for machine learning and done many analysis of data

keras matplotlib numpy pandas python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/simipk12/eda-healthcare-project

Healthcare Insurance Cost Analysis This project analyzes the Healthcare Insurance dataset to explore factors that influence medical charges. The goal is to identify key drivers of insurance costs, detect outliers, and provide actionable insights for individuals, insurance companies, and policymakers.

eda matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/shanmukhsrisaivedullapalli/smsspamclassification

SMSSpamClassification is a machine learning project aimed at accurately classifying SMS messages as either spam or ham (non-spam). It employs natural language processing techniques to extract relevant features from the text data and utilizes various classification algorithms to build a robust spam detection model.

jupyter-notebook numpy pandas pickle python3 sklearn spam-classification spam-detection

Last synced: 11 Apr 2026

https://github.com/petitgrizzlies/metaheuristiques

Travaux pratiques de meta heuristique. Unige 2016-2017

cffi cpp golang heuristic-algorithm matplotlib numpy python35

Last synced: 13 Apr 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/kaikymoura/backcourt-api

Repository for the Basketball Advanced Stats API

fastapi nba-api nba-stats numpy pandas python swagger uvicorn

Last synced: 03 May 2026

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/mgobeaalcoba/datacademy

Iré subiendo los distintos desafíos del reto de Platzi llamado Datacademy

anaconda deepnote google-colab jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper, using GPT-2 for text generation.

attention-is-all-you-need gpt gpt-2 gpt-3 gpt-implementation gpt-using-pytorch gpt2 numpy open-source paper-implementations python pytorch pytorch-implementation

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/OpenPose-Prediction

Person's body parts or joint position detection using openCV

detection numpy opencv prediction prediction-model python

Last synced: 28 Apr 2025

https://github.com/quinta0/pycolors

A python program to help people with color blindness recognize color in an image

colorblindness colors image-processing numpy pandas-dataframe python tkinter-gui

Last synced: 05 May 2026

https://github.com/ngirimana/decision-trees-and-random-forest-

For this project we will be exploring publicly available data from [LendingClub.com](www.lendingclub.com). Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this. Lending club had a [very interesting year in 2016](https://en.wikipedia.org/wiki/Lending_Club#2016), so let's check out some of their data and keep the context in mind. This data is from before they even went public. We will use lending data from 2007-2010 and be trying to classify and predict whether or not the borrower paid back their loan in full. You can download the data from [here](https://www.lendingclub.com/info/download-data.action) or just use the csv already provided. It's recommended you use the csv provided as it has been cleaned of NA values. Here are what the columns represent: * credit.policy: 1 if the customer meets the credit underwriting criteria of LendingClub.com, and 0 otherwise. * purpose: The purpose of the loan (takes values "credit_card", "debt_consolidation", "educational", "major_purchase", "small_business", and "all_other"). * int.rate: The interest rate of the loan, as a proportion (a rate of 11% would be stored as 0.11). Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates. * installment: The monthly installments owed by the borrower if the loan is funded. * log.annual.inc: The natural log of the self-reported annual income of the borrower. * dti: The debt-to-income ratio of the borrower (amount of debt divided by annual income). * fico: The FICO credit score of the borrower. * days.with.cr.line: The number of days the borrower has had a credit line. * revol.bal: The borrower's revolving balance (amount unpaid at the end of the credit card billing cycle). * revol.util: The borrower's revolving line utilization rate (the amount of the credit line used relative to total credit available). * inq.last.6mths: The borrower's number of inquiries by creditors in the last 6 months. * delinq.2yrs: The number of times the borrower had been 30+ days past due on a payment in the past 2 years. * pub.rec: The borrower's number of derogatory public records (bankruptcy filings, tax liens, or judgments).

artificial-intelligence decision-tree-classifier machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn

Last synced: 11 Apr 2026

https://github.com/mauriciovazquezm/machinelearning_course_spring2023

Machine Learning course tasks focused on the implementation of the ML algorithms using libraries such as Numpy, Pandas, etc.

machine-learning machine-learning-algorithms numpy python r-language

Last synced: 05 May 2026

https://github.com/khaymanii/fake_news_prediction_model

This model was built using python and logistic regression algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 05 May 2026

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing

algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 05 May 2026

https://github.com/lintangwisesa/python_recommendation_system

Basic recommendation system using Python & Sklearn

numpy pandas python recommendation-system sklearn

Last synced: 05 May 2026

https://github.com/upul/indi

Small Machine Learning library written in Python

algorithms machine-learning numpy python scipy

Last synced: 10 Apr 2026

https://github.com/fearless-spider/stooq-dax-seasonal

Parsing seasonality from time series data can often be useful in data analytics. It helps with analyzing seasonality for decision making as well as for more accurate forecasts. Python can be used to separate out these trend and seasonal components.

numpy pandas python statistics

Last synced: 13 Apr 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/abhinav330/data-science-projects

The Data Science Projects folder contains various projects related to data analysis, machine learning, and exploratory data analysis (EDA). Each project focuses on a specific dataset and utilizes different techniques and algorithms to extract insights and make predictions.

decision-trees exploratory-data-analysis k-means-clustering k-nearest-neighbours kaggle linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas plotly python random-forest-classifier support-vector-machines

Last synced: 30 May 2026

https://github.com/mrtarantoga/mdl-density-histogram

Cython-accelerated MDL histogram density estimation. Implements Kontkanen & Myllymaki's dynamic programming algorithm for optimal variable-width bins, parametric complexity with Ramanujan approximations, and automatic bin count selection. Based on "MDL Histogram Density Estimation" (JMLR 2007).

cython-library mdl numpy pypi-package python3

Last synced: 30 Nov 2025

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/rkarahul/machinelearning

This is for my all machine learning code

analysis computer-vision matplotlib mysql nlp numpy pandas python

Last synced: 11 Apr 2026

https://github.com/aymen016/image-processing-algorithms

Explore Python-based image processing from scratch with core algorithms like interpolation, convolution, and filtering!

matplotlib numpy pillow python

Last synced: 13 Apr 2026

https://github.com/itancio/realestateprice

This is an application of deep learning to train and predict real estate price dataset in King County, USA.

datat-visualization machine-learning numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mousazourob/stockdataresampler

Python script that resamples an instruments minute by minute candle stick data to any interval

alpha-vantage-api numpy pandas python

Last synced: 07 May 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/darshan12345678910/air-quality-prediction-ml-algorithms

AirPredict AQI prediction using classification algorithm

classification-algorithims gradio-interface numpy pandas

Last synced: 06 May 2026

https://github.com/imswappy/brain-tumor-detection

🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.

kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model

Last synced: 13 Apr 2026

https://github.com/gehad-ahmed30/data-analysis

This repository offers a guide to data visualization with Pandas and includes practical examples and two complete projects. Learn to preprocess, analyze, and visualize datasets to solve real-world problems effectively.

analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/ayaanjawaid/brain_stroke_prediction

project aims to predict the likelihood of a stroke based on various health parameters using machine learning models. The dataset is preprocessed, analyzed, and multiple models are trained to achieve the best prediction accuracy.

decision-trees exploratory-data-analysis matplotlib numpy pandas python regression xgboost

Last synced: 07 May 2026

https://github.com/akashash01/akash_linear-regression

This is an linear approach machine learning model used to predict the values of variable(dependent) based on other variables(independent).

anova-test autocorrelation data-science heteroscedasticity linear-regression machine-learning mape multicollinearity numpy pandas python supervised-learning

Last synced: 07 May 2026

https://github.com/yogeshnile/flight-delay-prediction

In this repo i developed predict a flight delay using machine learning.

flight-delay-prediction jupyter-notebook machine-learning numpy pandas python3 seaborn sklearn

Last synced: 13 Feb 2026

https://github.com/rishraks/color-object-detection

This project enables real-time detection of user-defined color objects using a webcam. Users enter a color name, and the program identifies objects matching that color by converting it to its HSV range, applying a mask, and detecting contours. It's a practical tool for object tracking, sorting, and interactive applications.

computer-vision numpy opencv pillow-library

Last synced: 11 Apr 2026

https://github.com/mansurpro/digitrecognizer

DigitRecognizer is a simple, handcrafted neural network implemented with NumPy to classify handwritten digits from the MNIST dataset. This project demonstrates the foundational principles of neural networks, including forward propagation, backpropagation, and gradient descent, all built from scratch without relying on deep learning frameworks.

backpropagation custom-dataset data-visualization deep-learning from-scratch gradient-descent handwritten-digit-recognition machine-learning minimal-dependencies mnist neural-network numpy python

Last synced: 16 May 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026