An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/iv4n-ga6l/Crowd-move-simulation

Crowd move simulation in python

matplotlib numpy python simulation

Last synced: 28 Apr 2025

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/harmanveer-2546/finding-you-next-perfect-house

In this exploratory data analysis, we compare a dataset which consists of various features about renting of houses available on these renting platforms listed by owners of these houses, and try to derive some constructive conclusions by performing Descriptive statistics of the available features.

adaboostregressor decisiontreeregressor exploratory-data-analysis house-price-prediction inline linearregression matplotlib numpy onehotencoder pandas plotly python randomforestregressor renting seaborn standardscaler xgbregressor

Last synced: 13 Apr 2026

https://github.com/amiralisajadi/image-classification-neural-network

Custom deep learning model for binary image classification. Entirely built from scratch in Python with NumPy, including all ML functions, activations, and optimizations.

deep-learning image-classification image-classification-model machine-learning neural-networks numpy

Last synced: 08 May 2026

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de rΓ©seaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/saro0307/exploratory-data-analysis-terrorism

Phase 1 of Data Science project (program) to perform Exploratory Data Analysis on Terrorism using Python On Google Colab for Coderscave Internship sept 2023

colaboratory data-analysis datascience machine-learning numpy pandas python seaborn skit-learn visualization

Last synced: 13 Apr 2026

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/soumyapro/house-price-prediction

Explored different attributes of Boston housing dataset then a part of dataset was used to train the linear regression algorithm after that trained model was used to give predictions on remaining part of dataset.

linear-regression matlplotlib numpy pandas sklearn svm

Last synced: 13 Apr 2026

https://github.com/wasifsohail5/virtual_mouse

Virtual Mouse is an AI-powered, gesture-based control system that replaces traditional peripherals with hand movements. Using MediaPipe for hand tracking and PyAutoGUI for cursor control, it enables seamless clicking, scrolling, and navigation through simple gestures. Experience a futuristic, touchless interaction with your PC! πŸš€πŸ–±οΈ

cv2 mediapipe mediapipe-hands numpy opencv pyautogui python

Last synced: 02 Aug 2025

https://github.com/eny2930/ai-ml-projects

A project demonstrating various libraries and tools in using algorithms in machine learning. This repository includes code samples, datasets, and detailed explanations.

iris-dataset keras linear-regression logistic-regression matplotlib numpy pandas pytorch sckiit-learn seaborn sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ishaan28malik/masters-admission-analyser

Admissions of Indian students for foreign universities.

admission classification machine-learning matplotlib numpy python students

Last synced: 08 May 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/kumartusha/Data_Scientist-AI-ML-_Training_Anudip_Foundation

Anudip Foundation Data Analytics Training πŸ“Š A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing πŸ“‰ Data Visualization & EDA πŸ“Š Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 27 Sep 2025

https://github.com/lmtreser/intro-data-science

Trayecto de formaciΓ³n en Ciencias de Datos con Python y Pandas

data-science matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 13 Apr 2026

https://github.com/pramodyasahan/spaceship-titanic

This repository features a machine learning model designed to predict whether passengers of a space travel company are likely to be transported. The model employs CatBoostClassifier, a machine learning algorithm known for handling categorical data effectively.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 08 May 2026

https://github.com/yesenvidnath/facerecognitionapplight

This is a face recognition application wich is capable of recognizing a person based on thire face and take attendence from it

face-recognition facent flask numpy python

Last synced: 13 Apr 2026

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/shwetapardhi/assignment-03-q2--hypothesis-testing

Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth

annova ftest hypothesis-testing numpy pandas python scipy significance-testing stats

Last synced: 11 Apr 2026

https://github.com/abhisek-13/multi-commodity-price-predictor

The Multi-Commodity Price Predictor is a machine learning project that predicts the prices of cars, bikes, and laptops. Using advanced techniques, it provides accurate price predictions based on user inputs. The project features an easy-to-use interface built with Streamlit, enabling users to get price predictions quickly and easily.

machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn sklearn-library streamlit

Last synced: 09 Apr 2026

https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project

In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.

data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql

Last synced: 09 Apr 2026

https://github.com/iwitaly/aws-lambda-docker-builder

Docker image for building packages for AWS Lambda with Python environment

aws aws-lambda docker docker-image numpy python python3

Last synced: 09 Apr 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/bropenguin847/python-learning

Learning Python for Scientific Programming. Intended for beginners with some knowledge of Python.

array-manipulations convolution jupyter matlab matplotlib-pyplot numpy pandas-python pep8 python python3 scientific-programming scipy spyder-ide vscode

Last synced: 13 Apr 2026

https://github.com/udacity-machinelearning-internship/support-vector-machine

Implementing SVM's using pandas and sklearn in python

numpy pandas sklearn support-vector-machines svm

Last synced: 13 Apr 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/rajan-bhateja/data-analysis-with-python

Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook

data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/kevinmbeaulieu/machine-learning-practice

ML models & data processing utilities written from scratch as practice.

data-science machine-learning numpy pandas

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. πŸ› οΈπŸŽ₯

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/audeering/audmath

General math functions

math numpy

Last synced: 07 Feb 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚒

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

πŸ“ˆ Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/chandkund/customer-segmentation

Customer segmentation divides customers into distinct groups based on characteristics and behaviors. This project uses K-Means clustering, an unsupervised machine learning algorithm, to segment customers and provide insights for targeted marketing strategies

kmeans-clustering matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/mzayles/analise-descritiva.py

Curso ProgramaΓ§Γ£o em Python para Data Science | AnΓ‘lise descritiva e mensuramento de dados de um report consolidado utilizando matplotlib.

matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/oaerobert/python-for-finance

Implementations of financial mathematics and statistics using the "Python for Finance" textbook. Includes Monte Carlo simulations, predictive stock prices using ML strategies and real world data visualisations.

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/coueghlani/nlp

Proyecto de Procesamiento de Lenguaje Natural y AnΓ‘lisis de Datos

mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy

Last synced: 08 Feb 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy β€” from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/shaulamarquez/data-job-market

This is a Data Job Market analysis

analysis matplotlib numpy python seaborn

Last synced: 09 May 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/harmanveer-2546/statistics-for-machine-learning

Statistical tools help you clean and organize your data. You can identify outliers, manage missing values, and ensure your data is in a format that the ML algorithms can understand.

inline matplotlib matplotlib-styles numpy pandas probability python seaborn statistics

Last synced: 18 Apr 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/mindkerchief/baselineml

A collection of machine learning task performed during my studies in computer science major in intelligent system.

decision-tree dummy gaussian-mixture-models kmeans-clustering linear-regression logistic-regression machine-learning matplotlib numpy pandas random-forest scikit-learn seaborn tensorflow

Last synced: 16 Apr 2026

https://github.com/jjwizardmp/gallery-art-problem

Resolve the Gallery Art Problem implement BFS and Delaunay Triangulation

jupyter jupyter-notebook jupyterlab mathplotlib numpy pandas python python-env python3 scipy shapely

Last synced: 09 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/walkerdustin/python-linear-regression

simple linear regression in python based on coursera andrew ng course

ai linear-regression machine-learning ml numpy python

Last synced: 05 May 2026

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 08 May 2026

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/diegodscamara/whisperlive

πŸŽ™οΈ WhisperLive: Real-time audio transcription powered by OpenAI's Whisper model. Convert live speech to text with high accuracy, supporting multiple languages and real-time processing. Perfect for accessibility, content creation, and live captioning.

ai blackhole numpy python wave whisper

Last synced: 09 May 2026

https://github.com/shingiraibhengesa/house-price-predictor

A machine learning project that predicts house prices based on user input features such as square footage, number of bedrooms, and more.

machine-learning-models matplotlib numpy python scikit-learn seaborn

Last synced: 09 May 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/separatrixxx/nm_labs_6_7_sem

πŸ‘“ Laboratory work for 6 and 7 semesters of MAI on numerical methods

numerical-methods numpy python

Last synced: 16 Apr 2026

https://github.com/rakesh3596/tutedude_projects

It consist of 3 major project 1:Covid19 Analysis 2:Rainfall Prediction 3:Tumor Detection

csv-files matplotlib numpy pandas plotly python sklearn-library

Last synced: 16 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/jahanostg/healthguard_with-smart-ai-diagnosis-hub

This project is a Flask-based platform for patients, donors, and doctors, integrating hospital management in one solution. It uses Machine Learning models for predictive health analysis and Power BI for interactive dashboards. Through this project, I enhanced my skills in full-stack development, applied ML, and healthcare data solutions.

css flask-api google-colaboratory-notebooks html5 js machine-learning mysql numpy pandas pkl-model powerbi python-3

Last synced: 09 Apr 2026

https://github.com/aravinda-1402/covid-detection-model-using-chest-x-ray

The objective of this project is to develop a Deep Learning Model to identify the X-Rays of healthy vs. Pneumonia (Corona) afflicted patients using the Chest X-Ray dataset, and use this model to power the AI application to test the Corona Virus in a faster phase.

classification cnn covid flask keras numpy pandas tensorflow

Last synced: 09 Apr 2026