An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/Khushi130404/CatNet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 19 Sep 2025

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/kgruiz/linalg-practice

LinAlg-Practice is a Python library developed to deepen my understanding of linear algebra through hands-on implementation of various matrix operations. It includes comprehensive tests that compare the results with established libraries like NumPy to ensure accuracy and reliability.

algorithms data-science linear-algebra math matrix-operations numpy python sympy

Last synced: 21 Apr 2026

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 16 May 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/vetrivel07/flight-price-prediction

Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 15 Jun 2025

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/vinicius999/eda-imdb-top1000-films

Análise exploratória dos Top 1000 filmes no IMDB até 2020

eda numpy pandas python

Last synced: 07 May 2026

https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera

introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera

data-analysis matplotlib numpy pandas

Last synced: 03 May 2026

https://github.com/abdullahashfaqvirk/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 02 Apr 2026

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 09 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/marcinz20/sortingalgorithms

This is a basic console program which contains just a few, basic sorting algorithms and showcases their usage

algorithms numpy object-oriented-programming python

Last synced: 15 May 2026

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/qanastek/parseur-pdf

https://trello.com/b/SbT2XGyF/g%C3%A9nie-logiciel-scrum

beautifulsoup4 numpy python

Last synced: 15 May 2026

https://github.com/kikoveiga/feup-ia1

Artificial Intelligence (IA) First Project (2023/2024): BSc in Informatics and Computing Engineering @ FEUP

feup feup-ia feup-leic genetic-algorithms hill-climbing numpy pandas python simulated-annealing tabu-search

Last synced: 06 May 2026

https://github.com/mustafadanabasi/python-linearregression-evfiyatlari

Ev Fiyatlarını Linear Regrasyon ile tahminleme çalışması.

linear-regression numpy pandas python

Last synced: 06 Apr 2026

https://github.com/franciscomartinez45/Social-Network-Analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 11 Apr 2025

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 06 Apr 2026

https://github.com/shambac/shamboflow

Fierce tensorflow competitor

cuda cupy machine-learning numpy pypi-package

Last synced: 19 Feb 2026

https://github.com/kenwuqianghao/ml-zoomcamp

Code and homework for ML Zoomcamp

machinelearning numpy pandas python3 tensorflow

Last synced: 06 Apr 2026

https://github.com/mallocode300/colour_palette_generator

Upload an image and immediately obtain a colour palette with the 10 most common colors in HEX codes and RGB

numpy python

Last synced: 29 Jul 2025

https://github.com/morsalinislamshapon/diabetes-prediction-systemv3

This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 29 Jul 2025

https://github.com/mzayles/vendas_ficticias_dataprep

💻📊✅ Curso de Programação em Python para Data Science | Analisando e tratando dados fictícios.

numpy pandas python

Last synced: 15 Jun 2025

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/zeeshan4002911/data-analysis-hub

Quality control, data processing, data cleaning, data ploting

jupyter-notebook jupyterlab matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 06 Apr 2026

https://github.com/ledsouza/MedPhys-BI

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 22 Sep 2025

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/youssefali11997/free-style-coding

Scripts written as practice, trying new technology or just playing with the code !

bash javascript numpy python3 ruby-on-rails scripts vuejs

Last synced: 12 Apr 2026

https://github.com/abeed04/face-recognition-using-computer-vision-cv2

OpenCV (cv2) can be used for face recognition by detecting faces, extracting facial features, and comparing them to a database of known faces.

cmake cv2-library dlib-face-recognition face-recognition flask numpy pycharm-ide python

Last synced: 08 Feb 2026

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 06 Apr 2026

https://github.com/lfgodoi/rpm-deep-guesser

A deep learning-based RPM estimator based on spectral features extracted from vibration signals of rotating machines.

condition-monitoring deep-learning docker flask machine-learning neural-networks numpy python pytorch scipy signal-processing spectral-analysis

Last synced: 02 Mar 2025

https://github.com/shreyasdankhade/portfolio_optimatization_project

The Portfolio Optimization Project uses optimization techniques to balance risk and return, helping investors make efficient asset allocation decisions.

flask flask-application matplotlib numpy pandas pandas-python porfolio-optimization portfolio python

Last synced: 06 Apr 2026

https://github.com/t-lak/decision-tree

This project implements a basic Decision Tree classifier. It supports visualizing the tree and calculating performance metrics (accuracy, precision, F1-score, and recall).

decision-trees graphviz metrics numpy python3 scikit-learn

Last synced: 07 May 2026

https://github.com/valmir-unicap/deteccao-de-mascaras-faciais

Experimento baseado no artigo: A Face Mask Detection Algorithm Based on YOLO

kaggle-dataset mobilenetv2 numpy opencv python torch yolo

Last synced: 12 Apr 2026

https://github.com/iv4n-ga6l/simple-image-segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 13 May 2026

https://github.com/harsha-yuvaraj/Data-Compression-and-Decompression-Tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 02 May 2025

https://github.com/elijahondiek/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 18 Apr 2026

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/nagipragalathan/python_tutorial_for_data-science

This repository is a comprehensive guide for learning data science using Python. It covers various essential libraries and tools commonly used in the field of data science, including Jupyter Notebook, Matplotlib, NumPy, Pandas, Scikit-learn, and PyTorch.

datascience datavisualization deeplearning jupyter jupyter-notebook learning-by-doing learningresources machinelearning matplotlib numpy opensource pandas python python-script python3 pytorch pytorch-implementation scikitlearn tutorial

Last synced: 09 Apr 2026

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/dpgitaccount/project-customer-segmentation-analysis-with-power-bi

This project segments customers based on their purchasing behavior to identify different target groups. It demonstrates skills in data analysis, clustering, and visualization using Power BI.

kmeans-clustering-algorithm linear-regression numpy pandas powerbi python seaborn segmentation

Last synced: 09 Apr 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/justinzhang17/comp-5801-queens

Reinforcement learning task of teaching an agent to play the game queens/star-battle

gymnasium numpy pygame python

Last synced: 15 May 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/jjwizardmp/gallery-art-problem

Resolve the Gallery Art Problem implement BFS and Delaunay Triangulation

jupyter jupyter-notebook jupyterlab mathplotlib numpy pandas python python-env python3 scipy shapely

Last synced: 09 Apr 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/oaerobert/python-for-finance

Implementations of financial mathematics and statistics using the "Python for Finance" textbook. Includes Monte Carlo simulations, predictive stock prices using ML strategies and real world data visualisations.

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 06 Jan 2026

https://github.com/ialam085/super_store_sales_analysis_python

The project is indeed focused on performing an exploratory data analysis (EDA) of Super Store Sales data from various perspectives, using comprehensive visualizations.

charts matplotlib numpy pandas python seaborn visualization

Last synced: 10 Apr 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/lovesaroha/learning-neural-networks

Various concepts of neural networks applied in python (numpy) to help people get started with AI.

batch-normalization dropout gradient-descent logistic-regression neural-network numpy python regularization

Last synced: 08 May 2026

https://github.com/giacomolat/realestateai-solutions---a-forecasting-model-for-the-housing-market

This project applies regularization techniques (Ridge, Lasso, and Elastic Net) to improve real estate price forecasting. This project focuses on reducing overfitting and increasing the stability of regression models' predictions

cross-validation elasticnet-regression lasso-regression-model machine-learning-algorithms matplotlib matplotlib-pyplot numpy pandas python regularization-methods regularization-to-avoid-overfitting ridge-regression-model seaborn standardization

Last synced: 01 May 2026

https://github.com/mjshubham21/ny_yellow_taxi_python_da_project

A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/ksganni/diabetes-prediction-using-machine-learning

🔬 A machine learning project that predicts diabetes likelihood using patient health metrics and multiple classification algorithms.

jupyter-notebook matplotlib numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 28 Apr 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026