An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/hossein-cod/practicing-deep-learning

with keras data set we load a data base and train our machine for test

deep deep-learning jupyter-notebook keras machine-learning matplotlib numpy py python sklearn tensorflow

Last synced: 25 Feb 2026

https://github.com/ishinzoo/songrecommendation

This project is a machine learning-based system that recommends songs based on the user's detected emotions. The application uses facial expression recognition to determine the user's current emotional state and suggests songs that align with that emotion. This system can be particularly useful for personalized music streaming services, helping use

machine-learning mediapipe numpy opencv os python tenserflow

Last synced: 25 Feb 2026

https://github.com/tomdewildt/google-foobar

Solutions for the Google Foobar problems implemented in Python

foobar google numpy pylint python

Last synced: 22 Feb 2026

https://github.com/manu-karenite/medical-insurance-cost-predictor

Medical Insurance Cost Generator is a Linear Regression based Predictor which is used to estimate and predict the Cost a person has to pay while Buying a Medical Insurance.

kaggle-dataset linear-regression machine-learning matplotlib numpy pandas python3 reactjs scikit-learn

Last synced: 15 Apr 2026

https://github.com/hajaarh/health_state_classifier

Ce projet vise à analyser un dataset contenant divers indicateurs de santé, bien-être et vie quotidienne afin de classifier les individus en trois catégories selon leur état de santé.

numpy pandas python seaborn sklearn

Last synced: 15 Apr 2026

https://github.com/karishmagupta05/airbnb-data-scraping-and-analysis

This project involves web scraping Airbnb listings using Python and BeautifulSoup, followed by data analysis with Pandas. The goal is to extract meaningful insights from the data, clean and preprocess it, and visualize the results using libraries like Matplotlib and Seaborn.

beautifulsoup4 matplotlib numpy pandas python scraping seaborn visualization

Last synced: 15 Apr 2026

https://github.com/hansalemaos/npzigloc

Zig for Numpy

numpy python zig

Last synced: 31 Jan 2026

https://github.com/franciscomartinez45/social-network-analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 09 May 2026

https://github.com/pedrohenriquebr/pegasus

Finance Reporting Automation

excel finance financial numpy pandas python python3 report

Last synced: 15 Apr 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 01 Feb 2026

https://github.com/davityak03/english-audio-to-hindi-text-using-pretrained-model

The primary goal of this project is to transcribe English audio and then translate the transcribed text into Hindi.

adam-weight-decay css flask html nlp numpy python tensorflow tokenizer transformers translation

Last synced: 26 Feb 2026

https://github.com/luisall12/fraud-detection-machine-learning-python

Fraud detection system that recognises and signals strange transactions using machine learning

detection flask fruad html numpy pandas python ui

Last synced: 15 Apr 2026

https://github.com/dhruvv1402/x-twitter-bot-python

A Python-powered Twitter bot for content generation, analytics, and automation using Twitter API v2.

huggingface-transformers matplotlib nltk numpy pandas pillow python requests schedule textblob threading tweepy twitter-api wordcloud

Last synced: 09 May 2026

https://github.com/gregoritsch3/project_spring_pendulum

A solution to the chaotic equations of motion of a spring pendulum using Lagrangean mechanics integrated through symbolic and numeric Python libraries (Sympy, Numpy, Scipy, Matplotlib, etc.).

lagrangian-mechanics maptlotlib numpy python scipy spring-pendulum sympy

Last synced: 12 Apr 2026

https://github.com/bursasha/pandas-numpy-matplotlib-cavies-analysis

Complete statistical analysis of cavy lifetime dataset using Python, Pandas, NumPy, Matplotlib, and SciPy to explore, visualize, and infer the impact of bacilli infection on cavy lifetimes 🦫

cavy dataset-analysis distribution-fitting hypothesis-testing jupyter-notebook matplotlib numpy pandas python-analysis scipy statistical-analysis statistical-methods

Last synced: 09 Feb 2026

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/harmanveer-2546/statistics-for-machine-learning

Statistical tools help you clean and organize your data. You can identify outliers, manage missing values, and ensure your data is in a format that the ML algorithms can understand.

inline matplotlib matplotlib-styles numpy pandas probability python seaborn statistics

Last synced: 18 Apr 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/baschin1103/neural-network-building-in-python

In this repository is a programm with a pretrained artificial neural network from Building AI. The network predicts the price of a cabin based on it's attributes.

bias building-ai forward-pass functioning identity linear-combinations neural-network numpy python regression relu

Last synced: 16 Apr 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/foggysmile/ml_labs_itmo

Лабораторные работы по машинному обучению

bert decision-trees jupyter-notebook machine-learning numpy pandas random-forest xgboost

Last synced: 11 Feb 2026

https://github.com/farhad-here/data-visualization-analysis-dva

This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.

altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/adriantomin/heart-disease-prediction

Predicting Heart Disease with ML 🏥🤖 Machine learning model to predict heart disease using clinical data. Includes data analysis, feature engineering, and model training.

machine-learning matplotlib numpy pandas python sklearn

Last synced: 12 Feb 2026

https://github.com/ywatanabe1989/scitex-io

Universal scientific data I/O with plugin registry — save/load 30+ formats with one API. Part of SciTeX.

cli csv data-io hdf5 mcp numpy openscience pandas plugin-registry python research scientific-computing scitex

Last synced: 07 Jun 2026

https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system

This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.

collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn

Last synced: 11 Apr 2026

https://github.com/bharatsharma07/ipl-analysis

This project provides insights into the IPL's historical data, helping to identify trends, player performance, and team strategies.

data-visualization ipl-data-analysis numpy pandas python

Last synced: 16 Apr 2026

https://github.com/shakhthi/mlproject

This project is an end-to-end ML solution that predicts math scores from student attributes. It starts with ingesting and preprocessing data, followed by feature engineering to pinpoint key predictors. A robust model is then trained and fine-tuned before being seamlessly deployed to deliver real-time predictions.

jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/omarsaad21/rfm-clustering-

A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)

business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/aliyahscoding/tiny-recommender

Tiny item-item recommender in Python using cosine similarity over a toy user–item matrix. Simple CLI to print top-N recommendations.

collaborative-filtering cosine-similarity numpy pandas python recommender-system

Last synced: 09 May 2026

https://github.com/mattkevan/data-science

Notes from learning various data science topics.

data-science matplotlib numpy python3

Last synced: 09 May 2026

https://github.com/dpgitaccount/project---hospital-readmission-analysis

The goal of this project is to build a predictive model to estimate the likelihood of a hospital readmission based on patient data. By identifying factors that contribute to readmissions, hospitals can optimize care and reduce costs associated with repeated visits.

boxplot confusion-matrix datamodeling exploratory-data-analysis heatmap histplot numpy pandas plotly python random-forest seaborn smote-sampling visualization

Last synced: 16 Apr 2026

https://github.com/maia-bastos/estimativas

Projeto de Probabilidades, Amostragem e Estimações com dados reais (IBGE).

numpy pandas python3 scipy-stats

Last synced: 16 Apr 2026

https://github.com/vgvr0/analisis-de-datos-con-streamlit-numpy-pandas-y-matplotlib

Sistema completo de análisis y visualización de datos cinematográficos que proporciona insights detallados sobre películas, incluyendo análisis financiero, puntuaciones, tendencias temporales y un sistema de recomendación. Desarrollado con Python y Streamlit, ofrece una interfaz interactiva y amigable para explorar datos de películas.

matplotlib numpy pandas plotly plotly-dash recommendation-system streamlit

Last synced: 16 Apr 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/mattia-hulathduwage/wine-quality-analyzer

A machine learning project that analyzes wine quality using clustering, regression, and classification techniques. The model predicts wine quality scores based on chemical properties and determines the most influential features affecting quality.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/tonoy30/emotion-mask

A pytorch model for add face mask based on detected emotion.

emotion-detection numpy opencv-python python pytorch

Last synced: 09 May 2026

https://github.com/yousefmostafa125/image-processing

An interactive image processing web app built with Streamlit and OpenCV. The app allows users to upload an image, apply various processing steps, and download the results individually or as a ZIP file.

numpy opencv-python python streamlit

Last synced: 03 May 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/mohamed15058/text_classification-digital-egypt-pioneers-initiative-project-

Text_Classification(Digital-Egypt-Pioneers-Initiative-Project )

depi mlops nlp nltk numpy panadas python3 twnsorflow

Last synced: 10 Apr 2026

https://github.com/justin-marian/numpy-mlp

MLP from-scratch implementation only with NumPy, it supports regression and classification tasks.

classification lasso mlp numpy r2-score regression ridge

Last synced: 16 Feb 2026

https://github.com/hussain-7/emotion_detection-master

Human Emotion Analysis using facial expressions in real-time from webcam feed. Based on the dataset from Kaggle's Facial Emotion Recognition Challenge.

keras-tensorflow matplotlib numpy opencv-python tensorflow

Last synced: 08 May 2026

https://github.com/anujdutt9/reinforcement_learning

Reinforcement Learning using Numpy and PyTorch.

numpy python3 pytorch reinforcement-learning

Last synced: 16 Apr 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/thiti-dev/attendance-management-system

A quick casual project just only for a demonstration in the discrete mathematics course

numpy opencv pandas pillow tkinter

Last synced: 16 Apr 2026

https://github.com/navindafernando/feature-extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling

Last synced: 10 Apr 2026

https://github.com/fedesgh/parkinson_volatility_spread_on_cedears

Creating a function that returns a graph with the difference between Parkinson's volatility and regular volatility given a certain bounds

numpy pandas pickle seaborn

Last synced: 10 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/matx64/birads-classifier

Breast Imaging Report and Data System (BIRADS) with Graphical User Interface.

numpy pillow skimage sklearn tkinter

Last synced: 20 Apr 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/anas436/student-performance-analysis

In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.

data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 16 Apr 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/shiqilu77/machinlearning-lol_gaming

a supervised model that predicts the match winning rate in LOL game based on the lineups of both sides. Implemented data processing by multiple python API including sklearn, NumPy, pandas, and by feature encoding, kNN, Synthetic Minority Oversampling Technique, etc

league-of-legends machine-learning numpy supervised-learning

Last synced: 16 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/ahmedabdalkreem/hotel-reservation

Our task is to classify a Hotel Reservation as either booking canceled (class1) or no canceled(class0) and use more one model to arrive the best model.

bagging decisiontreeclassifier ensemble extra-trees-classifier logistic-regression matplotlib numpy pandas python3 random-forest sklearn-library svc-model

Last synced: 11 Apr 2026

https://github.com/kwasi-dankwa/game-sales

This project investigates why Nintendo’s global market share fluctuated between 2010–2016, identifies top-performing genres and regions, and builds predictive models to forecast sales and regional performance. It culminates in specific, actionable recommendations to grow market share.

business-intelligence gaming juptyernotebook-language matplotlib numpy pandas postgresql python3

Last synced: 09 Apr 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/tfiers/unitlib

📐 Physical units for NumPy arrays \ ⏱ Fast • Simple • High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026

https://github.com/marcow2812/zuse-projekt

Python-basierte Software zur Projektion von 3D-Objekten auf einen Würfel

augmented-reality numpy opencv-contrib python

Last synced: 16 Apr 2026

https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification

This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.

classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn

Last synced: 06 May 2026

https://github.com/dr-rompecabezas/page-view-time-series-visualizer

A data science project built as part of the freeCodeCamp curriculum.

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/jsnevt/knn_neighborhood

ML - classificação por vizinhança

kneighborsclassifier matplotlib numpy pandas seaborn

Last synced: 13 May 2026

https://github.com/jaweria-b/eda-basketball

The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.

matplotlib numpy python streamlit

Last synced: 10 Apr 2026

https://github.com/starlitdreams/lunar-landing

This project implements a DQN agent using PyTorch to solve the LunarLander-v2 environment from OpenAI Gym. The agent learns to control the lunar lander using experience replay and a target network, aiming to maximize rewards by landing smoothly. Uses CUDA for computation.

artificial-intelligence cuda deep-learning gymnasium neural-network neural-networks numpy nvidia-gpu python python3 torch

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/yanxue06/housing-price-predictor

Python-based California housing price predictor

jupyter numpy pandas python seaborn

Last synced: 06 Mar 2026