An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/bzubs/mlzero

Implementation of widely used ML algorithms in vanilla python

machine-learning machine-learning-algorithms machinelearning numpy python statistics

Last synced: 03 Mar 2026

https://github.com/dwija12903/ai-lab

A collection of practical implementations from my AI Labs course

keras numpy sckit-learn tensorflow

Last synced: 19 Jan 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/theblueberrycompendium/keras-movie-classification

This is an basic application that uses Keras

ai keras-tensorflow numpy python3 simple-project

Last synced: 13 Apr 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/dina-hosny/investigate-a-dataset-project

Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.

csv data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/archish27/pythontutorial

Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications

matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup

Last synced: 01 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Apr 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/leftcoastnerdgirl/supervised_learning

This project demonstrates supervised machine learning using scikit-learn.

classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn

Last synced: 16 Apr 2026

https://github.com/rahulchouhan1/spotify-most-popular-songs-data-analysis

🎵 Spotify Songs Analysis using Pandas

matplotlib numpy pandas

Last synced: 16 Apr 2026

https://github.com/waghraj1699/pokemon-predicton

Predict which type of pokemon it is using CNN

data-science deep-learning keras numpy python tensorflow utils

Last synced: 05 Apr 2026

https://github.com/marcow2812/zuse-projekt

Python-basierte Software zur Projektion von 3D-Objekten auf einen Würfel

augmented-reality numpy opencv-contrib python

Last synced: 16 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/himanshugoyal77/movie-recommendation

Cosine similarity based Recommendation Engine

nltk numpy pandas requests-library-python tmdb-api vectorization

Last synced: 08 May 2026

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 16 Apr 2026

https://github.com/basith-ahmed/mtrp-butcher

A Machine Learning model, utilizing a range of technical indicators to accurately forecast forthcoming trend reversals with a high degree of confidence. This model is also complemented by an interactive web interface.

exponential-moving-average ipynb logistic-regression macd machine-learning matplotlib numpy pandas python rsi stochastic-rsi streamlit technical-analysis yfinance

Last synced: 04 Mar 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/antonio-f/simple-k-nearest-neighbors

Simple K-Nearest Neighbors using partial sort

101 k-nearest-neighbors knn numpy partial-sort python tutorial

Last synced: 08 May 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/omcodewizard/nasa-battery-analysis

This repository contains code for analyzing battery data from NASA's battery testing dataset. The analysis involves processing battery impedance, electrolyte resistance, and charge transfer resistance across charge/discharge cycles to track the aging and performance of various batteries.

battery-impedence nasa-battery-dataset numpy pandas plotly python

Last synced: 09 May 2026

https://github.com/sundanc/besthousefinder

A Streamlit web application that helps users find the best house based on their preferences.

numpy pandas prediction python streamlit

Last synced: 17 Apr 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/egrizq/bike_analysis

Exploring the bike sharing system to find the patterns behind user behavior by weather, season, hour, and months.

data-science matplot numpy python seaborn

Last synced: 08 May 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026

https://github.com/sweta2501/ml

Consist of ML projects based on Python along with DataSheets

machine-learning matplotlib-pyplot numpy pandas python seaborn sklearn sklearn-model

Last synced: 14 Apr 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/ricobuilds/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Mar 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/gauravtrivedi967/lane_detection

Lane detection and tracking are the advanced key features of the advanced driver assistance system. Lane detection is the process of detecting white lines on the roads. In this we I have used prebuild models for the development of the code

ipython-notebook numpy python thresholding

Last synced: 06 Mar 2026

https://github.com/kodemartin/np-pd-portfolio

Portfolio of numpy, pandas, and matplotlib solutions

matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/radinshahdaei/ce40215-nc

Theoretical and practical assignments for "Numerical Computation".

jupyter-notebook numpy python sympy

Last synced: 17 Apr 2026

https://github.com/neerajcodes888/a-novel-used-car-price-prediction-model-based-on-lindenoise

Welcome to the LinDenoise Repository! LinDenoise offers a smart solution for cleaning noisy data in regression tasks. Integrated seamlessly within the widely-used scikit-learn framework, it effortlessly enhances data quality while improving predictive accuracy

car-price-prediction deep-learning ipynb-notebook machine-learning numpy pandas python3 visualization

Last synced: 06 Mar 2026

https://github.com/psavarmattas/speechtotext

we shall build a very simple speech recognition system that takes our voice as input and produces the corresponding text by hearing the input.

facebook-api ipython librosa machine-learning numpy python pytorch soundfile transformers

Last synced: 02 Apr 2026

https://github.com/esraamhmd/intelligent-skin-cancer-image-based-detection-models

This project combines traditional machine learning approaches with advanced deep learning techniques to assist healthcare professionals in early diagnosis and improve patient outcomes.

ai deep-learning ham10000 isic kaggle machine-learning matplotlib numpy pandas python3 pytorch seaborn skin-cancer skin-detection sklearn time torchvision tqdm

Last synced: 02 Apr 2026

https://github.com/amirmohammadgholampour/mall-customer-segmentation

Project for segmenting customers in a shopping mall using the Clustering algorithm.

numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/satyas567/weatherdataanalysis

Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting. **NOTE: DeepTumor2.0 has been moved from Azure to On-Render to save costs**

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 21 Jan 2026

https://github.com/davityak03/basics-of-python-

In this we have done multiple programs in python so as to develop the basic skill sets in python

array dictionary lists matrix numpy pandas programming python sets

Last synced: 02 Apr 2026

https://github.com/saylie/cat-dog-classifier-

The Cat-Dog Classifier is a deep learning-based image classification project that can differentiate between images of cats and dogs. It uses a Convolutional Neural Network (CNN) model trained on labeled datasets to predict whether a given image belongs to a cat or a dog.

ai javascript keras numpy opencv python react tensorflow

Last synced: 14 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/ledsouza/curso_de_estatistica_parte_4

Projeto de estatística para estudo de correlação e regressão

data-science data-visualization numpy pandas seaborn statsmodels vitrinedev

Last synced: 17 Apr 2026

https://github.com/renan-siqueira/reinforcement-learning-cart-pole

This repository provides implementations of a Q-learning agent to balance a pole on a cart.

artificial-intelligence gym-environment machine-learning numpy openai-gym pygame python qlearning-algorithm reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/harmanveer2546/nasa-asteroid-classification

Classifying whether an asteroid is hazardous or not.

eda matplotlib numpy pandas python seaborn visualization xbgoost

Last synced: 02 Apr 2026

https://github.com/0290192029/apartment-price-predictor

Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".

apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn

Last synced: 08 May 2026

https://github.com/cosmicsid004/air_draw

Digital drawing....

handtracking mediapipe numpy opencv

Last synced: 30 Apr 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/tharushi11/titanic-data-analysis

This repository contains a simple prediction model and survival predictions based on the Titanic dataset from Kaggle, along with detailed visualizations using Power BI.

dax machine-learning numpy pandas powerbi sklearn titanic-survival-prediction visualization

Last synced: 17 Apr 2026

https://github.com/patex1987/ghcn-temperature-dataset

Analysis of GHCN (Global Historical Climatology Network) temperature dataset using Numpy

climate-data jupyter-notebook numpy python temperature-data

Last synced: 10 May 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/filnow/tinygimp

project for image processing class

gimp numpy opencv pysimplegui python3

Last synced: 08 May 2026

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 17 Apr 2026

https://github.com/theakashshukla/diabetes-prediction

Diabetes Prediction using futuristic tool

ai jupiter-notebook ml numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/khushi130404/binning-binarization

This project demonstrates binning and binarization on the Titanic dataset, comparing results with and without numeric encoding. Visualizations highlight the transformations and their impact on survival analysis.

binarization binarizer binning columntransformer decision-trees matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/tashi-2004/deep-learning-grid-world-q-learning

Deep Learning Grid World Q-Learning . Implement Q-learning in a 5x5 grid where an agent navigates obstacles and rewards. Train the agent with varying learning rates, visualize its progress, and see Q-values as heatmaps. Run the script to start training and view results. Contributions are welcome!

agent-based-modeling artificial-intelligence deep-learning deep-q-learning exploitation exploration machine-learning machine-learning-algorithms matplotlib-pyplot numpy python q-learning q-learning-algorithm reinforcement-learning reinforcement-learning-algorithms state-value-function training

Last synced: 17 Apr 2026

https://github.com/omogbolahan94/movies-budget-revenue-analysis

Data exploration, data cleaning, data analysis and predictive analysis

dataframe linear-regression matplotlib numpy pandas seaborn statistical-analysis

Last synced: 08 May 2026

https://github.com/dkm/stlsplit

Tool to split STL in several objects

numpy python stl

Last synced: 17 Jun 2026

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/venkat-a/python-eda-exploratory-data-analysis-

This project focuses on conducting exploratory data analysis (EDA) using Python. Utilizing powerful libraries such as Pandas for data manipulation and Plotly for interactive visualization, the project aims to uncover insights from datasets through comprehensive analysis and visualization techniques.

descriptive-statistics matplotlib numpy pandas seaborn statistical-analysis visualization

Last synced: 07 May 2026