An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/muhdhammad/machine-learning

Crafted for hands-on learning and implementation of ML with scikit-learn

data-science jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/joeylemon/python-kmeans

an implementation of the K-means clustering algorithm in Python

k-means-clustering matplotlib ml numpy python

Last synced: 21 May 2026

https://github.com/shweta189/sms-spam-detection

Detecting Spam SMS's messages using machine learning algorithms.Utilizing NLP techniques such as tf-idf vectorization and machine learning algorithms like naive bayes and logistic regression for classifications. Integrated with Streamlit to create an interactive web app where users can input a message and instantly check if it's spam or not.

eda machine-learning-models nltk numpy pandas plotly python seaborn streamlit

Last synced: 07 Apr 2026

https://github.com/xmcp/midify

Experimental WAVE to MIDI converter

midi numpy waveform

Last synced: 18 Apr 2026

https://github.com/manshreet27/mrs

This Movie Recommendation System is a web-based application built using Python and Streamlit, designed to provide movie recommendations based on user preferences. It utilizes TMDb API for fetching real-time movie details and Kaggle's TMDB 5000 Movies dataset for content-based filtering.

numpy pandas python scikit-learn streamlit tmdb-5000-movies-dataset-from-kaggle tmdb-api-for-fetching-real-time-movie-data

Last synced: 07 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/shekharkram/project

A collection of data analytics projects showcasing skills in data cleaning, exploration, visualization, and basic SQL queries. Designed to demonstrate entry-level data analyst competencies using real-world datasets and tools.

datacleaning excel jupyter-notebook mysql numpy pandas postgresql python sql

Last synced: 07 Apr 2026

https://github.com/vivekmodak3/multiple-disease-prediction-system--streamlit-webapp

This repository contains the codebase for "Multiple Disease Prediction Streamlit App". The training notebooks & the datasets are also provided in the respective folders.

machine-learning-algorithms numpy pandas python3 scikitlearn-machine-learning streamlit

Last synced: 07 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/human_detection_hog

Detect humans in real-time, image or video and count the detected people using OpenCV and HOG Descriptor

computer-vision easygui hog-descriptor human-detection image-processing imutils numpy opencv os python support-vector-machine

Last synced: 06 May 2026

https://github.com/chernyakid/game-success-analysis

Выявление закономерностей успешности компьютерных игр

jupyter numpy pandas python scipy seaborn

Last synced: 07 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/rpanackal/rl-msc-pro

A novel approach to solve Contextual Reinforcement Learning

dm-control gym numpy ppo pydantic pytorch reinfrocement-learning sac scipy

Last synced: 02 Jul 2026

https://github.com/edegan-furb/conwaygameoflife

This project is a simulation of Conway's Game of Life using Python with the Pygame library.

conways-game-of-life numpy pygame python

Last synced: 29 Apr 2026

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/singhvishal003/email-spam-detector-model

Email Spam Detector model to detect spam messages.

numpy pandas sklearn

Last synced: 08 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/ricomeetsworld/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Apr 2025

https://github.com/marknature/opencv-bootcamp

If you are looking to take your first steps towards learning Computer Vision and AI using OpenCV, this is the best OpenCV course to jumpstart your career.

anaconda computer-vision cv2-library ffmpeg jupyter-notebook matplotlib numpy opencv os urllib zipfile

Last synced: 02 Mar 2025

https://github.com/gauravpatil04/ml

This repo is on "Basics skills required for Machine Learning"

matplotlib numpy pandas python seaborn

Last synced: 11 Jul 2025

https://github.com/shankhadweep/diabetes-prediction-systemv3

This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 01 May 2026

https://github.com/fjohnson87/planning-and-feasability

Data-driven feasibility analysis using pandas, seaborn, and FAO food supply data

matplotlib numpy pandas

Last synced: 04 May 2026

https://github.com/atharvkadammm/suicide-prediction-system

A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.

csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-

Last synced: 01 Jul 2025

https://github.com/atharvkadammm/calmlytic

An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.

anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost

Last synced: 21 Jun 2025

https://github.com/ayushhhh123/diwali_sales_analysis

This project provides a comprehensive analysis of Diwali sales data using Python. The primary goal is to derive business insights from customer purchasing behavior during the Diwali festival season. This analysis can help improve marketing strategies, customer targeting, and overall sales performance.

matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/om-kanabar/sciencefair2025

This is my project for Chicago Public School's student science fair 2025.

chicago-public-schools matplotlib-pyplot neural-networks numpy python science-fair tensorflow

Last synced: 03 Nov 2025

https://github.com/trivediayush/python-bootcamp

This repository contains code, notes, and examples from my journey learning Python and its essential libraries.

flask matplotlib numpy pandas python python3 pythonbootcamp

Last synced: 01 Jul 2025

https://github.com/pedronjorge/average-face-python

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 21 Jun 2025

https://github.com/abdur-rahim-mygit/ml-movie-recommendation-system

This project is a Machine Learning-based Movie Recommendation System developed during my internship at YBI Foundation. The system uses collaborative filtering techniques to recommend movies based on user preferences and behavior. It was implemented on Google Colab using Python and popular machine learning libraries.

google-colab machinelearning movie-recomendation-system numpy pandas pyhton

Last synced: 02 Feb 2026

https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 05 Apr 2025

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/nik-kusanagi/jupyter

Treinamento Para Análise de Dados.

big-data jupyter jupyter-notebook matplotlib numpy pandas python python3

Last synced: 30 Apr 2026

https://github.com/kahngjoonkoh/randomshapegenerator

A program that will generate images with random shapes and background colours. Can be customized and generated in bulk.

generative-art numpy opencv python threading tkinter

Last synced: 11 Apr 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/ijproject/calculate-absorption-rate

大気を構成する分子ごとの赤外線吸収割合を計算するプログラム。

numpy python

Last synced: 11 May 2026

https://github.com/sumaiyyaf/spotify-data-analysis

This project explores Spotify music data, using Python to clean, analyze, and visualize the dataset. It examines the relationship between audio features and sentiment in song lyrics, uncovering patterns and trends to provide valuable insights into music preferences.

eda numpy pandas python

Last synced: 08 May 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/priyasingh26/financial_document-data_extraction

This project extracts key information from financial documents like invoices and receipts using text recognition. It processes images, classifies documents, and extracts data, which is then stored in a CSV file. The aim is to automate data collection from scanned documents, reducing manual work and increasing accuracy.

data-extraction numpy ocr pandas pillow preprocessing pytesseract-ocr python sklearn torch transformers

Last synced: 08 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/babagata/praktikum-4-data-analysis

Data analysis for course "Physics laboratory IV"

matplotlib numpy pandas scipy

Last synced: 28 Apr 2026

https://github.com/michelenana/projet-7

PRODUISEZ UNE ETUDE DE MARCHE AVEC R OU PYTHON

acp boxplots cah kmeans matplotlib numpy pandas python scipy seaborn sklearn

Last synced: 08 Apr 2026

https://github.com/asut00/python-piscine_42ai

Python Bootcamp: A one-week intensive course with 42AI at École 42, covering Python fundamentals, data manipulation, and introductory AI concepts.

matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/mrnust/ai-powered-socialmediapostoptimizer_llm

AI-Powered Social Media Post Optimizer enhances your social media content by optimizing text posts with relevant tags and keywords, predicting engagement metrics, and analyzing image quality. Upload your post and image to receive tailored suggestions for improving engagement and visual appeal.

groq langchain large-language-model llama-3-70b matplotlib numpy opencv streamlit textblob

Last synced: 14 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/aritrakar/ml-from-scratch

🤖Common ML implementations from scratch.

machine-learning numpy pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/rakibhhridoy/covid19analysisindashboard-tableau

Covid19 dashboard analysis of world,north america,south east Asia and their characteristics upon pandemic. Some interesting statistics is shown by the data. The increase rate make effect on death and recover rate quite periodic. Simulating those changes make more interactive.

covid-19 dashboard data-processing dataviz numpy pandas python statistics tableau tableau-dashboards

Last synced: 06 May 2026

https://github.com/1adore1/face-analysis

An application for face recognition and prediction of gender, age range and emotional state.

numpy opencv pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/michaeldehaney94/opencv

Python OpenCV computer vision projects

matplotlib numpy opencv-python python

Last synced: 20 Jun 2025

https://github.com/aysh2603/twitter-sentiment-analysis

The Twitter Sentiment Analysis project employs Natural Language Processing (NLP) techniques to classify tweets into positive or negative sentiments. By analyzing the tone of tweets, this project provides insights into public sentiment on various topics.

hyperparameter-tuning nlp-machine-learning numpy pandas python3 scikit-learn

Last synced: 13 Jul 2025

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 04 May 2026

https://github.com/davityak03/sentence-paraphraser-checker-using-transformers

This Jupyter Notebook implements a tool to check whether two sentences are paraphrases by analyzing their semantic similarity using NLP techniques. It provides a similarity score and a binary decision to indicate if the sentences are paraphrases.

keras nlp nltk numpy python tensorflow tokenizer transformers

Last synced: 02 Jan 2026

https://github.com/mikma03/simulation_modeling

Simulation models using Python. Practical use of Python in real-world examples and additional resourses.

matplotlib numpy pandas portfolio python simulation stocks

Last synced: 08 May 2026

https://github.com/faraazarsath/handson-numpy

This respository is on various Numpy Tasks provided as academic assignment

numpy

Last synced: 16 May 2026

https://github.com/omraj0/covid19-data-analysis

Analysis of COVID-19 infection rates in various countries, correlating them with factors such as GDP per capita and social support.

covid-19 google-colab matplotlib numpy pandas python

Last synced: 19 Apr 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/pranjalshivhare06/medical-ensurance-charge-predictor

The Insurance Price Predictor is a machine learning project designed to predict insurance costs based on various input features. The project leverages four different algorithms, with XGBoost emerging as the most accurate and efficient model.

fastapi machine-learning numpy pandas xgboost-classifier

Last synced: 19 Apr 2026

https://github.com/anmamun0/data-analysis-home-cleaning-services

This repository contains the analysis and visualization of data from a home cleaning services dataset. The project provides valuable insights into revenue generation, customer trends, and regional performance, helping businesses make data-driven decisions.

matplotlib numpy pandas

Last synced: 05 Mar 2025

https://github.com/vyjayanthipolapragada/image_classifier_model_hotdog

Building an Image classifier model to train and test a dataset and classify the given images into hotdog and not-hotdog.

artificial-intelligence dataset image-classification image-processing machine-learning matplotlib neural-networks numpy pandas python pytorch tensor torchvision transfer-learning

Last synced: 08 Apr 2026

https://github.com/muhkartal/e-forecast

machine learning-powered energy consumption prediction system that analyzes historical data to forecast future energy usage trends, optimizing efficiency and sustainability.

fastapi joblib matplotlib numpy pandas pydantic pytest sckit-learn seaborn tensorflow tqdm uvicorn xgboost yaml

Last synced: 08 Apr 2026

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/aqueeqazam/a-deep-dive-introduction-to-tensorflow

TensorFlow is a deep learning system that provides an interface for expressing deep learning algorithms and implementing them. It uses dataflow graphs to represent computation, shared state, and the operations that mutate that state.

deep-learning neural-network numpy tensor tensorflow

Last synced: 03 Aug 2025

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 11 Apr 2026

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 11 Apr 2026

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 07 May 2026

https://github.com/mohammedrashithkp/drlforsnakegame

This project utilises a Deep Reinforcement Learning specifically Q learning implemented using PyTorch which learns the game by interacting with the pygame environment .I have followed a tutorial on Freecodecamp YouTube Channel and made the initial model.

deep-reinforcement-learning matplotlib-pyplot numpy pygame python

Last synced: 16 May 2026

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/evernext10/coursera_capstone

This repository is created for sending the IBM Data Science Lab and will be used primarily for the Capstone project.

coursera-data-science ibm numpy pandas

Last synced: 10 May 2026

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 05 Feb 2026

https://github.com/pradyumnakr/facial-recognition-using-opencv

Uses haarcascades to recognize facial features, patterns and performs feature extraction followed by labelling the data to recognize the faces on a live webcam.

numpy opencv python3

Last synced: 09 May 2026

https://github.com/broodhoney/titanic-ml-from-disaster

This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data

machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning

Last synced: 11 Apr 2026