An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lorenzorottigni/ml-iris-svm

Machine Learning python bootcamp: Support Vector Machines on iris flower dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 10 Apr 2026

https://github.com/gaball1/alzheimer-prediction-ai

Alzheimer Prediction is a machine learning-based project designed to predict the likelihood of Alzheimer’s disease using clinical and imaging data. The project features an interactive web application built with Streamlit, enabling users to input key data, view predictions, and explore visualizations .

ai data-mining gradient-boosting jupyter-notebook logistic-regression logistic-regression-algorithm machine-learning matplotlib-python models numpy pandas python random-forest-classifier streamlit xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/maximlevchenko/pandas-numpy-hippocampus-analysis

This project aims to analyze the differences in hippocampus size between two groups: "Unaffect" and "Affected" using pandas, numpy and matplotlib

jypyternotebook matplotlib numpy pandas pyreadr seaborn statsmodels

Last synced: 16 Apr 2026

https://github.com/theanujsinha01/seoul-bike-sharing-demand-prediction

This project predicts the demand for bike-sharing services in Seoul using machine learning algorithms. It utilizes historical data and environmental factors to provide accurate predictions that can aid in resource optimization and planning.

data-cleaning data-science exploratory-data-analysis feature-engineering jupyter-notebook machine-learning matplotlib model numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 10 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/amiridk/machinelearning

machine-learning-notes

matplotlib-pyplot numpy pandas pyhton

Last synced: 04 May 2026

https://github.com/alyssonmach/machine-learning-com-python

Aplicações de Machine Learning usando a linguagem de programação Python.

ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy

Last synced: 10 Apr 2026

https://github.com/the-developer-306/neural-style-transfer

Neural Style Transfer project leveraging TensorFlow and VGG19 to blend artistic styles with content images. Features include style and content initialization, multi-layer feature extraction, total variation loss, and post-processing enhancements like sharpening, upscaling, and brightening.

deep-learning image-processing neural-style-transfer numpy opencv-python post-processing tensorflow transfer-learning vgg19

Last synced: 01 May 2026

https://github.com/lorenzorottigni/ml-ecommerce

Machine Learning python bootcamp: linear regression on ecommerce dataset

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/allanotieno254/pneumonia-xray-classification

This project is a Deep Learning-based Pneumonia classification system that allows medical staff to upload chest X-ray images and quickly determine whether a patient shows signs of pneumonia.

keras machine-learning numpy pandas pil pillow python streamlit streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 10 Apr 2026

https://github.com/sgb31/csv-retrival-based-rag

Created a simple RAG system that lets users upload any CSV file and ask questions about it. It reads the data, finds the most relevant info, and uses an LLM to answer the query. Super handy for quickly exploring and understanding data through natural language.

generative-ai llm numpy pandas pydantic python3 sentence-transformers sklearn streamlit transformers

Last synced: 19 Jun 2026

https://github.com/dineshdhamodharan24/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/shivammg/coursera-ml-numpy

Coursera Machine Learning course done using Python, NumPy, and Seaborn

machine-learning numpy python seaborn

Last synced: 09 May 2026

https://github.com/jaweria-b/eda-basketball

The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.

matplotlib numpy python streamlit

Last synced: 10 Apr 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/ahmedabdalkreem/connected_component_labeling

Technique used to detect small object in the image like shapes and number can used this technique in OCR.

computer-vision connected-components matplotlib numpy object-detection python rgb2gray threshold

Last synced: 11 Apr 2026

https://github.com/crvargasm/moore-penrose-matrix

A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.

numerical-methods numpy python

Last synced: 10 Apr 2026

https://github.com/soumyapro/wine-quality-prediction

This project is about the prediction of wine quality using machine learning algorithms

boxplot matplotlib numpy pandas random-forest smote

Last synced: 10 Apr 2026

https://github.com/farhad-here/data-visualization-analysis-dva

This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.

altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/utkarsh251106/cricket-shot-analyzer

Real-time cricket shot analyzer using Python, OpenCV, and MediaPipe. Processes videos frame-by-frame, overlays pose and biomechanical metrics, and outputs an annotated video with JSON evaluation. Also the output video might have "??" in it which is there cause OpenCV can't display degree's symbol.

artificial-intelligence computer-vision deep-learning machine-learning mediapipe numpy python real-time

Last synced: 05 May 2026

https://github.com/zuhairzia/titanic-survival-project

This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.

csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database

apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql

Last synced: 11 Apr 2026

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/riju18/from-data-production-to-client-handover

The common tedious problem is to build a data app to demonstrate the data analysis & analytics along with Machine Learning to a client. It was an attempt to do it on small scale in the most powerful & simplest way.

machine-learning matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/charles-l/rayboi

a raytracer written in futhark/python

futhark numpy pathtracing python3 raytracing

Last synced: 19 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/kahngjoonkoh/randomshapegenerator

A program that will generate images with random shapes and background colours. Can be customized and generated in bulk.

generative-art numpy opencv python threading tkinter

Last synced: 11 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/matiassenia/modelo-prediccion-temperatura

Proyecto de machine learning que utiliza TensorFlow para predecir temperaturas en grados Fahrenheit a partir de temperaturas en grados Celsius. Incluye un script de Python para entrenar el modelo y realizar predicciones, así como datos de entrenamiento y visualizaciones

adam-optimizer machine-learning numpy python python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/djdurga/google_play_store_apps_analysis

This data analysis project focuses on exploring and understanding the Google Play Store Apps dataset.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/isabelacaldeira/chutelibre

Crashing into code. Here is a physics problem about free fall solved with python!

freefall jupyter-notebook matplotlib numpy physics physics-simulation python3

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/biasandrade/rede-neural-do-zero-classificacao-binaria

🧠 Rede Neural Artificial construída do ZERO (só NumPy) para detectar fraudes em transações | Forward/Backward Propagation | Sigmoid | Gradiente Descendente

backpropagation binary-classification deep-learning fraud-detection from-scratch gradient-descent machine-learning neural-network numpy python

Last synced: 18 Nov 2025

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/ecrows/flippy

Python command-line utility for generating videos from image directories.

data-science image-processing numpy opencv python python3 video video-processing visualization

Last synced: 04 Mar 2026

https://github.com/4211421036/githubiotpy

GitHubIoT is a comprehensive toolkit designed to simplify the visualization of IoT (Internet of Things) data with seamless GitHub integration. The application provides an intuitive graphical interface for real-time data monitoring, analysis, and configuration

cli esp32 esp8266 github-actions github-iot matplotlib numpy pypi-packages python tkinter

Last synced: 16 Apr 2025

https://github.com/alexquar/whatisthatdog

Mobile App built using react-native that will send api requests for classifying dog breeds to a flask api server containing a custom built CNN made with pytorch trained on the Stanford dog dataset. To take a look at some examples checkout some images from my portfolio linked here.

aws ec2 expo flask gunicorn matplotlib-pyplot numpy pandas pytorch react-native seaborn typescript

Last synced: 11 Apr 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025

https://github.com/vyjayanthipolapragada/marketing_statistical_analysis

Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns

customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026

https://github.com/yanglr/mlnotes-python

Machine learning Notes for Python 3 (NumPy, TensorFlow etc.), with interactive Jupyter notebook.

jupyter jupyter-notebook numpy numpy-notes

Last synced: 21 May 2026

https://github.com/broodhoney/titanic-ml-from-disaster

This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data

machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning

Last synced: 11 Apr 2026

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 11 Apr 2026

https://github.com/linsanity03/football_prediction

ML project to predict goal from football events such as penalty, free kick.

jupyter-notebook numpy pandas pyspark python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/shubhamahobia/lstm-hamlet

The LSTM Next Word Prediction project trains a model on Hamlet to predict the next word in a sequence. After preprocessing the text, an LSTM network learns patterns in the language. The trained model can then generate text by predicting the next word in a given sequence, with its accuracy evaluated to ensure it effectively models the language.

deep-learning github hamlet lstm lstm-neural-network lstm-neural-networks machine-learning neural-network nltk numpy pandas pickle python

Last synced: 11 Apr 2026

https://github.com/costopoulos/ntua-telecommunications

:radio: :telephone_receiver: NTUA ECE Telecommunications Course Code & Report

numpy nyquist python sampling scipy telecommunications

Last synced: 20 Apr 2026

https://github.com/zhanziyuan/pixelpuzzle-web

Encode/decode images using Base64 or shuffle/recover the pixels of images.

anime encryption image image-processing numpy python

Last synced: 11 Apr 2026