An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/pavanreddy565/infosys_python_for_datascience

Data Science Course Tasks Repository This repository contains tasks and projects completed during the Data Science course from Infosys Springboard. It includes setup instructions for running Jupyter Notebook sessions in a virtual environment.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/guru-bharadwaj20/saliency-maps

A saliency-guided autonomous driving system using cmSalGAN heatmaps, a custom Fusion model, and Faster R-CNN detection. It generates refined saliency maps, trains a saliency-aware detector, and simulates vehicle navigation using datasets like NJUD, NLPR, and STEREO.

cmsalgan numpy opencv python pytorch torchvision

Last synced: 18 Nov 2025

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/chirindaopensource/non_linear_forecasting_backcasting

Python implementation of Gourieroux-Jasiak's (2025) mixed causal-noncausal VAR models. Features probabilistic forecasting, nonlinear innovation filtering, and state-dependent IRF analysis for financial time series with explosive dynamics. Enables robust risk assessment and structural analysis of speculative behavior.

bootstrap bubble-analysis financial-modeling forecasting jupyter-notebook monte-carlo nonlinear-modeling numpy pandas python quantitative-finance research-implementation risk-management scipy statistical-modeling time-series uncertainty-quantification var-models

Last synced: 11 Apr 2026

https://github.com/ecrows/flippy

Python command-line utility for generating videos from image directories.

data-science image-processing numpy opencv python python3 video video-processing visualization

Last synced: 04 Mar 2026

https://github.com/xiaohaoo/machine-learning-examples

机器学习使用不同框架的常见模型的例子

keras matplotlib numpy python pytorch sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/eljandoubi/trading-with-momentum

A trading strategy based on a momentum indicator.

momentum numpy pandas scipy trading trading-algorithms trading-strategies

Last synced: 05 May 2026

https://github.com/4211421036/githubiotpy

GitHubIoT is a comprehensive toolkit designed to simplify the visualization of IoT (Internet of Things) data with seamless GitHub integration. The application provides an intuitive graphical interface for real-time data monitoring, analysis, and configuration

cli esp32 esp8266 github-actions github-iot matplotlib numpy pypi-packages python tkinter

Last synced: 16 Apr 2025

https://github.com/alexquar/whatisthatdog

Mobile App built using react-native that will send api requests for classifying dog breeds to a flask api server containing a custom built CNN made with pytorch trained on the Stanford dog dataset. To take a look at some examples checkout some images from my portfolio linked here.

aws ec2 expo flask gunicorn matplotlib-pyplot numpy pandas pytorch react-native seaborn typescript

Last synced: 11 Apr 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025

https://github.com/dhirajmerit/diabetes-prediction-models

This project implements multiple machine learning algorithms to predict diabetes diagnoses based on medical diagnostic data. The project compares the performance of various models, providing insights into which algorithms are most effective for this task.

ai machine-learning numpy pandas tensorflow

Last synced: 11 Apr 2026

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/abhi227070/wine-quality-prediction

The Wine Quality Prediction project utilizes machine learning to assess wine quality based on various parameters. It offers a user-friendly interface built with Streamlit for easy interaction. Deployed on AWS, it provides scalable access to quality predictions for wine batches.

classification-algorithm machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas python3 seaborn

Last synced: 11 Apr 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/arnauog/ironhack_data_analytics

Weekly labs to practice everything taught in class during the Data Analytics bootcamp of Ironhack in Barcelona of Sept 2024 - March 2025.

machine-learning mongodb mysql numpy pandas python seaborn tableau webscraping

Last synced: 11 Apr 2026

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 11 Apr 2026

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/linsanity03/football_prediction

ML project to predict goal from football events such as penalty, free kick.

jupyter-notebook numpy pandas pyspark python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/anshpg/vectorwave

The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.

collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/amnydv17/landmark-detection

This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/pranavsuriya-sr/e-hub_energysystems

Python based Web Application for analysis and visualization of concepts in Energy Systems.

css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit

Last synced: 11 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 11 Apr 2026

https://github.com/pd2871/streamlit_automl

Auto Exploratory Data Analysis using Streamlit App

matplotlib numpy pandas python streamlit streamlit-webapp

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/imdeepmind/knn_from_scratch

KNN classifier in pure Numpy and Python

knn knn-classification machine-learning numpy python

Last synced: 11 Apr 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/kzon94/torn-market-analyzer

Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.

data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api

Last synced: 11 Apr 2026

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction

Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.

analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap

Last synced: 11 Apr 2026

https://github.com/nikhil-donthusaram/heartdiseaseprediction

Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.

data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode

Last synced: 11 Apr 2026

https://github.com/jagroop2001/learning-numpy

Learning NumPy complete the exercise

numpy pandas python

Last synced: 16 May 2025

https://github.com/junaidsumsaal/advanceparkingsystem

Its a web application that integrates AI-powered parking spot predictions, real-time notifications, and role-based dashboards for Drivers, Attendants, and Providers. This system is designed to streamline parking operations, enhance user experience, and leverage machine learning for future availability predictions.

axios-react chakra-ui-react data-science met metrics ml numpy pand python3 react sklearn tailwindcss typescript

Last synced: 11 Apr 2026

https://github.com/kersh1337228/tradenv

Trading strategies testing environment based on open stock market data

django django-rest-framework numpy pandas python react stock-market typescript

Last synced: 11 Apr 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/vansh-py04/convolution-net-from-scratch

A minimal NumPy-based implementation of a 3-layer convolutional neural network (CNN) from scratch — including custom forward and backward passes for conv, ReLU, pooling, affine, and softmax layers. Perfect for learning how CNNs actually work under the hood.

backpropagation cnn computer-vision convolutional-neural-networks cs231n cs231n-assignment deep-learning from-scratch from-scratch-in-python fully-connected-network machine-learning maxpool2d nueral-networks numpy softmax stanford-deep-learning

Last synced: 24 Apr 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/adi3042/sensor-fault-detection

🔍⚙️ Ensure Reliable Operations! Detect anomalies and prevent disruptions with our Sensor Fault Detection system. Explore advanced classification and regression techniques to identify and address sensor faults effectively. Your path to robust and accurate sensor data begins here! 🚨🔧 SensorFaultTech

classification css datetime fault-detection flask functools html ipykernel jupternotebook machine-learning numpy pandas python3 readme regression scikit-learn sensor setuptools venv

Last synced: 11 Apr 2026

https://github.com/tanmay-312/pacman-game-python

"Embark on a nostalgic journey with my Python Pac-Man game! 🎮🕹️ Crafted using pygame, tcod, numpy, and enum libraries, it's a modern twist on a classic. Navigate the maze, chomp pellets, and evade ghosts in this retro-inspired adventure! 🚀👻 #PythonGameDev #PacManRevival #ClassicGaming"

enum game game-development gamedev numpy pygame pygame-games python python-3 python3 tcod

Last synced: 22 Apr 2026

https://github.com/lorenzorottigni/ml-spam-filter

Machine Learning python bootcamp: natural language processing with spam filter purposes

machine-learning npl numpy pandas python scikit-learn seaborn spam-filtering

Last synced: 11 Apr 2026

https://github.com/alphan26/breastcancerclassification

This project involves a binary classification problem where we applied PyTorch to detect breast cancer.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/npodlozhniy/dl-tinkoff

This is a warehouse of my homeworks for master MIPT deep learning course from Tinkoff FinTech

deep-learning numpy pytorch

Last synced: 08 May 2026

https://github.com/muhammad-rebaal/eda-with-python-and-pandas

Explore, Analyze, and Visualize Data with Python and Pandas: Dive into the world of data analysis and visualization using Python and Pandas with this repository. Discover valuable insights, clean and preprocess your data, and create stunning visualizations to make data-driven decisions.

calmap matplotlib numpy pandas python python3 seaborn

Last synced: 07 May 2026

https://github.com/nishantksingh0/twitter-sentiment-analysis

A ML/DL based Twitter sentiment analysis model based on Transformer based pretrained Encoding on sentence

kaggle-dataset matplotlib-pyplot nlp numpy pandas sklearn tensorflow transformer

Last synced: 11 Apr 2026

https://github.com/aadrianleo/book-recommendation-system

Book Recommender System using the Book-Crossing dataset. Compares content-based (TF-IDF + cosine similarity) and collaborative filtering (SVD) methods for book recommendations. Includes data cleaning, EDA, and model evaluation (Precision@5, RMSE) in Python.

artificial-intelligence collaborative-filtering content-based-filtering cosine-similarity data-preprocessing jupyter jupyter-notebook machine-learning matrix-factorization numpy pandas predictive-modeling python3 recommendation-system recommender-system scikit-learn sckit-surprise

Last synced: 10 Apr 2026

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 11 Apr 2026

https://github.com/collins-kimotho/wrangle-and-analyze-project

Data Wrangling and Analysis Project: Analyzing WeRateDogs Twitter Account Data

explanatory-data-analysis exploratory-data-analysis matplotlib numpy pandas python twitter twitter-api

Last synced: 11 May 2026

https://github.com/shahzaibdev355/house-recommendation-backend

House Recommendation Backend is a Flask-based API that provides house recommendations based on user input. It includes endpoints to check the backend's functionality, fetch state data, and get personalized house recommendations based on criteria like price, property size, number of beds, number of baths, and state.

flask flask-api flask-cors numpy pandas scikit-learn tensorflow

Last synced: 05 Jan 2026

https://github.com/niteshchawla/yulu-hypothesistesting

Yulu has recently suffered considerable dips in its revenues. They have contracted a consulting company to understand the factors on which the demand for these shared electric cycles depends. Specifically, they want to understand the factors affecting the demand for these shared electric cycles in the Indian market.

hypothesis-testing matplot numpy pandas-library scipy-stats seaborn

Last synced: 20 Apr 2026

https://github.com/rezapace/number-generator

Proyek ini adalah sebuah program yang digunakan untuk menghasilkan deretan angka dengan langkah tertentu dan menambahkan teks setelah setiap angka. Program ini mendukung berbagai metode untuk menghasilkan angka, termasuk penggunaan CuPy untuk memanfaatkan GPU, serta penggunaan threading untuk meningkatkan kinerja.

colab cupy custom generator jupiter kumal link number numpy pyhon webkumal webkumal-projek

Last synced: 11 Apr 2026

https://github.com/vishal-verma-96/Honey_Production_Analysis_using_Python

In this project, I analyzed U.S. honey production from 1998 to 2021 across different U.S. cities, explored yields, and examined production values using Matplotlib and Seaborn visualizations.

matplotlib numpy pandas python-3 python-visualization seaborn

Last synced: 19 Apr 2025