An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/murugavl/forage_tata_genai_powered_data_analytics

This repository showcases my work from the TATA GenAI Powered Data Analytics Virtual Experience on Forage, where I applied data cleaning, analysis, and visualization techniques to derive actionable business insights.

excel genai matplotlib numpy pandas powerbi python

Last synced: 09 Feb 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/baschin1103/neural-network-building-in-python

In this repository is a programm with a pretrained artificial neural network from Building AI. The network predicts the price of a cabin based on it's attributes.

bias building-ai forward-pass functioning identity linear-combinations neural-network numpy python regression relu

Last synced: 16 Apr 2026

https://github.com/foggysmile/ml_labs_itmo

Лабораторные работы по машинному обучению

bert decision-trees jupyter-notebook machine-learning numpy pandas random-forest xgboost

Last synced: 11 Feb 2026

https://github.com/puureya2/catalogue-price-scraper

Scrape & store product prices with Selenium & BeautifulSoup4. Visually represent data with MatPlotLib,Personal Project

beatifulsoup4 chrome-webdriver matplotlib numpy pandas python python-csv selenium webscraping

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/adriantomin/heart-disease-prediction

Predicting Heart Disease with ML 🏥🤖 Machine learning model to predict heart disease using clinical data. Includes data analysis, feature engineering, and model training.

machine-learning matplotlib numpy pandas python sklearn

Last synced: 12 Feb 2026

https://github.com/karthikarajagopal44/numpy-beginner-to-advanced

This repository is designed to help you master NumPy, the fundamental package for scientific computing with Python. Whether you are just getting started or looking to refine your skills with advanced techniques, this repo covers a comprehensive range of topics and functionalities. It covers the range of topics from beginner to advance.

data-science numpy numpy-library numpy-tutorial python

Last synced: 16 Apr 2026

https://github.com/walkerdustin/python-linear-regression

simple linear regression in python based on coursera andrew ng course

ai linear-regression machine-learning ml numpy python

Last synced: 05 May 2026

https://github.com/bharatsharma07/ipl-analysis

This project provides insights into the IPL's historical data, helping to identify trends, player performance, and team strategies.

data-visualization ipl-data-analysis numpy pandas python

Last synced: 16 Apr 2026

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 13 Feb 2026

https://github.com/harmonicode/signal_equalizer

Signal Equalizer is a Python-based application designed to process and analyze various types of signal data, including audio and ECG signals. It provides functionalities for reading, processing, visualizing, and enhancing signal data using techniques such as Wiener filtering.

audio-processing numpy pyqt5 scipy signal-processing wiener-filter

Last synced: 13 Feb 2026

https://github.com/maia-bastos/estimativas

Projeto de Probabilidades, Amostragem e Estimações com dados reais (IBGE).

numpy pandas python3 scipy-stats

Last synced: 16 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/vgvr0/analisis-de-datos-con-streamlit-numpy-pandas-y-matplotlib

Sistema completo de análisis y visualización de datos cinematográficos que proporciona insights detallados sobre películas, incluyendo análisis financiero, puntuaciones, tendencias temporales y un sistema de recomendación. Desarrollado con Python y Streamlit, ofrece una interfaz interactiva y amigable para explorar datos de películas.

matplotlib numpy pandas plotly plotly-dash recommendation-system streamlit

Last synced: 16 Apr 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/killervardhan8/gesturedecode

The Sign Language Interpretation project focuses on recognizing and interpreting hand gestures to facilitate communication for individuals who use sign language. This project leverages computer vision and machine learning techniques to accurately identify and translate hand signs into text

csv mediapipe numpy python tensorflow

Last synced: 28 Feb 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/bzubs/mlzero

Implementation of widely used ML algorithms in vanilla python

machine-learning machine-learning-algorithms machinelearning numpy python statistics

Last synced: 03 Mar 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/anas436/student-performance-analysis

In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.

data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 16 Apr 2026

https://github.com/abhishekpsonawane07/loan-repayment-prediction-ml

Loan Approval Prediction uses machine learning to analyze factors like income and credit history, automating and improving the loan approval process. The project ensures accurate, efficient, and user-friendly predictions with a Gradio interface.

artificial-neural-networks confusion-matrix decision-trees gradio-interface k-nearest-neighbours loan-dataset loan-prediction-analysis loan-repayment logistic-regression numpy pandas python random-forest sckiit-learn seaborn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 16 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/rahulchouhan1/spotify-most-popular-songs-data-analysis

🎵 Spotify Songs Analysis using Pandas

matplotlib numpy pandas

Last synced: 16 Apr 2026

https://github.com/shriansh8619/eda_customer_behavior

This project analyzes Nielsen transaction data using Python to uncover sales trends, customer preferences, and purchasing patterns. It provides insights to optimize inventory, create targeted promotions, and improve store performance. The goal is to help supermarkets enhance strategies and boost profitability based on data-driven insights

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026

https://github.com/kodemartin/np-pd-portfolio

Portfolio of numpy, pandas, and matplotlib solutions

matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/neerajcodes888/a-novel-used-car-price-prediction-model-based-on-lindenoise

Welcome to the LinDenoise Repository! LinDenoise offers a smart solution for cleaning noisy data in regression tasks. Integrated seamlessly within the widely-used scikit-learn framework, it effortlessly enhances data quality while improving predictive accuracy

car-price-prediction deep-learning ipynb-notebook machine-learning numpy pandas python3 visualization

Last synced: 06 Mar 2026

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/satyas567/weatherdataanalysis

Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/offthetab/ml_homework

Практические работы по анализу данных МИРЭА 3 курс.

ml numpy pandas python shap sklearn xgboost

Last synced: 02 Apr 2026

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 02 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/cosmicsid004/air_draw

Digital drawing....

handtracking mediapipe numpy opencv

Last synced: 30 Apr 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/vishal-verma-96/oil_spill_prediction_using_ml

This ML project predicts oil spills using various machine learning algorithms like XGBoost and Random Forest. This project also contains saving and load of the model to make predictions on a sample dataset.

data-science machine-learning-algorithms machine-learning-classification matplotlib model-building-and-evaluation numpy pandas pickle python-3 scikitlearn-machine-learning xgboost

Last synced: 17 Apr 2026

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 17 Apr 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/jass024/bitcoin-historical-analysis

Performed comprehensive time-series analysis on 14+ years of Bitcoin historical data using Python (Pandas, NumPy, Plotly). Calculated key technical indicators (SMA, EMA, RSI, volatility) to identify market trends, potential anomalies, and areas of regulatory concern. Visualized results using interactive Plotly charts.

numpy pandas python-3

Last synced: 18 Apr 2026

https://github.com/arnoldchrisoduor1/potato_disease

Trained a CNN to detect the presence and the current stage of Blight disease in potatoes, WebApp in FastAPI and React

fastapi keras numpy react serving-tensors tensorflow

Last synced: 04 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/jsnevt/regressao_linear

Regressão linear simples - Qual a velocidade se levou x distância para parar ?

linear-regression matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/dyceszn/i.duruohacos102

First-year deep dive into problem-solving with Python—featuring class practices and projects using NumPy, Pandas, Tkinter, and psycopg2.

numpy psycopg2 python tkinter

Last synced: 18 Apr 2026

https://github.com/iv4n-ga6l/titanic-survival-prediction

a predictive model in Python that answers the following question: “using data relating to passengers (name, age, gender, socio-economic class, etc.), who has the greatest chance to survive?"

matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Apr 2026

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 05 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-movies

Machine Learning python bootcamp: Recommender Systems on movies dataset

ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligência artificial para analisar documentos e características físicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as áreas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/matiassenia/galaxypy

La aplicación carga una imagen en escala de grises y utiliza los puntos de menor luminosidad (más oscuros) para dibujar líneas de colores que crean una estructura parecida a una galaxia.

numpy python

Last synced: 18 Apr 2026