An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/soumyapro/wine-quality-prediction

This project is about the prediction of wine quality using machine learning algorithms

boxplot matplotlib numpy pandas random-forest smote

Last synced: 10 Apr 2026

https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system

This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.

collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/germanocastanho/python-notes

Personal collection of Python Notes, tips, and code snippets 🐍

data-science numpy pandas programming python

Last synced: 01 Sep 2025

https://github.com/hansalemaos/hexarray2decimal

Converts a numpy string array with hex values to int

convert hex int numpy python

Last synced: 05 May 2026

https://github.com/shivam5509/power-bi-project

Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.

advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server

Last synced: 11 Apr 2026

https://github.com/utkarsh251106/cricket-shot-analyzer

Real-time cricket shot analyzer using Python, OpenCV, and MediaPipe. Processes videos frame-by-frame, overlays pose and biomechanical metrics, and outputs an annotated video with JSON evaluation. Also the output video might have "??" in it which is there cause OpenCV can't display degree's symbol.

artificial-intelligence computer-vision deep-learning machine-learning mediapipe numpy python real-time

Last synced: 05 May 2026

https://github.com/armahdavi/qff-evalation_code-data-processing-statistics-plotting

Data pipelines and processing codes, statistical modellings, descriptive statistics, and plot visualizations for QFF evaluation phase of for Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021 Full-length article: https://www.sciencedirect.com/science/article/abs/pii/S0269749120370779

data-science data-visualization histogram matplotlib matplotlib-pyplot numpy pandas python

Last synced: 11 Apr 2026

https://github.com/zuhairzia/titanic-survival-project

This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.

csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/armahdavi/analytics-data-pipelines-statistics-plotting---dust-extraction-hvac-filters---phase-1

PhD Technical Paper 1 - Phase 1 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

matplotlib numpy pandas pandas-dataframe pyplot python scipy-stats sklearn

Last synced: 13 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/mehradi-github/ref-jupyter-2510

using python in machine learning

matplotlib numpy pandas python sklearn statistics

Last synced: 11 Apr 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/lucasgleria/seamese-network-algorithm

Este projeto implementa um sistema de busca por similaridade de imagens usando redes siamesas e Triplet Loss em PyTorch. Ele gera embeddings de imagens (MNIST com EfficientNet-B0) para encontrar visuais semelhantes. O foco está na análise visual e no aprendizado de representações no espaço vetorial.

google-colab matplotlib numpy pandas python pytorch timm

Last synced: 11 Apr 2026

https://github.com/abrarshahok/electric-vehicle-charging-station-energy-consumption-prediction

With the rapid adoption of electric vehicles, optimizing energy usage at charging stations has become crucial for improving operational efficiency and ensuring customer satisfaction. This tool leverages predictive modeling to forecast energy consumption for charging sessions based on various input features.

matplotlib numpy pandas plotly python3 scikit-learn xgboost

Last synced: 09 Jun 2026

https://github.com/riju18/from-data-production-to-client-handover

The common tedious problem is to build a data app to demonstrate the data analysis & analytics along with Machine Learning to a client. It was an attempt to do it on small scale in the most powerful & simplest way.

machine-learning matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/charles-l/rayboi

a raytracer written in futhark/python

futhark numpy pathtracing python3 raytracing

Last synced: 19 Apr 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/eduardoprofe666/mn-api

🐍📦 Paquete de Python con implementaciones de métodos numéricos

mn-api numerical-methods numpy pandas python scipy simpy tabulate

Last synced: 04 Jan 2026

https://github.com/kahngjoonkoh/randomshapegenerator

A program that will generate images with random shapes and background colours. Can be customized and generated in bulk.

generative-art numpy opencv python threading tkinter

Last synced: 11 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/erikbrinkman/hilbert-bytes

A python library for converting between d-dimensional points and indices on a hilbert curve

hilbert-curve numba numpy python

Last synced: 08 May 2025

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/andersoncrs/prediccion_precio_vehiculos_statsmodels

Este proyecto utiliza un modelo de regresión lineal para predecir el precio de vehículos basándose en sus características principales. El análisis incluye la definición del problema, exploración y limpieza de datos, conversión de variables categóricas a numéricas, evaluación de correlaciones y entrenamiento del modelo.

analisis-de-datos analisis-exploratorio-de-datos matplotlib numpy seaborn statsmodels visualizacion-de-datos

Last synced: 26 Apr 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/emhash/numpy-tutorial

Just simple tasks of practice of NumPy library of python

numpy numpy-library numpy-tutorial python

Last synced: 11 Apr 2026

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 11 Apr 2026

https://github.com/arn4/labtools

Package with useful scripts for Physics Laboratory Courses

numpy python3 scipy uncertainties

Last synced: 05 Jan 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/matiassenia/modelo-prediccion-temperatura

Proyecto de machine learning que utiliza TensorFlow para predecir temperaturas en grados Fahrenheit a partir de temperaturas en grados Celsius. Incluye un script de Python para entrenar el modelo y realizar predicciones, así como datos de entrenamiento y visualizaciones

adam-optimizer machine-learning numpy python python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/kkufieta/carnd-vehicle-detection

Vehicle Detection & Tracking - Self Driving Cars

cv2 hog-features numpy self-driving-car sklearn vehicle-detection

Last synced: 11 Apr 2026

https://github.com/isabelacaldeira/chutelibre

Crashing into code. Here is a physics problem about free fall solved with python!

freefall jupyter-notebook matplotlib numpy physics physics-simulation python3

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/chanmeng666/advanced-neural-network-applications

Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.

classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression

Last synced: 03 May 2026

https://github.com/sofiakhutsieva/analytical_report_for_hr_department

Подготовка аналитического отчета (SQL) (Проект в skillbox), 2020 г.

analytics matplotlib numpy pandas psycopg2 seaborn sqlalchemy

Last synced: 06 May 2026

https://github.com/yoavram/numpycharm

Some examples on using PyCharm with Scientific Python

cython numpy pycharm python scientific-computing training-materials

Last synced: 11 Apr 2026

https://github.com/anubagre/predicting-price-of-pre-owned-cars

This project is solution to data science problem based on regression model to predict the price of pre owned cars. Programming language used is Python.

data-science machine-learning numpy pandas price-prediction python seaborn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/ikbalcaus/HandSketch

Drawing on Canvas with Hand Gestures + AI for Letter Recognition

mediapipe numpy ocr-recognition opencv python pytorch tkinter

Last synced: 31 Mar 2025

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/biasandrade/rede-neural-do-zero-classificacao-binaria

🧠 Rede Neural Artificial construída do ZERO (só NumPy) para detectar fraudes em transações | Forward/Backward Propagation | Sigmoid | Gradiente Descendente

backpropagation binary-classification deep-learning fraud-detection from-scratch gradient-descent machine-learning neural-network numpy python

Last synced: 18 Nov 2025

https://github.com/guru-bharadwaj20/saliency-maps

A saliency-guided autonomous driving system using cmSalGAN heatmaps, a custom Fusion model, and Faster R-CNN detection. It generates refined saliency maps, trains a saliency-aware detector, and simulates vehicle navigation using datasets like NJUD, NLPR, and STEREO.

cmsalgan numpy opencv python pytorch torchvision

Last synced: 18 Nov 2025

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/chirindaopensource/non_linear_forecasting_backcasting

Python implementation of Gourieroux-Jasiak's (2025) mixed causal-noncausal VAR models. Features probabilistic forecasting, nonlinear innovation filtering, and state-dependent IRF analysis for financial time series with explosive dynamics. Enables robust risk assessment and structural analysis of speculative behavior.

bootstrap bubble-analysis financial-modeling forecasting jupyter-notebook monte-carlo nonlinear-modeling numpy pandas python quantitative-finance research-implementation risk-management scipy statistical-modeling time-series uncertainty-quantification var-models

Last synced: 11 Apr 2026

https://github.com/xiaohaoo/machine-learning-examples

机器学习使用不同框架的常见模型的例子

keras matplotlib numpy python pytorch sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/mashalk1/computer-vision-attendance-system

A computer vision based attendance system, using cv2, firebase, numpy, cvzone and face_recognition library in python

computer-vision cv2 cvzone face-recognition numpy

Last synced: 04 Feb 2026

https://github.com/eljandoubi/trading-with-momentum

A trading strategy based on a momentum indicator.

momentum numpy pandas scipy trading trading-algorithms trading-strategies

Last synced: 05 May 2026

https://github.com/mborne/numpy-experiments

[DEV] Some basic examples written to get started with numpy and pytorch

autograd basics get-started least-squares numpy torch

Last synced: 05 May 2026

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/chaudharypraveen98/lungcancerdetection

To distribute the work of doctors and process the large amount of data to produce accurate results on the go

numpy pandas pillow python scipy tenserflow

Last synced: 16 Apr 2026

https://github.com/jo-minseok/global-warming-100year

🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 ML [완료]

arima-model global-warming machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/ashishsingh789/traffic_accident_analysis

Analyzed traffic accident data to identify patterns related to road conditions, weather, and time of day. Visualized accident hotspots using heatmaps and bar plots.

excel matplotlib numpy pandas pandas-dataframe python seborn

Last synced: 11 Apr 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/aciluca/credit-approval-system

A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.

data-science keras machine-learning numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026