An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/paulokarabyna/ml-foundations-day1

🔧 Set up a Python environment and practice linear algebra with NumPy while plotting vectors and reflections in this Day 1 ML foundations starter repository.

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy terminal-mac tutorial visualization

Last synced: 18 May 2026

https://github.com/amiridk/statistics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 14 Apr 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/ljadhav25/false-alarm-detection

The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.

flask flask-api logestic-regression numpy pandas pkl pycharm-ide python

Last synced: 19 Jan 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 14 Apr 2026

https://github.com/priyanshulathi/air-quality-index-prediction

Machine learning based air quality index prediction using environmental and pollutant data to classify and forecast pollution levels.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 19 Jan 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/sweta2501/ml

Consist of ML projects based on Python along with DataSheets

machine-learning matplotlib-pyplot numpy pandas python seaborn sklearn sklearn-model

Last synced: 14 Apr 2026

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/omcodewizard/nasa-battery-analysis

This repository contains code for analyzing battery data from NASA's battery testing dataset. The analysis involves processing battery impedance, electrolyte resistance, and charge transfer resistance across charge/discharge cycles to track the aging and performance of various batteries.

battery-impedence nasa-battery-dataset numpy pandas plotly python

Last synced: 09 May 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 14 Apr 2026

https://github.com/dwija12903/ai-lab

A collection of practical implementations from my AI Labs course

keras numpy sckit-learn tensorflow

Last synced: 19 Jan 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 14 Apr 2026

https://github.com/zahediparsa/nlp_ta_uok

This repository contains exercises designed for students in the Natural Language Processing (NLP) course at the University of Kurdistan, taught by Dr. Fatemeh Daneshfar. The course took place from February 2024 to July 2024.

cbow jupyter-notebook matplotlib nlp nltk numpy pandas python tf-idf

Last synced: 19 Jan 2026

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 11 Apr 2026

https://github.com/angelarreola/ai_notes

Notas de la materia "Inteligencia Artificial" para su posterior extraccion mediante algun modelo de lenguaje que nos permita dar respuestas personalizadas con base a la informacion presente en este repositorio.

ai matplotlib numpy pandas phaserjs python scikit-learn

Last synced: 21 Jan 2026

https://github.com/chandkund/sms-spam-detection

The goal is to develop a classification model that can accurately differentiate between spam and non-spam messages. This is crucial for applications like email filtering, SMS spam detection, and improving overall user experience by reducing the influx of unwanted or malicious content.

matplotlib nlp-machine-learning numpy pandas seaborn stemming tfidf-vectorizer tokenization

Last synced: 19 Jan 2026

https://github.com/jsimell/sleepanalysis

A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.

data-analysis matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/shubham-bendre/football-clip-analysis

This project uses YOLO and Roboflow datasets to analyze football matches, detecting players, referees, and the ball while tracking movements in real-time.

deep-learning matplotlib numpy roboflow-dataset yolov5

Last synced: 17 May 2026

https://github.com/abhishekyadav915/multiple_disease_prediction_using_ml

Developed a web-based application for predicting the likelihood of multiple diseases using machine learning models with accuracy up to 95%.

colab-notebook logistic-regression numpy pandas python3 sklearn spyder-python-ide streamlit-web streamlitcloud svm-classifier

Last synced: 05 May 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 14 Apr 2026

https://github.com/hansalemaos/locate_pixelcolor_cython_multi_auto_compile

Detects colors in images 8 x faster than Numpy / Uses Cython - returns a dict

bot c color colors colour colours cpp cython fast find multiprocessing numpy numpy-array opencv rgb rgba search

Last synced: 24 Feb 2026

https://github.com/grand-27-master/fingers-tap

A real-time multi-finger tap detection system

mediapipe numpy opencv python3

Last synced: 14 Apr 2026

https://github.com/ananupam/eda-titanic

An Exploratory Data Analysis project done on Titanic Dataset.

numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/rohanrony19/movie-recommendation-system

This is a python project where using Pandas library we will find correlation and give the best recommendation for movies.

data-analysis deep-learning knn-algorithm numpy pandas python recommendation-system

Last synced: 14 Apr 2026

https://github.com/pedrosfaria2/analisetitulosnetflix

Estudo de popularidade dos filmes da Netflix no IMDB.

analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 14 Apr 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 14 Apr 2026

https://github.com/hossein-cod/parking-space-detection

Identifying empty spaces in the parking lot with openCV

cv2 jupyter-notebook numpy opencv pickle python

Last synced: 14 Apr 2026

https://github.com/mizcausevic-dev/embedding-drift-graph

Track how entity embeddings drift across encoder model versions. SQLite store + Strawberry GraphQL API. Cosine drift events computed automatically on every record. Reference impl for RAG/eval pipelines re-encoding their corpus.

ai-governance drift-detection embeddings graphql llm numpy python rag sqlite strawberry-graphql vector-search

Last synced: 01 Jun 2026

https://github.com/ljubogdan/solar-cycle-lstm

This project predicts sunspot activity using an LSTM model for time series data. Built with TensorFlow and Keras, it uses Huber loss for outlier handling and MAE for performance evaluation. The dataset, sourced from Kaggle or SIDC, spans over 270 years of monthly sunspot data.

conv1d huber-loss-regression kaggle keras lstm machine matplotlib numpy pandas scikit-learn seaborn solar sunspots tensorflow time-ser

Last synced: 13 Apr 2026

https://github.com/architj6/gesture-control-system

This Gesture Control System allows users to control various system functions (like mouse movement, volume adjustment, and screen brightness) using hand gestures through a webcam. It leverages MediaPipe for hand tracking and PyAutoGUI for controlling the system. The project enables intuitive, hands-free control for everyday tasks.

brightness-control cursor cursor-control doubleclick drag-and-drop gesture handgesture handgesture-recognition mediapipe numpy opencv os pyautogui pycaw python python3 scroll volume-control

Last synced: 17 Apr 2026

https://github.com/anish37860/python_party

Python summer party by Interview Master & in partnership with DataCamp

jupyter numpy pandas python

Last synced: 06 May 2026

https://github.com/mjul/scipy-lab

Scientific computation with Python

matplotlib numpy python scipy

Last synced: 04 May 2026

https://github.com/alansteinbarth/irysy

🪻 Irysy 🌼 eksploracyjna analiza danych – wizualizacje, ML, transparentny kod i autorskie grafiki. Projekt łączący AI, estetykę i praktyczne umiejętności!

ai classification clean-code data-science exploratory-data-analysis feature-engineering github-pages iris-dataset jupyter-notebook machine-learning matplotlib modern-ui numpy open-source pandas portfolio-project python reproducible-research seaborn visualization

Last synced: 17 Oct 2025

https://github.com/mbarbetti/hello-snakemake

Simple implementation of snakemake to sample and plot data

hello-world matplotlib numpy python snakemake

Last synced: 14 Apr 2026

https://github.com/deepcloudlabs/dcl702-2021-jul-12

DCL-702: Data Analytics using Python

data-analytics numpy pandas python3

Last synced: 05 May 2026

https://github.com/sunilvarma-l/liverdiseaseprediction

"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."

machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/neelays/xor-xnor_neural_network

NumPy neural network to approximate XOR/XNOR

numpy

Last synced: 15 May 2026

https://github.com/farhad-here/predict_student_performance

Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).

classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/sharif-minhaz/rag-system

Ingest and vectorize content upon publication, store embedding, then retrieve and augment user queries with context to generate high-quality responses.

faiss flask mysql2 nodejs numpy rag react transformers

Last synced: 14 Apr 2026

https://github.com/navindafernando/lane-detection

Lane Detection with Image Feature Extraction Techniques :aerial_tramway:

image-feature-extraction kernel-filters machine-learning neighborhood numpy opencv sobel-filter

Last synced: 13 Apr 2026

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 14 Apr 2026

https://github.com/icepanorama/internship-visualizations-and-demonstrations

A collection of some of the programs that I've written over the course of my internship.

artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn

Last synced: 14 Apr 2026

https://github.com/samir-atra/experiments_and-_improvements

A repository showing the knowledge in some of the machine learning technologies there are.

dcgan-tensorflow numpy tensorflow-tutorials tensorflow2

Last synced: 23 Jan 2026

https://github.com/shuyib/pimadlkeras

Using keras specify-compile-fit- predict workflow on this binary classification problem to investigate if i'll get better predictions.

diabetes hidden-layers keras-neural-networks matplotlib-figures model-validation numpy pima-indians-dataset prediction predictive-modeling python-3-5 seaborn

Last synced: 15 Mar 2026

https://github.com/mlicamele/neural-network

Project focused on exploring the computations behind neural networks by building one from scratch with only numpy and testing it with the MNIST dataset.

gradient-descent matrix-computations neural-networks numpy python

Last synced: 12 Apr 2026

https://github.com/chirindaopensource/search_benford_law_compatibility

End-to-End Python scalable forensic accounting toolkit implementing Benford's Law analysis for FTSE financial data. Delivers automated anomaly detection with Chi-Squared/MAD testing, comprehensive validation pipelines, and risk-based prioritization of investigative resources. Replicates Ausloos et al.'s (2025) methodology with full reproducibility.

academic-research anomaly-detection benfords-law chi-squared-test data-validation econometrics financial-analysis financial-data forensic-accounting fraud-detection ftse goodness-of-fit jupyter-notebook numpy pandas python reproducible-research risk-management scipy statistical-testing

Last synced: 12 Apr 2026

https://github.com/kostadinlambov/bitcoin-and-stock-market-correlation

This study uses a quantitative research design to analyze the relationship between Bitcoin prices and the stock market over the past five years with the S&P 500 Index serving as a proxy for the stock market.

bitcoin data-science jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-stats seaborn sp500-data-analysis

Last synced: 09 Apr 2026

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 07 Mar 2026

https://github.com/atul-maurya-30/galaxy

Galaxy Classification is a machine learning project focused on classifying galaxies into two subclasses: 'STARFORMING' and 'STARBURST'. This project demonstrates data preprocessing, model training, and evaluation using advanced machine learning techniques and Python libraries.

flask machine-learning matplotlib numpy pandas python regression-classification seaborn sklearn

Last synced: 09 Mar 2026

https://github.com/apbetioli/neural-network

A simple neural network implementation

machine-learning neural-network numpy python3

Last synced: 29 Apr 2026

https://github.com/luliatuccu/weather_analysis

This project highlights a combination of data science techniques and Python programming to explore real-world weather data.

data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns

Last synced: 02 Apr 2026

https://github.com/yashaka/learncv

Toolset used in learning Computer Vision at R_D course

cv numpy opencv utitlities

Last synced: 14 Apr 2026

https://github.com/hellymodikalpesh/numpy

This repository is designed as a comprehensive resource for learning the NumPy library. It covers nearly all functions and features of NumPy, providing a thorough exploration of its capabilities for numerical computing and data analysis.

jupyter-notebook numpy

Last synced: 28 Jan 2026

https://github.com/miasanromen/math-modeling

Math modeling with Python. Statistics, math, graphics

math matplotlib numpy probality-theory scipy-stats statistics

Last synced: 14 Apr 2026

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 23 Jan 2026

https://github.com/hafaio/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 24 Feb 2026

https://github.com/ggrbill/phd-plot-scripts

My personal plot scripts used to generate graphs for my PhD Thesis

hacktoberfest matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/rezasharifi82/lesson_extractor

Extract all possible "معارف" courses @ Ferdowsi University of Mashhad ,FUM

beautifulsoup4 ipynb jupyter-notebook numpy pandas python3 regex

Last synced: 06 Feb 2026

https://github.com/asut00/Machine-Learning-Program_42AI

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 27 Oct 2025

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/lavkalsi/creditcardfrauddetector

Credit Card Fraud Detector is a React web app that predicts if a credit card transaction is fraudulent using a Python machine learning model. Users can input transaction data, and Flask facilitates communication between the backend and frontend. Backend files are located in the res folder. This app provides simple UI for user interaction.

fraud-detection machine-learning nodejs numpy pandas python reactjs sklearn-library

Last synced: 15 Apr 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/ambrish8/bell-s-palsy

BELL’S PALSY SEVERITY DETECTION USING DEEP LEARNING

cnn-architecture imagedatagenerator keras numpy opencv python pytorch resnet-50

Last synced: 15 Apr 2026

https://github.com/andres29ac/website_heatmaprisk

Mapa hecho con Streamlit y Folium para mostrar la distribución delictiva de cada distrito de Trujillo mediante Mapas de Calor.

folium-maps heatmap-visualization numpy pandas ploty python stadistics streamlit

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026