An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ngangawairimu/automating_farming_analysis

This project automates farming in Maji Ndogo, analyzing soil fertility, climate, and geography to recommend optimal crop locations. Python-based data analysis I uncovers patterns guiding farming practices.

matplotlib-pyplot numpy pandas python seaborn sqlite

Last synced: 10 Apr 2026

https://github.com/nihalrt/lung-cancer-data-analysis-

This is a data analysis report, created with python pandas, numpy, matplotlib, and seaborn. Provides valuable insights into the symptoms of lung cancer from a survey set.

matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/rooneyrulz/cat-dog-classifier

Building a model using a convolutional neural network that well-classifying custom cat vs dog image data. including visualizations, data-preprocessing, future predictions, model-evaluation, identifying overfitting, and more.

artificial-intelligence convolutional-neural-networks data-preprocessing data-visualizations deep-learning keras matplotlib numpy pandas predictive-modeling python3 seaborn tensorflow2

Last synced: 10 Apr 2026

https://github.com/sgb31/csv-retrival-based-rag

Created a simple RAG system that lets users upload any CSV file and ask questions about it. It reads the data, finds the most relevant info, and uses an LLM to answer the query. Super handy for quickly exploring and understanding data through natural language.

generative-ai llm numpy pandas pydantic python3 sentence-transformers sklearn streamlit transformers

Last synced: 19 Jun 2026

https://github.com/rlxchap2/crypto-miner

🔨Crypto Miner is a Python project designed to encrypt and decrypt files, especially images, using the powerful cryptography library

crypto cryptography csv numpy pillow python

Last synced: 08 May 2026

https://github.com/smirnovlad/data-science-notebooks

A collection of various data analysis approaches

data-science deep-learning kaggle machine-learning numpy pandas pytorch

Last synced: 10 Apr 2026

https://github.com/amruta33/housing_price_prediction

Consider a real estate company that has a dataset containing the prices of properties in the Delhi region. It wishes to use the data to optimise the sale prices of the properties based on important factors such as area, bedrooms, parking, etc.

linear-regression numpy pandas python rfe vif

Last synced: 28 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 10 Apr 2026

https://github.com/munawar-code/car_price_predictor

This project is a ML-based car price prediction system. The model is built using Jupyter Notebook for training and evaluation, while a simple one-page website was developed using Pycharm to provide interface for users to input car details and get price predictions.

datapreprocessing datavisualization exploratory-data-analysis feature-engineering flask-application html-css-javascript linear-regression machine-learning-algorithms matplotlib numpy pandas python scikitlearn-machine-learning

Last synced: 13 Apr 2026

https://github.com/ryan-bendelson/2024-summer-research

This is Python code that I worked with during my summer 2024 research project involving quantum physics.

density-matrices kronecker-product linear-algebra miniconda3 numpy numpy-arrays partial-trace python quantum-information

Last synced: 16 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026

https://github.com/mogankumar/virtual-mouse

A Python-based Virtual Mouse that uses hand gestures for cursor control, clicking, scrolling, and taking screenshots. Powered by OpenCV, PyAutoGUI, and a custom hand tracking module for a touch-free experience.

autopy numpy opencv pyautogui python3

Last synced: 10 Apr 2026

https://github.com/khaymanii/spam_mail_detection_model

This model was built using Python and Logistics Regression algorithm

matplotlib numpy pandas python sckiit-learn

Last synced: 10 Apr 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/babagata/racunalna_fizika

Math and physics solved with python

matplotlib numpy random scipy sympy

Last synced: 10 Apr 2026

https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification

This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.

classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn

Last synced: 06 May 2026

https://github.com/ahmedabdalkreem/connected_component_labeling

Technique used to detect small object in the image like shapes and number can used this technique in OCR.

computer-vision connected-components matplotlib numpy object-detection python rgb2gray threshold

Last synced: 11 Apr 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/hotel-reservation

Our task is to classify a Hotel Reservation as either booking canceled (class1) or no canceled(class0) and use more one model to arrive the best model.

bagging decisiontreeclassifier ensemble extra-trees-classifier logistic-regression matplotlib numpy pandas python3 random-forest sklearn-library svc-model

Last synced: 11 Apr 2026

https://github.com/alejoduarte23/reading_data_from_dewesoft

The following repository retrieves sensor data (acceleration and strains) from both local and cloud databases. It processes the data using classes from another repository called Modal Engine for spectral analysis, modal analysis, and signal processing.

dewesoft matplotlib modal-analysis numpy orm scipy signal-processing sql sqlalchemy

Last synced: 07 Jan 2026

https://github.com/matx64/birads-classifier

Breast Imaging Report and Data System (BIRADS) with Graphical User Interface.

numpy pillow skimage sklearn tkinter

Last synced: 20 Apr 2026

https://github.com/fedesgh/parkinson_volatility_spread_on_cedears

Creating a function that returns a graph with the difference between Parkinson's volatility and regular volatility given a certain bounds

numpy pandas pickle seaborn

Last synced: 10 Apr 2026

https://github.com/asghar-rizvi/youtube-statistics-project

This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.

data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn

Last synced: 13 Jun 2026

https://github.com/tsungtsetu122/datamining-cifar10-classification

Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/crvargasm/moore-penrose-matrix

A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.

numerical-methods numpy python

Last synced: 10 Apr 2026

https://github.com/sc0v0ne/ai-discipline-work

AI Discipline Work - Movie recommendation

jupyter-notebook machine-learning numpy pandas python python3

Last synced: 15 Apr 2025

https://github.com/soumyapro/wine-quality-prediction

This project is about the prediction of wine quality using machine learning algorithms

boxplot matplotlib numpy pandas random-forest smote

Last synced: 10 Apr 2026

https://github.com/paraskevi-kivroglou/rl-pong-agent

A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.

atari atari-games gym-environment gymnasium numpy python3 pytorch q-learning reinforcement-learning reinforcement-learning-agent

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/germanocastanho/python-notes

Personal collection of Python Notes, tips, and code snippets 🐍

data-science numpy pandas programming python

Last synced: 01 Sep 2025

https://github.com/lilivalgo/ml_time_series

This project uses time series data to predict corn crop yield in Colombia

matplotlib numpy pandas scipy seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/hansalemaos/hexarray2decimal

Converts a numpy string array with hex values to int

convert hex int numpy python

Last synced: 05 May 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/utkarsh251106/cricket-shot-analyzer

Real-time cricket shot analyzer using Python, OpenCV, and MediaPipe. Processes videos frame-by-frame, overlays pose and biomechanical metrics, and outputs an annotated video with JSON evaluation. Also the output video might have "??" in it which is there cause OpenCV can't display degree's symbol.

artificial-intelligence computer-vision deep-learning machine-learning mediapipe numpy python real-time

Last synced: 05 May 2026

https://github.com/armahdavi/qff-evalation_code-data-processing-statistics-plotting

Data pipelines and processing codes, statistical modellings, descriptive statistics, and plot visualizations for QFF evaluation phase of for Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021 Full-length article: https://www.sciencedirect.com/science/article/abs/pii/S0269749120370779

data-science data-visualization histogram matplotlib matplotlib-pyplot numpy pandas python

Last synced: 11 Apr 2026

https://github.com/rkarahul/face-detection-using-opencv-

•Build a face detection project using OpenCV and haar cascades, which are the better choice for real-time detection.

haar-cascade-classifier machinelearning numpy pandas-library python3 tkinter

Last synced: 08 May 2026

https://github.com/zuhairzia/titanic-survival-project

This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.

csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/ahmed-maher77/diabetes-prediction-app-using-machine-learning

Diabetes Prediction: Using machine learning to classify individuals as diabetic or non-diabetic based on health data, enabling early intervention and improved healthcare outcomes.

ai css data-science gradientboostinclassifier javascript logisticregression machine-learning matplotlib numpy pandas python randomforestclassifier seaborn streamlit supportvectormachine webdevelopment

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database

apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql

Last synced: 11 Apr 2026

https://github.com/mehradi-github/ref-jupyter-2510

using python in machine learning

matplotlib numpy pandas python sklearn statistics

Last synced: 11 Apr 2026

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/varkenvarken/blempy

small, safe utilities to efficiently transfer Blender property-collection attributes (e.g. vertex coordinates) to/from NumPy arrays and perform vectorized operations with minimal Python overhead.

blender numpy

Last synced: 13 Jan 2026

https://github.com/volf52/deep-neural-net

A simple deep neural net class written to work with Numpy and Cupy

binarized-neural-networks binary-neural-networks bnn cupy deep-learning deep-neural-networks mnist numpy python python3

Last synced: 05 May 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/lucasgleria/seamese-network-algorithm

Este projeto implementa um sistema de busca por similaridade de imagens usando redes siamesas e Triplet Loss em PyTorch. Ele gera embeddings de imagens (MNIST com EfficientNet-B0) para encontrar visuais semelhantes. O foco está na análise visual e no aprendizado de representações no espaço vetorial.

google-colab matplotlib numpy pandas python pytorch timm

Last synced: 11 Apr 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/anas436/python

This is Python Programming Language. Here, the basic problems of Python language have been solved. If you can solve these problems then your basic skills will be developed in this language and you will be suitable for doing good projects.

jupyter-notebook jupyterlab numpy pandas pycharm python3 watson-studio

Last synced: 11 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/charles-l/rayboi

a raytracer written in futhark/python

futhark numpy pathtracing python3 raytracing

Last synced: 19 Apr 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026