An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/roaajadaa/text-classification

Classify documents using custom naïve bayes model

matplotlib nlp nltk numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/lorenzorottigni/ml-iris-svm

Machine Learning python bootcamp: Support Vector Machines on iris flower dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 10 Apr 2026

https://github.com/gaball1/alzheimer-prediction-ai

Alzheimer Prediction is a machine learning-based project designed to predict the likelihood of Alzheimer’s disease using clinical and imaging data. The project features an interactive web application built with Streamlit, enabling users to input key data, view predictions, and explore visualizations .

ai data-mining gradient-boosting jupyter-notebook logistic-regression logistic-regression-algorithm machine-learning matplotlib-python models numpy pandas python random-forest-classifier streamlit xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/maximlevchenko/pandas-numpy-hippocampus-analysis

This project aims to analyze the differences in hippocampus size between two groups: "Unaffect" and "Affected" using pandas, numpy and matplotlib

jypyternotebook matplotlib numpy pandas pyreadr seaborn statsmodels

Last synced: 16 Apr 2026

https://github.com/bharath-tars/deeptumor

DeepTumor is a project aimed at developing a deep learning model with over 1 million parameters using TensorFlow. with automatic reporting feature

keras machine-learning numpy pandas pyhton tensorflow webdevelopment

Last synced: 10 Apr 2026

https://github.com/theanujsinha01/seoul-bike-sharing-demand-prediction

This project predicts the demand for bike-sharing services in Seoul using machine learning algorithms. It utilizes historical data and environmental factors to provide accurate predictions that can aid in resource optimization and planning.

data-cleaning data-science exploratory-data-analysis feature-engineering jupyter-notebook machine-learning matplotlib model numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 10 Apr 2026

https://github.com/adi3042/credit-card-fault-detection

🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech

classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv

Last synced: 03 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/psavarmattas/covid19-dashboard

The goal of this project is not to build just another dashboard for Covid 19. But, to focus on collaborative plot ideas, and a mobile friendly UI/UX.

appseed bootstrap covid covid-19 css dashboard django docker html javascript json numpy plotly python travis-ci

Last synced: 10 Apr 2026

https://github.com/the-developer-306/neural-style-transfer

Neural Style Transfer project leveraging TensorFlow and VGG19 to blend artistic styles with content images. Features include style and content initialization, multi-layer feature extraction, total variation loss, and post-processing enhancements like sharpening, upscaling, and brightening.

deep-learning image-processing neural-style-transfer numpy opencv-python post-processing tensorflow transfer-learning vgg19

Last synced: 01 May 2026

https://github.com/lorenzorottigni/ml-ecommerce

Machine Learning python bootcamp: linear regression on ecommerce dataset

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/allanotieno254/pneumonia-xray-classification

This project is a Deep Learning-based Pneumonia classification system that allows medical staff to upload chest X-ray images and quickly determine whether a patient shows signs of pneumonia.

keras machine-learning numpy pandas pil pillow python streamlit streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 10 Apr 2026

https://github.com/ngangawairimu/automating_farming_analysis

This project automates farming in Maji Ndogo, analyzing soil fertility, climate, and geography to recommend optimal crop locations. Python-based data analysis I uncovers patterns guiding farming practices.

matplotlib-pyplot numpy pandas python seaborn sqlite

Last synced: 10 Apr 2026

https://github.com/rokmark0/auto-feature-engineering

🐙 Automated feature engineering toolkit that creates, selects, and transforms features for machine learning on tabular data, boosting model accuracy and speeding workflows.

auto-feature-engineering automl aws-ec2 dask data-science explainable-ml feature-engineering featuretools machine-learning mahinelearning numpy pandas pandas-library python r sagemaker supervised-learning unsupervised-learning

Last synced: 10 Apr 2026

https://github.com/ai-lenok/moex-analyzer

A simple engine for analyzing Moscow Exchange securities

docker matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/rooneyrulz/cat-dog-classifier

Building a model using a convolutional neural network that well-classifying custom cat vs dog image data. including visualizations, data-preprocessing, future predictions, model-evaluation, identifying overfitting, and more.

artificial-intelligence convolutional-neural-networks data-preprocessing data-visualizations deep-learning keras matplotlib numpy pandas predictive-modeling python3 seaborn tensorflow2

Last synced: 10 Apr 2026

https://github.com/sgb31/csv-retrival-based-rag

Created a simple RAG system that lets users upload any CSV file and ask questions about it. It reads the data, finds the most relevant info, and uses an LLM to answer the query. Super handy for quickly exploring and understanding data through natural language.

generative-ai llm numpy pandas pydantic python3 sentence-transformers sklearn streamlit transformers

Last synced: 19 Jun 2026

https://github.com/sarmad426/python

Python mastery. OOP | Numpy | Pandas | Jupyter Notebook & more.

fastapi file-handling-in-python jupyter numpy oop pandas pytest python regex unit-tests

Last synced: 10 Apr 2026

https://github.com/cartabinaria/calcolo-numerico

Una raccolta di risorse per l'insegnamento di Calcolo numerico (02023) del Corso di Laurea in Informatica.

course maths numerical-methods numpy scipy uni

Last synced: 20 Jan 2026

https://github.com/rlxchap2/crypto-miner

🔨Crypto Miner is a Python project designed to encrypt and decrypt files, especially images, using the powerful cryptography library

crypto cryptography csv numpy pillow python

Last synced: 08 May 2026

https://github.com/adityakumarda/kmeans-web-analytics

Built with Python, Pandas, and Scikit-learn, this machine learning project uses K-Means to cluster website users by behavior. It reveals patterns in engagement and bounce, helping drive data-informed decisions.

cluster-analysis elbow-curves elbow-method elbow-plot jupyter-notebook kmeans-clustering machine-learning matplotlib numpy pandas python python3 relationship scikit-learn seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/broodhoney/heart-disease-prediction

This is a machine learning project which has a trained model that classifies whether a patient has a heart-disease or not.

kaggle-dataset matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning uci

Last synced: 10 Apr 2026

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/farhad-here/median-performance-comparison

Benchmarking the performance of median calculation using vanilla Python vs NumPy.

data-analysis matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 10 Apr 2026

https://github.com/munawar-code/car_price_predictor

This project is a ML-based car price prediction system. The model is built using Jupyter Notebook for training and evaluation, while a simple one-page website was developed using Pycharm to provide interface for users to input car details and get price predictions.

datapreprocessing datavisualization exploratory-data-analysis feature-engineering flask-application html-css-javascript linear-regression machine-learning-algorithms matplotlib numpy pandas python scikitlearn-machine-learning

Last synced: 13 Apr 2026

https://github.com/ryan-bendelson/2024-summer-research

This is Python code that I worked with during my summer 2024 research project involving quantum physics.

density-matrices kronecker-product linear-algebra miniconda3 numpy numpy-arrays partial-trace python quantum-information

Last synced: 16 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026

https://github.com/mogankumar/virtual-mouse

A Python-based Virtual Mouse that uses hand gestures for cursor control, clicking, scrolling, and taking screenshots. Powered by OpenCV, PyAutoGUI, and a custom hand tracking module for a touch-free experience.

autopy numpy opencv pyautogui python3

Last synced: 10 Apr 2026

https://github.com/khaymanii/spam_mail_detection_model

This model was built using Python and Logistics Regression algorithm

matplotlib numpy pandas python sckiit-learn

Last synced: 10 Apr 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/semihbugrasezer/rockvsmine

Rock vs Mine Prediction with Python | Machine Learning Project

numpy pandas python

Last synced: 05 May 2026

https://github.com/babagata/racunalna_fizika

Math and physics solved with python

matplotlib numpy random scipy sympy

Last synced: 10 Apr 2026

https://github.com/jaweria-b/eda-basketball

The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.

matplotlib numpy python streamlit

Last synced: 10 Apr 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/elon-fask/nlp_num1

Natural Language Processing with Disaster Tweets

ai machine-learning nlp nlp-machine-learning numpy pandas python text-processing

Last synced: 10 Apr 2026

https://github.com/rama1997/lane-line-detection

Uses computer vision to detects lane lines on the road from images/videos using the POV of a driving vehicle

numpy opencv opencv-python python

Last synced: 10 Apr 2026

https://github.com/ahmedabdalkreem/connected_component_labeling

Technique used to detect small object in the image like shapes and number can used this technique in OCR.

computer-vision connected-components matplotlib numpy object-detection python rgb2gray threshold

Last synced: 11 Apr 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/speech-recognition

In this project we deal with audio dataset to make classification between ten audio and make preprocssing in it using Librosa and Scipy then be input of Neural Network Model.

librosa matplotlib neural-network numpy pandas python3 scipy sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/alejoduarte23/reading_data_from_dewesoft

The following repository retrieves sensor data (acceleration and strains) from both local and cloud databases. It processes the data using classes from another repository called Modal Engine for spectral analysis, modal analysis, and signal processing.

dewesoft matplotlib modal-analysis numpy orm scipy signal-processing sql sqlalchemy

Last synced: 07 Jan 2026

https://github.com/tsungtsetu122/datamining-cifar10-classification

Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/crvargasm/moore-penrose-matrix

A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.

numerical-methods numpy python

Last synced: 10 Apr 2026

https://github.com/mohamed15058/text_classification-digital-egypt-pioneers-initiative-project-

Text_Classification(Digital-Egypt-Pioneers-Initiative-Project )

depi mlops nlp nltk numpy panadas python3 twnsorflow

Last synced: 10 Apr 2026

https://github.com/sc0v0ne/ai-discipline-work

AI Discipline Work - Movie recommendation

jupyter-notebook machine-learning numpy pandas python python3

Last synced: 15 Apr 2025

https://github.com/omarsaad21/rfm-clustering-

A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)

business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/paulbrenker/decision-tsp

Scripts from a university data analysis project to approximate the decision tsp problem using heuristics and machine learning

data-science machine-learning numpy poetry python

Last synced: 26 May 2026

https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system

This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.

collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/germanocastanho/python-notes

Personal collection of Python Notes, tips, and code snippets 🐍

data-science numpy pandas programming python

Last synced: 01 Sep 2025

https://github.com/lilivalgo/ml_time_series

This project uses time series data to predict corn crop yield in Colombia

matplotlib numpy pandas scipy seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/hansalemaos/hexarray2decimal

Converts a numpy string array with hex values to int

convert hex int numpy python

Last synced: 05 May 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/utkarsh251106/cricket-shot-analyzer

Real-time cricket shot analyzer using Python, OpenCV, and MediaPipe. Processes videos frame-by-frame, overlays pose and biomechanical metrics, and outputs an annotated video with JSON evaluation. Also the output video might have "??" in it which is there cause OpenCV can't display degree's symbol.

artificial-intelligence computer-vision deep-learning machine-learning mediapipe numpy python real-time

Last synced: 05 May 2026

https://github.com/armahdavi/qff-evalation_code-data-processing-statistics-plotting

Data pipelines and processing codes, statistical modellings, descriptive statistics, and plot visualizations for QFF evaluation phase of for Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021 Full-length article: https://www.sciencedirect.com/science/article/abs/pii/S0269749120370779

data-science data-visualization histogram matplotlib matplotlib-pyplot numpy pandas python

Last synced: 11 Apr 2026

https://github.com/rkarahul/face-detection-using-opencv-

•Build a face detection project using OpenCV and haar cascades, which are the better choice for real-time detection.

haar-cascade-classifier machinelearning numpy pandas-library python3 tkinter

Last synced: 08 May 2026

https://github.com/zuhairzia/titanic-survival-project

This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.

csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/jigyasag18/fake-news-prediction-app

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization

Last synced: 11 Apr 2026

https://github.com/armahdavi/analytics-data-pipelines-statistics-plotting---dust-extraction-hvac-filters---phase-1

PhD Technical Paper 1 - Phase 1 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

matplotlib numpy pandas pandas-dataframe pyplot python scipy-stats sklearn

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database

apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql

Last synced: 11 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/varkenvarken/blempy

small, safe utilities to efficiently transfer Blender property-collection attributes (e.g. vertex coordinates) to/from NumPy arrays and perform vectorized operations with minimal Python overhead.

blender numpy

Last synced: 13 Jan 2026

https://github.com/volf52/deep-neural-net

A simple deep neural net class written to work with Numpy and Cupy

binarized-neural-networks binary-neural-networks bnn cupy deep-learning deep-neural-networks mnist numpy python python3

Last synced: 05 May 2026

https://github.com/ttwag/p2_gaussian_elimination

A function that encodes the Gaussian Elimination algorithm

algorithm linear-algebra numpy python

Last synced: 29 Apr 2026

https://github.com/lucasgleria/seamese-network-algorithm

Este projeto implementa um sistema de busca por similaridade de imagens usando redes siamesas e Triplet Loss em PyTorch. Ele gera embeddings de imagens (MNIST com EfficientNet-B0) para encontrar visuais semelhantes. O foco está na análise visual e no aprendizado de representações no espaço vetorial.

google-colab matplotlib numpy pandas python pytorch timm

Last synced: 11 Apr 2026