An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

๐Ÿ  Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/elmezianech/classifyreviews_nlp

Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.

accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc

Last synced: 06 May 2026

https://github.com/fbi555/fashion-mnist-cnn-tf

๐Ÿงต Classify fashion images using a compact CNN model in TensorFlow/Keras, designed for the Fashion-MNIST dataset with easy execution in Google Colab.

adam classification cnn colab-notebook computer-vision convolutional-neural-networks custom-training-loop deep-learning fashion-mnist gans image-classification interactive jupyter-notebook keras lowlevelapi mnist-dataset numpy regularization

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/sahilmate/gesturepainter

Virtual Painting Application

gesturepainter mediapi numpy opencv virtualpaint

Last synced: 06 May 2026

https://github.com/piero24/lossfuncition-4sn

This research will show an innovative method useful in the segmentation of polyps during the screening phases of colonoscopies. To do this we have adopted a new approach which consists in merging the hybrid semantic network (HSNet) architecture model with the Reagion-wise(RW) as a loss function for the backpropagation process.

dataset deep-learning deep-neural-networks loss loss-functions matplotlib neural-network numpy opencv polyp-detection polyp-segmentation python python3 pytorch segmentation segmentation-models torchvision transformers unipd unipd-dei

Last synced: 06 May 2026

https://github.com/vandanac20/advanced-real-estate-valuation-with-ensemble-regression-models

Advanced Real Estate Valuation with Ensemble Regression Models

machine-learning numpy pandas python sklearn

Last synced: 06 May 2026

https://github.com/redinfinitypro/python-finance

Rating: (8/10) The code performs a linear regression analysis on stock data using the yfinance library, importing necessary libraries, selecting stock symbols, downloading data, fitting a model, making predictions, and visualizing results.

datetime matplotlib numpy pandas random sklearn yfinance

Last synced: 06 May 2026

https://github.com/zotho/grav_euler

Euler gravity compute

gravity numpy python

Last synced: 06 May 2026

https://github.com/harmanveer-2546/student-performance-in-exam

Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.

classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization

Last synced: 06 May 2026

https://github.com/jcbritobr/iris-python

Simple data analysis using iris dataset, pandas, numpy, matplotlib and seaborn.

data-science matplotlib numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/abdulrahmanaymann/data-storage-and-information-retrieval

This is a Python project that performs tokenization, stop word removal, positional indexing, phrase query searching, term frequency-inverse document frequency (TF-IDF) calculation, cosine similarity computation, and document ranking.

cosine-similarity information-retrieval nltk numpy pandas postional-index python ranking stop-word-removal tf-idf tokenization

Last synced: 06 May 2026

https://github.com/kianaabrisham/svm-from-scratch

Linear SVM from scratch with hinge loss + decision boundaries

classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm

Last synced: 07 May 2026

https://github.com/chandkund/pytorch

Foundational introduction to PyTorch, focusing on the basics of tensors, their creation, manipulation, and operations, which are essential for understanding and building deep learning models

classification computer-vision cuda-programming deep-learning loss-functions matplotlib numpy optimization pandas pyhton pytroch workflow

Last synced: 07 May 2026

https://github.com/waghraj1699/ipl-win-probability-predictor

Web app to predict live probability of win percentage of match

data-preprocessing machine-learning ml-algorithms numpy pandas

Last synced: 07 May 2026

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 May 2026

https://github.com/dulajkavinda/breast-cancer-detect

๐Ÿงฌ breast cancer classification using machine learning

added machinelearning matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 May 2026

https://github.com/shantanudhanawade/mini-projects

Price prediction with linear regression

matplotlib-python numpy pandas scikit-learn-python

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/krishbharadwaj5678/gestureninja

Play Fruit Ninja using just your index finger with real time gesture tracking. No mouse, no keyboard โ€” just your webcam and smooth slicing.

automation computer-vision cvzone finger-tracking fruit-ninja game-development gamedev gesture-recognition hand-tracking mediapipe mouse mouse-control numpy opencv pyautogui python realtime virtualmouse webcam

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 07 May 2026

https://github.com/imskully/python-cv-fault-detection

Fault detection algorithm that determines whether or not a fault exists within a ring using raw computer vision.

ccl computer-vision matplotlib morphology numpy opencv python

Last synced: 07 May 2026

https://github.com/hackintoshrao/first-neural-network

Simple Neural Network with a hidden layer written from scratch using Numpy for prediction the bike sharing sales

deep-learning neural-networks numpy python

Last synced: 07 May 2026

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/pesala-x/data-science-ai-assessments

this is inClass practice application for Python-for-Data-Science-AI

data-types functions jupyter-notebook loops machine-learning numpy pandas-python python python3 seaborn

Last synced: 07 May 2026

https://github.com/maxencebonamy/ascii-webcam

You can see yourself in real time through characters in a terminal. This project is an enhancement of another project which consists in converting an image to ASCII format, which allows, in a way, to compress it.

ascii numpy opencv python webcam

Last synced: 07 May 2026

https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn

This repo desribes bulding a decision tree to predict customer churn in a given organisation

accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn

Last synced: 07 May 2026

https://github.com/jnayak1/naive_bayes_cars

Using Naive Bayes and SGD ML algorithms to predict if a new car will decrease in price

cars machine-learning naive-bayes numpy sklearn

Last synced: 07 May 2026

https://github.com/aimin-nur/dataanalyst-cohortanalysis

Sebuah project Data Analayst untuk mengetahui bagaimana pola pembelian pelanggan dalam periode waktu tertentu (Cohort Analysis)..

cohort-analysis numpy pandas pytz

Last synced: 07 May 2026

https://github.com/4strium/predict-french-tv-audience

๐ŸŽฌ A beginner's machine learning project whose aim is to predict the audience for films on TV in France, based on previous data provided by the CNC and Mรฉdiamรฉtrie.

machine-learning numpy pandas python tv xgboost

Last synced: 07 May 2026

https://github.com/r-mahesh45/reccomendation-system

This project involves building a recommender system using cosine similarity to measure the similarity between items or users. It leverages numerical data to provide personalized recommendations. The model focuses on extracting meaningful insights and improving decision-making processes.

numpy pandas-dataframe python3 scikitlearn-machine-learning

Last synced: 07 May 2026

https://github.com/reddyprasade/bird-classifications-problem

Birds Classification by using Scikit-learn and Scikit-image

matplotlib numpy pandas python-3 sckiit-learn sckit-image

Last synced: 08 May 2026

https://github.com/djdurga/predictive_analysis_in_diabetes

This project applies Logistic Regression to predict diabetes in patients using the Pima Indians Diabetes Dataset. It covers the full data science lifecycle โ€” from data imputation and exploration to model training, evaluation, and insights.

matplotlib numpy pandas

Last synced: 08 May 2026

https://github.com/tinaland101/carcinoma-regression-project

The goal of this project is to analyze and visualize the results of a clinical study involving the treatment of squamous cell carcinoma (SCC) in mice. As a senior data analyst, you are tasked with preparing a technical report for Pymaceuticals, Inc.

linear-regression mathplotlib numpy pandas pyhton

Last synced: 08 May 2026

https://github.com/mttgdd/openfabmap-python3

Pybind11 and numpy array bindings to openFABMAP

cpp numpy openfabmap pybind11 python robotics vision

Last synced: 08 May 2026

https://github.com/momo790-m/startup-profit-prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

assumptions business colab-notebook linear-regression machine-learning matplotlib numpy pandas preprocessing seaborn skit-learn sklearn

Last synced: 08 May 2026

https://github.com/alain-godo/spotify-popularity

A simple analysis and modeling for prediction of the popularity of songs on Spotify

linear-regression machine-learning numpy pandas random-forest sckiit-learn spotify tree-decision

Last synced: 08 May 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/himanshugoyal77/movie-recommendation

Cosine similarity based Recommendation Engine

nltk numpy pandas requests-library-python tmdb-api vectorization

Last synced: 08 May 2026

https://github.com/antonio-f/simple-k-nearest-neighbors

Simple K-Nearest Neighbors using partial sort

101 k-nearest-neighbors knn numpy partial-sort python tutorial

Last synced: 08 May 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/deepcloudlabs/dcl702-2020-sep-16

DCL-702: Data Analytics using Python

data-analytics numpy pandas python python-3

Last synced: 08 May 2026

https://github.com/0290192029/apartment-price-predictor

Python-ะฟั€ะพะตะบั‚ ะฟะพ ะฟั€ะพะณะฝะพะทะธั€ะพะฒะฐะฝะธัŽ ัั‚ะพะธะผะพัั‚ะธ ะฐั€ะตะฝะดั‹ ะบะฒะฐั€ั‚ะธั€ ั ะฟะพะผะพั‰ัŒัŽ ะปะธะฝะตะนะฝะพะน ั€ะตะณั€ะตััะธะธ. ะŸั€ะฐะบั‚ะธั‡ะตัะบะฐั ั€ะฐะฑะพั‚ะฐ ะฟะพ ั‚ะตะผะต: "ะžัะฝะพะฒั‹ ะผะฐัˆะธะฝะฝะพะณะพ ะพะฑัƒั‡ะตะฝะธั" ะดะธัั†ะธะฟะปะธะฝั‹ "ะœะ”ะš 13.01: ะžัะฝะพะฒั‹ ะฟั€ะธะผะตะฝะตะฝะธั ะผะตั‚ะพะดะพะฒ ะธัะบัƒััั‚ะฒะตะฝะฝะพะณะพ ะธะฝั‚ะตะปะปะตะบั‚ะฐ ะฒ ะฟั€ะพะณั€ะฐะผะผะธั€ะพะฒะฐะฝะธะธ".

apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn

Last synced: 08 May 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchรญs

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/filnow/tinygimp

project for image processing class

gimp numpy opencv pysimplegui python3

Last synced: 08 May 2026

https://github.com/sunnyrao07/water-quality-analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost

Last synced: 08 May 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy โ€” from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 08 May 2026

https://github.com/diegodscamara/whisperlive

๐ŸŽ™๏ธ WhisperLive: Real-time audio transcription powered by OpenAI's Whisper model. Convert live speech to text with high accuracy, supporting multiple languages and real-time processing. Perfect for accessibility, content creation, and live captioning.

ai blackhole numpy python wave whisper

Last synced: 09 May 2026

https://github.com/aliakbar-omidi/ten-most-populous-cities-of-iran

This project is to show the densely populated cities of Iran on the map

basemap matplotlib numpy pandas python

Last synced: 09 May 2026

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/kemalcalak/developing-driverless-vehicles

This project includes the software required for an autonomous vehicle

numpy object-detection opencv segmentation yolo

Last synced: 09 May 2026

https://github.com/mahmood-anaam/deterministic-dynamic-programming

This repository explores deterministic dynamic programming solutions for the Shortest Path Problem (SPP) and the Traveling Salesperson Problem (TSP). It includes graphical representations, a detailed Jupyter notebook, and practical implementations for optimizing travel routes and minimizing costs in various city-based scenarios.

dynamic-programming graph-theory jupyter-notebook numpy python shortest-path-problem traveling-salesperson-problem

Last synced: 09 May 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/aliyahscoding/tiny-recommender

Tiny item-item recommender in Python using cosine similarity over a toy userโ€“item matrix. Simple CLI to print top-N recommendations.

collaborative-filtering cosine-similarity numpy pandas python recommender-system

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/amr-yasser226/pagerank-power-method

Modular Python implementation of the PageRank algorithm via the power method, with discreteโ€‘math foundations, endโ€‘toโ€‘end code, and experimental validation.

discrete-mathematics graph-algorithms markov-chain network-analysis networks numpy pagerank power-method python scipy

Last synced: 09 May 2026

https://github.com/brian-hepler-phd/kak-decomposition-qml

Cartan decomposition (SU(4)) via Lie theory and QML โ€“ exact + variational approaches for two-qubit gates.

lie-theory machine-learning numpy pennylane qml quantum-computing quantum-gate-decomposition scipy su4

Last synced: 09 May 2026

https://github.com/shaikats/python-assignment

Simulation and Modeling Lab assignments using Python, NumPy, Matplotlib, and Jupyter Notebook.

jupyter-notebook matplotlib numpy python simulation-modeling

Last synced: 09 May 2026

https://github.com/sivasurya681/financial-expense-tracker

The Financial Expense Tracker is a Python-based app built with Flet, Pandas, NumPy, and Matplotlib.pyplot for seamless expense management. It enables real-time tracking, categorization, and data visualization through bar and pie charts. Users can generate summaries, export/import data, and make informed financial decisions easily.

calculator-application differntial expense-tracker flet matplotlib-pyplot numpy pandas python salary-calculator total

Last synced: 09 May 2026

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 May 2026

https://github.com/chuvalniy/tulia

Contains self-implemented Machine Learning algorithms using only numpy.

boosting from-scratch knn learning logistic-regression machine ml numpy python random-forest regression sklearn testing xgboost

Last synced: 09 May 2026

https://github.com/michaelzheng67/real_estate_ml

AI experiment developed to run against simulated real estate and investing environment

ai numpy python pytorch

Last synced: 09 May 2026

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/liannekiarra/university-dissertation

Combining Machine Learning Techniques with Multi-Objective evolutionary Algorithms to Solve Real World Engineering Problems

artificial-intelligence engineering hypervolume matplot multi-objective-optimization neuralnetwork nsga-ii numpy pareto-front pareto-optimality pymoo python3 sklearn

Last synced: 09 May 2026

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 09 May 2026

https://github.com/alexgustafsson/word-frequencies

Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.

language nlp nltk numpy python python3 sklearn spelling-correction trigram

Last synced: 10 May 2026