An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/farhad-here/median-performance-comparison

Benchmarking the performance of median calculation using vanilla Python vs NumPy.

data-analysis matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 10 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/ahmedabdalkreem/hotel-reservation

Our task is to classify a Hotel Reservation as either booking canceled (class1) or no canceled(class0) and use more one model to arrive the best model.

bagging decisiontreeclassifier ensemble extra-trees-classifier logistic-regression matplotlib numpy pandas python3 random-forest sklearn-library svc-model

Last synced: 11 Apr 2026

https://github.com/alejoduarte23/reading_data_from_dewesoft

The following repository retrieves sensor data (acceleration and strains) from both local and cloud databases. It processes the data using classes from another repository called Modal Engine for spectral analysis, modal analysis, and signal processing.

dewesoft matplotlib modal-analysis numpy orm scipy signal-processing sql sqlalchemy

Last synced: 07 Jan 2026

https://github.com/navindafernando/feature-extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling

Last synced: 10 Apr 2026

https://github.com/crvargasm/moore-penrose-matrix

A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.

numerical-methods numpy python

Last synced: 10 Apr 2026

https://github.com/paulbrenker/decision-tsp

Scripts from a university data analysis project to approximate the decision tsp problem using heuristics and machine learning

data-science machine-learning numpy poetry python

Last synced: 26 May 2026

https://github.com/paraskevi-kivroglou/rl-pong-agent

A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.

atari atari-games gym-environment gymnasium numpy python3 pytorch q-learning reinforcement-learning reinforcement-learning-agent

Last synced: 11 Apr 2026

https://github.com/utkarsh251106/cricket-shot-analyzer

Real-time cricket shot analyzer using Python, OpenCV, and MediaPipe. Processes videos frame-by-frame, overlays pose and biomechanical metrics, and outputs an annotated video with JSON evaluation. Also the output video might have "??" in it which is there cause OpenCV can't display degree's symbol.

artificial-intelligence computer-vision deep-learning machine-learning mediapipe numpy python real-time

Last synced: 05 May 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/riju18/from-data-production-to-client-handover

The common tedious problem is to build a data app to demonstrate the data analysis & analytics along with Machine Learning to a client. It was an attempt to do it on small scale in the most powerful & simplest way.

machine-learning matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 11 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 11 Apr 2026

https://github.com/project54321/stocksniper

"An interactive Stock Dashboard that visualizes historical prices, calculates key financial metrics, displays fundamental statements, and aggregates the latest news for any ticker — all in a sleek, easy-to-use Streamlit interface."

alphavantage finance numpy pandas plotly pypi python stock-market stocknews streamlit

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/biasandrade/rede-neural-do-zero-classificacao-binaria

🧠 Rede Neural Artificial construída do ZERO (só NumPy) para detectar fraudes em transações | Forward/Backward Propagation | Sigmoid | Gradiente Descendente

backpropagation binary-classification deep-learning fraud-detection from-scratch gradient-descent machine-learning neural-network numpy python

Last synced: 18 Nov 2025

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/faizantkhan/numpy

NumPy Practice Repository Dive into the world of numerical computing with our collection of NumPy exercises. Perfect for beginners and seasoned programmers alike, this repository offers a range of challenges to sharpen your array manipulation and data analysis skills.

data-science library machine-learning numpy numpy-arrays numpy-library numpy-python outlier outlier-detection python pythonlibrarires tutorial

Last synced: 21 May 2026

https://github.com/ecrows/flippy

Python command-line utility for generating videos from image directories.

data-science image-processing numpy opencv python python3 video video-processing visualization

Last synced: 04 Mar 2026

https://github.com/eljandoubi/trading-with-momentum

A trading strategy based on a momentum indicator.

momentum numpy pandas scipy trading trading-algorithms trading-strategies

Last synced: 05 May 2026

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/chaudharypraveen98/lungcancerdetection

To distribute the work of doctors and process the large amount of data to produce accurate results on the go

numpy pandas pillow python scipy tenserflow

Last synced: 16 Apr 2026

https://github.com/jo-minseok/global-warming-100year

🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 ML [완료]

arima-model global-warming machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/abhi227070/wine-quality-prediction

The Wine Quality Prediction project utilizes machine learning to assess wine quality based on various parameters. It offers a user-friendly interface built with Streamlit for easy interaction. Deployed on AWS, it provides scalable access to quality predictions for wine batches.

classification-algorithm machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas python3 seaborn

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/nasa-asteroid-classification

Classifying whether an asteroid is hazardous or not.

eda matplotlib numpy pandas python seaborn visualization xgboost

Last synced: 11 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/linsanity03/football_prediction

ML project to predict goal from football events such as penalty, free kick.

jupyter-notebook numpy pandas pyspark python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/anshpg/vectorwave

The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.

collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/amiriiw/text_classification

Welcome to the Text Classification Project! This project is designed to train a model for classifying texts based on their emotional content and then using it to categorize new texts into corresponding emotional categories.

keras numpy pandas pickle scikit-learn tensorflow text-classification

Last synced: 20 Jan 2026

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 11 Apr 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/nagipragalathan/zk-connect

In the conventional hiring process, candidate applications are met with limited assessment, leaving the crucial task of evaluating suitability to recruiters. ZKConnect disrupts this norm by introducing a transformative approach.

css django html nextjs numpy opencv pandas python

Last synced: 11 Apr 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026

https://github.com/zhanziyuan/pixelpuzzle-web

Encode/decode images using Base64 or shuffle/recover the pixels of images.

anime encryption image image-processing numpy python

Last synced: 11 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject1

This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.

kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer

Last synced: 11 Apr 2026

https://github.com/nikhil-donthusaram/heartdiseaseprediction

Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.

data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode

Last synced: 11 Apr 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/dyarleniber/hands-on-machine-learning

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.

ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/alphan26/breastcancerclassification

This project involves a binary classification problem where we applied PyTorch to detect breast cancer.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/hmourad2023/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 01 Aug 2025

https://github.com/frauvate/cheatsheet

This project provides ready-to-use templates for machine learning base models. This repo, which contains templates of common models in Jupyter notebook format, aims to help users quickly implement these models and easily adapt them to their own projects.

machine-learning machinelearning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/nishantksingh0/twitter-sentiment-analysis

A ML/DL based Twitter sentiment analysis model based on Transformer based pretrained Encoding on sentence

kaggle-dataset matplotlib-pyplot nlp numpy pandas sklearn tensorflow transformer

Last synced: 11 Apr 2026

https://github.com/blacksujit/neural_network_using_numpy_and_math

Inspired from a repo . i have implemented an some ground level maths and logic . In this neural network no tensorflow , pytorch is used it is just prepared by using numpy and some logical math.

algorithmic-composition basics-of-python calculus criticalthinking innovation machine-learning maths neural-network neuralnetworkusingnumpy numpy purebasics

Last synced: 26 Aug 2025

https://github.com/aadrianleo/book-recommendation-system

Book Recommender System using the Book-Crossing dataset. Compares content-based (TF-IDF + cosine similarity) and collaborative filtering (SVD) methods for book recommendations. Includes data cleaning, EDA, and model evaluation (Precision@5, RMSE) in Python.

artificial-intelligence collaborative-filtering content-based-filtering cosine-similarity data-preprocessing jupyter jupyter-notebook machine-learning matrix-factorization numpy pandas predictive-modeling python3 recommendation-system recommender-system scikit-learn sckit-surprise

Last synced: 10 Apr 2026

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 20 Jan 2026