An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 17 Apr 2026

https://github.com/theakashshukla/diabetes-prediction

Diabetes Prediction using futuristic tool

ai jupiter-notebook ml numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/sofiakhutsieva/analytical_report_for_producers_of_educational_programs

Подготовка аналитического отчета (Проект в skillbox )

analytics matplotlib numpy pandas seaborn

Last synced: 17 Apr 2026

https://github.com/khushi130404/binning-binarization

This project demonstrates binning and binarization on the Titanic dataset, comparing results with and without numeric encoding. Visualizations highlight the transformations and their impact on survival analysis.

binarization binarizer binning columntransformer decision-trees matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/vishal-verma-96/oil_spill_prediction_using_ml

This ML project predicts oil spills using various machine learning algorithms like XGBoost and Random Forest. This project also contains saving and load of the model to make predictions on a sample dataset.

data-science machine-learning-algorithms machine-learning-classification matplotlib model-building-and-evaluation numpy pandas pickle python-3 scikitlearn-machine-learning xgboost

Last synced: 17 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/0nom4d/306radiator

Mathematical EPITECH Project - This project reproduces the behaviour of a radiator placed at a position in a room of a given size.

epitech epitech-project numpy pytest python3

Last synced: 29 Apr 2026

https://github.com/mgitrov/cnn-on-cifar-10

A computer vision project aiming to classify random images uploaded by the user.

artificial-neural-networks computer-vision convolutional-neural-networks deep-learning keras matplotlib numpy pil regularization tkinter

Last synced: 17 Apr 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/shailikadakia/yoga-pose-accuracy-detector

Real-time yoga pose accuracy detector using MediaPipe and OpenCV. Classifies poses from webcam input with KNN and provides feedback on duration and accuracy, built with Python and scikit-learn.

computer-vision fitness health-tech knn machine-learning mediapipe numpy opencv pandas pose-estimation python real-time-detection yoga

Last synced: 09 Apr 2026

https://github.com/mugambi645/classify-song-genre-from-audio-data

Classify song genres using supervised learning models & PCA for dimensiality reduction

decision-tree-classifier logistic-regression numpy pandas pca-analysis

Last synced: 17 Apr 2026

https://github.com/glbter/data-analysis-py

data-analysis labs. 50/50 made with Oleh Tonkoshkur

numpy pandas python3 scipy-stats

Last synced: 12 Jun 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/varakanthamsaihruthikreddy975/champo_carpets_case_analysis_python

the champo carpets case study Analysis in python, this is still under development this will be a more comprehensive analysis than previous one in R. I have the complete version of code in R refer to this link - ( https://github.com/VarakanthamSaiHruthikReddy975/Champo_Carpets_Case_Study_Analysis)

case-analysis jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python3 seaborn

Last synced: 17 Apr 2026

https://github.com/anag0es/projetos-ml

Este é um lugar onde guardo vários projetos para entender como funciona a aprendizagem de máquina. Cada projeto é uma forma de praticar o que estou aprendendo nos meus estudos.

artificial-intelligence crisp-dm data-science deep-learning learning machine-learning neural-network numpy pandas prediction-model python

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/w13b3/ssim-py

Structural Similarity (SSIM) index, where the only core dependency is NumPy

image-comparison numpy opencv-python pillow python3 ssim tested unittests

Last synced: 29 Apr 2026

https://github.com/mwoss/mownit

Various tasks from numerical analysis laboratories solved using Python

jupyter-notebook mathematics matplotlib numerical-analysis numpy python scipy

Last synced: 17 Apr 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/kmkurn/text2array

Python library to convert text dataset into arrays.

deep-learning natural-language-processing neural-networks numpy

Last synced: 14 May 2026

https://github.com/sanchariii/health-assist-1

Health Assist web app uses the Streamlit framework to help users identify potential diseases. Simply input your symptoms, and the system will predict whether you have diabetes, heart disease, or Parkinson's disease. The system is designed to be quick and convenient, promoting early detection and timely medical intervention.

numpy pandas pickle streamlit-webapp svm-model

Last synced: 13 Aug 2025

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 18 Apr 2026

https://gitlab.com/DLR-SY/beos

Beos is a legacy Fortran tool used to calculate the static and dynamic buckling behavior of flat/curved fiber composite structures.

fortran numpy python

Last synced: 16 Aug 2025

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/romerorodriguezd/homography-with-opencv

Homography operations applied to insert images inside others regardless of the perspective.

computer-vision homography matrices-transformation numpy opencv opencv-python

Last synced: 29 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/amirali5/numpy

In this repo, all about NumPy which basically a Python Library used for multidimensional array. It also has functions for working in domain of linear algebra, fourier transform, and matrices. NumPy was created in 2005 by Travis Oliphant. It is an open source project and you can use it freely.

numpy

Last synced: 30 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/arnoldchrisoduor1/potato_disease

Trained a CNN to detect the presence and the current stage of Blight disease in potatoes, WebApp in FastAPI and React

fastapi keras numpy react serving-tensors tensorflow

Last synced: 04 Apr 2026

https://github.com/amruta33/melanoma-detection

To build a CNN based model which can accurately detect melanoma. Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution that can evaluate images and alert dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.

keras machine-learning numpy pandas python3 sckiit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 20 Jan 2026

https://github.com/vansh-py04/convolution-net-from-scratch

A minimal NumPy-based implementation of a 3-layer convolutional neural network (CNN) from scratch — including custom forward and backward passes for conv, ReLU, pooling, affine, and softmax layers. Perfect for learning how CNNs actually work under the hood.

backpropagation cnn computer-vision convolutional-neural-networks cs231n cs231n-assignment deep-learning from-scratch from-scratch-in-python fully-connected-network machine-learning maxpool2d nueral-networks numpy softmax stanford-deep-learning

Last synced: 24 Apr 2026

https://github.com/nauxqouh/python-for-data-science-labs

This repo contains weekly practical codes for my Python for Data Science course at university.

data-science jupyter-notebook numpy pandas python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/abdullahashfaqvirk/SMS-Spam-Detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 16 Aug 2025

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/adi3042/sensor-fault-detection

🔍⚙️ Ensure Reliable Operations! Detect anomalies and prevent disruptions with our Sensor Fault Detection system. Explore advanced classification and regression techniques to identify and address sensor faults effectively. Your path to robust and accurate sensor data begins here! 🚨🔧 SensorFaultTech

classification css datetime fault-detection flask functools html ipykernel jupternotebook machine-learning numpy pandas python3 readme regression scikit-learn sensor setuptools venv

Last synced: 11 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/alainlebret/python-et-ia-1

Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN

artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn

Last synced: 04 Apr 2026

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 11 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/npodlozhniy/dl-tinkoff

This is a warehouse of my homeworks for master MIPT deep learning course from Tinkoff FinTech

deep-learning numpy pytorch

Last synced: 08 May 2026

https://github.com/muhammad-rebaal/eda-with-python-and-pandas

Explore, Analyze, and Visualize Data with Python and Pandas: Dive into the world of data analysis and visualization using Python and Pandas with this repository. Discover valuable insights, clean and preprocess your data, and create stunning visualizations to make data-driven decisions.

calmap matplotlib numpy pandas python python3 seaborn

Last synced: 07 May 2026

https://github.com/moustafamohamed01/car-price-prediction

Car Price Prediction using Linear Regression A machine learning model that predicts car prices based on features like name, company, year, kilometers driven, and fuel type. Built with Linear Regression and deployed as a Streamlit web app.

data-science jupyter-notebook linear-regression machine-learning numpy pandas python sickit-learn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/junaidsumsaal/advanceparkingsystem

Its a web application that integrates AI-powered parking spot predictions, real-time notifications, and role-based dashboards for Drivers, Attendants, and Providers. This system is designed to streamline parking operations, enhance user experience, and leverage machine learning for future availability predictions.

axios-react chakra-ui-react data-science met metrics ml numpy pand python3 react sklearn tailwindcss typescript

Last synced: 11 Apr 2026

https://github.com/dane-meister/machine-learning-algos

Repository of several ML algorithms implemented for different environments and robot control vectors.

colab-notebook gymnasium jupyter-notebook machine-learning-algorithms numpy opencv pybullet python pytorch

Last synced: 11 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/chirindaopensource/high_dimensional_matrix_variate_diffusion_index_models

End-to-end Python implementation of Ma et al.'s (2025) matrix-variate diffusion index models for macroeconomic forecasting. Features α-PCA factor extraction, supervised screening, and ILS estimation for high-dimensional forecasting with preserved structural information.

diffusion-index dimension-reduction econometrics factor-models financial-modeling high-dimensional-statistics macroeconomic-forecasting matrix-factorization monte-carlo-simulation numpy principal-component-analysis python quantitative-finance research-replication scientific-computing statistical-computing statistical-modeling supervised-learning time-series-analysis time-series-forecasting

Last synced: 29 Apr 2026

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/yahiazakaria445/sentiment-analysis-using-lstm

A Sentiment analysis NLP project using LSTM model

matplotlib nltk numpy pandas scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 20 Jan 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/jaygaha/ai-mastery-roadmap

A collaborative AI learning roadmap. I'm documenting my journey from grasping the fundamentals of Python to mastering Machine Learning. Feel free to fork and contribute — let's build our AI skills together!

artificial-intelligence beginners-guide data-science deep-learning education numpy python python-programming self-taught tutorial

Last synced: 18 Jan 2026

https://github.com/collins-kimotho/wrangle-and-analyze-project

Data Wrangling and Analysis Project: Analyzing WeRateDogs Twitter Account Data

explanatory-data-analysis exploratory-data-analysis matplotlib numpy pandas python twitter twitter-api

Last synced: 11 May 2026

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 11 Apr 2026

https://github.com/shahzaibdev355/house-recommendation-backend

House Recommendation Backend is a Flask-based API that provides house recommendations based on user input. It includes endpoints to check the backend's functionality, fetch state data, and get personalized house recommendations based on criteria like price, property size, number of beds, number of baths, and state.

flask flask-api flask-cors numpy pandas scikit-learn tensorflow

Last synced: 05 Jan 2026

https://github.com/nikhil-donthusaram/heartdiseaseprediction

Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.

data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/bhuvilol/objdetect_img

Object Detecttion from a image input

numpy opencv python3 pytorch yolov8

Last synced: 04 Feb 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/vishal-verma-96/Honey_Production_Analysis_using_Python

In this project, I analyzed U.S. honey production from 1998 to 2021 across different U.S. cities, explored yields, and examined production values using Matplotlib and Seaborn visualizations.

matplotlib numpy pandas python-3 python-visualization seaborn

Last synced: 19 Apr 2025

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject1

This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.

kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer

Last synced: 11 Apr 2026

https://github.com/zhanziyuan/pixelpuzzle-web

Encode/decode images using Base64 or shuffle/recover the pixels of images.

anime encryption image image-processing numpy python

Last synced: 11 Apr 2026

https://github.com/engichang1467/cnn-sign-language

Built a Convolutional Neural Network to train to recognize sign languages

kaggle keras matplotlib numpy pandas sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/bagusperdanay7/fcc-da-mean-variance-standard-deviation-calculator

One of Data Analysis with Python (freecodecamp) task, created a Mean Variance Standard Deviation Calculator.

data-analysis freecodecamp-project numpy python

Last synced: 06 May 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026