An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lijesh010/employeeanalysis-

This is a simple data analysis project developed as part of my Entri Elevate Data Science and Machine Learning course. It serves as a practical demonstration of my skills in data analysis.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rizkyrajitha/ann_101

A really simple neural network for binary classification written from scratch using python

jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/bhushan148/ipl-stats-analysis

This project explores IPL data to show team and player performances. 📈 Key highlights include analyzing match results, highest scores, and performance at different venues. 🏟️ It uses charts to reveal trends like win margins and average scores by venue. 📉 Discover important insights into what affects match outcomes and team success.

jupyter-notebook markdown matplotlib-pyplot numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/constraintautomaton/client_pgu1

The client part of the PgU1 project it. It command the robot and do the video processing

numpy opencv paramiko pyqt5 python3 raspberry-pi-3

Last synced: 12 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/sonatipek/matris_transactions_py

My "Python Programlama-I" lesson's homework. Some matris transactions with numpy library in Python

matrix numpy numpy-arrays python sonatipek

Last synced: 24 Mar 2025

https://github.com/sothulthorn/stock-prediction-portal

The Stock Prediction Portal is a full-stack web application that demonstrates how to use machine learning models to analyze stock market data and forecast future trends.

axios django django-rest-framework javascript keras matplotlib numpy pandas python react tensorflow yfinance

Last synced: 02 Apr 2026

https://github.com/akashkg03/feynn-labs-internship

This repository contains the projects I did as a Data Science intern with Feynn Labs.

data-science machine-learning matplotlib numpy pandas python scipy seaborn

Last synced: 07 Jan 2026

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 12 Apr 2026

https://github.com/yahiazakaria445/image-classification-using-vgg-16

computer vision model for 10 classes classification

matplotlib numpy opencv pandas sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/ericchw/youth_discord_nlp_chatbot

A python AI chatbot with emotion detection model. Frontend using PHP, API using Flask and database using PostgreSQL. Collaborate with CyberYouth from SJS. @HKMU 2022-2023 FYP

ai bootstrap chatbot chinese-nlp docker emontion-detection flask jquery machine-learning nlp nlp-machine-learning nltk numpy pgsql php postgresql pycord torch transformers

Last synced: 21 Jan 2026

https://github.com/ahmedabdalkreem/animals_classification

This project is dependent to make classification between faces of cats, dogs and wild_animals and use multiple model such as CNN and pretrained model like ResNet and MobileNet and show the result of all model in the image using heatmap to know what is the part that model take decision when see it.

classification cnn heatmap matplotlib mobilenet numpy opencv pandas pretrained-models python restnet

Last synced: 12 Apr 2026

https://github.com/sandipkushwaha20/brain_tumour_detector

This project utilizes deep learning to detect and classify brain tumors from MRI images. The primary objective is to determine whether a tumor is present and, if so, classify its type with high accuracy.

deeplearning keras numpy pandas python tensorflow

Last synced: 12 Apr 2026

https://github.com/mairagalvao/steak_risk_data

A data analysis based on correlations and comparisons among the samples.

database matplotlib numpy pandas python3 sql

Last synced: 12 Apr 2026

https://github.com/chenshuliu/python-guide-to-numpy

This repository is dedicated to explaining the functionalities of Numpy in Python

data-structures numpy python

Last synced: 18 Jan 2026

https://github.com/2kabhishek/ml-primer

Basic Machine Learning & Data Science in Python/ 📈🐍

jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/liviuxyz-ctrl/financialportofoliosimulator

FinancialPortfolio is your financial planning ally, helping you manage portfolios, calculate future values, and visualize performance, all with user-friendly ease🚀

matplotlib numpy python simulation

Last synced: 12 Apr 2026

https://github.com/ishtufinquant/portfolio-optimizer

Optimizes stock portfolios using simulation and real-world constraints (SLSQP).

backtesting finance numpy pandas portfolio-optimization python quant scipy yfinance

Last synced: 12 Apr 2026

https://github.com/subhamsarangi/pdnp

Data Analysis using Python

matplotlib numpy pandas python3

Last synced: 12 Apr 2026

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 12 Apr 2026

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 12 Apr 2026

https://github.com/llwusill/firepredictioncnn

CNN kullanarak Orman Yangını Tahmin/Tespit Sistemi

matplotlib numpy opencv-python pandas sckiit-learn seaborn tensorflow

Last synced: 12 Apr 2026

https://github.com/renukadhule/crop_production_python_analysis

This project analyzes crop production data in India from 1997 to 2015, identifying trends, seasonality, and top-performing states and crops. The insights aim to guide improvements in agricultural productivity and policy decisions.

matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system

Prime numbers and polar coordinate system

matplotlib numba numpy python

Last synced: 01 May 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/asuquoaa/cdc_2017_immunization_data_analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 12 Apr 2026

https://github.com/elifftosunn/data-labeling

Bu araç veri kazıma aşamasından sonra verilerin istenilen kategoriye göre etiketlenmesi ve kullanılmayacak olan verilerin database'ten silinmesi ile makine öğrenmesi modellerini kullanmak için etiketleme adımınını otomatikleştirmek amaçlı yapılmıştır.

base64 kaleido numpy os pandas plotly postgresql psycopg2 psycopg2-binary time

Last synced: 12 Apr 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ellennewerkla/apriori-analise_cesta_de_mercado

Análise de Cesta de Mercado (Market Basket Analysis) utilizando o algoritmo Apriori para identificar regras de associação entre produtos em um conjunto de transações de supermercado.

apriori-algorithm jupyternotebook marketbasketanalysis mathplotlib mba numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/kartikeya443/handgesture_vol_control

This OpenCV project enables remote control of sound playback using wireless communication. It lets users adjust the master volume of the central device using finger gestures, enhancing convenience and user experience.

mediapipe numpy opencv pycaw wireless-communication

Last synced: 13 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/goessl/adjugate

A package for calculating submatricies, minors, adjugate- and cofactor matrices.

adjugate cofactor determinant inverse linear-algebra matrix minor numpy python submatrix

Last synced: 07 Jan 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/bbhargavpanchal/brain-tumor-segmentation-unet

U-Net based Convolutional Neural Network for brain tumor segmentation using MRI images. Implemented in Python with TensorFlow, OpenCV, and Scikit-learn.

keras-tensorflow numpy opencv pandas python scikit-learn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/oleksiilatypov/titanic_ml

Titanic Survival Prediction using ML and Flask Deployment.

data-science docker flask machine-learning numpy pandas skit-learn

Last synced: 13 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/sanjaiveeran44/shopping-intent-analasys

This project predicts whether a visitor to an e-commerce website will make a purchase based on their browsing behavior. It includes data analysis, preprocessing, and machine learning modeling to understand online shopping patterns.

matplotlib matplotlib-pyplot numpy pandas python3 sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 13 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/kizman-23/supervised_models

Classical prediction of future data using models trained by labeled data

numpy pandas scikit-learn supervised-machine-learning

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/data-analytics-projects-phase-1

A set of projects on Data Analysis using Python and its libraries

matplotlib numpy pandas python seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026

https://github.com/hootbu/cyberpunk-filter-desing

This project aims to design filters with a cyberpunk aesthetic by performing low-level image manipulation in Python.

computer-vision image-manipulation matplotlib numpy python

Last synced: 08 May 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/abideen-olawuwo/nyc-taxi

Prediction the duration of New York Taxi trip

linear-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/nikhilsree5/netflixcasestudy

Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth

eda numpy pandas python visualization

Last synced: 13 Apr 2026

https://github.com/alex1iv/asr_ru_numbers

Automatic Speech Recognition (ASR) system for Russian digits

audio-processing librosa numpy speech-recognition tensorflow

Last synced: 13 Apr 2026

https://github.com/alepm1990/nba-analytics

Análisis y trabajos en Python con BBDD deportivas

matplotlib-pyplot numpy pandas pyhon python3 seaborn

Last synced: 05 May 2026

https://github.com/paulo-santos-ds/machinelearninglogisticregression

A fraude de cartão de crédito é uma forma comum de fraude que afeta milhões de pessoas em todo o mundo. Essa fraude pode ter consequências graves, tanto para as vítimas quanto para as instituições financeiras que emitem os cartões de crédito.

math matplot numpy pandas python3 sklearn

Last synced: 13 Apr 2026

https://github.com/yuu-eguci/cognitive-services-trial

Try to play with Cognitive Services!! [Cognitive Services] [OpenCV] [Numpy]

cognitive-services dotenv numpy opencv-python pipenv python python3

Last synced: 05 Jul 2025

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/bhanuprakashnani/scit-tasks

Tasks for the SCIT Work Shop.

numpy python3

Last synced: 25 Apr 2026

https://github.com/kecnry/nparray

High-Level Wrappers for Building and Manipulating Numpy Arrays

numpy numpy-arrays python

Last synced: 08 Jan 2026

https://github.com/itshyphen/mass-mailing-script

A simple mass mailing script that sends personalized email to multiple emails importing from csv

numpy pandas python smtplib

Last synced: 13 Apr 2026

https://github.com/a-parida12/numericalmethods

Teaching Material for Workshop for Engineering Students

numpy ode pde python3

Last synced: 13 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/raeinlayeghpour/test-bench-for-signal-validation-simulated-3-phase-system

Automated Project - Test Bench for Signal Validation (Simulated 3-Phase System)

numpy pandas pycharm pytest python

Last synced: 13 Apr 2026

https://github.com/chandkund/wine-quality-prediction

This project predicts wine quality based on physicochemical properties using machine learning models. By leveraging Random Forest Classifier, Logistic Regression, and SVM, the goal is to classify wines into quality categories and uncover the key factors that influence wine quality.

logistic-regression matplotlib numpy pandas-python random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/harmanveer-2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg

Last synced: 13 Apr 2026

https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/tromesh/attendance-predictor

Student Attendance predictor for Scheduling lecture sessions

matplotlib numpy pandas tensorflow

Last synced: 05 May 2026

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025