An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/aymen016/leetcode-problems

This repo is designed to enhance your problem-solving skills through daily coding challenges, complete with clear explanations and well-structured solutions. 🎯

coding decision-making leetcode-python leetcode-solutions numpy problem-solving python

Last synced: 09 May 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/shiqilu77/machinlearning-lol_gaming

a supervised model that predicts the match winning rate in LOL game based on the lineups of both sides. Implemented data processing by multiple python API including sklearn, NumPy, pandas, and by feature encoding, kNN, Synthetic Minority Oversampling Technique, etc

league-of-legends machine-learning numpy supervised-learning

Last synced: 16 Apr 2026

https://github.com/archish27/pythontutorial

Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications

matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup

Last synced: 01 Apr 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/michaelgermini/python-3d-programming-book

Complete Python & 3D Programming Book: 30 chapters, 120+ examples, real-time ray tracing, ML graphics, OpenGL, NumPy

3d-graphics computer-graphics educational game-development graphics-programming machine-learning numpy opengl python ray-tracing tutorial

Last synced: 16 Apr 2026

https://github.com/parth-jatav/super-store-analysis-project

The Super Store Analysis project leverages Python libraries such as pandas, matplotlib, and numpy to perform a comprehensive analysis of a retail store's data. This project includes data cleaning, visualization, and statistical analysis to identify key trends, optimize inventory, enhance decision-making processes for improved business performance.

data-analysis matplotlib numpy pandas python super-store

Last synced: 12 Apr 2026

https://github.com/kganitis/bam-engine

High-performance Python implementation of the macroeconomic agent-based BAM model using ECS architecture

agent-based-modeling computational-economics macroeconomics numpy python

Last synced: 04 Mar 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 16 Apr 2026

https://github.com/shreeparab1890/chat-analyzer

This project is a Data Analysis project to analyze the WhatsApp chats.

data-analysis numpy pandas python

Last synced: 12 Apr 2026

https://github.com/sumitkundu102022/ecobuddy

EcoBuddy: Personalized Carbon Footprint Tracker

bcrypt numpy pandas plotly pymongo python-dotenv requests streamlit

Last synced: 12 Apr 2026

https://github.com/supershivam5/python_projects

💻 Python programming with Numpy, Pandas, Matplotlib.🌟 Love exploring new technologies. Check out my projects!

matplotlib-pyplot numpy pandas scikit-learn seaborn

Last synced: 17 Apr 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/arrmansa/leetcode-nice-solutions

Some solutions for leetcode problems that I found fun - either the algorithm or using numpy bruteforce where it shouldn't be

leetcode leetcode-python leetcode-python3 leetcode-solutions numpy python python3

Last synced: 16 Mar 2025

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/prashhhant213/customer-behavior-analysis-for-walmart-black-friday-sales

This project is a Walmart case study analyzing customer purchase behavior by gender and demographics to inform business decisions, especially around spending habits during events like Black Friday.

matplotlib numpy pandas python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/ricobuilds/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Mar 2026

https://github.com/sujalk777/signal_systems_lab

This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24

jupyter-notebook librosa linux matplotlib numpy python raspberry-pi

Last synced: 12 Apr 2026

https://github.com/neerajcodes888/a-novel-used-car-price-prediction-model-based-on-lindenoise

Welcome to the LinDenoise Repository! LinDenoise offers a smart solution for cleaning noisy data in regression tasks. Integrated seamlessly within the widely-used scikit-learn framework, it effortlessly enhances data quality while improving predictive accuracy

car-price-prediction deep-learning ipynb-notebook machine-learning numpy pandas python3 visualization

Last synced: 06 Mar 2026

https://github.com/tnleite/credit-card-customer-clustering

Este repositório apresenta um projeto de segmentação e predição de clientes de cartões de crédito. Utilizando EDA, clusterização (K-Means) e machine learning, o objetivo é prever o grupo de novos clientes, apoiando estratégias de marketing personalizadas.

classification-algorithm clustering-algorithm clustering-analysis data-science exploratory-data-analysis kmeans-clustering logistic-regression machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn seaborn

Last synced: 07 May 2026

https://github.com/esraamhmd/intelligent-skin-cancer-image-based-detection-models

This project combines traditional machine learning approaches with advanced deep learning techniques to assist healthcare professionals in early diagnosis and improve patient outcomes.

ai deep-learning ham10000 isic kaggle machine-learning matplotlib numpy pandas python3 pytorch seaborn skin-cancer skin-detection sklearn time torchvision tqdm

Last synced: 02 Apr 2026

https://github.com/akshitvats026/heart_disease_prediction

An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.

accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 02 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/kernel-loophole/numerical-computing

Numerical Computing use python(Numpy,metplotlib).

numpy python quantum-computing

Last synced: 17 Apr 2026

https://github.com/rohitinu6/image_cartoonify

To Cartoonify an Image using opencv-python

image-processing matplotlib numpy opencv opencv-python python

Last synced: 12 Apr 2026

https://github.com/kiarashyavari/python_interview

Practice intermediate to advance topics of Python here

decorators generator multiprocessing multithreading numpy pandas python python3

Last synced: 17 Apr 2026

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/omogbolahan94/lego-analysis

Analysis of LEGO data from 1949 to 2021.

matplotlib merge numpy pandas

Last synced: 09 May 2026

https://github.com/nikhilgugwad/sentiment-analysis

Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.

numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/lmizner/codecademy_nba_trends

Investigate whether there is an association between two variables

chi2-contingency jupyter-notebook matplotlib numpy pandas pearson-correlation pyplot python seaborn

Last synced: 17 Apr 2026

https://github.com/sofiakhutsieva/analytical_report_for_producers_of_educational_programs

Подготовка аналитического отчета (Проект в skillbox )

analytics matplotlib numpy pandas seaborn

Last synced: 17 Apr 2026

https://github.com/vishal-verma-96/oil_spill_prediction_using_ml

This ML project predicts oil spills using various machine learning algorithms like XGBoost and Random Forest. This project also contains saving and load of the model to make predictions on a sample dataset.

data-science machine-learning-algorithms machine-learning-classification matplotlib model-building-and-evaluation numpy pandas pickle python-3 scikitlearn-machine-learning xgboost

Last synced: 17 Apr 2026

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/sayan520/titanic-data-insights

Conducting data analysis on Kaggle's Titanic: Machine Learning from Disaster dataset using essential data wrangling, exploratory data analysis (EDA), and visualization techniques to uncover insights, identify patterns, and explore factors influencing passenger survival.

jupyter-notebook kaggle matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/sultanazhari/prediction-model-of-customers-leaving

Bank Beta Company focus on retain existing customers, our task is to create a model that predicts whether or not a customer will leave the bank soon.

accuracy-score confusion-matrix logistic-regression-algorithm matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 20 Apr 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/franciscomartinez45/social-network-analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 09 May 2026

https://github.com/dhruvv1402/x-twitter-bot-python

A Python-powered Twitter bot for content generation, analytics, and automation using Twitter API v2.

huggingface-transformers matplotlib nltk numpy pandas pillow python requests schedule textblob threading tweepy twitter-api wordcloud

Last synced: 09 May 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/akashash01/navie_bayes

It's an simple probabilistic classifiers based on applying Bayes theorem with strong independence assumptions between the features. Highly effective for text sequence oriented data(independent variables).

data-science machine-learning numpy pandas python sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/jvictor011/big-five-personality-test

Usando machine learning para fazer o agrupamento de um sistema de traço de personalidade

matplotlib numpy os pandas python seaborn

Last synced: 07 May 2025

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/aliyahscoding/tiny-recommender

Tiny item-item recommender in Python using cosine similarity over a toy user–item matrix. Simple CLI to print top-N recommendations.

collaborative-filtering cosine-similarity numpy pandas python recommender-system

Last synced: 09 May 2026

https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql

Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!

data-analysis data-science deepnote machine-learning numpy pandas python sql

Last synced: 18 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/jsnevt/regressao_linear

Regressão linear simples - Qual a velocidade se levou x distância para parar ?

linear-regression matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/armanshadyan/bacteria-detection

Deep learning-powered tool for detecting and analyzing bacteria in microscope images. Designed for microbiology research, it identifies bacteria, counts them, and calculates coverage area with high precision. Supports real-time video processing, single-image analysis, batch processing, and model training.

mediapipe numpy object-detection opencv python pytorch

Last synced: 05 Apr 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/ayushmaan-xd/exploratory-spotify-data-analysis

Exploratory Spotify Data Analysis is a project where I analyzed Spotify’s music dataset to uncover trends in audio features and song popularity. Using Python and data visualization tools

jupyter-notebook matplotlib numpy pandas python seaborn spotify-dataset

Last synced: 18 Apr 2026

https://github.com/chmohit91/Introduction-to-Python

A comprehensive collection of Python programming resources, from basics to advanced concepts. Built for learners, by learners

matplotlib numpy pandas plotly python python-library python3 seaborn

Last synced: 21 Jul 2025