An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/harmanveer-2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python

Last synced: 10 Apr 2026

https://github.com/keizerzilla/como-funciona-meshgrid

Apontamento breve explicando como a função meshgrid do Numpy funciona.

meshgrid numpy

Last synced: 28 Apr 2026

https://github.com/lorenzorottigni/ml-iris-svm

Machine Learning python bootcamp: Support Vector Machines on iris flower dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 10 Apr 2026

https://github.com/tdeepa20/heart-disease-prediction-using-ml

Machine Learning models to predict heart disease using patient medical data

datascience jupyer-notebook machinelearning numpy python sklearn

Last synced: 28 Apr 2026

https://github.com/hai4320/ml_ai_notebook

All my note about ML, AI and Data Science

ai machine-learning numpy pandas scikit-learn

Last synced: 28 Apr 2026

https://github.com/gregoritsch3/dl_cnn_resnet50_digitrecognition

A Deep Learning Computer Vision project on the MNIST Digit Dataset. The project demonstrates the use of four TensorFlow Neural Network architectures, ranging from a basic Shallow Sigmoid Model to a Deep Convolutional Model constructed using the FunctionalAPI, and even a modified Resnet50 Model. Includes Error Analysis and test runs on real images.

cnn convolutional-neural-networks cv2 keras matplotlib numpy resnet-50 tensorflow

Last synced: 28 Apr 2026

https://github.com/rajivaleaakash/customer-churn-prediction

A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.

churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn

Last synced: 28 Apr 2026

https://github.com/varakanthamsaihruthikreddy975/avocadopricesanalysis

This data analysis finds out the trends and analysis of avocado data by year

matplotlib-pyplot numpy pandas scipy scipy-stats seaborn sklearn-preprocessing

Last synced: 28 Apr 2026

https://github.com/abdeldjalilchafai/us-flight-delay-eda

Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.

data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/udipta14/historical-olympic-games-eda-python

Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-cleaning data-visualization eda matplotlib numpy pandas python3 seaborn

Last synced: 28 Apr 2026

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 May 2026

https://github.com/shreeparab1890/indian-elections-2019-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.

data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization

Last synced: 28 Apr 2026

https://github.com/semanticdata/traffic-studies

Comprehensive traffic analysis dashboard for Crystal, Minnesota, built with Streamlit.

matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 28 Apr 2026

https://github.com/manalisbhavsar/stock-price-prediction

Stock Price Prediction model using Machine Learning and LSTM to forecast future stock prices based on historical data. Achieved a low error rate of 3.2% by leveraging moving averages and deep learning techniques, ensuring accurate predictions.

data-analysis deep-learning lstm machine-learning matplotlib numpy pandas python

Last synced: 28 Apr 2026

https://github.com/aroojzahra908/machine-learning

This repository contains helpful guides and materials for easily understanding Pandas, NumPy, and Matplotlib libraries, which are essential tools for creating visual representations of data.

algorithms machine-learning-algorithms matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/redinfinitypro/drawimages

Rating: (8/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 28 Apr 2026

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 09 May 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/ksprugevics/galton-board

Python implementation of the Galton board

galton-board numpy pygame pymunk python

Last synced: 29 Apr 2026

https://github.com/fimbres/numpy-start

A crash project with the most important numpy functions and methods, used in data science, AI and machine learning.

data-science numpy python python-script

Last synced: 29 Apr 2026

https://github.com/thekartikeyamishra/interactive-ai-model-trainer

A Google Colab notebook for interactively training simple AI/ML classification models. Features CSV upload, dummy data generation, feature/target selection, model choice, and basic evaluation. Includes a user-friendly UI. Ideal for educational purposes. See screenshots below!

google googlecolab ipywidgets matpl numpy pandas python scikit-learn seaborn

Last synced: 29 Apr 2026

https://github.com/rand-asswad/controle_optimal

Une étude d'un problème de contrôle optimal avec visualisation 3D

controle-optimal numpy optimal-control python sympy

Last synced: 29 Apr 2026

https://github.com/muazify/pyequationsolver

A Python script that solves mathematical equations for the variable 'x' using a hybrid approach, combining symbolic solving (for exact solutions) with numerical methods (for approximations when symbolic solving is insufficient).

equation-solver numpy python3 sympy

Last synced: 29 Apr 2026

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/w13b3/ssim-py

Structural Similarity (SSIM) index, where the only core dependency is NumPy

image-comparison numpy opencv-python pillow python3 ssim tested unittests

Last synced: 29 Apr 2026

https://github.com/dulajkavinda/document-similarity-ml

📑 Check similarity between documents using td-idf and cosine similarity.

machinelearning numpy python scikit

Last synced: 29 Apr 2026

https://github.com/romerorodriguezd/homography-with-opencv

Homography operations applied to insert images inside others regardless of the perspective.

computer-vision homography matrices-transformation numpy opencv opencv-python

Last synced: 29 Apr 2026

https://github.com/amithjoseph777/gourmethaven-case-competition

Predictive analytics case competition project from the Master of Science in Business Analytics course at the College of Business, University of Louisville. Our team analyzed customer response trends for Gourmet Haven, developed a predictive model, and ranked in the Top 5 out of 25 teams.

google-colab jupyter-notebook numpy pandas r scikit-learn

Last synced: 10 Apr 2026

https://github.com/giordanodaloisio/demv

Debiaser for Multiple Variables, a model- and data- agnostic method to improve fairness in binary and multi-class classification tasks

bias-mitigation numpy pandas python

Last synced: 29 Apr 2026

https://github.com/thaivincent/fft_image_compression

A simple image compression program using FFT and numPy

fft image-processing numpy

Last synced: 29 Apr 2026

https://github.com/fatimaafzaal/car_price_prediction

Trains Random Forest and Gradient Boosting models to predict car prices based on user inputs for various car attributes, evaluating models and making predictions using the best-performing model.

car-price-prediction ensemble-learning gradient-boosting matplotlib numpy pandas random-forest regression regression-models scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/ovsidee/psm

Fundamentals of Mathematical Simulations using Python

numpy python

Last synced: 29 Apr 2026

https://github.com/amulya20010418/zomato_data_analysis

Analyze Zomato restaurant data all the world and find the insights by using Python libaries and also visualize the dataset by using Power-bi

cleaning-data-in-python dashboard numpy pand powerbi python visualization

Last synced: 29 Apr 2026

https://github.com/kostadinlambov/algorithmic-trading-bot

The project aims to evaluate the predictive performance of different machine learning (ML) algorithms for Bitcoin trading. The proposed trading strategy integrates key technical indicators, including the Relative Strength Index (RSI), Simple and Exponential Moving Averages, and the Moving Average Convergence Divergence (MACD).

lightgbm machine-learning matplotlib mlflow numpy optuna pandas pickle random-forest scikit-learn scipy seaborn statsmodels xgboost

Last synced: 05 Apr 2026

https://github.com/matheusvazdata/retail-sales-forecast-linreg-sklearn

Minimal project for retail sales forecasting using linear regression (scikit-learn).

forecasting linear-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/s0fft/learning-lab

Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / DRF / Parsing / Telegram-Bot / SQL / Docker / DS / ML / etc.

asynchrony data-science django-rest-framework docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot

Last synced: 10 Apr 2026

https://github.com/divansh2025/python_notebook

Welcome to "Basics of Python", a repository designed to help you master the fundamentals of Python. 🚀

datavisualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/djdurga/upi-transactions-eda

The dataset provided encompasses a comprehensive record of Unified Payments Interface (UPI) transactions spanning the period from January 2023 to December 2023. Unified Payments Interface (UPI) has emerged as a prominent digital payment system in India, facilitating seamless fund transfers between bank accounts instantly.

matplotlib numpy pandas python

Last synced: 29 Apr 2026

https://github.com/sakshijain009/minor-project-ml-in-python

Exploratory Data analysis on data using python

matplotlib-pyplot ml numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/yimethan/basics-of-data-analysis

2023-2 Basics of Data Analysis

data-analysis numpy pandas python

Last synced: 29 Apr 2026

https://github.com/michaelzheng67/farmsa-programming-for-finance-spring2021

Source files for the programming for finance presentation (complements the material covered)

excel jupyter-notebook matplotlib numpy pandas python sql yahoo-finance-api

Last synced: 29 Apr 2026

https://github.com/raduldev/datasciencepython

Interview Task From CSIR-IGIB, Delhi & maps for journal

biopython colab-notebook counter jupyter-notebook matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/rebeccamorolong/mtn-customer-churn-prdiction

This project addresses a real-world business problem: predicting customer churn for MTN, a major telecommunications company. Customer churn significantly impacts profitability, and accurate predictions enable targeted retention efforts.

anova-test matplotlib-pyplot numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/atchayaah/vois_aicte_oct2025_atchayaa

Comprehensive data science analysis leveraging NYC Airbnb market data to derive actionable insights on optimal pricing models, key performance indicators (KPIs), and factors influencing host trust and guest experience optimization.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/harmanveer-2546/bird-species-prediction-using-deep-learning

Using convolutional neural networks to build and train a bird species classifier on bird pics data with corresponding species labels, also build GUI for the same.

3d-graph callback deep-learning eda gui gui-application image-generator imageclassification keras-tensorflow matplotlib maxpooling mobilenetv2 numpy opencv pillow plotly python seaborn transfer-learning visualization

Last synced: 29 Apr 2026

https://github.com/fantaso/data-analysis-and-manipulation-with-pandas

Project consists to analyse a publicly available movie dataset found in https://www.kaggle.com/beyjin/movies-1990-to-2017 and use Python tools like Pandas in order to get some initial insights about the dataset and finally proceeding to clean, transform and save a new version of the dataset in a better structure thinking about storing the data in a database.

matplotlib numpy pandas python3 seaborn

Last synced: 29 Apr 2026

https://github.com/hansalemaos/cv2pencil

Converts images to pencil sketches

filter numpy opencv pencil python

Last synced: 10 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/ola-kaznowska/sigmanoid-function

Sigmanoid function. Written in Python 3.13.1 with matplotlib library and numpy

mathmatics matplotlib numpy python python3

Last synced: 30 Apr 2026

https://github.com/vedikasnehil/my-data-science-projects

This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.

data data-science deep-learning machine-learning matplotlib numpy python sql visualization

Last synced: 10 Apr 2026

https://github.com/gill-singh-a/radial-velocity-of-star

A Program that approximates the number of exoplanets and their data (Mass, Radius of Revolution and Time Period of Revolution) present in a star system given the Radial Velocity of Star with noise

astronomical-data-analysis astronomy exoplanets fourier-transform matplotlib numpy physics python python-3 science stars

Last synced: 30 Apr 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/tsar123/coffee-quality-analysis

machine learning using RandomForestRegressor and dataset with coffee data

coffee ml numpy pandas randomforestregressor regressor-models sklearn

Last synced: 30 Apr 2026

https://github.com/rijul007/smartwatch-data-analysis-using-python

Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python

Last synced: 30 Apr 2026

https://github.com/alirezasaharkhiz9/python-for-dataanalysis

The educational repository for data manipulation and analysis with the libraries numpy, pandas, and matplotlib.

matplotlib numpy pandas

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/student-stress-levels-classification

Proyek pemodelan machine learning untuk mengklasifikasikan tingkat stres mahasiswa berdasarkan parameter input akademik dan psikologis.

joblib jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 08 Jun 2026

https://github.com/redinfinitypro/python-task-manager

Rating: (7/10): The Python Task Manager, is a user-friendly interface using tkinter library for system monitoring and management. It displays detailed system process information using psutil, with interactive features.

matplotlib numpy psutil pygetwindow tkinter

Last synced: 30 Apr 2026

https://github.com/ayushshrivstava/es654_prereqtest

Prerequisite test for the Machine Learning course ES654

jax matplotlib numpy pandas sympy

Last synced: 30 Apr 2026

https://github.com/danchik12/habr_scraping

Parsing Habr articles and saving in csv format

csv fake-useragent numpy pandas parser python3

Last synced: 30 Apr 2026

https://github.com/saro0307/unemployment-analysis

This data science project delves into unemployment trends, using data analysis and machine learning to identify key factors and predict future joblessness rates, aiding policymakers and businesses in informed decision-making.

data-science datavisualization machine-learning matplotlib numpy pandas python seaborn skit-learn

Last synced: 10 Apr 2026

https://github.com/pedasoft-consult/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance.

nltk numpy pandas sklearn

Last synced: 30 Apr 2026

https://github.com/sayginsaman/hand-gesture-desktop-controller

A comprehensive Python application that transforms your webcam into a powerful desktop control interface using computer vision and hand gesture recognition. Control media playback, volume, scrolling, and navigation with simple hand gestures - no mouse or keyboard required!

chmod gesture gesture-control gesture-detection gesture-recognition gestures media mediapipe numpy opencv pip pyautogui python

Last synced: 30 Apr 2026

https://github.com/abhiram-epuru/ai-powered-movie-recommender

A Movie Recommendation System using the MovieLens dataset, leveraging collaborative filtering techniques such as KNN (k-Nearest Neighbors) and Singular Value Decomposition (SVD). The project demonstrates a personalized recommendation model capable of suggesting movies based on user preferences and interaction data.

matplotlib numpy pandas python sckiit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/maximlevchenko/python-course-convolution-orb-detector-data-processing-2d-game

This repository contains a collection of projects developed during a university Python course. Each project explores different aspects of data processing, analysis, and computer vision.

2dgame bomberman-game computer-vision convolution harris-corner-detector image-processing keypoint-detection numpy pandas python statistical-analysis

Last synced: 30 Apr 2026

https://github.com/manorohith-03/attendance-system-using-face-recognition

The system leverages the Python programming language and several powerful libraries for image processing and face recognition. The ultimate goal is to provide a more efficient, accurate, and user-friendly attendance tracking solution.

face-recognition numpy opencv pandas python-3

Last synced: 30 Apr 2026

https://github.com/vrush9840/emotion-detection-from-tweets-using-dl

Emotion detection from tweets using Deep Learning (LSTM & BERT) to classify emotions such as Positive, Negative, Neutral, and Irrelevant with emoji recommendations.

glove-embeddings lstm matplotlib nltk numpy pandas python seaborn shap tensorflow

Last synced: 10 Apr 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/sairupeshl/portfolio-risk-calculator

A portfolio risk engine using Python, integrating the Yahoo Finance API to ingest and preprocess historical multi-asset price data

numpy pandas python yfinance

Last synced: 08 Jun 2026

https://github.com/zsailer/np2d

Common 2-D Numpy operations

array numpy

Last synced: 01 May 2026

https://github.com/hghyhghy/notebook

Repo consist numpy , matplotlib and pymongo aggression

jupyter-notebook matplotlib numpy pymongo python-3

Last synced: 01 May 2026

https://github.com/ahmedsamir45/demograghic-data-analyzer

answer some questions about the data

demographic-data-analyzer numpy pandas

Last synced: 01 May 2026

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 09 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026