An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/wwwmisla/gerador-mapas-calor

Sistema de visão computacional para gerar mapas de calor com base na movimentação em espaços públicos, auxiliando no planejamento urbano e uso eficiente do espaço.

color computer-vision demo google-colab gradio heatmap marchine-learning matplotlib model numpy opencv people-detection python smart-city ufrn visao-computacional vision-computer yolo yolov8

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/tonoy30/emotion-mask

A pytorch model for add face mask based on detected emotion.

emotion-detection numpy opencv-python python pytorch

Last synced: 09 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/rtgrt5645/numpy-lab

🧮 Explore, manipulate, and visualize data with NumPy to enhance your Python skills in scientific computing and data analysis.

array-operations data-analysis data-science jupyter-notebook machine-learning numerical-computing numpy numpy-arrays numpy-library numpy-python python python3 scientific-computing

Last synced: 04 May 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/ishaan28malik/masters-admission-analyser

Admissions of Indian students for foreign universities.

admission classification machine-learning matplotlib numpy python students

Last synced: 08 May 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/kevinmbeaulieu/machine-learning-practice

ML models & data processing utilities written from scratch as practice.

data-science machine-learning numpy pandas

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/abhishekyadav915/e-commerce-sales-analysis

E-Commerce Sales Analysis is a data analysis project that explores sales data from an e-commerce platform to uncover insights and trends. The analysis includes visualizing sales performance, customer behavior, and product trends to help optimize business strategies and improve customer satisfaction.

matplotlib-pyplot numpy pandas-library plotly python3

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/chathumiamarasinghe/nn-training-model

A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.

jupyter-notebook matplotlib numpy phyton scikit-learn

Last synced: 04 May 2026

https://github.com/mehradi-github/ref-jupyter-2510

using python in machine learning

matplotlib numpy pandas python sklearn statistics

Last synced: 11 Apr 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/kaungkhantkyaw1997/face_recognition

A Computer Vision tool designed to identify and recognize characters in images.

face-recognition flask numpy opencv

Last synced: 05 May 2026

https://github.com/mzayles/analise-descritiva.py

Curso Programação em Python para Data Science | Análise descritiva e mensuramento de dados de um report consolidado utilizando matplotlib.

matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 05 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/vladd12/machine-learning-practice

Репозиторий для учебной практики и различных курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/farial-mahmod/nuclear-energy-generation-prediction-logistic-regression

Nuclear Energy Generation Prediction Logistic Regression project is aimed at predicting the nuclear energy generation based on the production (generation) data from 1991 to 2023 by using Logistic Regression.

energy kaggle logistic-regression machine-learning matplotlib nuclear-energy nuclear-power numpy power python3 sklearn

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/thomasa/python-intro

This is a Jupyter notebook slide show for a 2-hour crash intro to Python, NumPy, Matplotlib, and SciPy

matplotlib numpy python scipy tutorial

Last synced: 05 May 2026

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 08 May 2026

https://github.com/13anush/python-libraries-

A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.

data-analysis matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/yenk/human-genomic-meta-analysis

Meta analysis of genomic studies to better understand if telomeres in the human genome are impacted by age from wild animals data.

linear-regression machine-learning matplotlib numpy scipy

Last synced: 05 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/mohitsai/boston-housing-data-analysis

Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city

data-analysis data-science matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/jeslipriya/expense-tracker

This is a smart expense tracking web app built with Flask that helps users manage and visualize their spending. It features Pandas and Numpy analysis to suggest ways to cut down on unnecessary expenses, with clean UI, graph visualizations, and secure login.

css flask-application full-stack html matplotlib numpy pandas

Last synced: 05 May 2026

https://github.com/himanshigautam-rgb/python_libraries

This repository contains operations in Matplotlib for data visualization, NumPy for numerical computations, and Pandas for data manipulation and analysis. These libraries work together to efficiently process, analyze, and visualize data.

matplotlib numpy pandas python3

Last synced: 05 May 2026

https://github.com/shaikats/python-assignment

Simulation and Modeling Lab assignments using Python, NumPy, Matplotlib, and Jupyter Notebook.

jupyter-notebook matplotlib numpy python simulation-modeling

Last synced: 09 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/voidwalker7/mnist

MNIST Digit Detector Model using only Numpy

from-scratch-in-python mnist numpy python

Last synced: 05 May 2026

https://github.com/pramodyasahan/cifar-10-object-recognition

This repository contains a TensorFlow-based image classification model designed to classify images from the CIFAR-10 dataset. The CIFAR-10 dataset comprises 60,000 32x32 color images in 10 classes, with 6,000 images per class.

convolutional-neural-networks numpy python tensorflow

Last synced: 05 May 2026

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 05 May 2026

https://github.com/icejan/predicton-systems

Various systems that train on data and generate a prediction

lightfm machine-learning numpy python scikit-learn

Last synced: 08 May 2026

https://github.com/muthupillai1204/diwali_sales_analysis

The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.

data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization

Last synced: 05 May 2026

https://github.com/nandinimarepalli/ai_ml_internship_projects

Projects completed during my AI/ML and Data Expert internship, including EDA, machine learning models, and dashboard development using Python, pandas, scikit-learn, and visualization libraries.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/dulajkavinda/fashion-classification

👗 classify fashion items using machine learning

added machinelearning matplotlib numpy pandas python scikit

Last synced: 05 May 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/kalwalt/pybind-test

A simple example with pybind and numpy with C++ code

cplusplus numpy pybind11 python

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/starlitdreams/lunar-landing

This project implements a DQN agent using PyTorch to solve the LunarLander-v2 environment from OpenAI Gym. The agent learns to control the lunar lander using experience replay and a target network, aiming to maximize rewards by landing smoothly. Uses CUDA for computation.

artificial-intelligence cuda deep-learning gymnasium neural-network neural-networks numpy nvidia-gpu python python3 torch

Last synced: 09 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026