An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/4ngelojr/ai-ml-cheatsheets

🗂️ Access essential AI and ML concepts with quick-reference cheatsheets for effective learning and project implementation.

bigdata cheatsheet computer-science convolutional-neural-networks data-science deep-learning dl excel hadoop keras linux neural-networks numpy pig plotly probability sql statistics

Last synced: 01 May 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/nguyenanhtuan1912/datatable-image-to-text

Repository này là đồ án môn học Computer Vision

computer-vision javascript nodejs numpy opencv python tesseract

Last synced: 09 Apr 2026

https://github.com/mirzaazwad/tymbert

TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy

bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/abhisek-13/multi-commodity-price-predictor

The Multi-Commodity Price Predictor is a machine learning project that predicts the prices of cars, bikes, and laptops. Using advanced techniques, it provides accurate price predictions based on user inputs. The project features an easy-to-use interface built with Streamlit, enabling users to get price predictions quickly and easily.

machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn sklearn-library streamlit

Last synced: 09 Apr 2026

https://github.com/ashishsingh789/bcg_virtual_internship

This repository showcases my BCG X virtual internship project on customer churn analysis for PowerCo, covering business understanding, EDA, feature engineering, and modeling using Python and machine learning.

data-manipulation data-science dataanalysis datavisualization eda machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/nischalkshaj/image-identification

This is a repository for AI image training for beginners.

express mongodb node numpy pillow python3 pytorch reactjs

Last synced: 09 Apr 2026

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 09 Apr 2026

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/lyubov0406/data_analyst_portfolio

В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных

data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization

Last synced: 09 Apr 2026

https://github.com/sudip-13/plant-disease-detection

Plant disease detection from leaf uses machine learning to identify diseases for early intervention

cloudinary css docker jwt-token keras matplotlib-pyplot mongodb nodejs numpy reactjs tensorflow tensorflow-serving

Last synced: 09 Apr 2026

https://github.com/RedInfinityPro/CircuitPythonProjects

Rating: (4/10) This Python script simulates circuits using PySpice, importing libraries like NumPy and Matplotlib. It defines voltage divider, diode circuits, subcircuit definitions, DC sweep analysis, and simulates each circuit using the simulator.

matplotlib numpy os pyspice sys

Last synced: 30 Sep 2025

https://github.com/subhas-pramanik-09/mediscan-ai

A smart and scalable ML-powered health prediction system that can help detect the risk of three major diseases: Diabetes + Heart Disease + Parkinsons Disease

jupyter-notebook logistic-regression machine-learning numpy pandas scikit-learn streamlit svm-classifier

Last synced: 09 Apr 2026

https://github.com/samir-atra/cancerdetector

The final project of AI50xIraq course is found in the commit on 21-3-2023

cancer-detection cnn dataset inceptionv3 matplotlib numpy opencv tensorboard tensorflow tensorflow-tutorials transfer-learning

Last synced: 11 Feb 2026

https://github.com/harmanveer-2546/diagnosis-of-pneumonia-by-cnn-classifier

The primary objective s to develop an accurate and efficient classification model capable of identifying pneumonia cases in patients based on chest X-ray images. Pneumonia is a prevalent and potentially life-threatening respiratory infection. Early detection plays a critical role in timely intervention and effective treatment.

binary-accuracy dropout flatten keras loadimg matplotlib maxpooling2d numpy python regularizers roc-curve seaborn sequential-models tensorflow vgg16

Last synced: 09 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/BiocPy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 03 Oct 2025

https://github.com/lmizner/codecademy_life_expectancy

Calculate quartiles, quantiles, and the inter-quartile range (IQR) for a variable

histogram jupyter-notebook matplotlib-pyplot numpy pandas python quantiles quartiles

Last synced: 09 Apr 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/pem-humboldt/wiutils

Exploration utilities for Wildlife Insights projects.

biodiversity camera-trap images matplotlib numpy pandas python seaborn trail-cameras wildlife-insights

Last synced: 09 Aug 2025

https://github.com/srikarveluvali/dataanalysis

The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.

css data-analysis html javascript matplotlib numpy pandas python seaborn tableau

Last synced: 09 Apr 2026

https://github.com/sk-g/mnist_beginners

Model search in traditional machine learning algorithms (non DL) and DL starter codes on MNIST dataset. This is a good starter code for beginners trying to learn about curse of dimensionality, overfitting and other concepts in general

keras machine-learning machine-learning-algorithms mnist mnist-beginners mnist-classification mnist-dataset numpy overfitting python pytorch pytorch-implmention resnet resnet-50 scikit-learn scikitlearn-machine-learning sklearn tensorflow

Last synced: 09 Apr 2026

https://github.com/prath0m/python-numpy

Python basic Numpy Program

machine-learning numpy python python3

Last synced: 09 Apr 2026

https://github.com/muhkartal/xai_dashboard

an interactive AI dashboard for machine learning model analysis and explainability, supports model training, dataset exploration, feature importance analysis, and SHAP-based explanations for both individual predictions and overall model behavior, compare multiple models, visualize insights, and export results seamlessly

joblib numpy pandas python sckit-learn shap streamlit xgboost

Last synced: 04 May 2026

https://github.com/aathithya-shanmuga-sundaram/cyber-threat-intelligence-dashboard

Interactive Streamlit dashboard for visualizing and analyzing cyber threats, featuring real-time data insights, severity classification, geolocation mapping, and customizable dark-cyber UI.

cyber-threat-intelligence cyber-threat-tool cybersecurity cybersecurity-tools data-visualization dataset numpy pandas plotly python streamlit threat-intelligence

Last synced: 10 Nov 2025

https://github.com/baranwaljii/data-science-projects

This Repository contains projects related to the domain of data science based on python and jupyter notebook

data-science jupyter-notebook machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/vuanhtuan1012/data-scientist-with-python

Notes on career track "Data Scientist with Python" at DataCamp

importing-data matlab numpy pandas python3 sqlalchemy

Last synced: 09 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/mpbeatriz/santander_coders_2024

Repositório dos exercícios e projetos do curso de Engenharia de Dados da Ada em parceria com o Santander (em andamento).

apache-kafka apache-spark api data-engineering data-extraction fastapi flask numpy oop pandas python webhooks

Last synced: 10 Apr 2026

https://github.com/rotantos/ml-projects

Includes various ML projects created for educational purposes.

classification-model jupyter-notebook machine-learning numpy pandas python sklearn-library

Last synced: 10 Apr 2026

https://github.com/hemanthsaikumarbethapudi/last-mile_delivery_optimization.py-

This project implements an AI-powered last-mile delivery optimization system using Python. It leverages graph-based route optimization, demand prediction, and cost analysis to improve delivery efficiency in urban areas.

heapq networkx numpy python

Last synced: 27 Aug 2025

https://github.com/yessasvini23/pytorch-bootcamp-on-ai-by-opencv

This OpenCV course is highly recommended for individuals seeking an introductory experience in Computer Vision and AI, and who wish to establish a strong foundation for their career development.

cnn-classification computer-vision cv2-library deeplearning-ai numpy opencv pytorch rnn-tensorflow

Last synced: 13 Apr 2026

https://github.com/thatguychandan/adoptimization

This project implements an ad optimization system using a hybrid approach combining Thompson Sampling and Upper Confidence Bound (UCB) algorithms. The system learns to select the most effective ads based on user context and historical performance.

numpy pandas plotly python pytorch reinforcement-learning scikit-learn streamlit thompson-sampling upper-confidence-bound

Last synced: 10 Apr 2026

https://github.com/hasnainroopawalla/sudoku-generator

This algorithm can generate and solve a random Sudoku Puzzle Board using Backtracking

backtracking numpy python sudoku-generator

Last synced: 16 May 2026

https://github.com/tralahm/datascience

Getting started with Data science Numerical Analysis and Scientific Computing

hmmlearn ipynb jupyterlab keras numpy pandas python3 scipy tensorflow

Last synced: 10 Apr 2026

https://github.com/saro0307/unemployment-analysis

This data science project delves into unemployment trends, using data analysis and machine learning to identify key factors and predict future joblessness rates, aiding policymakers and businesses in informed decision-making.

data-science datavisualization machine-learning matplotlib numpy pandas python seaborn skit-learn

Last synced: 10 Apr 2026

https://github.com/gaball1/alzheimer-prediction-ai

Alzheimer Prediction is a machine learning-based project designed to predict the likelihood of Alzheimer’s disease using clinical and imaging data. The project features an interactive web application built with Streamlit, enabling users to input key data, view predictions, and explore visualizations .

ai data-mining gradient-boosting jupyter-notebook logistic-regression logistic-regression-algorithm machine-learning matplotlib-python models numpy pandas python random-forest-classifier streamlit xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/bharath-tars/deeptumor

DeepTumor is a project aimed at developing a deep learning model with over 1 million parameters using TensorFlow. with automatic reporting feature

keras machine-learning numpy pandas pyhton tensorflow webdevelopment

Last synced: 10 Apr 2026

https://github.com/amiridk/machinelearning

machine-learning-notes

matplotlib-pyplot numpy pandas pyhton

Last synced: 04 May 2026

https://github.com/amruta33/housing_price_prediction

Consider a real estate company that has a dataset containing the prices of properties in the Delhi region. It wishes to use the data to optimise the sale prices of the properties based on important factors such as area, bedrooms, parking, etc.

linear-regression numpy pandas python rfe vif

Last synced: 28 Apr 2026

https://github.com/shivammg/coursera-ml-numpy

Coursera Machine Learning course done using Python, NumPy, and Seaborn

machine-learning numpy python seaborn

Last synced: 09 May 2026

https://github.com/anshpg/linearluminary

Greetings! I've developed a straightforward linear regression model from scratch to predict house prices in Bangalore. But before delving into coding, let me walk you through the algorithm's conceptualization. I considered various factors such as location, ocean proximity, plot size, finished state, and flat type.

algo linea mathematics matplotlib numpy pandas pyth

Last synced: 13 May 2026

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/babagata/racunalna_fizika

Math and physics solved with python

matplotlib numpy random scipy sympy

Last synced: 10 Apr 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/soumyapro/wine-quality-prediction

This project is about the prediction of wine quality using machine learning algorithms

boxplot matplotlib numpy pandas random-forest smote

Last synced: 10 Apr 2026