An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/akku-1206/spotify_data_analysis

Spotify Tracks Analysis Overview This project performs an exploratory data analysis (EDA) on Spotify music data to uncover insights and correlations related to song features, popularity, and genre.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/sducournau/ign_lidar_hd_dataset

🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.

building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn

Last synced: 20 Jan 2026

https://github.com/venky-1710/superhero-recruitment

Superhero Recruitment System predicts hero selection using machine learning. Users input hero attributes through a web interface. A Random Forest model analyzes abilities, strengths, weaknesses, success rates, and missions completed. The Flask app displays results, showing if a hero is selected.

css flask html numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/christabelsakyi/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib-pyplot numpy python seaborn sklearn

Last synced: 08 May 2026

https://github.com/umer-farooq-cs/california-housing-regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

california-housing cross-validation deep-learning-for-perception linear-regression numpy polynomial-features reproducible-research residual-analysis sgd sklearn

Last synced: 09 May 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/prarthana-singh/bangalore-house-price-predictor

🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.

data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/zainulmustafa/home-neural-net-gui-using-flask

I am working on a project that I call HNN whose GUI is created using Flask framework to give multiple users the access of controlling components over the same connection.

flask flask-api flask-server html5 numpy python3

Last synced: 05 May 2026

https://github.com/axelsgit/fashion-recommender

This project builds a hybrid and visual recommender system for (luxury) fashion products, simulating a real-world e-commerce scenario (e.g., Zalando, Farfetch, Fashion Nova). It integrates visual filtering, collaborative filtering, content-based filtering, and hybrid models, enriched with user metadata and interaction recency.

deepfashion fashion fastapi numpy pandas python resnet-50 sklearn streamlit tensorflow ui-design

Last synced: 07 Apr 2026

https://github.com/NetBr3ak/Planetary-System-Simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 08 Oct 2025

https://github.com/chernyakid/russian-film-distribution

Исследование российского кинопроката

jupyter numpy pandas python seaborn

Last synced: 15 May 2026

https://github.com/fahrettinsolak/ai-map-based-geographic-clustering-project

This project focuses on clustering crime incidents in San Francisco using the K-Means algorithm. The dataset is obtained from Kaggle and contains information about crime types, geographical coordinates, and other relevant features. The goal is to identify crime hotspots through geographic clustering and visualize the clusters on an interactive map.

artificial-intelligence deep-learning elbow-method jupyter-notebook machine-learning numpy openstreetmap pandas phyton plotly scikit-learn standardscaler

Last synced: 05 Apr 2026

https://github.com/alejoduarte23/fast_fdd

Fast implementation of frequency domain decomposition (FDD) in python with multiple identification techniques

numpy scipy-signal

Last synced: 10 May 2026

https://github.com/ahmrina/unetbraintumorseg

UNetBrainTumorSeg is a 3D Slicer extension named "seg_module" that integrates a 3D UNet trained on BraTS2020 dataset to segment brain tumor area from MRI volumes

3d-slicer-extension monai numpy python pytorch slicer unet vtk

Last synced: 07 May 2026

https://github.com/hiftd22/wpattern

📈 Analyze and visualize stock patterns with wPattern for better trading insights. Simplify your investment decisions through data-driven analysis.

cli finance financial-analysis financial-data matplotlib numpy pandas pattern-recognition python scikit-learn stock-scanner technical-analysis yfinance

Last synced: 29 Apr 2026

https://github.com/alex-s2222/mem_chat_bot

телеграмм бот для создания gif из текста введённого пользователем

cv2 google-api moviepy numpy python python-telegram-bot requests

Last synced: 08 Oct 2025

https://github.com/muzahid0408/student_marks_analysis_using_numpy

This project analyzes a student marks dataset to calculate key statistical measures such as mean, median, standard deviation, and more.

datasets jupyter-notebook numpy numpy-arrays pythoh

Last synced: 04 May 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/philcs/electromag

Électromagnétisme, Université Laval, 2017

matplotlib numpy python

Last synced: 18 May 2026

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/hmotez/medclaimml

An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.

healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss

Last synced: 12 Apr 2026

https://github.com/samuelpuerta/modelosiasaberprocolombia

PROYECTO KAGGLE - Crear un modelo para predecir el redimiento de los estudiantes en las pruebas Saber Pro

ai ia jupyter-notebook machine-learning numpy pandas prediction-model predictive-modeling python

Last synced: 06 May 2026

https://github.com/paul-bokelman/nn

Basic neural network in python

machine-learning neural-networks numpy python

Last synced: 09 May 2026

https://github.com/gherrada22/spanish-sign-translator

This project uses Python and the OpenCV and MediaPipe libraries to detect and translate signs in real time.

language matplotlib mediapipe numpy opencv pycharm-ide python python3 spanish-language toml

Last synced: 18 Jan 2026

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 08 May 2026

https://github.com/netbr3ak/planetary-system-simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 14 Apr 2026

https://github.com/andreazoccatelli/light_permanova

A lightweight implementation of PERMANOVA based on Euclidean distance from centroid

computervision numpy permanova pytorch statistics tabular-data

Last synced: 08 May 2026

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/winterwind/mlp_and_cnn

A short two-part project involving using an MLP and a CNN for ECG signal data

cnn convolutional-neural-network csv-files jupyter jupyter-notebook mlp neur numpy python pytorch

Last synced: 14 Apr 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026

https://github.com/03-bunny-06/japan-heart-attack-analysis

This project analyzes heart attack data in Japan, focusing on gender-based differences in risk factors such as smoking, diabetes, alcohol consumption, and regional variations. The dataset is cleaned and processed using Pandas and NumPy, extracting insights on affected individuals based on lifestyle and health conditions.

data-cleaning dataframes numpy pandas python risk-analysis

Last synced: 20 Apr 2026

https://github.com/arsh-pixel-cmd/ai-image-classifier

AI Image Classifier is a web app that uses AI and deep learning to classify images in real-time. Built with Streamlit, TensorFlow (MobileNetV2), OpenCV, and NumPy, it allows users to upload single or multiple images and get the top-3 predictions with confidence scores instantly in the browser.

docker numpy opencv python3 streamlit tensorflow2 uv

Last synced: 10 Apr 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/naveen88112/vehicle-routing-problem

Vehicle Routing Problem (VRP) Solver This project applies a Genetic Algorithm (GA) to optimize the Vehicle Routing Problem (VRP). This is done for more than one vehicle to ensure delivery routes that save travel distance and increase efficiency.

deap genetic-algorithm matplotlib numpy python vehicle-routing-problem

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_image-text-recognition

Project for University of Michigan Python Programming Specialization -- Read in documents with images and text, and utilized CV libraries/packages to extract specific types of images and text, pairing them together

classification computer-vision image-classification numpy pandas programming python text-classification

Last synced: 14 Apr 2026

https://github.com/ramyacp14/sentimentanalysis

Implements a sentiment analysis model to determine the emotional tone behind text, helping understand attitudes, opinions, and emotions in online mentions.

machine-learning natural-language-processing nltk numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/vishalgaud17/stroke

A simple Streamlit web app that predicts stroke risk based on user input features like age, BMI, glucose level, and lifestyle factors, using a pre-trained machine learning model.

machine-learning numpy pandas python scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/tuokri/rs2simlib

Python library for Rising Storm 2: Vietnam physics objects simulation and automated data collection.

numba numpy physics rising-storm-2-vietnam simulation unreal-engine-3

Last synced: 23 Feb 2026

https://github.com/audrbar/py_site

Website for displaying Data Science Projects build on Streamlit

matplotlib numpy pandas python3 requests streamlit

Last synced: 11 Apr 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/jcardonamde/food-sales-predictions

A project that seeks to predict the sales of a food distributor by applying machine learning algorithms and data analysis.

data-science machine-learning machine-learning-algorithms numpy pandas pipelines python seaborn sklearn

Last synced: 05 Apr 2026

https://github.com/thekartikeyamishra/colorextractor

A Python-based project that extracts the dominant colors from an image using the KMeans clustering algorithm. This project is useful for applications like image processing, color palette generation, or analyzing the most prominent colors in images.

kmeans-clustering matplotlib matplotlib-pyplot numpy p pillow python sklearn

Last synced: 05 May 2026

https://github.com/rscr1/summer_practice_mai_2

Решение всех заданий летней практики

matplotlib numpy pandas python scipy sympy

Last synced: 14 Apr 2026

https://github.com/gopikrsmscs/mycinepick-recommendation-system

MyCinePick is a user-specific movie recommendation system built using collaborative filtering techniques. This project aims to provide personalized movie suggestions tailored to each user's unique viewing preferences and habits.

collaborative-filtering movie-recommendation-app numpy python3 recommendation-system sklearn streamlit

Last synced: 14 Apr 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/paulokarabyna/ml-foundations-day1

🔧 Set up a Python environment and practice linear algebra with NumPy while plotting vectors and reflections in this Day 1 ML foundations starter repository.

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy terminal-mac tutorial visualization

Last synced: 18 May 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 11 Apr 2026

https://github.com/thoughtscript/python_api_2023

Some fun with Flask, Python, and Machine Learning

angular docker docker-compose flask machine-learning mysql numpy

Last synced: 04 Apr 2026

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 07 Apr 2026

https://github.com/hoyirul/flask_test

This is a Python programming using flask and has been set up with the MVC concept and can connect to the database

flask mysql numpy python3 restful-api

Last synced: 07 Apr 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/pedrolustosa/energyforecasting

This project offers a complete solution for solar energy generation analysis and forecasting, using different machine learning models and a modern, intuitive web interface.

artificial-intelligence axios bootstrap5 chartjs css data-science fastapi fontawesome html5 numpy pandas python r react react-bootstrap rpy2 sklearn spinner toastr typescript

Last synced: 04 Apr 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/venkat-a/python-eda-exploratory-data-analysis-

This project focuses on conducting exploratory data analysis (EDA) using Python. Utilizing powerful libraries such as Pandas for data manipulation and Plotly for interactive visualization, the project aims to uncover insights from datasets through comprehensive analysis and visualization techniques.

descriptive-statistics matplotlib numpy pandas seaborn statistical-analysis visualization

Last synced: 07 May 2026

https://github.com/arnoldchrisoduor1/potato_disease

Trained a CNN to detect the presence and the current stage of Blight disease in potatoes, WebApp in FastAPI and React

fastapi keras numpy react serving-tensors tensorflow

Last synced: 04 Apr 2026

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 09 Apr 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 01 May 2026

https://github.com/mayankmittal29/duplifinder-quora-clone-catcher

An advanced system for detecting semantically duplicate question pairs using cutting-edge NLP techniques. Combines traditional ML models (XGBoost, SVM, Random Forest) with deep learning architectures (BiLSTM, Siamese Networks, Transformers) and contextual embeddings (BERT, RoBERTa). Features engineered using token similarity, fuzzy matching, and em

bert bilstm cross-validation eda fastext fuzzy-matching glove numpy pandas python3 quora-question-pairs random-forest roberta seaborn stemming svm tf-idf transformers word2vec xgboost

Last synced: 15 Apr 2026

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 14 Apr 2026

https://github.com/saylie/cat-dog-classifier-

The Cat-Dog Classifier is a deep learning-based image classification project that can differentiate between images of cats and dogs. It uses a Convolutional Neural Network (CNN) model trained on labeled datasets to predict whether a given image belongs to a cat or a dog.

ai javascript keras numpy opencv python react tensorflow

Last synced: 14 Apr 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting. **NOTE: DeepTumor2.0 has been moved from Azure to On-Render to save costs**

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 21 Jan 2026

https://github.com/lorenzorottigni/ml-breast-cancer

Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 14 Apr 2026

https://github.com/gauravtrivedi967/lane_detection

Lane detection and tracking are the advanced key features of the advanced driver assistance system. Lane detection is the process of detecting white lines on the roads. In this we I have used prebuild models for the development of the code

ipython-notebook numpy python thresholding

Last synced: 06 Mar 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026