An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abyshergill/ultralytics_yolo_object_detection_testing_gui

This is a desktop application for real-time object detection using the YOLO (You Only Look Once) model build with ultralytics libarary. It's built with PyQt5 for the graphical user interface and OpenCV for video and image processing. The application allows users to load a YOLO model, adjust detection settings, and perform object detection on media.

numpy opencv-python pytorch qt5-gui threading ultralytics yolo

Last synced: 08 May 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/rohanbanerjee1234567-cell/prediction-of-expected-salary-using-machine-learning

Here is my first Project Repository where I have made a Machine Learning Project using Python. The Problem statement was to train a model based on the given Dataset and from there we need to Predict the Expected Salary of an Employee who will have similar profiles.

exploratory-data-analysis linearregression matplotlib-pyplot numpy pandas randomforest randomforestregressor scikit-learn scikitlearn-machine-learning searborn visualization

Last synced: 27 Apr 2026

https://github.com/mzayles/base_vendas_analise-exploratoria

💻📊✅ Curso de Programação em Python para Data Science | Identificando e analisando problemas.

numpy pandas python

Last synced: 24 Mar 2025

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/crvargasm/metnumun2021i

Universidad Nacional de Colombia - Métodos Numéricos ~ Grupo 1 2021-I

matplotlib numerical-analysis numpy pandas python quantopian

Last synced: 12 Apr 2026

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mrnust/ai-powered-socialmediapostoptimizer_llm

AI-Powered Social Media Post Optimizer enhances your social media content by optimizing text posts with relevant tags and keywords, predicting engagement metrics, and analyzing image quality. Upload your post and image to receive tailored suggestions for improving engagement and visual appeal.

groq langchain large-language-model llama-3-70b matplotlib numpy opencv streamlit textblob

Last synced: 14 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/stephnna/my_image_classifier

A deep learning-based image classifier built with PyTorch, designed to identify various flower species using architectures like VGG16, ResNet101, and AlexNet. This project leverages transfer learning for improved performance, checkpointing for seamless training, and supports both CPU and GPU devices. Perfect for exploring deep learning models and i

matplotlib numpy python pytorch

Last synced: 11 May 2026

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 19 Apr 2026

https://github.com/crispengari/effects-of-covid19-on-trade

This repository visulises the effects of covid19 on trade within these years:

100daysofcode datascience datascience-machinelearning matplotlib matplotlib-pyplot numpy pandas python python3

Last synced: 02 May 2026

https://github.com/sultanazhari/prediction-model-of-customers-leaving

Bank Beta Company focus on retain existing customers, our task is to create a model that predicts whether or not a customer will leave the bank soon.

accuracy-score confusion-matrix logistic-regression-algorithm matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 20 Apr 2026

https://github.com/sutterseba/des-python

A simple DES implementation in Python

cryptography numpy python

Last synced: 15 Jun 2026

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/nguyenanhtuan1912/datatable-image-to-text

Repository này là đồ án môn học Computer Vision

computer-vision javascript nodejs numpy opencv python tesseract

Last synced: 09 Apr 2026

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/rahul-404/full_stack_data_science_masters

Welcome to the repository for the course "Full Stack Data Science Masters". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science techniques.

computer-vision data-science database deep-learning exploratory-data-analysis flask machine-learning natural-language-processing numpy pandas python statistics time-series visualization

Last synced: 10 Apr 2026

https://github.com/ramyacp14/salesforecasting

Forecasts future sales for a retail company using time series analysis with Facebook Prophet. The project involves data preprocessing, exploratory data analysis (EDA), and forecasting with holiday effects considered.

data-preprocessing data-visualization exploratory-data-analysis fbprophet machine-learning matplotlib model-evaluation numpy pandas python seaborn time-series-forecasting

Last synced: 06 Apr 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/alejandrolara11/desafio_latam_introduccion_analisis_de_datos

Repositorio del curso "Introducción al Análisis de Datos" de Desafío Latam. Ejercicios prácticos realizados durante el curso, enfocados en análisis de datos con Python, Pandas, y visualización básica.

data-analysis data-science data-visualization matplotlib numpy pandas python seaborn statsmodels

Last synced: 29 Apr 2026

https://github.com/shibam120302/dog_breed_prediction_app

The Dog Scanner app will identify your dog's breed reliably in just a few seconds! Besides taking a picture, you can also record a video or upload an image from your gallery. Got a mixed breed? No problem, the Dog Scanner app also recognizes mixed breeds!

keras numpy opencv python

Last synced: 06 Apr 2026

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/manuethomas/traffic-accident-analysis-us

The project provides a comprehensive analysis of traffic accidents in the US from 2016-2023 aiming to identify key factors contributing to accidents. The analysis also focussed on finding features that could be used to develop a predictive model

exploratory-data-analysis feature-engineering feature-selection matpllotlib numpy pandas seaborn

Last synced: 20 Mar 2025

https://github.com/mukhtarmid/data-science

This repository is for the knowledge of data science.

datascience eda numpy pandas

Last synced: 10 May 2026

https://github.com/kikoveiga/feup-ia1

Artificial Intelligence (IA) First Project (2023/2024): BSc in Informatics and Computing Engineering @ FEUP

feup feup-ia feup-leic genetic-algorithms hill-climbing numpy pandas python simulated-annealing tabu-search

Last synced: 06 May 2026

https://github.com/chaitanyac22/cross_platform_product_mapping_algorithm_for_products

This repository contains a product ID mapping solution using TF-IDF vectorizer for weighted text vectors, Facebook AI Similarity Search (FAISS) for coarse filtering with cosine similarity, and Levenshtein distance for refined matching against the Blinkit catalog. Achieved 11.45% match for Zepto and 11.48% for Instamart.

exploratory-data-analysis faiss levenshtein-distance nlp numpy pandas similarity-search tf-idf-vectorizer

Last synced: 20 Mar 2025

https://github.com/ansh2709/customer-segmentation-ml-project

Project segregates the customers on the basis of their spending score and annual income using K-Means Clustering that is a part of unsupervised learning

clustering-algorithm k-means-clustering machine-learning matplotlib-pyplot numpy pandas python unsupervised-machine-learning wcss

Last synced: 01 May 2026

https://github.com/shambac/shamboflow

Fierce tensorflow competitor

cuda cupy machine-learning numpy pypi-package

Last synced: 19 Feb 2026

https://github.com/morsalinislamshapon/diabetes-prediction-systemv3

This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 29 Jul 2025

https://github.com/abdelrahman-amen/attendance_system

An AI-driven system leveraging real-time face recognition to automate attendance, ensuring accuracy, scalability, and seamless CSV logging of names and timestamps.

cmake cv2 dlib face-recognition numpy python

Last synced: 20 May 2026

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/zeeshan4002911/data-analysis-hub

Quality control, data processing, data cleaning, data ploting

jupyter-notebook jupyterlab matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/sultanazhari/factors-that-affect-vehicle-prices.

Crankshaft List Company want to know what the factors that affect to vehicles prices. As a data Analyst we're giving list of factor that affect with vehicle prices.

matplotlib-pyplot numpy python3 seaborn vehicles

Last synced: 11 May 2026

https://github.com/ledsouza/MedPhys-BI

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 22 Sep 2025

https://github.com/abhay-kanwasi/ml-learning

Discover a repository brimming with machine learning insights and projects. Dive into comprehensive documentation of ML concepts, algorithms, and techniques. Explore hands-on projects spanning sentiment analysis, image classification, recommendation systems, and more.

ml numpy pandas python recommender-system sklearn streamlit-webapp themoviedb-api

Last synced: 02 Apr 2026

https://github.com/hotequil/fish_classification

Identify the fish specie with Machine Learning.

classification fish keras numpy python tensorflow

Last synced: 27 Mar 2025

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 07 Jan 2026

https://github.com/armandomtz05/optikit

Solution of the paraxial wave equations using different coordinates systems

fourier-transform holography numerical-methods numpy optics-code optics-simulation pillow

Last synced: 02 Feb 2026

https://github.com/dhirajmerit/diabetes-prediction-models

This project implements multiple machine learning algorithms to predict diabetes diagnoses based on medical diagnostic data. The project compares the performance of various models, providing insights into which algorithms are most effective for this task.

ai machine-learning numpy pandas tensorflow

Last synced: 11 Apr 2026

https://github.com/danwild/bike-share-prediction

Predict bike sharing time-series with numpy for Deep Learning

artificial-intelligence artificial-neural-networks deep-learning neural-network numpy

Last synced: 16 May 2026

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/adnaen/water-quality-analysis

End-to-end Ground Water Quality Prediction app using Streamlit.

classification-model machine-learing matplotlib notebook numpy pandas plotly python sklearn streamlit

Last synced: 30 Dec 2025

https://github.com/vinit-source/csl7382-medical-image-clustering-assignment.py

The IPython notebook contains the questions as well as the related code. Only numpy has been used.

bioimage-analysis kmeans-clustering numpy slic spectral-clustering

Last synced: 22 May 2026

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/muavia1/roman-urdu-poetry-generation-using-lstm

Here’s a short description you can add to your GitHub project: Roman Urdu Poetry Generator A deep learning project using LSTM and TensorFlow to generate Roman Urdu poetry. Trained on a poetry dataset and deployed with a Gradio interface for interactive text generation.

deep-learning gradio gradio-interface lstm model-training numpy pandas poetry-generator python tensorflow

Last synced: 09 Apr 2026

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/gitw1n/japandemographicsinsights

JapanDemographicsInsights is an ongoing project aimed at providing comprehensive insights into the demographic trends of Japan. The project is focused on analyzing and visualizing population data, age distribution, migration patterns, birth and death rates, and other related demographic factors that shape the current and future landscape of Japan.

indevelopment jypyternotebook numpy python3 scientific-visualization

Last synced: 11 May 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/dpgitaccount/project-customer-segmentation-analysis-with-power-bi

This project segments customers based on their purchasing behavior to identify different target groups. It demonstrates skills in data analysis, clustering, and visualization using Power BI.

kmeans-clustering-algorithm linear-regression numpy pandas powerbi python seaborn segmentation

Last synced: 09 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/anas436/exploratory-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/oaerobert/python-for-finance

Implementations of financial mathematics and statistics using the "Python for Finance" textbook. Includes Monte Carlo simulations, predictive stock prices using ML strategies and real world data visualisations.

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/vedanty3/bulldozer-price-prediction

A machine learning project aiming to build a machine learning model which could predict the sales price of bulldozer.

andrew-ng-machine-learning ensemble-machine-learning gridsearchcv jupyter-notebook machine-learning matplotlib numpy pandas python randomforestregressor randomizedsearchcv scikit-learn ztm

Last synced: 05 Apr 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/sandy-07/breast-cancer-detection-using-histopathological-images

Using Histopathological images identifying the type of cancer Benign and Malignant tumors.

flask-application html-css numpy pyhton3 tensa tensorflow

Last synced: 11 May 2026

https://github.com/shreeparab1890/chat-analyzer

This project is a Data Analysis project to analyze the WhatsApp chats.

data-analysis numpy pandas python

Last synced: 12 Apr 2026

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 06 Mar 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. 🛠️🎥

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/zeeshan4002911/data-analysis

Quality control, data processing, data cleaning, data ploting

jupyter-notebook jupyterlab matplotlib numpy pandas seaborn

Last synced: 30 Apr 2025