An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/sudharshan-bathini/chat-bot-using-cornell-movie-data-set

Developed an AI chatbot using the Cornell Movie Dialogs dataset; trained an LSTM-based model to generate context-aware, conversational responses for user inputs.

keras lstm-sentiment-analysis neural-networks nlp numpy python tensorflow tokenizer

Last synced: 08 Apr 2026

https://github.com/christabelsakyi/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib-pyplot numpy python seaborn sklearn

Last synced: 08 May 2026

https://github.com/umer-farooq-cs/california-housing-regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

california-housing cross-validation deep-learning-for-perception linear-regression numpy polynomial-features reproducible-research residual-analysis sgd sklearn

Last synced: 09 May 2026

https://github.com/zolppy/recommendation-system

This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.

computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16

Last synced: 12 Apr 2026

https://github.com/khushneetsingh/datasanity

DataSanity is a AI-powered web application for dataset cleaning, synthetic data generation, vectorization, and data enrichment using natural language prompts.

cerebrus exa faiss-vector-database llm nextjs numpy pandas serperdev sqlite tailwindcss

Last synced: 08 Apr 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/piyushxbhardwaj/credit-card-fraud-detection

In this project, we focus on developing a machine learning model to detect fraudulent transactions within credit card data. Credit card fraud poses a significant challenge to financial institutions and consumers globally, with traditional rule-based detection systems often struggling to adapt to new fraud tactics.

matplotlib numpy pandas python scikit-image seaborn

Last synced: 16 Feb 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/lewis-morris/image_augment

Image augmentation for CNN's. Aiming to increase generalization.

augmentation image numpy python

Last synced: 18 Feb 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/ellennewerkla/eda_students_performance

EDA em Python com Pandas, Matplotlib e Seaborn para identificar padrões de desempenho em estudantes.

matplotlib numpy pandas python3 seaborn

Last synced: 08 May 2026

https://github.com/gherrada22/spanish-sign-translator

This project uses Python and the OpenCV and MediaPipe libraries to detect and translate signs in real time.

language matplotlib mediapipe numpy opencv pycharm-ide python python3 spanish-language toml

Last synced: 18 Jan 2026

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 08 May 2026

https://github.com/fuodorov/python

Python Numpy Tutorial

numpy python

Last synced: 09 Oct 2025

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/amritendunath/appointmate_1.0

A LangGraph-powered multi-agent/supervisor system for appointment scheduling. It enables users to interact in natural language to check doctor availability, book, reschedule, or cancel appointments.

fastapi langchain langgraph nltk numpy pandas psycopg2 python sqlalchemy uvicorn websocket

Last synced: 07 Apr 2026

https://github.com/tuokri/rs2simlib

Python library for Rising Storm 2: Vietnam physics objects simulation and automated data collection.

numba numpy physics rising-storm-2-vietnam simulation unreal-engine-3

Last synced: 23 Feb 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/gopikrsmscs/mycinepick-recommendation-system

MyCinePick is a user-specific movie recommendation system built using collaborative filtering techniques. This project aims to provide personalized movie suggestions tailored to each user's unique viewing preferences and habits.

collaborative-filtering movie-recommendation-app numpy python3 recommendation-system sklearn streamlit

Last synced: 14 Apr 2026

https://github.com/amiridk/statistics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 14 Apr 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting. **NOTE: DeepTumor2.0 has been moved from Azure to On-Render to save costs**

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 21 Jan 2026

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/ml-kyphosis

Machine Learning python bootcamp: decision trees & random forest on kyphosis dataset

decision-trees ipynb machine-learning numpy pandas python random-forest seaborn

Last synced: 14 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 14 Apr 2026

https://github.com/chandkund/ev-market-segmentation

This project focuses on segmenting the Electric Vehicle (EV) market in India to identify key customer groups based on their preferences, behaviors, and demographics.

machine-learning matplotlib numpy pandas seaborn sklearn visualization

Last synced: 19 Jan 2026

https://github.com/umesh-01/python

Python is a programming language that lets you work more quickly and integrate your systems more effectively. Python is used for web development, AI, machine learning, operating systems, mobile application development, and video games.

ai application-development artificial-intelligence deep-learning machine-learning matpotlib ml nlp numpy open-source pandas python python-framework python-libraries python3 web-development

Last synced: 07 Apr 2026

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 11 Apr 2026

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/wasifsohail5/amusic-ai_powered_musicrecommendationsystem

AMUSIC is an AI-driven music recommendation system that helps users discover personalized songs. Using Python, Streamlit, and Scikit-learn, it offers smart recommendations, advanced search, and interactive music insights. Users can save favorites, create playlists, and export data for a seamless music discovery experience.

joblib k-nearest-neighbours matplotlib minmaxscaler numpy pandas pickle plotly python scikit-learn seaborn streamlit

Last synced: 14 Oct 2025

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 14 Apr 2026

https://github.com/ljubogdan/solar-cycle-lstm

This project predicts sunspot activity using an LSTM model for time series data. Built with TensorFlow and Keras, it uses Huber loss for outlier handling and MAE for performance evaluation. The dataset, sourced from Kaggle or SIDC, spans over 270 years of monthly sunspot data.

conv1d huber-loss-regression kaggle keras lstm machine matplotlib numpy pandas scikit-learn seaborn solar sunspots tensorflow time-ser

Last synced: 13 Apr 2026

https://github.com/anish37860/python_party

Python summer party by Interview Master & in partnership with DataCamp

jupyter numpy pandas python

Last synced: 06 May 2026

https://github.com/mjul/scipy-lab

Scientific computation with Python

matplotlib numpy python scipy

Last synced: 04 May 2026

https://github.com/cosmoquester/numpy-neural-net

This is neural network samples implemented with numpy without machine learning libraries.

implementation neural-network numpy

Last synced: 22 Jan 2026

https://github.com/farhad-here/predict_student_performance

Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).

classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/sharif-minhaz/rag-system

Ingest and vectorize content upon publication, store embedding, then retrieve and augment user queries with context to generate high-quality responses.

faiss flask mysql2 nodejs numpy rag react transformers

Last synced: 14 Apr 2026

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 14 Apr 2026

https://github.com/apbetioli/neural-network

A simple neural network implementation

machine-learning neural-network numpy python3

Last synced: 29 Apr 2026

https://github.com/miasanromen/math-modeling

Math modeling with Python. Statistics, math, graphics

math matplotlib numpy probality-theory scipy-stats statistics

Last synced: 14 Apr 2026

https://github.com/rezasharifi82/lesson_extractor

Extract all possible "معارف" courses @ Ferdowsi University of Mashhad ,FUM

beautifulsoup4 ipynb jupyter-notebook numpy pandas python3 regex

Last synced: 06 Feb 2026

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/lavkalsi/creditcardfrauddetector

Credit Card Fraud Detector is a React web app that predicts if a credit card transaction is fraudulent using a Python machine learning model. Users can input transaction data, and Flask facilitates communication between the backend and frontend. Backend files are located in the res folder. This app provides simple UI for user interaction.

fraud-detection machine-learning nodejs numpy pandas python reactjs sklearn-library

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/deepankkartikey/ml-projects

A repository of projects done while learning and exploring ML

classification csv machine-learning numpy pandas regression sklearn

Last synced: 15 Apr 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/tamk-kol/project_orbital_data_analysis

The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.

matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2026

https://github.com/stavrosarm/mathematics_algorithms

Fundamental algorithms of Linear Algebra , Numerical Analysis and Calculus in Python and Julia.

calculus julia-language linear-algebra numerical-analysis numpy python

Last synced: 30 Jan 2026

https://github.com/thkox/uni_2023-compression-and-object-removal

This project is a team assignment for the "Multimedia Systems" course, offered in the 6th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

2023 6th-semester colorama compression-algorithm department-of-informatics heapq lossless-compression-algorithm motion-compensation multimedia-systems numpy object-removal opencv-python pickle python scipy-stats university-of-piraeus

Last synced: 07 Feb 2026

https://github.com/farha-dahman/investigate_dataset

Investigate [No-show appointments] Dataset

data-analyst jupyter-notebook matplotlib numpy pandas python

Last synced: 15 Apr 2026

https://github.com/codehub001/age-and-gender-detection

This AI-powered project detects faces and predicts age & gender from images or video streams using OpenCV and deep learning. 🧠💡 🔹 Face Detection 👀 🔹 Gender Classification 🏷️ (Male/Female) 🔹 Age Estimation 📊 (from predefined age groups) 🔹 Live Video & Image Support 🎥🖼️

argparse deep-learning numpy opencv pretrained-model python

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/martincastroalvarez/python-video-processing

Movie generator using MoviePy, OpenCV2 and Numpy.

moviepy numpy opencv

Last synced: 15 Apr 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 15 Apr 2026

https://github.com/pedrohenriquebr/pegasus

Finance Reporting Automation

excel finance financial numpy pandas python python3 report

Last synced: 15 Apr 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/murugavl/forage_tata_genai_powered_data_analytics

This repository showcases my work from the TATA GenAI Powered Data Analytics Virtual Experience on Forage, where I applied data cleaning, analysis, and visualization techniques to derive actionable business insights.

excel genai matplotlib numpy pandas powerbi python

Last synced: 09 Feb 2026

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 09 Feb 2026

https://github.com/khinthandarkyaw98/python_for_engineers

This particular Python notebook is designed to provide Engineers with an opportunity to practice scientific computations.

engineering numpy python scientific-computing youtube

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/ywatanabe1989/scitex-io

Universal scientific data I/O with plugin registry — save/load 30+ formats with one API. Part of SciTeX.

cli csv data-io hdf5 mcp numpy openscience pandas plugin-registry python research scientific-computing scitex

Last synced: 07 Jun 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/andreashovaldt/simple-neural-network

A simple neural network, created with numpy, based on Stochastic Gradient Descent, used for classifying the MNIST dataset

mnist-classification neural-network numpy stochastic-gradient-descent

Last synced: 16 Apr 2026

https://github.com/saurabh274/netflix-eda

Analyze the data and generate insights that could help Netflix in deciding which type of shows/movies to produce and how they can grow the business in different countries.

matplotlib numpy pandas python seaborn

Last synced: 28 Feb 2026