An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/manshreet27/mrs

This Movie Recommendation System is a web-based application built using Python and Streamlit, designed to provide movie recommendations based on user preferences. It utilizes TMDb API for fetching real-time movie details and Kaggle's TMDB 5000 Movies dataset for content-based filtering.

numpy pandas python scikit-learn streamlit tmdb-5000-movies-dataset-from-kaggle tmdb-api-for-fetching-real-time-movie-data

Last synced: 07 Apr 2026

https://github.com/pranjalshivhare06/medical-ensurance-charge-predictor

The Insurance Price Predictor is a machine learning project designed to predict insurance costs based on various input features. The project leverages four different algorithms, with XGBoost emerging as the most accurate and efficient model.

fastapi machine-learning numpy pandas xgboost-classifier

Last synced: 19 Apr 2026

https://github.com/nihanthbhargav/time-series-stock-market

This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.

matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/elvisrock/government_analysis

Esse projeto, tem como objetivo, analisar, explorar algumas técnicas e propor uma sugestão ao governo de forma geral, sem qualquer viés politico.

data-science machine-learning numpy pandas python sklearn

Last synced: 05 Apr 2026

https://github.com/mikma03/simulation_modeling

Simulation models using Python. Practical use of Python in real-world examples and additional resourses.

matplotlib numpy pandas portfolio python simulation stocks

Last synced: 08 May 2026

https://github.com/gauravpatil04/ml

This repo is on "Basics skills required for Machine Learning"

matplotlib numpy pandas python seaborn

Last synced: 11 Jul 2025

https://github.com/akashash01/support_vector_machine

It's an linear model for classification and regression problems. It can solve non-linear problems by creating a hyperplane which separates the data into classes using KERNAL .

data-science datavisualization machine-learning numpy pandas python sklearn svm-classifier

Last synced: 13 Apr 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/davityak03/sentence-paraphraser-checker-using-transformers

This Jupyter Notebook implements a tool to check whether two sentences are paraphrases by analyzing their semantic similarity using NLP techniques. It provides a similarity score and a binary decision to indicate if the sentences are paraphrases.

keras nlp nltk numpy python tensorflow tokenizer transformers

Last synced: 02 Jan 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/huggon1/ml-algorithm-implementations

Educational implementations for ML, DL, LLM blocks, ViT, and CUDA.

cuda machine-learning numpy pytorch vision-transformer

Last synced: 28 Jun 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/chandkund/image-classification-using-the-mnist-dataset

Image Classification using the MNIST dataset. This project leverages a Convolutional Neural Network (CNN) to recognize and classify handwritten digits with high accuracy. Includes data preprocessing, model architecture, and evaluation. Explore the code and results here!

computer-vision data-science machine-learning matplotlib numpy pandas python

Last synced: 08 Apr 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/michelenana/projet-7

PRODUISEZ UNE ETUDE DE MARCHE AVEC R OU PYTHON

acp boxplots cah kmeans matplotlib numpy pandas python scipy seaborn sklearn

Last synced: 08 Apr 2026

https://github.com/shankhadweep/diabetes-prediction-systemv3

This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 01 May 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/ashishgoswami13/handgesture_recognition

Real-time hand gesture recognition using computer vision and deep learning. Collect gesture images, train a model with Google Teachable Machine, and classify gestures live for applications like drone control or human-computer interaction.

computer-vision cvzone handgesture-recognition keras numpy python teachable-machine

Last synced: 18 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/caefleury/cis-ieee-difusion-model

Repositório teste para códigos do projeto de Modelo de Difusão do branch CIS-IEEE

docker docker-compose numpy python3 tensorflow

Last synced: 08 Apr 2026

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/isratafrin826/linear-regression-model

Linear Regression is a statistical method used to model the relationship between a dependent variable (target) and one or more independent variables (features) by fitting a straight line (regression line) to the data. It predicts the target value based on the linear relationship and is commonly used for forecasting and trend analysis.

googlecolaboratory numpy pandas sklearn

Last synced: 03 May 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/zolppy/recommendation-system

This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.

computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16

Last synced: 12 Apr 2026

https://github.com/prarthana-singh/bangalore-house-price-predictor

🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.

data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/thavinduushan/object-detection

Application developed for real-time object detection and counting utilizing COCO dataset

numpy opencv python

Last synced: 15 Apr 2026

https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 05 Apr 2025

https://github.com/nskamaleshmani/exoseeker

🌌 Discover and analyze exoplanets with ExoSeeker, a tool designed for efficient world-hunting using Python and data visualization techniques.

exoplanet-transits exoplanets gradient-boosting machine-learning matplotlib nasa nasa-data nasa-spaceapps-challenge numpy pandas perceptron-neural-networks random-forest streamlit

Last synced: 03 Nov 2025

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/thkox/uni_2023-content-based-image-retrieval

This project is a team assignment for the "Image Analysis" course, offered in the 7th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

7th-semester cartesian-product content-based-image-retrieval department-of-informatics hyperedge-similarity-calculation hypergraph image-analysis matplotlib networkx numpy python rank-normalization scipy torchvision university-of-piraeus

Last synced: 08 Oct 2025

https://github.com/abhishek-k-git/feature-engineering-using-stock-exchange-data

Creating quantitative trading strategies requires historical and real-time data and involves comparing models against future returns post-training.

lightgbm matplotlib numpy pandas plotly seaborn

Last synced: 12 Apr 2026

https://github.com/williyam-m/sentiment_analysis

A Django-based Sentiment Analysis web app leveraging the power of the BERT transformer model to classify text sentiment accurately.

beautifulsoup bert django numpy pandas python torch transformers

Last synced: 20 Jun 2026

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 08 Apr 2026

https://github.com/lluispp/hunting_for_exoplanets_with_ai

An AI-powered tool that classifies exoplanets using NASA datasets and provides an interactive web interface for real-time exploration and discovery.

ai challenge exoplanets nasa numpy python spaceappchallenge-2025 spaceapps swai web

Last synced: 18 May 2026

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/khushneetsingh/datasanity

DataSanity is a AI-powered web application for dataset cleaning, synthetic data generation, vectorization, and data enrichment using natural language prompts.

cerebrus exa faiss-vector-database llm nextjs numpy pandas serperdev sqlite tailwindcss

Last synced: 08 Apr 2026

https://github.com/rtmigo/markov_walk_py

🔢 Python module that calculates probabilities for a random walk in 1-dimensional discrete state space

absorbing-markov-chains absorbing-states markov-chain mathematics numpy probability probability-theory random-walk stochastic-matrix stochastic-models

Last synced: 21 May 2026

https://github.com/claby2/mouse-track

Track, display, and graph the position of a mouse cursor.

c cpp matplotlib numpy python sdl2 windows

Last synced: 12 Apr 2026

https://github.com/alex-s2222/mem_chat_bot

телеграмм бот для создания gif из текста введённого пользователем

cv2 google-api moviepy numpy python python-telegram-bot requests

Last synced: 08 Oct 2025

https://github.com/manjotkaurgill/agritech

Enter details of your soil and weather, and find best suitable crop for farming. With our advanced AI system, you can make informed decisions and optimize your agricultural practices.

flask generative-ai insight-generation machine-learning matplotlib mongodb nextjs numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/chandkund/stock-price-prediction-

Predict Tesla stock prices using historical data. The project preprocesses data, applies normalization and standardization, trains a linear regression model, and evaluates its performance. Visualizations include box plots and time series plots.

data-science linear-regression numpy pandas plotly python sklearn sklearn-model

Last synced: 21 Jan 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/lewis-morris/image_augment

Image augmentation for CNN's. Aiming to increase generalization.

augmentation image numpy python

Last synced: 18 Feb 2026

https://github.com/hmotez/medclaimml

An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.

healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss

Last synced: 12 Apr 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/ellennewerkla/eda_students_performance

EDA em Python com Pandas, Matplotlib e Seaborn para identificar padrões de desempenho em estudantes.

matplotlib numpy pandas python3 seaborn

Last synced: 08 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/redinfinitypro/redditbot

Rating: (7/10) This script collects, preprocesses, trains models, processes images, and handles files, handling data from Reddit, image processing, and file handling.

autocorrect bytesio concurrent nltk numpy openpyxl pandas pil praw random re requests secrets sklearn string tensorflow time urllib

Last synced: 08 Apr 2026

https://github.com/yappy2000d/pps-ndarray

Multi-dimensional array implementation

cpp numpy

Last synced: 18 Jan 2026

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 08 May 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/yeisson8a/tallerdataanalysis

Taller de Data Analysis utilizando Python y las librerías Pandas, NumPy, Seaborn, Matplotlib para manipulación de datos de un archivo, así como la librería Statsmodels para regresiones lineales.

analisis-de-datos linear-regression matplotlib numpy pandas python3 seaborn statsmodels

Last synced: 14 Apr 2026

https://github.com/ksharma67/eda-on-ipl

In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.

data-analysis data-science eda matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/01one/python-online-with-pyodide

Run and edit python code including matplotlib and numpy on the browser with pyodide and codemirror

matplotlib-python numpy pyodide python-browser python-online python-online-compiler

Last synced: 29 Mar 2025

https://github.com/fuodorov/python

Python Numpy Tutorial

numpy python

Last synced: 09 Oct 2025

https://github.com/winterwind/mlp_and_cnn

A short two-part project involving using an MLP and a CNN for ECG signal data

cnn convolutional-neural-network csv-files jupyter jupyter-notebook mlp neur numpy python pytorch

Last synced: 14 Apr 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/gojoel/brockton-data-analysis

Analyzes police logs from the city of Brockton, MA and attempts predictive analysis.

data-science exploratory-data-analysis geopy machine-learning matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/s-h-i-v-i-s/bangalore-housing-price-prediction

A Machine Learning Project to predict Bangalore House Prices.

jupyter-notebook machine-learning matplotlib numpy pandas python pythonflask sklearn

Last synced: 08 Apr 2026

https://github.com/naveen88112/vehicle-routing-problem

Vehicle Routing Problem (VRP) Solver This project applies a Genetic Algorithm (GA) to optimize the Vehicle Routing Problem (VRP). This is done for more than one vehicle to ensure delivery routes that save travel distance and increase efficiency.

deap genetic-algorithm matplotlib numpy python vehicle-routing-problem

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_image-text-recognition

Project for University of Michigan Python Programming Specialization -- Read in documents with images and text, and utilized CV libraries/packages to extract specific types of images and text, pairing them together

classification computer-vision image-classification numpy pandas programming python text-classification

Last synced: 14 Apr 2026

https://github.com/amritendunath/appointmate_1.0

A LangGraph-powered multi-agent/supervisor system for appointment scheduling. It enables users to interact in natural language to check doctor availability, book, reschedule, or cancel appointments.

fastapi langchain langgraph nltk numpy pandas psycopg2 python sqlalchemy uvicorn websocket

Last synced: 07 Apr 2026

https://github.com/sohitbennett/roadsafe

A Deep learning computer vision system for real-time traffic safety monitoring.

computer-vision esrgan keras numpy pandas python scikit-learn tensorflow tesseract-ocr yolov5 yolov8

Last synced: 08 Apr 2026

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 14 Apr 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/cinnaavox/loan-prediction

Machine Learning project predicting loan approvals using Decision Tree Classification. Includes data cleaning, feature engineering, model evaluation and key business insights.

decision-trees matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/gopikrsmscs/mycinepick-recommendation-system

MyCinePick is a user-specific movie recommendation system built using collaborative filtering techniques. This project aims to provide personalized movie suggestions tailored to each user's unique viewing preferences and habits.

collaborative-filtering movie-recommendation-app numpy python3 recommendation-system sklearn streamlit

Last synced: 14 Apr 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/rahulraimau/credit-card-fraud-detection

This project develops a real-time credit card fraud detection system using the Kaggle Credit Card Fraud Detection Dataset, containing 284,807 transactions with 31 features (Time, Amount, V1-V28, Class).

imbalanced-learn- matplotlib numpy pandas python pytorch scikit-learn- tensorflow

Last synced: 08 Apr 2026

https://github.com/amiridk/statistics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 14 Apr 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026