An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lmizner/codecademy_life_expectancy

Calculate quartiles, quantiles, and the inter-quartile range (IQR) for a variable

histogram jupyter-notebook matplotlib-pyplot numpy pandas python quantiles quartiles

Last synced: 09 Apr 2026

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 09 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/omdoshi13/pricing-of-laptops-using-ml

Data Analysis, training Machine Learning models, and Model Evaluation and Refinement for Pricing of Laptops dataset.

data-analysis data-analysis-project datascience google-colab jupyter-notebook machine-learning matplotlib model-evaluation model-refinement numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/prat555/python_practice

These files contain the Python exercises I completed as practice.

deep-learning numpy pyhton3 regex

Last synced: 01 Oct 2025

https://github.com/subhas-pramanik-09/mediscan-ai

A smart and scalable ML-powered health prediction system that can help detect the risk of three major diseases: Diabetes + Heart Disease + Parkinsons Disease

jupyter-notebook logistic-regression machine-learning numpy pandas scikit-learn streamlit svm-classifier

Last synced: 09 Apr 2026

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject1

This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.

kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer

Last synced: 11 Apr 2026

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/elmezianech/email-spam-ham-classifier-nb-hard

Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.

ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 11 Apr 2026

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/pvlbzn/datascience

Learning data science 🤓

data-science numpy pandas

Last synced: 09 May 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/fazal-coder/data-science-business-analytics-tsf

I have completed two tasks in this repository , task 3 and task 4

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/aaaastark/textclassification-nlp-project

Dataset preparation, Feature Engineering, Model Building, Summary Statistics, Data Exploration by Visualizations, and Save File CSV

matplotlib nltk numpy pandas python seaborn sklearn textblob wordcloud

Last synced: 11 Apr 2026

https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction

Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.

analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/nikhil-donthusaram/heartdiseaseprediction

Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.

data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode

Last synced: 11 Apr 2026

https://github.com/jagroop2001/learning-numpy

Learning NumPy complete the exercise

numpy pandas python

Last synced: 16 May 2025

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 11 Apr 2026

https://github.com/yelypk/nasa-farm-navigators

NASA Space Apps Challenge Hackathon

fastapi javascript numpy phaser postgresql python railway

Last synced: 11 Apr 2026

https://github.com/junaidsumsaal/advanceparkingsystem

Its a web application that integrates AI-powered parking spot predictions, real-time notifications, and role-based dashboards for Drivers, Attendants, and Providers. This system is designed to streamline parking operations, enhance user experience, and leverage machine learning for future availability predictions.

axios-react chakra-ui-react data-science met metrics ml numpy pand python3 react sklearn tailwindcss typescript

Last synced: 11 Apr 2026

https://github.com/kersh1337228/tradenv

Trading strategies testing environment based on open stock market data

django django-rest-framework numpy pandas python react stock-market typescript

Last synced: 11 Apr 2026

https://github.com/moustafamohamed01/car-price-prediction

Car Price Prediction using Linear Regression A machine learning model that predicts car prices based on features like name, company, year, kilometers driven, and fuel type. Built with Linear Regression and deployed as a Streamlit web app.

data-science jupyter-notebook linear-regression machine-learning numpy pandas python sickit-learn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/vibhore-sagar/stock-movement-analysis-based-on-social-media-sentiment

This project aims to develop a machine learning model that predicts stock movements by analyzing sentiment in social media discussions (Reddit posts).

joblib numpy pandas praw python scikit-learn textblob-sentiment-analysis yfinance

Last synced: 09 Apr 2026

https://github.com/reubenmathew/docscanner

📸 Document Scanner using an OpenCV Edge Detection Engine

flask numpy opencv python

Last synced: 11 Apr 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/nauxqouh/python-for-data-science-labs

This repo contains weekly practical codes for my Python for Data Science course at university.

data-science jupyter-notebook numpy pandas python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/vansh-py04/convolution-net-from-scratch

A minimal NumPy-based implementation of a 3-layer convolutional neural network (CNN) from scratch — including custom forward and backward passes for conv, ReLU, pooling, affine, and softmax layers. Perfect for learning how CNNs actually work under the hood.

backpropagation cnn computer-vision convolutional-neural-networks cs231n cs231n-assignment deep-learning from-scratch from-scratch-in-python fully-connected-network machine-learning maxpool2d nueral-networks numpy softmax stanford-deep-learning

Last synced: 24 Apr 2026

https://github.com/dyarleniber/hands-on-machine-learning

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.

ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 13 Feb 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 20 Jan 2026

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/rutujaingole/jailbreaking-deep-models

This repository contains the codebase for Jailbreaking Deep Models, which investigates the vulnerability of deep convolutional neural networks to adversarial attacks. The project systematically implements and analyzes Fast Gradient Sign Method (FGSM), Projected Gradient Descent (PGD), and localized patch-based attacks on the pretrained

adversarial-attacks deep-learning densenet121 fgsm-attack imagenet-classifier jailbreak machine-learning numpy patch-based-attack pgd-adversarial-attacks torch

Last synced: 14 May 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/omarsaad21/shopping-cart-eda

An EDA python project focusing on getting the most out of a movies dataset (- 1- combine this data together - 2- check messing values - 3- show summary statistics - 4- deal with date time and extract features from dates - 5- answer at least three questions from this data)

eda jupyter-notebook numpy pandas plotly python

Last synced: 13 Apr 2026

https://github.com/mobinabatebi/electricity_consumption_project

The project aimed at estimating electricity consumption in administrative offices, along with analyzing the impacts of such consumption on power outages, is critical for addressing the pressing issue of summer electricity shortages.

jupyter-notebook numpy pandas python

Last synced: 28 Sep 2025

https://github.com/dpgitaccount/project---hospital-readmission-analysis

The goal of this project is to build a predictive model to estimate the likelihood of a hospital readmission based on patient data. By identifying factors that contribute to readmissions, hospitals can optimize care and reduce costs associated with repeated visits.

boxplot confusion-matrix datamodeling exploratory-data-analysis heatmap histplot numpy pandas plotly python random-forest seaborn smote-sampling visualization

Last synced: 16 Apr 2026

https://github.com/adi3042/sensor-fault-detection

🔍⚙️ Ensure Reliable Operations! Detect anomalies and prevent disruptions with our Sensor Fault Detection system. Explore advanced classification and regression techniques to identify and address sensor faults effectively. Your path to robust and accurate sensor data begins here! 🚨🔧 SensorFaultTech

classification css datetime fault-detection flask functools html ipykernel jupternotebook machine-learning numpy pandas python3 readme regression scikit-learn sensor setuptools venv

Last synced: 11 Apr 2026

https://github.com/tanmay-312/pacman-game-python

"Embark on a nostalgic journey with my Python Pac-Man game! 🎮🕹️ Crafted using pygame, tcod, numpy, and enum libraries, it's a modern twist on a classic. Navigate the maze, chomp pellets, and evade ghosts in this retro-inspired adventure! 🚀👻 #PythonGameDev #PacManRevival #ClassicGaming"

enum game game-development gamedev numpy pygame pygame-games python python-3 python3 tcod

Last synced: 22 Apr 2026

https://github.com/lorenzorottigni/ml-spam-filter

Machine Learning python bootcamp: natural language processing with spam filter purposes

machine-learning npl numpy pandas python scikit-learn seaborn spam-filtering

Last synced: 11 Apr 2026

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 11 Apr 2026

https://github.com/alphan26/breastcancerclassification

This project involves a binary classification problem where we applied PyTorch to detect breast cancer.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/hmourad2023/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 01 Aug 2025

https://github.com/npodlozhniy/dl-tinkoff

This is a warehouse of my homeworks for master MIPT deep learning course from Tinkoff FinTech

deep-learning numpy pytorch

Last synced: 08 May 2026

https://github.com/muhammad-rebaal/eda-with-python-and-pandas

Explore, Analyze, and Visualize Data with Python and Pandas: Dive into the world of data analysis and visualization using Python and Pandas with this repository. Discover valuable insights, clean and preprocess your data, and create stunning visualizations to make data-driven decisions.

calmap matplotlib numpy pandas python python3 seaborn

Last synced: 07 May 2026

https://github.com/frauvate/cheatsheet

This project provides ready-to-use templates for machine learning base models. This repo, which contains templates of common models in Jupyter notebook format, aims to help users quickly implement these models and easily adapt them to their own projects.

machine-learning machinelearning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/psycho-poodle/simple_rnn_imdb

This repository contains a simple web application for sentiment analysis of movie reviews using a pre-trained RNN model. The application is built with TensorFlow and Streamlit, making it easy to use and deploy.

numpy python streamlit tensorflow

Last synced: 11 Apr 2026

https://github.com/nishantksingh0/twitter-sentiment-analysis

A ML/DL based Twitter sentiment analysis model based on Transformer based pretrained Encoding on sentence

kaggle-dataset matplotlib-pyplot nlp numpy pandas sklearn tensorflow transformer

Last synced: 11 Apr 2026

https://github.com/dane-meister/machine-learning-algos

Repository of several ML algorithms implemented for different environments and robot control vectors.

colab-notebook gymnasium jupyter-notebook machine-learning-algorithms numpy opencv pybullet python pytorch

Last synced: 11 Apr 2026

https://github.com/blacksujit/neural_network_using_numpy_and_math

Inspired from a repo . i have implemented an some ground level maths and logic . In this neural network no tensorflow , pytorch is used it is just prepared by using numpy and some logical math.

algorithmic-composition basics-of-python calculus criticalthinking innovation machine-learning maths neural-network neuralnetworkusingnumpy numpy purebasics

Last synced: 26 Aug 2025

https://github.com/aadrianleo/book-recommendation-system

Book Recommender System using the Book-Crossing dataset. Compares content-based (TF-IDF + cosine similarity) and collaborative filtering (SVD) methods for book recommendations. Includes data cleaning, EDA, and model evaluation (Precision@5, RMSE) in Python.

artificial-intelligence collaborative-filtering content-based-filtering cosine-similarity data-preprocessing jupyter jupyter-notebook machine-learning matrix-factorization numpy pandas predictive-modeling python3 recommendation-system recommender-system scikit-learn sckit-surprise

Last synced: 10 Apr 2026

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/doondigangadhar30/ghiblify-animeganv3

A creative tool that generates beautiful Ghibli-style artwork from your image input.

colab-notebook jupyter-notebook numpy onnxruntime opencv python

Last synced: 01 Jul 2025

https://github.com/yahiazakaria445/sentiment-analysis-using-lstm

A Sentiment analysis NLP project using LSTM model

matplotlib nltk numpy pandas scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/tigureis/aula_deployment

This project not only demonstrates a machine learning pipeline but also serves as a template for deploying models into production. By leveraging the project's structure and code, you can streamline the process of deploying your own machine learning models.

numpy pandas pickle sklearn

Last synced: 11 Apr 2026

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 20 Jan 2026

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 11 Apr 2026

https://github.com/2100031988/programming_for_aiml_codes

All of my descriptive code are published here and is updated everyday as I learn new concepts

dataset numpy pandas pipeline python visual-studio visualization

Last synced: 28 May 2026

https://github.com/jaygaha/ai-mastery-roadmap

A collaborative AI learning roadmap. I'm documenting my journey from grasping the fundamentals of Python to mastering Machine Learning. Feel free to fork and contribute — let's build our AI skills together!

artificial-intelligence beginners-guide data-science deep-learning education numpy python python-programming self-taught tutorial

Last synced: 18 Jan 2026

https://github.com/collins-kimotho/wrangle-and-analyze-project

Data Wrangling and Analysis Project: Analyzing WeRateDogs Twitter Account Data

explanatory-data-analysis exploratory-data-analysis matplotlib numpy pandas python twitter twitter-api

Last synced: 11 May 2026

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/magnitopic/ft-linear-regression

Your first implementation of a machine learning algorithm. Predicting the price of a car by it's milage

42school ai ai-algorithm ft-linear-regression linear-regression machine-learning matplotlib numpy pandas prediction-algorithm python

Last synced: 16 Apr 2026

https://github.com/shahzaibdev355/house-recommendation-backend

House Recommendation Backend is a Flask-based API that provides house recommendations based on user input. It includes endpoints to check the backend's functionality, fetch state data, and get personalized house recommendations based on criteria like price, property size, number of beds, number of baths, and state.

flask flask-api flask-cors numpy pandas scikit-learn tensorflow

Last synced: 05 Jan 2026

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/bhuvilol/objdetect_img

Object Detecttion from a image input

numpy opencv python3 pytorch yolov8

Last synced: 04 Feb 2026

https://github.com/niteshchawla/yulu-hypothesistesting

Yulu has recently suffered considerable dips in its revenues. They have contracted a consulting company to understand the factors on which the demand for these shared electric cycles depends. Specifically, they want to understand the factors affecting the demand for these shared electric cycles in the Indian market.

hypothesis-testing matplot numpy pandas-library scipy-stats seaborn

Last synced: 20 Apr 2026

https://github.com/rezapace/number-generator

Proyek ini adalah sebuah program yang digunakan untuk menghasilkan deretan angka dengan langkah tertentu dan menambahkan teks setelah setiap angka. Program ini mendukung berbagai metode untuk menghasilkan angka, termasuk penggunaan CuPy untuk memanfaatkan GPU, serta penggunaan threading untuk meningkatkan kinerja.

colab cupy custom generator jupiter kumal link number numpy pyhon webkumal webkumal-projek

Last synced: 11 Apr 2026

https://github.com/iwitaly/aws-lambda-docker-builder

Docker image for building packages for AWS Lambda with Python environment

aws aws-lambda docker docker-image numpy python python3

Last synced: 09 Apr 2026

https://github.com/gustavoaca1997/placa-orificio-velocidad

script en python3 para obtener la velocidad de una placa orificio mediante tanteos

chemistry numerical-calculations numerical-methods numpy python

Last synced: 01 May 2026

https://github.com/vishal-verma-96/Honey_Production_Analysis_using_Python

In this project, I analyzed U.S. honey production from 1998 to 2021 across different U.S. cities, explored yields, and examined production values using Matplotlib and Seaborn visualizations.

matplotlib numpy pandas python-3 python-visualization seaborn

Last synced: 19 Apr 2025

https://github.com/beberiche/jeon-woo-chi

지역 축제를 기반으로 하는, 게이미피케이션을 접목시킨 지역 상권 맞춤 추천 서비스

aws django docker elasticsearch grafana jenkins kibana logstash numpy pandas phaserjs prometheus react react-query recoil redis spring-boot tensorflow typescript zipkin

Last synced: 10 Mar 2026

https://github.com/pakagronglb/istanbul-restaurant-tips-prediction

This project focuses on predicting restaurant tips in Istanbul using machine learning techniques, specifically Linear Regression. The analysis includes data preprocessing, exploratory data analysis (EDA), model training, and evaluation.

kaggle matplotlib numpy pandas prediction python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 09 Apr 2026