An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/fatimaafzaal/car_price_prediction

Trains Random Forest and Gradient Boosting models to predict car prices based on user inputs for various car attributes, evaluating models and making predictions using the best-performing model.

car-price-prediction ensemble-learning gradient-boosting matplotlib numpy pandas random-forest regression regression-models scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/roaajadaa/text-classification

Classify documents using custom naïve bayes model

matplotlib nlp nltk numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/gaball1/alzheimer-prediction-ai

Alzheimer Prediction is a machine learning-based project designed to predict the likelihood of Alzheimer’s disease using clinical and imaging data. The project features an interactive web application built with Streamlit, enabling users to input key data, view predictions, and explore visualizations .

ai data-mining gradient-boosting jupyter-notebook logistic-regression logistic-regression-algorithm machine-learning matplotlib-python models numpy pandas python random-forest-classifier streamlit xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/maximlevchenko/pandas-numpy-hippocampus-analysis

This project aims to analyze the differences in hippocampus size between two groups: "Unaffect" and "Affected" using pandas, numpy and matplotlib

jypyternotebook matplotlib numpy pandas pyreadr seaborn statsmodels

Last synced: 16 Apr 2026

https://github.com/adi3042/credit-card-fault-detection

🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech

classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv

Last synced: 03 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/amiridk/machinelearning

machine-learning-notes

matplotlib-pyplot numpy pandas pyhton

Last synced: 04 May 2026

https://github.com/alyssonmach/machine-learning-com-python

Aplicações de Machine Learning usando a linguagem de programação Python.

ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy

Last synced: 10 Apr 2026

https://github.com/the-developer-306/neural-style-transfer

Neural Style Transfer project leveraging TensorFlow and VGG19 to blend artistic styles with content images. Features include style and content initialization, multi-layer feature extraction, total variation loss, and post-processing enhancements like sharpening, upscaling, and brightening.

deep-learning image-processing neural-style-transfer numpy opencv-python post-processing tensorflow transfer-learning vgg19

Last synced: 01 May 2026

https://github.com/allanotieno254/pneumonia-xray-classification

This project is a Deep Learning-based Pneumonia classification system that allows medical staff to upload chest X-ray images and quickly determine whether a patient shows signs of pneumonia.

keras machine-learning numpy pandas pil pillow python streamlit streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 10 Apr 2026

https://github.com/rooneyrulz/cat-dog-classifier

Building a model using a convolutional neural network that well-classifying custom cat vs dog image data. including visualizations, data-preprocessing, future predictions, model-evaluation, identifying overfitting, and more.

artificial-intelligence convolutional-neural-networks data-preprocessing data-visualizations deep-learning keras matplotlib numpy pandas predictive-modeling python3 seaborn tensorflow2

Last synced: 10 Apr 2026

https://github.com/adityakumarda/kmeans-web-analytics

Built with Python, Pandas, and Scikit-learn, this machine learning project uses K-Means to cluster website users by behavior. It reveals patterns in engagement and bounce, helping drive data-informed decisions.

cluster-analysis elbow-curves elbow-method elbow-plot jupyter-notebook kmeans-clustering machine-learning matplotlib numpy pandas python python3 relationship scikit-learn seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/smirnovlad/data-science-notebooks

A collection of various data analysis approaches

data-science deep-learning kaggle machine-learning numpy pandas pytorch

Last synced: 10 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/khaymanii/spam_mail_detection_model

This model was built using Python and Logistics Regression algorithm

matplotlib numpy pandas python sckiit-learn

Last synced: 10 Apr 2026

https://github.com/semihbugrasezer/rockvsmine

Rock vs Mine Prediction with Python | Machine Learning Project

numpy pandas python

Last synced: 05 May 2026

https://github.com/jaweria-b/eda-basketball

The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.

matplotlib numpy python streamlit

Last synced: 10 Apr 2026

https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification

This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.

classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn

Last synced: 06 May 2026

https://github.com/rama1997/lane-line-detection

Uses computer vision to detects lane lines on the road from images/videos using the POV of a driving vehicle

numpy opencv opencv-python python

Last synced: 10 Apr 2026

https://github.com/ahmedabdalkreem/hotel-reservation

Our task is to classify a Hotel Reservation as either booking canceled (class1) or no canceled(class0) and use more one model to arrive the best model.

bagging decisiontreeclassifier ensemble extra-trees-classifier logistic-regression matplotlib numpy pandas python3 random-forest sklearn-library svc-model

Last synced: 11 Apr 2026

https://github.com/matx64/birads-classifier

Breast Imaging Report and Data System (BIRADS) with Graphical User Interface.

numpy pillow skimage sklearn tkinter

Last synced: 20 Apr 2026

https://github.com/asghar-rizvi/youtube-statistics-project

This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.

data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn

Last synced: 13 Jun 2026

https://github.com/tsungtsetu122/datamining-cifar10-classification

Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/hussain-7/emotion_detection-master

Human Emotion Analysis using facial expressions in real-time from webcam feed. Based on the dataset from Kaggle's Facial Emotion Recognition Challenge.

keras-tensorflow matplotlib numpy opencv-python tensorflow

Last synced: 08 May 2026

https://github.com/paulbrenker/decision-tsp

Scripts from a university data analysis project to approximate the decision tsp problem using heuristics and machine learning

data-science machine-learning numpy poetry python

Last synced: 26 May 2026

https://github.com/paraskevi-kivroglou/rl-pong-agent

A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.

atari atari-games gym-environment gymnasium numpy python3 pytorch q-learning reinforcement-learning reinforcement-learning-agent

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/germanocastanho/python-notes

Personal collection of Python Notes, tips, and code snippets 🐍

data-science numpy pandas programming python

Last synced: 01 Sep 2025

https://github.com/lilivalgo/ml_time_series

This project uses time series data to predict corn crop yield in Colombia

matplotlib numpy pandas scipy seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/rkarahul/face-detection-using-opencv-

•Build a face detection project using OpenCV and haar cascades, which are the better choice for real-time detection.

haar-cascade-classifier machinelearning numpy pandas-library python3 tkinter

Last synced: 08 May 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/armahdavi/analytics-data-pipelines-statistics-plotting---dust-extraction-hvac-filters---phase-1

PhD Technical Paper 1 - Phase 1 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

matplotlib numpy pandas pandas-dataframe pyplot python scipy-stats sklearn

Last synced: 13 Apr 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/lmizner/grokking_data_science

Coding practice for basic data science interview questions in Python

data-science numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 11 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/emhash/numpy-tutorial

Just simple tasks of practice of NumPy library of python

numpy numpy-library numpy-tutorial python

Last synced: 11 Apr 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/kkufieta/carnd-vehicle-detection

Vehicle Detection & Tracking - Self Driving Cars

cv2 hog-features numpy self-driving-car sklearn vehicle-detection

Last synced: 11 Apr 2026

https://github.com/isabelacaldeira/chutelibre

Crashing into code. Here is a physics problem about free fall solved with python!

freefall jupyter-notebook matplotlib numpy physics physics-simulation python3

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/yoavram/numpycharm

Some examples on using PyCharm with Scientific Python

cython numpy pycharm python scientific-computing training-materials

Last synced: 11 Apr 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/biasandrade/rede-neural-do-zero-classificacao-binaria

🧠 Rede Neural Artificial construída do ZERO (só NumPy) para detectar fraudes em transações | Forward/Backward Propagation | Sigmoid | Gradiente Descendente

backpropagation binary-classification deep-learning fraud-detection from-scratch gradient-descent machine-learning neural-network numpy python

Last synced: 18 Nov 2025

https://github.com/guru-bharadwaj20/saliency-maps

A saliency-guided autonomous driving system using cmSalGAN heatmaps, a custom Fusion model, and Faster R-CNN detection. It generates refined saliency maps, trains a saliency-aware detector, and simulates vehicle navigation using datasets like NJUD, NLPR, and STEREO.

cmsalgan numpy opencv python pytorch torchvision

Last synced: 18 Nov 2025

https://github.com/xiaohaoo/machine-learning-examples

机器学习使用不同框架的常见模型的例子

keras matplotlib numpy python pytorch sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/eljandoubi/trading-with-momentum

A trading strategy based on a momentum indicator.

momentum numpy pandas scipy trading trading-algorithms trading-strategies

Last synced: 05 May 2026

https://github.com/alexquar/whatisthatdog

Mobile App built using react-native that will send api requests for classifying dog breeds to a flask api server containing a custom built CNN made with pytorch trained on the Stanford dog dataset. To take a look at some examples checkout some images from my portfolio linked here.

aws ec2 expo flask gunicorn matplotlib-pyplot numpy pandas pytorch react-native seaborn typescript

Last synced: 11 Apr 2026

https://github.com/theashishmavii/intelligent-crash-detection-and-emergency-notification

Intelligent Crash Detection: A deep learning-based system that detects vehicle accidents in real-time and sends emergency alerts with GPS location.

artificial-intelligence computer-vision deep-learning geolocation geopy keras machine-learning matplotlib numpy opencv pandas python tensorflow twilio vgg16

Last synced: 11 Apr 2026

https://github.com/dhirajmerit/diabetes-prediction-models

This project implements multiple machine learning algorithms to predict diabetes diagnoses based on medical diagnostic data. The project compares the performance of various models, providing insights into which algorithms are most effective for this task.

ai machine-learning numpy pandas tensorflow

Last synced: 11 Apr 2026

https://github.com/matheusafonseca/c111

Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).

data-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/jo-minseok/global-warming-100year

🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 ML [완료]

arima-model global-warming machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 11 Apr 2026