An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/karthikarajagopal44/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

data-cleaning data-visualization matplotlib numpy pandas python python3 scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/wbadart/simplegcn

Basic implementation of a Graph Convolutional Network

data-science gcn neural-network numpy

Last synced: 18 May 2026

https://github.com/gdg-a-coruna/inteligencia-artificial-en-la-deteccion-del-cancer

Repositorio con recursos sobre la charla Build with AI: Inteligencia Artificial en la Detección del Cáncer

ann breast-cancer breast-cancer-prediction breast-cancer-wisconsin keras lasso matplotlib ml numpy pandas python r svm tensorflow

Last synced: 17 Feb 2026

https://github.com/chernyakid/russian-film-distribution

Исследование российского кинопроката

jupyter numpy pandas python seaborn

Last synced: 15 May 2026

https://github.com/ourway/simple-cnn

A simple CNN implementation using numpy

convolutional-neural-networks numpy

Last synced: 10 May 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. 🛠️🎥

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/ovsidee/psm

Fundamentals of Mathematical Simulations using Python

numpy python

Last synced: 29 Apr 2026

https://github.com/alejoduarte23/fast_fdd

Fast implementation of frequency domain decomposition (FDD) in python with multiple identification techniques

numpy scipy-signal

Last synced: 10 May 2026

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://github.com/muzahid0408/student_marks_analysis_using_numpy

This project analyzes a student marks dataset to calculate key statistical measures such as mean, median, standard deviation, and more.

datasets jupyter-notebook numpy numpy-arrays pythoh

Last synced: 04 May 2026

https://github.com/paul-bokelman/nn

Basic neural network in python

machine-learning neural-networks numpy python

Last synced: 09 May 2026

https://github.com/exclowd/gg

A library for creating terminal based games in python.

numpy python

Last synced: 18 May 2026

https://github.com/ireneflorez/sf-kindergarten-lottery

Data analysis reviewing data and findings on the public school kindergarten lottery in San Francisco

ipynb-jupyter-notebook jupyter-notebook numpy pandas python sql

Last synced: 17 Feb 2026

https://github.com/magnitopic/ft-linear-regression

Your first implementation of a machine learning algorithm. Predicting the price of a car by it's milage

42school ai ai-algorithm ft-linear-regression linear-regression machine-learning matplotlib numpy pandas prediction-algorithm python

Last synced: 16 Apr 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/harmanveer-2546/house-price-prediction-

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn svm

Last synced: 09 Apr 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/thekartikeyamishra/object-detection-for-retailer

The AI Customer Feedback Summarizer is a Python-based tool that uses extractive summarization techniques to provide concise summaries of customer feedback.

ai computer-vision cv2 numpy pillow python torch torchvision yolov5

Last synced: 05 May 2026

https://github.com/ashishsingh789/bcg_virtual_internship

This repository showcases my BCG X virtual internship project on customer churn analysis for PowerCo, covering business understanding, EDA, feature engineering, and modeling using Python and machine learning.

data-manipulation data-science dataanalysis datavisualization eda machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/kasraskari/tumor-predict

Streamlit app for predicting tumor malignancy using logistic regression.

logistic-regression machine-learning numpy pandas python scikit-learn streamlit tumor-detection

Last synced: 09 Apr 2026

https://github.com/kiok46/subset-sum-problem

Solving the Subset Sum Problem using Python, Pandas and Numpy.

numpy pandas python subset-sum

Last synced: 05 May 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/iv4n-ga6l/GenderDetection

Gender detection using gender classification model

genderclassification genderdetection numpy pil python resnet18 torch torchvision

Last synced: 28 Apr 2025

https://github.com/anselmoo/useful-math-functions

useful-math-functions for Optimization, Benchmarking, Visualizing, and more ...

basic functions math mathematics numpy

Last synced: 17 Mar 2025

https://github.com/mogankumar/covid-19-classification

A deep learning-based project that classifies chest X-ray images into Normal, COVID-19, and Viral Pneumonia using a two-layer CNN architecture. Achieves high accuracy and demonstrates potential for aiding in early diagnosis and treatment of COVID-19.

keras-tensorflow matplotlib numpy python3

Last synced: 11 May 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/prajakta1321/authencheck

Amdocs Gen AI Graduate Hackathon 2024-25- A comprehensive fact-checking and misinformation detection system that leverages cutting-edge AI models and multiple news sources to verify information circulating on social media

api bert-fine-tuning flask-application matplotlib ngrok-server nlp nlp-machine-learning numpy pandas python3 scikit-learn seaborn wandb

Last synced: 05 Apr 2026

https://github.com/offthetab/statistics_homework

Практические работы по статистике МИРЭА 3 курс.

numpy pandas python statsmodels

Last synced: 05 Apr 2026

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the Lévy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 02 May 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 19 Apr 2026

https://github.com/stephnna/my_image_classifier

A deep learning-based image classifier built with PyTorch, designed to identify various flower species using architectures like VGG16, ResNet101, and AlexNet. This project leverages transfer learning for improved performance, checkpointing for seamless training, and supports both CPU and GPU devices. Perfect for exploring deep learning models and i

matplotlib numpy python pytorch

Last synced: 11 May 2026

https://github.com/cr00z/dlschool_adv

Репозиторий для сдачи домашних заданий на первом семестре продвинутого потока курса "Deep Learning" Школы глубокого обучения ФПМИ МФТИ

matplotlib numpy pandas python pytorch sklearn

Last synced: 02 Apr 2026

https://github.com/klaudiozdrava/image-analysis

A Python project that was developed as a university assignment and the goal is to colorize an grayscale image using machine learning techniques.

colorization image-processing kmeans-clustering numpy svm

Last synced: 16 May 2026

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/abdelrahman-lila/seam-carving

Seam Carving for Content Aware Image Resizing Project - Computer Vision Lab

numba-jit numpy python

Last synced: 10 Jul 2025

https://github.com/ohyeasabhi/signature-recognizer

Este projeto desenvolve um sistema para verificar a similaridade de assinaturas usando redes neurais. Ele utiliza o Triplet Loss para criar embeddings, focando na avaliação biométrica com o dataset CEDAR. 🖊️🌐

backpropogation-neural-network computer-vision django image-recognition jupyter-notebook keras machine-learning-projects neural-network numpy one-shot-learning python-project siamese-neural-network signature signature-recogniton tensorflow type voice-recognition windows

Last synced: 08 Apr 2026

https://github.com/danielpancake/mlml

my little machine learning library written in NumPy

machine-learning numpy

Last synced: 05 Apr 2025

https://github.com/bishopce16/surfs_up

An analysis of weather data to determine opening of an ice cream and surf shop using SQLite, SQLAlchemy, and Python.

matplotlib numpy pandas python sqlalchemy sqlite weather

Last synced: 02 Apr 2026

https://github.com/themihirmathur/machine-failure-predictor

The 'Machine Failure Predictor' is a web application designed to enable predictive maintenance for industrial equipment. Leveraging machine learning techniques, this application predicts whether equipment will fail and identifies the type of failure based on key process parameters.

cicd digitalocean docker fastapi flask github github-actions machine-learning matplotlib mlops numpy plotly python randomforest scikit-learn seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/tberchanov/clustering-k-means

Sample program where is implemented clustering by K-means algorithm, and its visualisation.

clustering-algorithm kmeans-clustering matplotlib numpy python

Last synced: 16 May 2026

https://github.com/vipinchaudhary31122002/cardiopredict

CardioPredict is a smart, ML-powered application that helps analyze your risk of heart disease using interactive forms, clinical features, and trained classification models.

machine-learning matplotlib numpy pandas python3 random-forest seaborn streamlit

Last synced: 12 Apr 2025

https://github.com/shreeparab1890/laptop-price-predictor

This notebook is trying to build a model which will predict the price of the laptop based on the given specifications.

machine-learning-algorithms matplotlib numpy pandas price-prediction python scikit-learn sklearn

Last synced: 06 Apr 2026

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 12 Apr 2025

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 20 Feb 2026

https://github.com/mohammedsaqibms/planar_data_classification_with_onehidden_layer

This repository implements a simple neural network for binary classification of 2D planar data using Python and NumPy. It compares logistic regression with neural networks and includes code for forward/backward propagation, gradient descent, and decision boundary visualization.

backpropagation binary-classification gradient-descent logistic-regression machine-learning neural-networks numpy python3

Last synced: 10 Jul 2025

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 05 Feb 2026

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 11 Apr 2026

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/faraazarsath/handson-numpy

This respository is on various Numpy Tasks provided as academic assignment

numpy

Last synced: 16 May 2026

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 04 May 2026

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/1adore1/face-analysis

An application for face recognition and prediction of gender, age range and emotional state.

numpy opencv pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/sumaiyyaf/spotify-data-analysis

This project explores Spotify music data, using Python to clean, analyze, and visualize the dataset. It examines the relationship between audio features and sentiment in song lyrics, uncovering patterns and trends to provide valuable insights into music preferences.

eda numpy pandas python

Last synced: 08 May 2026

https://github.com/nik-kusanagi/jupyter

Treinamento Para Análise de Dados.

big-data jupyter jupyter-notebook matplotlib numpy pandas python python3

Last synced: 30 Apr 2026

https://github.com/ayushhhh123/diwali_sales_analysis

This project provides a comprehensive analysis of Diwali sales data using Python. The primary goal is to derive business insights from customer purchasing behavior during the Diwali festival season. This analysis can help improve marketing strategies, customer targeting, and overall sales performance.

matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/atharvkadammm/suicide-prediction-system

A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.

csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-

Last synced: 01 Jul 2025

https://github.com/fjohnson87/planning-and-feasability

Data-driven feasibility analysis using pandas, seaborn, and FAO food supply data

matplotlib numpy pandas

Last synced: 04 May 2026

https://github.com/shankhadweep/diabetes-prediction-systemv3

This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 01 May 2026

https://github.com/gauravpatil04/ml

This repo is on "Basics skills required for Machine Learning"

matplotlib numpy pandas python seaborn

Last synced: 11 Jul 2025

https://github.com/edegan-furb/conwaygameoflife

This project is a simulation of Conway's Game of Life using Python with the Pygame library.

conways-game-of-life numpy pygame python

Last synced: 29 Apr 2026

https://github.com/chernyakid/game-success-analysis

Выявление закономерностей успешности компьютерных игр

jupyter numpy pandas python scipy seaborn

Last synced: 07 Apr 2026

https://github.com/vivekmodak3/multiple-disease-prediction-system--streamlit-webapp

This repository contains the codebase for "Multiple Disease Prediction Streamlit App". The training notebooks & the datasets are also provided in the respective folders.

machine-learning-algorithms numpy pandas python3 scikitlearn-machine-learning streamlit

Last synced: 07 Apr 2026

https://github.com/shekharkram/project

A collection of data analytics projects showcasing skills in data cleaning, exploration, visualization, and basic SQL queries. Designed to demonstrate entry-level data analyst competencies using real-world datasets and tools.

datacleaning excel jupyter-notebook mysql numpy pandas postgresql python sql

Last synced: 07 Apr 2026

https://github.com/manshreet27/mrs

This Movie Recommendation System is a web-based application built using Python and Streamlit, designed to provide movie recommendations based on user preferences. It utilizes TMDb API for fetching real-time movie details and Kaggle's TMDB 5000 Movies dataset for content-based filtering.

numpy pandas python scikit-learn streamlit tmdb-5000-movies-dataset-from-kaggle tmdb-api-for-fetching-real-time-movie-data

Last synced: 07 Apr 2026

https://github.com/shweta189/sms-spam-detection

Detecting Spam SMS's messages using machine learning algorithms.Utilizing NLP techniques such as tf-idf vectorization and machine learning algorithms like naive bayes and logistic regression for classifications. Integrated with Streamlit to create an interactive web app where users can input a message and instantly check if it's spam or not.

eda machine-learning-models nltk numpy pandas plotly python seaborn streamlit

Last synced: 07 Apr 2026

https://github.com/muhdhammad/machine-learning

Crafted for hands-on learning and implementation of ML with scikit-learn

data-science jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/fikri-rouzan/energy-consumption-prediction

Final Project for the AI/ML Weekly Class by Google Developer Group on Campus (GDGoC) UIN Jakarta.

jupyter-notebook matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 07 Apr 2026

https://github.com/paulo-santos-ds/analise_vendas_supermercado

Este projeto realiza o web scraping de Variação de preços de produtos do supermercado

env numpy pandas scrapy sqlite3 streamlit

Last synced: 29 Apr 2026

https://github.com/dmarks84/coursework_project_ml-classifier-eval-selection

Project for University of Michigan Applied Data Science Specialization -- Predicted viewer engagement based on features related to video metrics; evaluated a large set of classifiers under different scoring metrics to select the "optimal" one.

classification cross-validation data-modeling data-reporting data-visualization databases dataframes eda grid-search matplotlib numpy pandas python scikit-learn statistics supervised-ml

Last synced: 02 Apr 2026