An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/semihbugrasezer/rockvsmine

Rock vs Mine Prediction with Python | Machine Learning Project

numpy pandas python

Last synced: 05 May 2026

https://github.com/elon-fask/nlp_num1

Natural Language Processing with Disaster Tweets

ai machine-learning nlp nlp-machine-learning numpy pandas python text-processing

Last synced: 10 Apr 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/alejoduarte23/reading_data_from_dewesoft

The following repository retrieves sensor data (acceleration and strains) from both local and cloud databases. It processes the data using classes from another repository called Modal Engine for spectral analysis, modal analysis, and signal processing.

dewesoft matplotlib modal-analysis numpy orm scipy signal-processing sql sqlalchemy

Last synced: 07 Jan 2026

https://github.com/hussain-7/emotion_detection-master

Human Emotion Analysis using facial expressions in real-time from webcam feed. Based on the dataset from Kaggle's Facial Emotion Recognition Challenge.

keras-tensorflow matplotlib numpy opencv-python tensorflow

Last synced: 08 May 2026

https://github.com/sc0v0ne/ai-discipline-work

AI Discipline Work - Movie recommendation

jupyter-notebook machine-learning numpy pandas python python3

Last synced: 15 Apr 2025

https://github.com/paulbrenker/decision-tsp

Scripts from a university data analysis project to approximate the decision tsp problem using heuristics and machine learning

data-science machine-learning numpy poetry python

Last synced: 26 May 2026

https://github.com/paraskevi-kivroglou/rl-pong-agent

A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.

atari atari-games gym-environment gymnasium numpy python3 pytorch q-learning reinforcement-learning reinforcement-learning-agent

Last synced: 11 Apr 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/armahdavi/qff-evalation_code-data-processing-statistics-plotting

Data pipelines and processing codes, statistical modellings, descriptive statistics, and plot visualizations for QFF evaluation phase of for Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021 Full-length article: https://www.sciencedirect.com/science/article/abs/pii/S0269749120370779

data-science data-visualization histogram matplotlib matplotlib-pyplot numpy pandas python

Last synced: 11 Apr 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/charles-l/rayboi

a raytracer written in futhark/python

futhark numpy pathtracing python3 raytracing

Last synced: 19 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/eduardoprofe666/mn-api

🐍📦 Paquete de Python con implementaciones de métodos numéricos

mn-api numerical-methods numpy pandas python scipy simpy tabulate

Last synced: 04 Jan 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/bhawnamehbubani/advanced-zomato-restaurant-clustering-and-sentiment-analysis

This project leverages this data to perform clustering and sentiment analysis, providing actionable insights for customers and business growth.

matplotlib-pyplot nltk numpy pandas seaborn sklearn wordcloud

Last synced: 05 May 2026

https://github.com/project54321/stocksniper

"An interactive Stock Dashboard that visualizes historical prices, calculates key financial metrics, displays fundamental statements, and aggregates the latest news for any ticker — all in a sleek, easy-to-use Streamlit interface."

alphavantage finance numpy pandas plotly pypi python stock-market stocknews streamlit

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/sofiakhutsieva/analytical_report_for_hr_department

Подготовка аналитического отчета (SQL) (Проект в skillbox), 2020 г.

analytics matplotlib numpy pandas psycopg2 seaborn sqlalchemy

Last synced: 06 May 2026

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/ecrows/flippy

Python command-line utility for generating videos from image directories.

data-science image-processing numpy opencv python python3 video video-processing visualization

Last synced: 04 Mar 2026

https://github.com/4211421036/githubiotpy

GitHubIoT is a comprehensive toolkit designed to simplify the visualization of IoT (Internet of Things) data with seamless GitHub integration. The application provides an intuitive graphical interface for real-time data monitoring, analysis, and configuration

cli esp32 esp8266 github-actions github-iot matplotlib numpy pypi-packages python tkinter

Last synced: 16 Apr 2025

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/chaudharypraveen98/lungcancerdetection

To distribute the work of doctors and process the large amount of data to produce accurate results on the go

numpy pandas pillow python scipy tenserflow

Last synced: 16 Apr 2026

https://github.com/matheusafonseca/c111

Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).

data-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/vyjayanthipolapragada/marketing_statistical_analysis

Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns

customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/yanglr/mlnotes-python

Machine learning Notes for Python 3 (NumPy, TensorFlow etc.), with interactive Jupyter notebook.

jupyter jupyter-notebook numpy numpy-notes

Last synced: 21 May 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/broodhoney/titanic-ml-from-disaster

This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data

machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning

Last synced: 11 Apr 2026

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 11 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 11 Apr 2026

https://github.com/pd2871/streamlit_automl

Auto Exploratory Data Analysis using Streamlit App

matplotlib numpy pandas python streamlit streamlit-webapp

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/shwetajanwekar/hr-data-analytics

Analyze the HR data through exploratory data analysis, we identified key factors influencing employee attrition, satisfaction levels, and performance. Machine learning models enabled us to predict employee churn and classify potential candidates for promotion.

matplotlib numpy pandas seaborn

Last synced: 19 Apr 2026

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/vansh-py04/convolution-net-from-scratch

A minimal NumPy-based implementation of a 3-layer convolutional neural network (CNN) from scratch — including custom forward and backward passes for conv, ReLU, pooling, affine, and softmax layers. Perfect for learning how CNNs actually work under the hood.

backpropagation cnn computer-vision convolutional-neural-networks cs231n cs231n-assignment deep-learning from-scratch from-scratch-in-python fully-connected-network machine-learning maxpool2d nueral-networks numpy softmax stanford-deep-learning

Last synced: 24 Apr 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/alphan26/breastcancerclassification

This project involves a binary classification problem where we applied PyTorch to detect breast cancer.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 11 Apr 2026

https://github.com/jaygaha/ai-mastery-roadmap

A collaborative AI learning roadmap. I'm documenting my journey from grasping the fundamentals of Python to mastering Machine Learning. Feel free to fork and contribute — let's build our AI skills together!

artificial-intelligence beginners-guide data-science deep-learning education numpy python python-programming self-taught tutorial

Last synced: 18 Jan 2026

https://github.com/niteshchawla/yulu-hypothesistesting

Yulu has recently suffered considerable dips in its revenues. They have contracted a consulting company to understand the factors on which the demand for these shared electric cycles depends. Specifically, they want to understand the factors affecting the demand for these shared electric cycles in the Indian market.

hypothesis-testing matplot numpy pandas-library scipy-stats seaborn

Last synced: 20 Apr 2026

https://github.com/pakagronglb/istanbul-restaurant-tips-prediction

This project focuses on predicting restaurant tips in Istanbul using machine learning techniques, specifically Linear Regression. The analysis includes data preprocessing, exploratory data analysis (EDA), model training, and evaluation.

kaggle matplotlib numpy pandas prediction python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/waynewbishop/bishop-algorithms-quiver-package

Quiver is a Swift package that provides vector operations for Swift applications.

data-science linear-algebra numpy swift vectors

Last synced: 31 Mar 2025

https://github.com/lorenzorottigni/ml-titanic

Machine Learning python bootcamp: logistic regression on titanic dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/amirkh777/data-science-yandex-practicum

My projects from the Yandex Practicum Data Science course.

data-science matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 12 Apr 2026

https://github.com/shhiivvaam/traffic_prediction

This project explores the application of advanced neural network architectures, including Multi-Layer Perceptron (MLP), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU), to forecast traffic volume. 👍👍✅

conv1d conv2d keras lstm matplotlib numpy pandas seaborn sklearn statsmodels tensorflow

Last synced: 12 Apr 2026

https://github.com/iamkushvanth/marketing-campaign-analysis

Data analysis comparing the performance of Facebook and AdWords ad campaigns in 2019, Using Statical Methods such as A/B test focusing on clicks, conversions, cost-effectiveness, and seasonal trends.

ab-testing hypothesis-testing linear-regression matplotlib-pyplot numpy pandas python regression-analysis scikit-learn scipy stastistics

Last synced: 07 Apr 2026

https://github.com/jillmpla/pythonxmldatabase

A Python & XML database application for a book collection.

database matplotlib numpy pandas pandas-dataframe python xml

Last synced: 12 Apr 2026

https://github.com/webintellectual/coloured-image-to-b-w-converter-web-app

A Streamlit web application to convert a coloured image to black and white image.

numpy opencv pil pillow python3 streamlit streamlit-webapp

Last synced: 12 Apr 2026