An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/lavkalsi/creditcardfrauddetector

Credit Card Fraud Detector is a React web app that predicts if a credit card transaction is fraudulent using a Python machine learning model. Users can input transaction data, and Flask facilitates communication between the backend and frontend. Backend files are located in the res folder. This app provides simple UI for user interaction.

fraud-detection machine-learning nodejs numpy pandas python reactjs sklearn-library

Last synced: 15 Apr 2026

https://github.com/sahraiidle/email-spam-detector

Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.

data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm

Last synced: 24 Jan 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/slovnicki/matematicki-softver

homeworks and projects for Mathematical Software course

ipython jupyter latex markdown mathplotlib numpy pandas scikit sympy

Last synced: 15 Apr 2026

https://github.com/ambrish8/bell-s-palsy

BELL’S PALSY SEVERITY DETECTION USING DEEP LEARNING

cnn-architecture imagedatagenerator keras numpy opencv python pytorch resnet-50

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/subh888999/car-prices--analysis-projects

This repository houses projects focused on data collection, assessment, cleaning, visualization, and analysis. It includes workflows and methodologies for handling data, from initial gathering and evaluation to processing, visualizing insights, and performing in-depth analysis

jupyter-notebook matplotlib numpy panda seaborn statistics

Last synced: 03 May 2026

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 25 Feb 2026

https://github.com/asherk7/house-price-prediction

House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting

data-science numpy pandas regression scikit-learn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/hercules-peters/learn-ml-with-me

Welcome to **ML-Journey**, my personal progress tracker for mastering machine learning! This repository showcases my projects, notebooks, and explorations as I advance my skills, starting with hands-on projects.

machine-learning numpy pandas python sklearn

Last synced: 15 Apr 2026

https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch

Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch

elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec

Last synced: 15 Apr 2026

https://github.com/muhkartal/fr-photo

face-recognition system works on images first of all you need to register valid image as recognition

face-recognition numpy opencv pillow python streamlit

Last synced: 15 Apr 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/itssahilwhat/ai-fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 15 Apr 2026

https://github.com/stavrosarm/mathematics_algorithms

Fundamental algorithms of Linear Algebra , Numerical Analysis and Calculus in Python and Julia.

calculus julia-language linear-algebra numerical-analysis numpy python

Last synced: 30 Jan 2026

https://github.com/ayuub34/siesta-smart

This project is designed to be a AIaaS 2-Way-PMS optimized with AI and ML models. The goal is to enhance the efficiency of the system through advanced technology solutions.

ai data-analysis-python data-processing data-science jupyter-python machine-learning matplotlib numpy pandas powerbi-report pyqt5 scipy seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-03-q4-hypothesis-testing

Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing norm null-hypothesis numpy numpy-arrays pandas python scipy significance-testing stats

Last synced: 31 Jan 2026

https://github.com/moh-27sl/image-alignment--python-edition

Alignment of an unaligned image with a base/reference image using feature detection, feature matching, and homography in OpenCV.

ai classical-computer-vision classical-image-processing computer-vision computervision image-alignment image-processing imageprocessing neural-network numpy opencv python python3

Last synced: 15 Apr 2026

https://github.com/manu-karenite/medical-insurance-cost-predictor

Medical Insurance Cost Generator is a Linear Regression based Predictor which is used to estimate and predict the Cost a person has to pay while Buying a Medical Insurance.

kaggle-dataset linear-regression machine-learning matplotlib numpy pandas python3 reactjs scikit-learn

Last synced: 15 Apr 2026

https://github.com/hajaarh/health_state_classifier

Ce projet vise à analyser un dataset contenant divers indicateurs de santé, bien-être et vie quotidienne afin de classifier les individus en trois catégories selon leur état de santé.

numpy pandas python seaborn sklearn

Last synced: 15 Apr 2026

https://github.com/sarmad426/ai

AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.

ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/l-gre/data_analytics_for_finance

Comprehensive course materials for the Data Analytics for Finance - Master Programme, covering data manipulation, statistical analysis, visualisation, automation, and real-world case studies using industry-standard tools.

automation data-cleaning data-manipulation data-visualization excel hypothesis-testing industry-applications matplotlib numpy pandas python real-world-case-studies regression-analysis seaborn sql statistical-analysis tableau workflow-automation

Last synced: 15 Apr 2026

https://github.com/karishmagupta05/airbnb-data-scraping-and-analysis

This project involves web scraping Airbnb listings using Python and BeautifulSoup, followed by data analysis with Pandas. The goal is to extract meaningful insights from the data, clean and preprocess it, and visualize the results using libraries like Matplotlib and Seaborn.

beautifulsoup4 matplotlib numpy pandas python scraping seaborn visualization

Last synced: 15 Apr 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/farha-dahman/investigate_dataset

Investigate [No-show appointments] Dataset

data-analyst jupyter-notebook matplotlib numpy pandas python

Last synced: 15 Apr 2026

https://github.com/singhvishal003/sales-data-analysis

Data Analysis according to sale data.

matplotlib numpy pandas seborn

Last synced: 15 Apr 2026

https://github.com/ankitpundir7/group-face-emotion-recognition

This project can identify and categorize human emotions in both static and dynamic context. Duration : Jun 2023 - Jul 2023

convolutional-neural-networks deep-learning keras machine-learning-algorithms numpy opencv python tensorflow

Last synced: 07 Feb 2026

https://github.com/dmitruzik/python_projects

All python projects are here

matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/dhruvv1402/document-scanner-python-opencv

Transform smartphone photos into scanned documents in seconds! This Python-based document scanner automatically detects edges, corrects perspective, and enhances document images to produce clean, scanner-like results.

canny-edge-detection contour-detection imutils numpy opencv python scikitlearn-machine-learning warpperspective

Last synced: 15 Apr 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 15 Apr 2026

https://github.com/pedrohenriquebr/pegasus

Finance Reporting Automation

excel finance financial numpy pandas python python3 report

Last synced: 15 Apr 2026

https://github.com/anshpg/exploration-in-image-processing-digit-image-generation

This project, developed by Anshuman Pattnaik, explores image processing techniques using Python libraries such as pandas, numpy, matplotlib, and cv2 (OpenCV). The primary objective of the project was to delve into image processing with a focus on creating a unique dataset and algorithm for image generation.

cv2 image-generation image-processing ipynb-jupyter-notebook matplotlib-pyplot numpy opencv pandas

Last synced: 01 Feb 2026

https://github.com/tsar123/neutral-single-nucleotide-polymorphisms-analysis

analysis of data on neutral single nucleotide polymorphisms

dna-polymorphism matplotlib numpy pandas seaborn snv

Last synced: 01 Feb 2026

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 01 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/audeering/audmath

General math functions

math numpy

Last synced: 07 Feb 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/mohamedelashri/lvec

Python package for seamless handling of Lorentz vectors

awkward hep hep-ex numpy physics root root-cern uproot

Last synced: 25 Feb 2026

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 01 Feb 2026

https://github.com/hygull/pyrandocs

A repository (pyrandocs i.e. Python's v2/v3 random documents/files) containing different Python v2/v3 based resources (files/documents etc.) from random topics. A project to try different Python's features.

base64 beautifulsoup class exception-handling hashlib json jwt-authentication loops matplotlib nltk numpy pandas python2 python27 python3 requests searching time-complexity urllib urllib2

Last synced: 15 Apr 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/luisall12/fraud-detection-machine-learning-python

Fraud detection system that recognises and signals strange transactions using machine learning

detection flask fruad html numpy pandas python ui

Last synced: 15 Apr 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/coueghlani/nlp

Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos

mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy

Last synced: 08 Feb 2026

https://github.com/giihcastro/trabalho-fisica

3D VPython Electromagnetic Particle Motion Simulator

numpy python vpython

Last synced: 15 Apr 2026

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 09 Feb 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 26 Feb 2026

https://github.com/sahilk12nayak/data-capstone-projects-using-python-libraries

In this data project we will focus on exploratory data analysis. This project utilizes Python libraries such as NumPy, Pandas, Matplotlib, and Seaborn for data manipulation, analysis, and visualization.

matplotlib numpy pandas seaborn

Last synced: 15 Apr 2026

https://github.com/mahmud-kandawala/simulating-covid-19-python-

This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.

matplotlib numpy python

Last synced: 16 Apr 2026

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 09 Feb 2026

https://github.com/harmanveer-2546/statistics-for-machine-learning

Statistical tools help you clean and organize your data. You can identify outliers, manage missing values, and ensure your data is in a format that the ML algorithms can understand.

inline matplotlib matplotlib-styles numpy pandas probability python seaborn statistics

Last synced: 18 Apr 2026

https://github.com/baschin1103/neural-network-building-in-python

In this repository is a programm with a pretrained artificial neural network from Building AI. The network predicts the price of a cabin based on it's attributes.

bias building-ai forward-pass functioning identity linear-combinations neural-network numpy python regression relu

Last synced: 16 Apr 2026

https://github.com/foggysmile/ml_labs_itmo

Лабораторные работы по машинному обучению

bert decision-trees jupyter-notebook machine-learning numpy pandas random-forest xgboost

Last synced: 11 Feb 2026

https://github.com/puureya2/catalogue-price-scraper

Scrape & store product prices with Selenium & BeautifulSoup4. Visually represent data with MatPlotLib,Personal Project

beatifulsoup4 chrome-webdriver matplotlib numpy pandas python python-csv selenium webscraping

Last synced: 16 Apr 2026

https://github.com/khinthandarkyaw98/python_for_engineers

This particular Python notebook is designed to provide Engineers with an opportunity to practice scientific computations.

engineering numpy python scientific-computing youtube

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/mindkerchief/baselineml

A collection of machine learning task performed during my studies in computer science major in intelligent system.

decision-tree dummy gaussian-mixture-models kmeans-clustering linear-regression logistic-regression machine-learning matplotlib numpy pandas random-forest scikit-learn seaborn tensorflow

Last synced: 16 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/adriantomin/heart-disease-prediction

Predicting Heart Disease with ML 🏥🤖 Machine learning model to predict heart disease using clinical data. Includes data analysis, feature engineering, and model training.

machine-learning matplotlib numpy pandas python sklearn

Last synced: 12 Feb 2026

https://github.com/walkerdustin/python-linear-regression

simple linear regression in python based on coursera andrew ng course

ai linear-regression machine-learning ml numpy python

Last synced: 05 May 2026

https://github.com/hansalemaos/np_asciiart

Ascii art with numpy

ascii numpy opencv python

Last synced: 16 Apr 2026

https://github.com/mouradhamzaoui/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 16 Apr 2026

https://github.com/bharatsharma07/ipl-analysis

This project provides insights into the IPL's historical data, helping to identify trends, player performance, and team strategies.

data-visualization ipl-data-analysis numpy pandas python

Last synced: 16 Apr 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/shakhthi/mlproject

This project is an end-to-end ML solution that predicts math scores from student attributes. It starts with ingesting and preprocessing data, followed by feature engineering to pinpoint key predictors. A robust model is then trained and fine-tuned before being seamlessly deployed to deliver real-time predictions.

jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn sklearn

Last synced: 16 Apr 2026