An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/heshanera/paclassifier

Online Passive Aggressive Binary Classifier

binary-classifier numpy passive-aggressive python

Last synced: 06 May 2026

https://github.com/carrington-115/ppt-generator-from-excel

PPT Generator for Excel file

numpy oop pandas python python-pptx

Last synced: 10 Feb 2026

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/felixcharotte/housepredictions_tatahousing

Data science project of real estate price prediction website, deployed using a Flask server on AWS, incorporating machine learning for price prediction and a UX HTML/CSS/JavaScript for website building.

aws flask jupyter linear-regression numpy pandas python

Last synced: 09 Feb 2026

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 11 Apr 2026

https://github.com/ujstor/streamlit-working-hours

The analysis pipeline involves combining the data forms, performing data cleaning, and starning streamlit server for statistical analysis.

data-pipeline numpy pandas statistics streamlit

Last synced: 06 May 2026

https://github.com/yangeorget/clustering-algorithms

K-means and C-means experiments with Numpy

clustering clustering-algorithm numpy

Last synced: 02 Apr 2026

https://github.com/py-mp3/boston-comparing-regression-modules

A ipynb file that compares different regression modules in the sk.learnit library.

ipython-notebook numpy pandas python regression-models seaborn sklearn

Last synced: 09 Feb 2026

https://github.com/pr38/cox_ph_estimation_notebooks

Personal discovery work on estimating Cox Proportional hazards coefficients for for both breslow and efron ties, using both autograd and directly calculating the gradient and hessian

cox-regression dask data-science machine-learning numpy pytensor statistics survival-analysis

Last synced: 15 Apr 2026

https://github.com/fiercex/toynn

Python实现神经网络

mnist-classification neural-network numpy python

Last synced: 15 May 2026

https://github.com/prabath1998/rice_leaf_disease_ditection_system

This project is a Paddy Crop Disease Detection System that uses machine learning to identify diseases in paddy (rice) leaves. Farmers can upload an image of a paddy leaf, and the system will predict whether the leaf is healthy or affected by a specific disease

flask joblib numpy opencv pillow sklearn

Last synced: 15 Apr 2026

https://github.com/yc1909/ai_impact_on_jobs

Impact of Artificial Intelligence on Jobs and Workplace-Analysis

analytics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 06 May 2026

https://github.com/upul/indi

Small Machine Learning library written in Python

algorithms machine-learning numpy python scipy

Last synced: 10 Apr 2026

https://github.com/khaymanii/customer_segmentation_model

This model was built using Python and KMeans Clustering algorithm

matplotlib numpy pandas python sckit-learn seaborn

Last synced: 06 May 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/nguyenanhtuan1912/computer_vision_exercises

Trong này sẽ chứa các file source về bài tập Computer Vision.

computer-vision numpy opencv python

Last synced: 12 Apr 2026

https://github.com/ryantusi/ai_ml_fundamentals

Welcome to my GitHub repository showcasing projects completed during the AI/ML Foundation course from Codecademy. This repository contains a collection of projects that demonstrate my skills and knowledge in the field of artificial intelligence and machine learning.

artificial-intelligence data-science data-visualization exploratory-data-analysis foundation machine-learning numpy pandas python

Last synced: 14 Apr 2026

https://github.com/hansalemaos/permutation_no_repetition

Permutation without repetition - written in Cython - for Numpy

cython data-science no-repetitions numpy pandas permutations

Last synced: 14 Jan 2026

https://github.com/meghajbhat/photo-editor

This is a Photo Editor Application with various image processing functionalities using Tkinter for the GUI and OpenCV for image manipulation. The application allows users to apply multiple filters and transformations to images such as Black and White, Pencil Sketch, Sepia, Crop, Sharpen, Contrast, X-ray, and Brighten.

gui numpy opencv opencv-python python python3 tkinter tkinter-gui tkinter-python

Last synced: 31 Jan 2026

https://github.com/quentinwidlocher/cours-python-pandas

Jupyter Lab sur le traitement des données en python

jupyter-lab matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/briankopp/fewerbytes

compress numpy arrays. binary and numeric techniques for numpy compression

compress compression numpy python python3 storage

Last synced: 18 May 2026

https://github.com/lolosoares/dataanalises

This repository shows my projects made in my journey learning about data analises, ciency, engeniering and ML.

acces data-science dax excel mysql numpy pandas powerbi powerquery python r seaborn sql vba

Last synced: 10 Apr 2026

https://github.com/datarohit/titanic-survival-data--classification

This is a Titanic Survival Data from Kaggle Competition with Multiple Classification Algorithm testing and Detailed Exploratory Data Analysis.

classification-algorithm kaggle-competition machine-learning-algorithms matplotlib-pyplot numpy pandas seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/abhishek010397/logisticregression

Using Logistic Regression Classifier to Predict Target Using Three Features

csv iloc logistic-regression machine-learning numpy pandas-dataframe python3 sigmoid-function sklearn

Last synced: 06 May 2026

https://github.com/ananas1kexe/clap-to-rickroll

A Python-based utility that detects double claps via microphone to automate system actions. It forces volume to 100% via ALSA and triggers a multi-tab browser prank. Uses NumPy for real-time acoustic sharpness analysis.

audio-processing automation claps-detection cybersecurity-prank dsp numpy python rickroll

Last synced: 11 May 2026

https://github.com/sahajrajmalla/co2-calculator-api

A Web Application backend API that enables people to create events, join events, and measure the carbon footprint produced by each participant to get insights about online and onsite carbon emissions.

apis carbon-emissions carbon-footprint fastapi makefile numpy pandas precommit sqlalchemy sqlite3 uvicorn

Last synced: 09 Apr 2026

https://github.com/duoan/dl-bike-sharing-demand

ractice neural network implemented just with numpy for Kaggle Bike Sharing Demand

bike-sharing bike-sharing-demand deep-learning kaggle neural-network numpy udacity-deep-learning

Last synced: 18 Jun 2026

https://github.com/Krisseck/hki-traffic-predict

Keras neural network to predict traffic in Helsinki

avoindata helsinki keras numpy open-data python traffic

Last synced: 13 Jul 2025

https://github.com/gehad-ahmed30/data-analysis

This repository offers a guide to data visualization with Pandas and includes practical examples and two complete projects. Learn to preprocess, analyze, and visualize datasets to solve real-world problems effectively.

analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/miltiadiss/ceid_ne562-data-mining-and-machine-learning-algorithms

This project involves analyzing a dataset through statistics and visualizations to uncover patterns & correlations. It trains classifiers (Neural Networks, Random Forests, Bayesian Networks) to predict physical activities and evaluates their performance. Finally, it clusters participants' activities using 2 algorithms, comparing their effectivenes.

matplotlib numpy pandas seaborn sklearn sklearn-metrics

Last synced: 15 Apr 2026

https://github.com/codewitheshayoutube/corvit_python_data_analytics_course

I teach Python & Data Analytics at Corvit Systems. This repository contains all course materials, including code, assignments, projects, and resources. ⭐ Stay updated!

matplotlib numpy pandas python python3 scikitlearn-machine-learning seaborn

Last synced: 13 Apr 2026

https://github.com/m-aadhi/game-review-sentiment-analysis

This project aims to analyze and predict the sentiment of game reviews using natural language processing (NLP) and machine learning techniques. The objective is to determine whether a review is positive, negative, or neutral based on its content.

matplotlib numpy pandas-library python

Last synced: 19 Jun 2026

https://github.com/abhihirekhan/abhi-s-data-science-portfolio

A list of data science projects completed by me for academic, self learning, and creative purposes

datascience ipynb-jupyter-notebook jupyter jupyter-kernels jupyter-notebook keras-tensorflow machine-learning numpy pandas portfolio python tensorflow

Last synced: 31 Jan 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/ejw-data/proj-nba-eda

NBA exploratory data analysis project with Pandas and Tableau

nba numpy pandas python tableau

Last synced: 14 Apr 2026

https://github.com/harsh8088/py_cluster

Grouping in data values.

numpy python sklearn

Last synced: 11 Apr 2026

https://github.com/ayaanjawaid/brain_stroke_prediction

project aims to predict the likelihood of a stroke based on various health parameters using machine learning models. The dataset is preprocessed, analyzed, and multiple models are trained to achieve the best prediction accuracy.

decision-trees exploratory-data-analysis matplotlib numpy pandas python regression xgboost

Last synced: 07 May 2026

https://github.com/nirmalyabag20/crop-yield-prediction-using-machine-learning

This project uses machine learning to predict crop yields based on factors like region, crop type, rainfall, temperature, and pesticide use. By analyzing a dataset of over 28,000 records, the models provide accurate yield forecasts, helping optimize farming decisions and resource management, ultimately contributing to sustainable agriculture.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Feb 2026

https://github.com/akashash01/akash_linear-regression

This is an linear approach machine learning model used to predict the values of variable(dependent) based on other variables(independent).

anova-test autocorrelation data-science heteroscedasticity linear-regression machine-learning mape multicollinearity numpy pandas python supervised-learning

Last synced: 07 May 2026

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/guilhermebit/movie-suggestions-using-k-nearest-neighbors

This project implements a movie recommendation system using the K-Nearest Neighbors (KNN) machine learning algorithm.

algorithms data-science euclidean-distance knn machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/rishraks/color-object-detection

This project enables real-time detection of user-defined color objects using a webcam. Users enter a color name, and the program identifies objects matching that color by converting it to its HSV range, applying a mask, and detecting contours. It's a practical tool for object tracking, sorting, and interactive applications.

computer-vision numpy opencv pillow-library

Last synced: 11 Apr 2026

https://github.com/fayzi-dev/machin_learning

Machin Learning Full Algorithm (Linear Regression, Decision tree, Random forest, Neural network ,Logistic regression ,Support vector machine ,Naive Bayes ,Clustering, XGBoost,DBscan,KMeans)

algorithms artificial-neural-networks logistic-regression machine-learning matplotlib matplotlib-pyplot naive-bayes-classifier numpy pandas python python3 seaborn seaborn-plots sklearn sklearn-knn sklearn-library sklearn-linear-model sklearn-linear-regression sklearn-metrics sklearn-svm

Last synced: 09 Apr 2025

https://github.com/yash22222/sync-intern-s-ml-tasks

SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.

bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns

Last synced: 07 May 2026

https://github.com/abhishekyana/zframework

Z Framework for Easy Deep Learning

deep-learning machine-learning numpy python tensorflow

Last synced: 24 Jan 2026

https://github.com/henriqueotogami/curso-de-python3-udemy

Documentos do curso de Python 3, com arquivos das aulas, e Jupyter Notebook. Conceitos de Machine Learning.

iris-dataset jupyter-notebook linear-regression numpy pandas-python python3 udemy

Last synced: 08 May 2026

https://github.com/kunalrrathod/placement-predictor

A web application that is using machine learning technology to deliver results ensuring the students are getting placed in campus placements

bootstrap4 flask-application jquery knearest-neighbor-algorithm machine-learning numpy pandas-dataframe python3 sklearn

Last synced: 15 Apr 2026

https://github.com/SudipBishwakarma/Bulk-Remove-OneSignal-Users

Bulk remove OneSignal subscribed users blazingly faster with parallel processing.

numpy pandas python python-logger python-multiprocessing python-threading python3

Last synced: 31 Mar 2025

https://github.com/mohamedawnallah/covid19-dataanalysis-python

how to preprocess and merge datasets to calculate needed measures and prepare them for an Analysis. In this project, we are going to work with the COVID19 dataset, published by John Hopkins University, which consists of the data related to the cumulative number of confirmed cases, per day, in each Country. Also, we have another dataset consist of various life factors, scored by the people living in each country around the globe. We are going to merge these two datasets to see if there is any relationship between the spread of the virus in a country and how happy people are, living in that country.

covid19 dataanalysis datavisualization happiness-report jupyternotebook numpy pandas python

Last synced: 08 May 2026

https://github.com/ola-kaznowska/ai-digit-recognition

AI digit recognition. Artificial neural network for digit recognition

ai artificial-intelligence machine-learning neural-network numpy tensorflow

Last synced: 14 Apr 2026

https://github.com/MITHILESHK11/Steganography-App-Aicte-Internship-Cybersecurity-Edunet-Foundation

Secure Image Steganography App : This Streamlit-based Steganography App lets you securely encrypt and decrypt messages in images using the Least Significant Bit (LSB) technique. Messages remain invisible unless unlocked with the correct password.

hashlib numpy opencv3 python streamlit-webapp

Last synced: 24 Oct 2025

https://github.com/octcarp-courses/sustech_cs303-ai_f23_works

(49.8/50) Course code of CS303 2023 Fall: Artificial Intelligence, SUSTech. Taught by Prof. Ke TANG.

numpy pytorch sustech

Last synced: 08 May 2026

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/serhatderya/fall_detection

This project implements a computer vision-based fall detection system using deep learning techniques. The system can detect whether a person has fallen in images or video frames.

ai artificial-intelligence artificial-neural-networks computer-vision deep-learning deep-neural-networks deeplearning falldetection matplo neural-network numpy opencv pil pillow pytorch tensorflow yolo yolov5 yolov8

Last synced: 11 Apr 2026

https://github.com/poornesh-v/bank-customer-churn-prediction-system

The repository is meant to predict customer churn in a banking dataset based on transaction history and demographics.

jupyter-notebook matplotlib numpy pandas-python python seaborn

Last synced: 11 Apr 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/raulpy271/videoinascii

🖼️ this program show a video in terminal with ascii-art style

ascii-art ffmpeg numpy python video-processing

Last synced: 08 May 2026

https://github.com/mark0011astra/simplecuda

CUDAを使用したGPU演算をNumPyと同様のインターフェースで簡単行えるライブラリ。A library that allows users to easily perform GPU operations using CUDA with a NumPy-like interface.

cuda cupy gpu machine-learning numpy python vector

Last synced: 02 May 2026

https://github.com/nemeslaszlo/loan-data-decision-tree-and-random-forest

This repository covers a "Lending Club" analysis with decision tree and random forest with data visualizations and exploration.

decesion-trees matplotlib numpy pandas random-forest seaborn sklearn

Last synced: 17 Apr 2026

https://github.com/neetigyab/pdfreader

Ready to use Python application/file for parsing a specific format of pdf form, and storing relevant user data in a tabular format in excel sheet

excel forms matplotlib numpy ocr opencv-python pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 08 May 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/matx64/rs-netbot

Old School Runescape bot with CNN for object identification

cuda numpy python pytorch

Last synced: 04 May 2026

https://github.com/achalesh27022003/python-assignments-iwt

This repository is objected to have all the assignments and project related to IWT (Semester - 3).

cosinewave frame grid image imagetk labels matplotlib-pyplot numpy numpy-arrays numpy-library photoimage pil python3 sinewave spyder tkinter tkinter-gui window

Last synced: 10 Apr 2026

https://github.com/harmanveer-2546/heart-failure-prediction

Heart failure is a severe condition in which the heart is unable to pump blood effectively. Early prediction of heart failure can significantly improve patient outcomes. This project aims to build a predictive model using machine learning techniques to identify patients at risk of heart failure.

exploratory-data-analysis feature-engineering jupyter-notebook matplotlib model-training-and-evaluation numpy pandas python sckiit-learn seaborn visualization-of-results

Last synced: 14 Apr 2026

https://github.com/camilajaviera91/apache-beam-pipeline-first-approach

This code demonstrates how to integrate Apache Beam with scikit-learn datasets and perform simple data transformations. It loads the Linnerud dataset from scikit-learn, converts it into a Pandas DataFrame for easier manipulation.

apache-beam dataframes glob kmeans-clustering matplotlib-pyplot mean-absolute-error mean-square-error numpy os pandas pipelines scipy-stats seaborn silhouette-score sklearn sklearn-datasets standardscaler

Last synced: 28 Apr 2026

https://github.com/idow09/ud810-problem-sets

Solutions to the problem sets of Introduction to Computer Vision course at Udacity

computer-vision numpy opencv opencv-python problem-sets python solutions udacity

Last synced: 09 May 2026

https://github.com/anishshinde01/machine-learning-exercises

Python implementations of machine learning, statistics, and mathematical foundations.

linear-algebra machine-learning machine-learning-algorithms matplotlib numerical-analysis numpy python scikit-learn scipy statistics

Last synced: 11 Jun 2026

https://github.com/michael-insights/portfolio

This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.

data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql

Last synced: 11 Apr 2026

https://github.com/khaymanii/medical_insurance_cost_prediction-_model

This Model was built using Python and Linear Regression algorithm

matplotlib numpy pandas python sckit-learn seaborn

Last synced: 09 May 2026

https://github.com/haloapping/sharing-tipis-tipis

#sharingtipistipis 📗 catatan ketika belajar machine learning, deep learning dan masih banyak lagi.

caring deep-learning machine-learning numpy pandas scikit sharing

Last synced: 09 May 2026

https://github.com/rubinlake/rl-academy-data-analytics

Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks

cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn

Last synced: 09 May 2026

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/mothnik/robust_fourier

Noise- and Outlier-Robust Fourier Transform with Hermite Functions with NumPy and Numba

chebyshev-polynomials fourier-transform hermite-functions hermite-polynomials least-square-regression logsumexp numba numpy python robust-statistics

Last synced: 04 Jan 2026

https://github.com/arjunan-k/netflix_analysis

Exploratory Data Analysis on Netflix Movies & TV Shows.

matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/rixiiz/using-knn-to-predict-the-obp-of-mlb-players

Using KNN to predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 05 Apr 2026

https://github.com/nemeslaszlo/dog_breed_identification

Dog breed identification model to identify and label 120 different dog breeds.

matplotlib numpy pandas tensorflow tensorflow-hub

Last synced: 13 Apr 2026

https://github.com/mithileshk11/steganography-app-aicte-internship-cybersecurity-edunet-foundation

Secure Image Steganography App : This Streamlit-based Steganography App lets you securely encrypt and decrypt messages in images using the Least Significant Bit (LSB) technique. Messages remain invisible unless unlocked with the correct password.

hashlib numpy opencv3 python streamlit-webapp

Last synced: 28 Apr 2026