An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abhinav330/data-science-projects

The Data Science Projects folder contains various projects related to data analysis, machine learning, and exploratory data analysis (EDA). Each project focuses on a specific dataset and utilizes different techniques and algorithms to extract insights and make predictions.

decision-trees exploratory-data-analysis k-means-clustering k-nearest-neighbours kaggle linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas plotly python random-forest-classifier support-vector-machines

Last synced: 30 May 2026

https://github.com/kwanit1142/sars-cov-2-detection-radiology-based-multi-modal-multi-task-framework

Keras-based Implementation for "SARS-CoV-2 Detection: Radiology based Multi-modal Multi-task Framework" (Accepted in 45th IEEE EMBC 2k23)

gans keras matplotlib numpy opencv os pandas pillow scipy sklearn tensorflow transfer-learning wasserstein-distance

Last synced: 14 Apr 2026

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 10 Apr 2026

https://github.com/ngirimana/decision-trees-and-random-forest-

For this project we will be exploring publicly available data from [LendingClub.com](www.lendingclub.com). Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this. Lending club had a [very interesting year in 2016](https://en.wikipedia.org/wiki/Lending_Club#2016), so let's check out some of their data and keep the context in mind. This data is from before they even went public. We will use lending data from 2007-2010 and be trying to classify and predict whether or not the borrower paid back their loan in full. You can download the data from [here](https://www.lendingclub.com/info/download-data.action) or just use the csv already provided. It's recommended you use the csv provided as it has been cleaned of NA values. Here are what the columns represent: * credit.policy: 1 if the customer meets the credit underwriting criteria of LendingClub.com, and 0 otherwise. * purpose: The purpose of the loan (takes values "credit_card", "debt_consolidation", "educational", "major_purchase", "small_business", and "all_other"). * int.rate: The interest rate of the loan, as a proportion (a rate of 11% would be stored as 0.11). Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates. * installment: The monthly installments owed by the borrower if the loan is funded. * log.annual.inc: The natural log of the self-reported annual income of the borrower. * dti: The debt-to-income ratio of the borrower (amount of debt divided by annual income). * fico: The FICO credit score of the borrower. * days.with.cr.line: The number of days the borrower has had a credit line. * revol.bal: The borrower's revolving balance (amount unpaid at the end of the credit card billing cycle). * revol.util: The borrower's revolving line utilization rate (the amount of the credit line used relative to total credit available). * inq.last.6mths: The borrower's number of inquiries by creditors in the last 6 months. * delinq.2yrs: The number of times the borrower had been 30+ days past due on a payment in the past 2 years. * pub.rec: The borrower's number of derogatory public records (bankruptcy filings, tax liens, or judgments).

artificial-intelligence decision-tree-classifier machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn

Last synced: 11 Apr 2026

https://github.com/pradipece/insurance_data_analysis_ml

This project approach defines the terms machine learning and linear regression ML algorithm in the context of real-time problem-solving.

data-science data-visualization database machine-learning matplotlib numpy pandas python python3

Last synced: 11 Apr 2026

https://github.com/burakkagann/pixels2genai-2.0

Free, open-source curriculum teaching generative art and AI across 15 modules, from a single pixel to diffusion models.

computer-vision creative-coding curriculum deep-learning design-based-research diffusion-models education gans generative-ai generative-art machine-learning numpy pytorch touchdesigner vae

Last synced: 19 Jun 2026

https://github.com/zazi2002/neuroscience-project-cnn

This project is part of the Neuroscience of Learning, Memory, Cognition course and focuses on the visual processing pathways in the brain, particularly the dorsal and ventral streams.

cnn-for-visual-recognition learning matplotlib numpy scipy visual-processing

Last synced: 09 May 2026

https://github.com/sergio11/breast_cancer_diagnosis_mlp

🩺🔬 MLP-based Breast Cancer Diagnosis: Predicts tumor malignancy from image features, aiding in early detection. 📊🤖

deep-learning deep-neural-networks machine-learning mlp-classifier mlp-networks numpy sklearn sklearn-classify

Last synced: 21 Feb 2026

https://github.com/thevarunsharma/similar-word-finder

A web application with Python backend which finds words similar to a given word, using pre-trained GloVe word embeddings.

flask-application machine-learning nlp numpy python word-embeddings

Last synced: 07 May 2026

https://github.com/aakashks/ct-reconstruction-2d

Python Implementation of Computed Tomography (CT) reconstruction for 2D images

computed-tomography ct-reconstruction numpy

Last synced: 09 Aug 2025

https://github.com/affromero/pixelcache

A versatile image processing library for Python with built-in support for caching, using Pillow, NumPy, and PyTorch.

image-processing numpy opencv pillow pytorch torchvision

Last synced: 07 May 2026

https://github.com/mothnik/robust_fourier

Noise- and Outlier-Robust Fourier Transform with Hermite Functions with NumPy and Numba

chebyshev-polynomials fourier-transform hermite-functions hermite-polynomials least-square-regression logsumexp numba numpy python robust-statistics

Last synced: 04 Jan 2026

https://github.com/qua77i20/ai2

A simple single-layer neural network (perceptron) built in Python. Demonstrates backpropagation training from scratch for beginners.

ai backpropagation data-science deep-learning machine-learning neural-network numpy perceptron python

Last synced: 15 May 2026

https://github.com/ahlem-phantom/django-travel-agency

Travel Agency website built using Django, designed for browsing and booking of travel packages.

beautifulsoup celery djagno django-rest-framework numpy rabbitmq weasyprint

Last synced: 02 Aug 2025

https://github.com/kaikymoura/backcourt-api

Repository for the Basketball Advanced Stats API

fastapi nba-api nba-stats numpy pandas python swagger uvicorn

Last synced: 03 May 2026

https://github.com/kientech/calories-burnt-prediction-with-machine-learning

The Calories Burnt Prediction with Machine Learning project aims to predict the number of calories burned during physical activities based on various factors such as duration, activity type, age, weight, and more.

fastapi heroku machine-learning matplotlib-pyplot modeling numpy pandas python reactjs tailwindcss

Last synced: 02 Aug 2025

https://github.com/harmanveer-2546/diabetes-prediction-using-machine-learning

Diabetes is a medical disorder that affects how the body uses food for energy. When blood sugar levels rise, the pancreas releases insulin. If diabetes is not managed, blood sugar levels can rise, increasing the risk of heart attack and stroke. We used Python machine learning to forecast diabetes.

adaboost decision-trees gradient-boosting-classifier gridsearchcv knn-classification machine-learning numpy os pandas pickle python random-forest seaborn svc-model

Last synced: 14 Apr 2026

https://github.com/dantasl/probabilidade-analise

Análise sobre o desempenho dos discentes na disciplina Resolução de Problemas Matemáticos Para Tecnologia da Informação (RPMTI) nos semestres de 2015.2, 2016.1 e 2016.2

data-science numpy pandas probability

Last synced: 13 Apr 2026

https://github.com/rusiruchapana/blood-group-prediction

Creating a machine learning project to predict blood groups from fingerprint patterns

asp-net-web-api keras matplotlib numpy opencv-python pandas pillow scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 14 Apr 2026

https://github.com/rhazra-003/sketcher

A simple python code to make sketch out of any given photo using opencv and numpy

numpy opencv python3

Last synced: 11 May 2026

https://github.com/sudipbishwakarma/bulk-remove-onesignal-users

Bulk remove OneSignal subscribed users blazingly faster with parallel processing.

numpy pandas python python-logger python-multiprocessing python-threading python3

Last synced: 07 May 2026

https://github.com/akash-rajak/image-noise-remover

Python Script to remove noise from image.

cv2 image-noise-remover numpy os python3 random tkinter

Last synced: 28 Apr 2026

https://github.com/m-rishab/housing-price-prediction

The Housing Price Prediction Accuracy Improvement project is a data-driven initiative focused on enhancing the precision and reliability of housing price predictions. This project encompasses a multidisciplinary approach, combining data science, machine learning, and real estate insights to optimize the accuracy of forecasts in the housing market.

gridsearchcv hyperparameter-tuning matplotlib matplotlib-pyplot numpy pandas pickle python3 randomforestregressor scipy sklearn

Last synced: 11 Apr 2026

https://github.com/amorehead/ml-fundamentals

A self-educational series of Python scripts made to explore the vast library of machine learning knowledge.

machine-learning numpy pandas python scipy sentdex tensorflow tflearn

Last synced: 13 Apr 2026

https://github.com/yash22222/sync-intern-s-ml-tasks

SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.

bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns

Last synced: 07 May 2026

https://github.com/gourab-sinha/machine_learning

This repository consists of Machine Learning Concepts and Projects.

classification machine-learning neural-network numpy pandas sklearn-library

Last synced: 09 Apr 2026

https://github.com/dmarks84/coursework_coursework_project_automobile-sales-visualization

Project for IBM Data Science course on Visualization & Dashboards -- Analyzed historical sales data, performing EDA and setting up an interactive dashboard

communication dash dashboards data-modeling elt etl folium matplotlib numpy pandas pipelines plotly python scipy seaborn visualization

Last synced: 10 Apr 2026

https://github.com/hansalemaos/numpycythonpermutations

Permutations, Combinations and Product for Numpy - written in Cython - 20x faster than itertools

c combinations cython fast itertools numpy permutations product python

Last synced: 08 May 2026

https://github.com/anvesham/machinelearning_models_using_matlab

Stepwise Multiple Linear Regression (With Interactions) and Random Forest Regression on predicting the Productivity of the Garment Factory Workers

bagging-ensemble bayesian-optimization decision-trees jupyter-notebook machine-learning matlab multiple-linear-regression-model numpy python random-forest-regression

Last synced: 11 Apr 2026

https://github.com/mattdelaune/saas_cohort_analysis

This project analyzes customer segmentation and behavior using data science and cohort analysis. Key metrics like CRR, NRR, CLR, and CLV are examined through detailed charts, including the cohort layer cake and CLR vs. CLV cost efficiency analysis. Exploratory Data Analysis and systematic data manipulation reveal actionable insights.

anaconda git jupyter-notebook matplotlib numpy pandas python saas-data sales-data sales-data-analysis-project seaborn

Last synced: 03 Jan 2026

https://github.com/savernish/forgenn

forgeNN is a in-development purpose‑built neural network framework combining a transparent NumPy autograd engine with a Keras‑like API and performance oriented primitives. Developed by a college student with an ambitious feature pipeline.

artificial-intelligence deep-learning keras machine-learning mlp-networks neural-network numpy pytorch tensorflow tensors

Last synced: 08 Oct 2025

https://github.com/camilajaviera91/prediction-of-housing-prices-using-linear-regression

This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.

curses kaggle linear-regression matplotlib-pyplot mean-absolute-error mean-square-error numpy pandas pathlib python scikit-learn train-test-split

Last synced: 10 Apr 2026

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 07 Apr 2026

https://github.com/harmanveer2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics dataanalysis datetime matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/dust-detection-on-solar-panel-using

The accumulation of dust, snow, bird drops etc. on the surface of solar panels reduces the efficiency of the solar modules and hence the amount of produced energy. Monitoring and cleaning solar panels is a crucial task, hence developing an optimal procedure to monitor and clean these panels is very important in order to increase modules efficiency.

classification dirty inceptionv3-model keras numpy pandas plotly python regularizer sequential solar-panels tensorflow visualization

Last synced: 13 Apr 2026

https://github.com/aymen016/emotion-recognition-using-face-detection

A deep learning-based project that detects human faces and predicts their emotions in real-time using OpenCV and a trained neural network model.

gpu-acceleration haar-cascade-classifier keras-tensorflow matplotlib numpy opencv python

Last synced: 13 Apr 2026

https://github.com/faisal-khann/comprehensive_banking_portfolio

This analysis explores the banking customer dataset to uncover key patterns in account holdings, loan portfolios, income profiles, and transactional behavior. By integrating both financial and demographic variables, we examine customer engagement, lending trends, and risk exposure across different segments.

data-manipulation eda matplotlib numpy pandas powerbi python seaborn

Last synced: 20 Apr 2026

https://github.com/cusyio/datenverarbeitung-mit-numpy

Kurs zum Schreiben und Optimieren eigener Programme zur Datenverarbeitung mit Numpy.

numpy

Last synced: 08 Jun 2026

https://github.com/prathameshdhande22/data-visualization-tutorial

Repository Contains all the stuff required for Data Visualization. Numpy Tutorial, Pandas Tutorial, Matplotlib Tutorial, Seaborn Tutorial

data-visualization jupyter-notebook matplolib numpy pandas pandas-dataframe python seaborn tutorial

Last synced: 09 Apr 2026

https://github.com/prasadlokhande-880/imagetransformationtool

This script demonstrates image transformations using OpenCV, including shifting, resizing, rotating with scaling, and skewing for various image processing tasks.

ai datamanipulation ml numpy opencv python

Last synced: 08 May 2026

https://github.com/ki-seki/set_determ

Set deterministic modes of `random`, `torch` and `numpy` to ensure reproducibility. Idea is inspired by https://darinabal.medium.com/deep-learning-reproducible-results-using-pytorch-42034da5ad7

deterministic numpy pytorch random-generation reproducibility

Last synced: 08 May 2026

https://github.com/akashash01/logistic-regresion

This is an contrast of linear regression model, used to examine the association between the independent variable(category or contineous) with dependent variable(binary), which is an discrete outcome.

auc-roc-curve classification confusion-matrix data-science kolmogorov-smirnov machine-learning nonlinear-regression numpy pandas python supervised-learning vif woe-and-iv

Last synced: 08 May 2026

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 13 Apr 2026

https://github.com/neetigyab/pdfreader

Ready to use Python application/file for parsing a specific format of pdf form, and storing relevant user data in a tabular format in excel sheet

excel forms matplotlib numpy ocr opencv-python pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 08 May 2026

https://github.com/nemeslaszlo/titanic-logistic-regression

This notebook covers a simple analysis with logistic regression on the popular titanic dataset.

cufflinks jupyter-notebook logistic-regression matplotlib numpy pandas plotly seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/gauranshgoel123/predictive-demand-analysis

Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend

chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render

Last synced: 13 Apr 2026

https://github.com/c0sogi/python-dcm

A high-performance Python package for handling ETAS DCM(Data Conversion Format) files used in engine calibration tools like INCA, MDA, EHANDBOOK, and CANape.

automotive dcm ecu-calibration etas inca numpy pandas python

Last synced: 12 Aug 2025

https://github.com/gauravsakure02/python_libraries

Data Analysis using Python libraries

matplotlib numpy pandas-library streamlit

Last synced: 14 Apr 2026

https://github.com/607011/1dollar

Simulation of a counterintuitive distribution problem

animation go golang matplotlib numpy plot python3 simulation statistics

Last synced: 09 May 2026

https://github.com/lgeiger/mypy-einsum

MyPy Type Checking for NumPy/Jax/PyTorch Einsum Operations

deep-learning einsum jax mypy mypy-plugins numpy pytorch

Last synced: 09 May 2026

https://github.com/khaymanii/wine-quality-prediction-model

This is a model built using Python and Random Forest Classifier which is an ensemble algorithm and also a supervised learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 Apr 2026

https://github.com/muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper, using GPT-2 for text generation.

attention-is-all-you-need gpt gpt-2 gpt-3 gpt-implementation gpt-using-pytorch gpt2 numpy open-source paper-implementations python pytorch pytorch-implementation

Last synced: 13 Apr 2026

https://github.com/arshitarora/image-overlaying-prespective

Perform perspective warping and image fusion in Python using OpenCV and NumPy. Select four points on the base image to define the region where the subject image will be warped and fused. The script creates a fused image and saves it as Final_Output.png. 🖼️🌌🔮

image numpy opencv python swap

Last synced: 09 May 2026

https://github.com/sreejabethu/cost-of-living-index-globally

This Streamlit app is a data visualization tool that allows users to explore and compare the cost of living indices across different countries. The app takes in a dataset of cost of living indices for various countries and provides an interactive interface for users to select specific indices and countries to visualize.

bargraphs costofliving costoflivingindex csv datavisualization index matplotlib-pyplot numpy pandas-dataframe plotly python seaborn streamlit-webapp

Last synced: 19 Apr 2026

https://github.com/rhazra-003/fake_news_detector

A Machine Learning model to detect fake news with more than 95% accuracy

fake-news numpy pandas scikit-learn

Last synced: 18 Apr 2026

https://github.com/bielstela/connect-four

Numpy implementation of the game connect four (AKA four in a row)

game-2d numpy numpy-exercises python

Last synced: 19 Apr 2026

https://github.com/khaymanii/heart-disease-prediction-model

This repository contains a model built using python and Logistic Regression algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 18 Apr 2026

https://github.com/assamirzafar/learning

My Roadmaps and challenges are in this repo...I will add my colab and kaggle notebook links along with py script files in here.

calculus convolutional-neural-networks deep-learning deep-neural-networks keras linear-algebra machine-learning numpy opencv probability python3 pytorch scikit-learn scipy statistics

Last synced: 05 Apr 2026

https://github.com/ankushmallick1100/Diabetes-Prediction-of-Females-using-Maching-Learning-Techniques

This is a machine learning project that uses various machine learning algorithms to predict whether a patient is diabetic or not. Here various machine learning algorithms like SVM, RF Classifier, DT Classifier, KNN, LR , LRwith CV, NB Classifier, and XGB are used. For this work, a website is made with Python Streamlit library. Paper is ongoing.

classification cross-validation decision-tree-classifier diabetes diabetes-prediction jupyter jupyter-notebook logistic-regression machine-learning machine-learning-algorithms numpy pandas python random-forest random-forest-classifier regression streamlit support-vector-machine xgboost xgboost-classifier

Last synced: 29 Sep 2025

https://github.com/rhazra-003/indiebot

A basic chatbot which answers questions based on history of India

chatbot jupyter-notebook nlp nltk numpy python3 sckit-learn

Last synced: 20 Apr 2026

https://github.com/shreeparab1890/movie-recommender-system

This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.

bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization

Last synced: 09 Apr 2026

https://github.com/SudipBishwakarma/Bulk-Remove-OneSignal-Users

Bulk remove OneSignal subscribed users blazingly faster with parallel processing.

numpy pandas python python-logger python-multiprocessing python-threading python3

Last synced: 31 Mar 2025

https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken

Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.

3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics

Last synced: 11 Apr 2026

https://github.com/mdaiyub/covid-19-detection

The emergence of Coronavirus Disease 2019 (COVID-19) in early December 2019 has caused immense damage to health and global well-being. Currently, there are approximately five million confirmed cases and the novel virus is still spreading rapidly all over the world. The chest X-ray images along with emerging Artificial Intelligence (AI) methodologies, in particular, Deep Learning (DL) algorithms have recently become a worthy choice for early COVID-19 screening. This work proposes a DL-assisted automated method using X-ray images for the early diagnosis of COVID-19 infection. We evaluate the effectiveness of the pre-trained Convolutional Neural Network (CNN) model. The model has been validated on publicly available chest X-ray images and it is obtained by CNN with an accuracy of 97.26%. This study will be useful for researchers to think about the design of more effective CNN-based models for early COVID-19 detection.

backend early-stopping global keras matplotlib modelcheckpoint numpy os pandas tensorflow

Last synced: 02 Apr 2026