An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/shivendrra/Seeker

This project will help to write new scripts for YouTube videos from scratch using fine-tuned Llama-7B

ai artificial-intelligence finetuning google-custom-search-api google-custom-search-engine llama2 llm numpy pandas python pytorch react-native reactjs transformers youtube-api

Last synced: 30 Dec 2025

https://github.com/camilajaviera91/apache-beam-pipeline-first-approach

This code demonstrates how to integrate Apache Beam with scikit-learn datasets and perform simple data transformations. It loads the Linnerud dataset from scikit-learn, converts it into a Pandas DataFrame for easier manipulation.

apache-beam dataframes glob kmeans-clustering matplotlib-pyplot mean-absolute-error mean-square-error numpy os pandas pipelines scipy-stats seaborn silhouette-score sklearn sklearn-datasets standardscaler

Last synced: 28 Apr 2026

https://github.com/aymen016/data-visualization

A collection of data analysis projects using Python, covering various datasets and domains. Includes EDA, preprocessing, data visualization (Matplotlib/Seaborn), and Power BI dashboards.

dashboard eda jupyter-notebook matplotlib-pyplot matplotlib-python numpy pandas pandas-dataframe powerbi python seaborn seaborn-python visualization

Last synced: 15 Jun 2026

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/shanmukhsrisaivedullapalli/cifar10

This project uses TensorFlow to classify images from the CIFAR-10 dataset. It compares the performance of an Artificial Neural Network (ANN) and a Convolutional Neural Network (CNN), covering data preprocessing, model training, evaluation, and prediction on new images.

cifar10-classification cnn-classification matplotlib neural-networks numpy pickle pillow python3 tensorflow

Last synced: 03 Feb 2026

https://github.com/burakkagann/pixels2genai-2.0

Free, open-source curriculum teaching generative art and AI across 15 modules, from a single pixel to diffusion models.

computer-vision creative-coding curriculum deep-learning design-based-research diffusion-models education gans generative-ai generative-art machine-learning numpy pytorch touchdesigner vae

Last synced: 19 Jun 2026

https://github.com/jswong65/machine_learning_nanodegree

Projects of Udacity Machine Learning nanodegree

machine-learning numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/virajbhutada/google-stock-price-forecasting-lstm

Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.

data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting

Last synced: 27 Feb 2025

https://github.com/rahulvictor12/the-movie-database-data-scrapper

A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.

beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping

Last synced: 11 Apr 2026

https://github.com/vitroid/tilecycles

Tessellate a graph into cycles.

genice2 graph numpy tiling

Last synced: 27 Feb 2025

https://github.com/jayanthsd2003/dms_sarathi_-driver-monitoring_system-

This is a project which can be seen on TATA Trucks and other premium edition cars based and focussed on the safety and monitoring driver Consciousness state

dlib dlib-face-detection numpy opencv-python pygame python pyttsx3 scipy

Last synced: 13 Apr 2026

https://github.com/nikita620/phishing-website-detection-by-machine-learning-techniques

Phishing is one of the familiar attacks that trick users to access malicious content and gain their information. In terms of website interface and uniform resource locator (URL), most phishing webpages look identical to the actual webpages. Various strategies for detecting phishing websites, such as blacklist, heuristic, Etc.

bagging boosting decision-trees numpy pandas python random-forest sckiit-learn sql

Last synced: 10 Apr 2026

https://github.com/rixiiz/knn-model-to-predict-the-obp-of-mlb-players

A machine learning model I built that can predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 01 Jul 2025

https://github.com/ashar97/financial-engineering

Financial Engineering Practicum Assignments

financial-engineering matlab numpy pandas python r yahoo-finance

Last synced: 11 Apr 2026

https://github.com/akash-rajak/image-noise-remover

Python Script to remove noise from image.

cv2 image-noise-remover numpy os python3 random tkinter

Last synced: 28 Apr 2026

https://github.com/yessasvini23/deepfake_immunization_toolkit

🛡️ AI-powered toolkit to detect deepfakes, educate users, and verify content authenticity using federated learning and blockchain. Built for election security, media integrity, and digital literacy.

blockchain matplotlib numpy opencv python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/mtgsoftworks/blink-detecion-player

It is a software that has the ability to stop and play the videos watched on youtube and rewind when the eyes of this software are closed and opened.

dlib numpy opencv-python python3 scipy selenium-webdriver

Last synced: 13 Apr 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper, using GPT-2 for text generation.

attention-is-all-you-need gpt gpt-2 gpt-3 gpt-implementation gpt-using-pytorch gpt2 numpy open-source paper-implementations python pytorch pytorch-implementation

Last synced: 13 Apr 2026

https://github.com/hansalemaos/cythonunique

Fast implementation of unique elements in an array - up to 30x faster than NumPy

faster numpy unique

Last synced: 28 May 2026

https://github.com/iv4n-ga6l/Realtime-Car-Traffic-Tracking

Realtime car traffic tracking using yolov8 model

cartracking numpy opencv python ultralytics yolov8 yolov8n

Last synced: 28 Apr 2025

https://github.com/iv4n-ga6l/OpenPose-Prediction

Person's body parts or joint position detection using openCV

detection numpy opencv prediction prediction-model python

Last synced: 28 Apr 2025

https://github.com/ngirimana/decision-trees-and-random-forest-

For this project we will be exploring publicly available data from [LendingClub.com](www.lendingclub.com). Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this. Lending club had a [very interesting year in 2016](https://en.wikipedia.org/wiki/Lending_Club#2016), so let's check out some of their data and keep the context in mind. This data is from before they even went public. We will use lending data from 2007-2010 and be trying to classify and predict whether or not the borrower paid back their loan in full. You can download the data from [here](https://www.lendingclub.com/info/download-data.action) or just use the csv already provided. It's recommended you use the csv provided as it has been cleaned of NA values. Here are what the columns represent: * credit.policy: 1 if the customer meets the credit underwriting criteria of LendingClub.com, and 0 otherwise. * purpose: The purpose of the loan (takes values "credit_card", "debt_consolidation", "educational", "major_purchase", "small_business", and "all_other"). * int.rate: The interest rate of the loan, as a proportion (a rate of 11% would be stored as 0.11). Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates. * installment: The monthly installments owed by the borrower if the loan is funded. * log.annual.inc: The natural log of the self-reported annual income of the borrower. * dti: The debt-to-income ratio of the borrower (amount of debt divided by annual income). * fico: The FICO credit score of the borrower. * days.with.cr.line: The number of days the borrower has had a credit line. * revol.bal: The borrower's revolving balance (amount unpaid at the end of the credit card billing cycle). * revol.util: The borrower's revolving line utilization rate (the amount of the credit line used relative to total credit available). * inq.last.6mths: The borrower's number of inquiries by creditors in the last 6 months. * delinq.2yrs: The number of times the borrower had been 30+ days past due on a payment in the past 2 years. * pub.rec: The borrower's number of derogatory public records (bankruptcy filings, tax liens, or judgments).

artificial-intelligence decision-tree-classifier machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn

Last synced: 11 Apr 2026

https://github.com/avijit-jana/used-car-price-prediction

A Streamlit-based application utilizing machine learning to predict used car prices based on features like make, model, year, fuel type, and transmission, enhancing customer experience and streamlining the pricing process.

carpriceprediction data-science jupyter-notebooks machine-learning numpy pandas pickle pricing-data-science python3 streamlit streamlit-webapp webapplication

Last synced: 11 Apr 2026

https://github.com/3rd-son/knn-breast-cancer-prediction-model-

A breast cancer prediction model using KNN with an accuracy of 96%

jupyter-notebook knn-classifier matplotlib numpy pandas python scikit-learn scipy search

Last synced: 11 Apr 2026

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 10 Apr 2026

https://github.com/leabrodyheine/water-pump-status-prediction

This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.

argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/suraj-sedai/lstm-deep-learning

A step-by-step guide to mastering sequence prediction using TensorFlow and LSTM. This repository covers everything from basic linear predictions to advanced multi-step forecasting, many-to-many LSTM models, and real-world applications like stock price forecasting and sales prediction.

lstm-neural-networks lstm-sentiment-analysis numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/shibam120302/plant_deseases_prediction

Plant disease forecasting is a management system used to predict the occurrence or change in severity of plant diseases. At the field scale, these systems are used by growers to make economic decisions about disease treatments for control.

keras numpy opencv python

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/dados-telefonia

Este projeto tem como objetivo desenvolver um modelo que possa analisar o comportamento do cliente e recomendar um dos planos mais recentes de uma companhia telefônica fictícia.

matplotlib numpy pandas python3 sklearn

Last synced: 12 Apr 2026

https://github.com/achuth-0908/hemoguard-anemia-predictor

A Web App incorporated with a Gradient Boosting Classifier Model, to predict Anemia with given data.

css flask html matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/anvesham/machinelearning_models_using_matlab

Stepwise Multiple Linear Regression (With Interactions) and Random Forest Regression on predicting the Productivity of the Garment Factory Workers

bagging-ensemble bayesian-optimization decision-trees jupyter-notebook machine-learning matlab multiple-linear-regression-model numpy python random-forest-regression

Last synced: 11 Apr 2026

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 11 Apr 2026

https://github.com/nemeslaszlo/financial-crisis-effects-on-some-stocks

Focusing on bank stocks to see how they progressed throughout the 2008-09 financial crisis all the way to early 2016.

cufflinks jupyter-notebook matplotlib numpy pandas pandas-datareader plotly seaborn

Last synced: 03 May 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/harshita2234/breast-cancer-prediction

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

breast-cancer-prediction jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 12 Apr 2026

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/huangcongqing/python-libraries

python库(numpy,pandas...)学习

matplotlib numpy pandas python3 xgboost

Last synced: 09 Mar 2026

https://github.com/uea-geral/rna-perceptron-exercise

🤖Disciplina de RNA: treinamento de um neurônio Perceptron.

jupyter-notebook neural-network numpy perceptron python scikit-learn

Last synced: 13 Apr 2026

https://github.com/gauravsakure02/pubg_win_prediction

Predicting PUBG game wins using a dataset involves analyzing various in-game features such as player kills, distance traveled, and weapons acquired to build a model that forecasts the likelihood of winning a match based on player performance, helping in winning game

matplotlib numpy pandas sklearn

Last synced: 29 Apr 2026

https://github.com/fearless-spider/stooq-dax-seasonal

Parsing seasonality from time series data can often be useful in data analytics. It helps with analyzing seasonality for decision making as well as for more accurate forecasts. Python can be used to separate out these trend and seasonal components.

numpy pandas python statistics

Last synced: 13 Apr 2026

https://github.com/ladityagogoi/shadowguard

The ShadowGuard Browser Extension is a powerful tool designed to enhance user experience by identifying and highlighting potential dark patterns on websites. Our extension employs a combination of machine learning algorithms and natural language processing (NLP) models to detect and classify various deceptive design practices

css flask html javascript joblib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/singhmnprt01/Custom-Deep-Neural-Network-Package

A custom deep neural network package that gives the liberty to design your deep neural network using only NumPy

customized-deep-neural-network deep-learning-framework maths-behind-neural-network numpy

Last synced: 09 Jul 2025

https://github.com/prakashjha1/stock-trend-prediction

Deep learning for predicting stock market prices and trends has become even more popular than before. I have used yahoo finance to collect the data and LSTM to build the stock trend model.

deep-learning keras lstm lstm-neural-networks machine-learning numpy pandas pandas-datareader prediction python rnn tensorflow

Last synced: 11 Apr 2026

https://github.com/pacatro/lse2text

Deep learning program that translates Spanish Sign Language (LSE) to text in real time.

ai cnn computer-vision deep-learning lse matplotlib numpy pandas python pytorch pytorch-lightning scikit-learn torchmetrics translation

Last synced: 11 Apr 2026

https://github.com/mahnoorsheikh16/Credit-Card-Default-Prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

chi-square-test encoding hiplot imblearn json knn-imputer matplotlib numpy pandas pca-analysis pillow plotly robust-scalar scipy seaborn sklearn smote streamlit ttest visualization

Last synced: 01 Mar 2025

https://github.com/abhinav330/data-science-projects

The Data Science Projects folder contains various projects related to data analysis, machine learning, and exploratory data analysis (EDA). Each project focuses on a specific dataset and utilizes different techniques and algorithms to extract insights and make predictions.

decision-trees exploratory-data-analysis k-means-clustering k-nearest-neighbours kaggle linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas plotly python random-forest-classifier support-vector-machines

Last synced: 30 May 2026

https://github.com/balaji24092001/python

All python related content: notes, problems, progress archive.

matplotlib numpy opencv pandas python3 scipy-stats seaborn sklearn sqllite

Last synced: 12 Apr 2026

https://github.com/kanishk1420/renewabled-data-helper

An AI-powered assistant designed to help analysts, developers, and investors efficiently analyze and extract critical insights from renewable energy project documents. Built using Python, Google Gemini, FAISS, and Streamlit

gemini-api langchain mongodb numpy python streamlit

Last synced: 11 Apr 2026

https://github.com/nguyenanhtuan1912/computer_vision_exercises

Trong này sẽ chứa các file source về bài tập Computer Vision.

computer-vision numpy opencv python

Last synced: 12 Apr 2026

https://github.com/htlinh0604/invoice-data-extraction

This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

flask numpy ocr-recognition opencv pandas python regex tesseract-ocr

Last synced: 10 Nov 2025

https://github.com/dpgitaccount/file-handling-data-analytics-toolkits

In this repo File handling, Inventory management system and data toolkits resources are provided these are essential for managing, storing, and manipulating data in various formats, such as JSON files. JSON (JavaScript Object Notation), is widely used for data exchange due to its simplicity and readability. also File Handling included in this repo.

filehandling inventory-management json jupyter-notebook numpy os

Last synced: 13 Apr 2026

https://github.com/haidermalikk/neural-network-circuit-optimizer

The Neural Network Circuit Optimizer, built on my custom digital circuit simulator, uses AI and machine learning to optimize simple digital circuits. It leverages neural networks, circuit simulation, and optimization algorithms to reduce component count which improves: power usage, and execution time and efficiency. *Under Dev see Readme*

ai jupyter-notebook matplotlib ml numpy pandas python pytorch sqlalchemy tensorflow

Last synced: 10 Apr 2026

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/amari-calipso/papers-please-tasbot

A "Papers, Please" TASbot that offers a programmable interface with the game

cython game-bot games image-processing numpy papers-please papersplease python python3 tas tool-assisted-speedrun

Last synced: 13 Feb 2026

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 11 Apr 2026

https://github.com/aymen016/image-processing-algorithms

Explore Python-based image processing from scratch with core algorithms like interpolation, convolution, and filtering!

matplotlib numpy pillow python

Last synced: 13 Apr 2026

https://github.com/amishmathur1/resumeats

✨ With Resume ATS Analyzer, users can improve their resume quality, boost ATS compatibility, and increase their chances of landing interviews — all with smart AI assistance.

express-python matplotlib-python nltk nltk-python numpy openai-api pandas plotly plotly-dash plotly-express pypdf2 python streamlit wordcloud

Last synced: 11 Apr 2026

https://github.com/itancio/realestateprice

This is an application of deep learning to train and predict real estate price dataset in King County, USA.

datat-visualization machine-learning numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/nvinayvarma189/deep-learning-with-tensorflow

This repo consists all the basics of tensorflow and all major Deep Learning algorithms and techniques implemented in tensorflow. It will explain you the basics of tensorflow. You can visualize how tensors flow through a grapg while you create one

autoencoder cnn deep-learning gan matplotlib neural-network numpy pandas python3 reinforcement-learning rnn tensorflow

Last synced: 11 Apr 2026

https://github.com/kumartusha/data_scientist-ai-ml-_training_anudip_foundation

Anudip Foundation Data Analytics Training 📊 A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing 📉 Data Visualization & EDA 📊 Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 10 Apr 2026

https://github.com/kartikeya443/handgesture_detection_opencv

A Computer Vision based model trained on data sets of over 300 inputs for a particular gesture is able to detect and identify human hands gestures effortlessly and is able to track the coordinates for further modifications in its usage for controlling devices like mouse and windows (control features are in development).

deep-learning mediapipe numpy opencv tensorflow

Last synced: 11 Apr 2026

https://github.com/mihirh19/python

a repository for collaborative development and version control using Git that houses Python code files, documentation, tests, and other project-related files.

algorithms data-structures django fastapi flask machine-learning numpy pandas python

Last synced: 11 Apr 2026

https://github.com/solrikk/optichrom

This repository provides a collection of Python tools and applications for comparing images using various algorithms such as ORB (Oriented FAST and Rotated BRIEF) and SSIM (Structural Similarity Index). The tools include both command-line scripts and graphical user interface (GUI) applications for ease of use

computer-vision cv2 feature-matching image-comparison image-comparison-tool machine-learning numpy object-detection opencv orb python scikit-image ssim tkinter visualization yolo yolov5

Last synced: 18 Jan 2026

https://github.com/thamirisq/data_analysis_culinary_journey

Project to analyze key information from Culinary Jorney, a marketplace Company, and develop an initial strategy to lead the entire team effectively.

folium-maps groupby-method jupyter-notebook markercluster marketplace-ds matplotlib-pyplot numpy pandas pandas-dataframe streamlit

Last synced: 10 Apr 2026

https://github.com/fiercex/toynn

Python实现神经网络

mnist-classification neural-network numpy python

Last synced: 15 May 2026

https://github.com/bogdang888g/jojo_stands

Анализ параметров стендов из ДжоДжо

numpy pandas plotly python scipy

Last synced: 11 Apr 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/bhattbhavesh91/tf-numpy-example

Faster NumPy using Tensorflow's tf.experimental.numpy module

keras numpy tensorflow

Last synced: 01 May 2026

https://github.com/kishore-sr/creditworthy

AI-powered web app built at "Codeathon 2024" IIIT Bangalore

clerk-auth node-js numpy pandas python reactjs streamlit

Last synced: 11 Apr 2026