An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 07 Apr 2026

https://github.com/jcm-ai/robust-mlops-elevating-machine-learning-to-production-scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 09 Apr 2026

https://github.com/hemanthsaikumarbethapudi/last-mile_delivery_optimization.py-

This project implements an AI-powered last-mile delivery optimization system using Python. It leverages graph-based route optimization, demand prediction, and cost analysis to improve delivery efficiency in urban areas.

heapq networkx numpy python

Last synced: 27 Aug 2025

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 10 Apr 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/fabriziomusacchio/ann_with_numpy

A how-to for building an artificial neural network from scratch using NumPy

ann artificial-neural-networks neural-network numpy

Last synced: 08 May 2026

https://github.com/shreeparab1890/heart-attack-analysis-prediction-accuracy-85.24

This iPython Notebook implements the Analysis and Predictions for Heart Attack. The Analysis is done on the dataset to identify the key features which has a impact on Heart Attack. After the analysis and data cleaning, the dataset is used to train the model using different ML algorithm.

eda exploratory-data-analysis matplotlib numpy pandas plotly prediction python scikit sklearn

Last synced: 13 Apr 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/nyx1311/toxicity-detector-using-bilstm

🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web app that flags multiple forms of online abuse in real time and offers tools for emotional recovery. 📊 Under the hood: BiLSTM + Word2Vec embeddings for deep, context-aware detection Trained on 21K+ labeled comments across 7 toxicity categories Built with Python, Tensor

epoch genism gpu keras-tensorflow matplotlib model nlp nlp-machine-learning nlpaug-textual numpy pandas pandas-library python3 streamlit tensorflow word2vec

Last synced: 14 Apr 2026

https://github.com/lesiaukr/mathematic_for_cs_colab_hw

Master's degree | HW | Mathematics for Computer Science course | Homeworks

goit google-colab home-works latex linear-algebra mathematics-for-computerscience matrices numpy python vectors

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/healty-care

The Complete project using neural network to arrive the best healt know he is stoke or not.

matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 13 Apr 2026

https://github.com/tigureis/analise_dataset_campanha_marketing

O projeto visa melhorar a eficácia das campanhas de marketing através da análise de dados e da construção de um modelo preditivo.

numpy pandas plotly seaborn sklearn xgbo

Last synced: 13 Apr 2026

https://github.com/kunalshelke90/wine-quality-testing

This project is about creating a machine learning algorithm that can predict the quality of wine based on the given dataset. Different machine learning algorithms such as logistic regression, decision tree and random forest are used in this project.

eda feature-engineering flask machine-learning numpy pandas python

Last synced: 10 Apr 2026

https://github.com/mmerlyn/analysis-of-tomato-prices

Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.

css flask html matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/1adityakadam/computer-vision-approach-for-quality-inspection-of-steel-sheets

Machine Learning and Computer Vision project improves steel defect inspection using computer vision. A segmentation technique partitions steel sheet images into regions, enabling better defect analysis. U-Net is used for segmentation, and MobileNetV2 for classification. Performance is evaluated with the Dice Coefficient.

computer-vision machine-learning numpy pandas unet-image-segmentation

Last synced: 10 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/cyberlument/opencv-colormasking-red-

This mini project opencv is for educational purpose.

numpy opencv opencv-python pycharm vscode

Last synced: 20 Jan 2026

https://github.com/dhaase-de/dito

Yet another toolbox for the daily work with OpenCV under Python.

numpy opencv python

Last synced: 22 Aug 2025

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/danny-1k/nnpy

Neural network library written from scratch in numpy

deep-learning neural-networks-from-scratch numpy python

Last synced: 17 May 2026

https://github.com/legalaspro/rnn_gru_lstm_experiments

RNN, GRU, LSTM implementation using PyTorch and Numpy

gru ltsm numpy pytorch rnn

Last synced: 21 Apr 2026

https://github.com/harshs472/real-time-object-detection

Real time object detection that detects approximately 80 objects!!!!

coco numpy python yolov3

Last synced: 27 Apr 2026

https://gitlab.com/DLR-SY/mcodac

MCODAC is a Fortran library for the numerical evaluation of fiber composite damage. The library contains analysis methods specifically tailored to fiber composites, from micromechanical homogenization approaches to macroscopic fatigue models of orthotropic multilayer composites.

fortran numpy python

Last synced: 02 Apr 2025

https://github.com/mpbeatriz/santander_coders_2024

Repositório dos exercícios e projetos do curso de Engenharia de Dados da Ada em parceria com o Santander (em andamento).

apache-kafka apache-spark api data-engineering data-extraction fastapi flask numpy oop pandas python webhooks

Last synced: 10 Apr 2026

https://github.com/parthshah02/customer_churn_dashboard

This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python

data-analysis matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/lingesh81051/similar-template-document-matching-and-fraud-detection

An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.

numpy opencv opencv-python pillow pymupdf pytesseract pytesseract-ocr python tkinter

Last synced: 13 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/chandkund/customer-segmentation-using-k-means-clustering

Implemented K-Means Clustering to segment customers based on purchasing behavior, enabling targeted marketing strategies. Analyzed data, optimized clusters using the Elbow Method, and derived insights to enhance customer engagement and retention.

data-science kmeans-clustering machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/suap-dev/python-essentials

a repository intended for people who want to know some python basics

lambda-functions numba numpy oop pandas teaching

Last synced: 17 Mar 2025

https://github.com/jersonrn/termviewer3d

Interactive terminal environment that allows users to explore 3D models in real-time.

3d ascii cupy gpu linear-algebra numpy python

Last synced: 19 May 2026

https://github.com/alimiheb/morphotype-web-app

A modern web application that analyzes body morphotypes (Ectomorph, Mesomorph, Endomorph) using MediaPipe AI and provides personalized fitness and nutrition plans.

body-tracking flask mediapipe numpy opencv pandas pillow python3

Last synced: 10 Apr 2026

https://github.com/patex1987/ghcn-temperature-dataset

Analysis of GHCN (Global Historical Climatology Network) temperature dataset using Numpy

climate-data jupyter-notebook numpy python temperature-data

Last synced: 10 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/fimbres/matplotlib-start

A crash project with the most important matplotlib functions and methods, used in data science, AI and machine learning.

data-science matplotlib matplotlib-pyplot numpy pandas python python-script

Last synced: 13 Apr 2026

https://github.com/kostadinlambov/time-series-forecasting

This project evaluates the predictive performance of a CNN-LSTM Hybrid deep learning model for Bitcoin price movement prediction.

keras-tensorflow matplotlib-pyplot mlflow numpy optuna pandas python scikit-learn seaborn statsmodels ta-lib tensorflow

Last synced: 07 Apr 2026

https://github.com/elvisrock/government_analysis

Esse projeto, tem como objetivo, analisar, explorar algumas técnicas e propor uma sugestão ao governo de forma geral, sem qualquer viés politico.

data-science machine-learning numpy pandas python sklearn

Last synced: 05 Apr 2026

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/ksganni/diabetes-prediction-using-machine-learning

🔬 A machine learning project that predicts diabetes likelihood using patient health metrics and multiple classification algorithms.

jupyter-notebook matplotlib numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 28 Apr 2026

https://github.com/sweta-kaundilya/finance-data-project

This data project focuses on exploratory data analysis of bank stock prices throughout financial crisis all the way to early 2016.

financial-analysis jupyter-notebook matplotlib numpy pandas-python seaborn

Last synced: 16 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/farhad-here/height-distribution-analysis

Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.

coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance

Last synced: 13 Apr 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/zolppy/recommendation-system

This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.

computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16

Last synced: 12 Apr 2026

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/thkox/uni_2023-content-based-image-retrieval

This project is a team assignment for the "Image Analysis" course, offered in the 7th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

7th-semester cartesian-product content-based-image-retrieval department-of-informatics hyperedge-similarity-calculation hypergraph image-analysis matplotlib networkx numpy python rank-normalization scipy torchvision university-of-piraeus

Last synced: 08 Oct 2025

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/ben854719/autonomous-health-monitoring-emergency-response

This app uses Bluetooth to link a worker’s smartphone with a supervisor’s laptop for live health monitoring. PySyft handles secure biometric authentication, while machine learning detects injuries and visualizes them on a dashboard. Agentic AI responds to critical events by triggering emergency protocols and contacting 911 with location data.

agentic-ai bluetooth cryptography cybersecurity error-handling gemini langgraph machine-learning-algorithms matplotlib mcpserver nltk numpy pysyft python pytorch socket tcp tenseal

Last synced: 28 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/eiichiroito/numpharo

NumPy like tools for Pharo Smalltalk

matplotlib-pyplot numpy pharo pharo-smalltalk smalltalk

Last synced: 18 Jan 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/nikhleshshukla123/web-scraping-using-python

scrapes multiple pages of Amazon search results using python.

beautifulsoup4 numpy pandas python

Last synced: 03 May 2026

https://github.com/abdullah-masood-05/eye-balll-detection

This repository provides advanced eye gaze tracking using MediaPipe Face Mesh for both video files and live camera feeds.

mediapipe numpy opencv scipy

Last synced: 10 Apr 2026

https://github.com/yappy2000d/pps-ndarray

Multi-dimensional array implementation

cpp numpy

Last synced: 18 Jan 2026

https://github.com/yashpaneliya/bank-loan-default-analysis

Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.

data-analysis loan-default-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/netbr3ak/planetary-system-simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 14 Apr 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/fuodorov/python

Python Numpy Tutorial

numpy python

Last synced: 09 Oct 2025

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/gojoel/brockton-data-analysis

Analyzes police logs from the city of Brockton, MA and attempts predictive analysis.

data-science exploratory-data-analysis geopy machine-learning matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/glbter/data-analysis-py

data-analysis labs. 50/50 made with Oleh Tonkoshkur

numpy pandas python3 scipy-stats

Last synced: 12 Jun 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/vishalgaud17/stroke

A simple Streamlit web app that predicts stroke risk based on user input features like age, BMI, glucose level, and lifestyle factors, using a pre-trained machine learning model.

machine-learning numpy pandas python scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 14 Apr 2026