An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/sunnyrao07/water-quality-analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/kemalcalak/developing-driverless-vehicles

This project includes the software required for an autonomous vehicle

numpy object-detection opencv segmentation yolo

Last synced: 09 May 2026

https://github.com/mahmood-anaam/deterministic-dynamic-programming

This repository explores deterministic dynamic programming solutions for the Shortest Path Problem (SPP) and the Traveling Salesperson Problem (TSP). It includes graphical representations, a detailed Jupyter notebook, and practical implementations for optimizing travel routes and minimizing costs in various city-based scenarios.

dynamic-programming graph-theory jupyter-notebook numpy python shortest-path-problem traveling-salesperson-problem

Last synced: 09 May 2026

https://github.com/dhruvv1402/x-twitter-bot-python

A Python-powered Twitter bot for content generation, analytics, and automation using Twitter API v2.

huggingface-transformers matplotlib nltk numpy pandas pillow python requests schedule textblob threading tweepy twitter-api wordcloud

Last synced: 09 May 2026

https://github.com/mattkevan/data-science

Notes from learning various data science topics.

data-science matplotlib numpy python3

Last synced: 09 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 09 May 2026

https://github.com/sivasurya681/financial-expense-tracker

The Financial Expense Tracker is a Python-based app built with Flet, Pandas, NumPy, and Matplotlib.pyplot for seamless expense management. It enables real-time tracking, categorization, and data visualization through bar and pie charts. Users can generate summaries, export/import data, and make informed financial decisions easily.

calculator-application differntial expense-tracker flet matplotlib-pyplot numpy pandas python salary-calculator total

Last synced: 09 May 2026

https://github.com/aroojzahra908/machine-learning

This repository contains helpful guides and materials for easily understanding Pandas, NumPy, and Matplotlib libraries, which are essential tools for creating visual representations of data.

algorithms machine-learning-algorithms matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/vviveksharma/barcodescanner

The Bar Code scanning Application built using Computer Vision library such as Opencv in python the main objective of the project is to read the barcode that is being shown in the camera and decoding it and telling if the Database or Text file that is attached to it has that permitted decoded number if yes then it shows "Permitted" is green else "Not-Permitted" in red

numpy opencv python3

Last synced: 10 May 2026

https://github.com/1adityakadam/computer-vision-approach-for-quality-inspection-of-steel-sheets

Machine Learning and Computer Vision project improves steel defect inspection using computer vision. A segmentation technique partitions steel sheet images into regions, enabling better defect analysis. U-Net is used for segmentation, and MobileNetV2 for classification. Performance is evaluated with the Dice Coefficient.

computer-vision machine-learning numpy pandas unet-image-segmentation

Last synced: 10 May 2026

https://github.com/dawoudtormos/rain_predicition_ml

This repository contains a machine learning project for rain prediction using historical meteorological data. The goal is to predict whether it will rain the next day based on weather measurements like atmospheric water vapor, cloud formation, and past rainfall records.

ai machine-learning machine-learning-algorithms matplotlib numpy seaborn sklearn

Last synced: 10 May 2026

https://github.com/dzakarias/waves

Fun with the sine function inspired by Chladni plates

numpy streamlit

Last synced: 10 May 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/jjerphan/joml

A minimalist numpy-baked Neural Network API

deep-learning fun minimalist neural-network neural-networks numpy python wheel-implementing

Last synced: 10 May 2026

https://github.com/kartmaan/plotly-intervals

Group values from a Pandas Series according to given intervals and represent them graphically with Plotly

bar-chart grouping grouping-plots intervals intervals-chart numpy numpy-arrays orca pandas pandas-series percentile pie-chart plot plotly plotting values

Last synced: 10 May 2026

https://github.com/anshpg/exploring-ipl-rivalries-an-analysis-of-matches-from-2008-to-2022

In this project, I delved into an extensive analysis of IPL matches spanning from 2008 to 2022, utilizing a dataset sourced from Kaggle.com. My aim for this IPL season was to explore something novel, steering away from conventional analyses.

csv-files ipl kaggle-dataset matplotlib numpy pandas

Last synced: 10 May 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026

https://github.com/velascort/image_equalizer_gui

Image equalizer using LUT interface.

image-processing lut numpy python scikit-image ttk

Last synced: 10 May 2026

https://github.com/shriansh2002/stock-prediction

This project predicts Apple stock prices using linear regression. It's based on historical stock price data and uses Python and popular data science libraries like Pandas, NumPy, Matplotlib, and scikit-learn.

apple datascience finance linearregression machinelearning matplotlib ml numpy pandas prediction python scikitlearn stock-prediction stockmarket

Last synced: 10 May 2026

https://github.com/jsnevt/regressao_logistica

Cálculos de probabilidades sobre uma eleição

matplotlib numpy pandas-dataframe python

Last synced: 11 May 2026

https://github.com/chaganti-reddy/heart_disease_prediction

Machine Model creation for Heart Disease Prediction

data-science deep-learning machine-learning numpy python

Last synced: 11 May 2026

https://github.com/samuel-schlemper-schlemuel/calculadora-de-media-variancia-e-desvio-padrao

Esse é um projeto do currículo do curso de analise de dados do FreeCodeCamp

numpy python

Last synced: 11 May 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/tfiers/unitlib

📐 Physical units for NumPy arrays \ ⏱ Fast • Simple • High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026

https://github.com/maroshmka/np-mlp-clf

Raw implementation of mlp classifier in python using numpy.

classification example-code learning machine-learning multilayer-perceptron neural-network numpy

Last synced: 14 May 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/williyam-m/sentiment_analysis

A Django-based Sentiment Analysis web app leveraging the power of the BERT transformer model to classify text sentiment accurately.

beautifulsoup bert django numpy pandas python torch transformers

Last synced: 20 Jun 2026

https://github.com/takana671/fluidcube

Particle Movement Using Curl Noise

curl-noise cython numpy panda3d python

Last synced: 22 Jun 2026

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/micmurawski/cloud-array

cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.

aws azure big-data bigarray blob-storage cloud data-structures digitalocean-spaces gcp gcp-cloud-storage ibm-cloud-object-storage numpy s3 stream-processing streaming zadara

Last synced: 12 Aug 2025

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 09 Apr 2026

https://github.com/shoaib1522/credit-card-fraud-analysis

"A comprehensive project on Credit Card Fraud Detection combining Exploratory Data Analysis, Machine Learning, and an interactive Streamlit web app for real-world applicability."

eda epochs mlp-classifier neural-network numpy pandas-python perceptron-learning-algorithm python regression sckiit-learn seaborn streamlit streamlit-webapp tensorflow

Last synced: 09 Apr 2026

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 09 Apr 2026

https://github.com/prithvi45/machine-learning-prerequisites

Basics to Advance concepts on Numpy, Pandas, Matplotlib

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/baranwaljii/data-science-projects

This Repository contains projects related to the domain of data science based on python and jupyter notebook

data-science jupyter-notebook machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/tasninanika/australian-credit-approval-analysis-svm

This project uses a Support Vector Machine (SVM) Classifier to predict whether a credit application is approved (1) or denied (0) based on applicant features.

numpy pandas python3 scikit-learn svm-classifier

Last synced: 10 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/amruta33/melanoma-detection

To build a CNN based model which can accurately detect melanoma. Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution that can evaluate images and alert dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.

keras machine-learning numpy pandas python3 sckiit-learn tensorflow

Last synced: 09 Apr 2026

https://gitlab.com/DLR-SY/beos

Beos is a legacy Fortran tool used to calculate the static and dynamic buckling behavior of flat/curved fiber composite structures.

fortran numpy python

Last synced: 16 Aug 2025

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/glbter/data-analysis-py

data-analysis labs. 50/50 made with Oleh Tonkoshkur

numpy pandas python3 scipy-stats

Last synced: 12 Jun 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/balajig-24/titanic_data_analysics-

Project Title: Titanic Survival Prediction Project Overview The Titanic Survival Prediction project is a classic machine learning problem that aims to predict whether a passenger survived the Titanic disaster based on various features such as age, gender, passenger class, and more. This project demonstrates my ability to clean, analyze, and model.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/nas-research/knowledge-model

Our knowledge system systematically ingests, processes, and indexes open-access life science publications. It supports internal research by providing precise question-answering and efficient retrieval from a continuously updated repository of scientific literature

accelerate aws boto3 dataingestion keras lifesciences llama llama3 llm numpy pymupdf pytorch researchsupport sqlalchemy tensorflow textextraction

Last synced: 10 Apr 2026

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/ledsouza/medphys-bi

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 10 Apr 2026

https://github.com/kylehperez/mushroomnet

This API is for research purposes ONLY and is NOT to be used for food-safety or medical advice. Mushroomnet is a machine learning neural network for classifying mushrooms as poisonous or edible. The model was trained on data from uni.edu: over 5,000 instances of mushrooms, with 22 qualitative features and a determination of poisonous or edible.

artificial-intelligence botany flask-api machine-learning mycology neural-network numpy python3 pytorch scikit-learn

Last synced: 09 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/amithjoseph777/gourmethaven-case-competition

Predictive analytics case competition project from the Master of Science in Business Analytics course at the College of Business, University of Louisville. Our team analyzed customer response trends for Gourmet Haven, developed a predictive model, and ranked in the Top 5 out of 25 teams.

google-colab jupyter-notebook numpy pandas r scikit-learn

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/ahmedabdalkreem/naive-bayes

Show what types of naive-bayes and how to use model and deal with data ( Numerical, Categorical and Mixed between Numerical and Categorical )

numpy pandas python3

Last synced: 10 Apr 2026

https://github.com/ialam085/super_store_sales_analysis_python

The project is indeed focused on performing an exploratory data analysis (EDA) of Super Store Sales data from various perspectives, using comprehensive visualizations.

charts matplotlib numpy pandas python seaborn visualization

Last synced: 10 Apr 2026

https://github.com/ksganni/diabetes-prediction-using-machine-learning

🔬 A machine learning project that predicts diabetes likelihood using patient health metrics and multiple classification algorithms.

jupyter-notebook matplotlib numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 28 Apr 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/alimiheb/morphotype-web-app

A modern web application that analyzes body morphotypes (Ectomorph, Mesomorph, Endomorph) using MediaPipe AI and provides personalized fitness and nutrition plans.

body-tracking flask mediapipe numpy opencv pandas pillow python3

Last synced: 10 Apr 2026

https://github.com/mayankmittal29/pagesage-ai_urls_content_scraper_webapp

PageSage is a modular web scraping and semantic retrieval system using BeautifulSoup for extraction, SentenceTransformers for dense embeddings, and ChromaDB for scalable vector search—enabling efficient data indexing for advanced RAG pipelines.

beautifulsoup chromadb fastapi gpt-turbo numpy openai-api python3 question-answering rag react requests sentence-transformers tailwindcss web-scraping

Last synced: 10 Apr 2026

https://github.com/kaladabrio2020/calculusnumerics

Trabalho da cadeira de Métodos Numérico I

metodos-numericos numpy python tkinter-gui

Last synced: 18 May 2026

https://github.com/mpbeatriz/santander_coders_2024

Repositório dos exercícios e projetos do curso de Engenharia de Dados da Ada em parceria com o Santander (em andamento).

apache-kafka apache-spark api data-engineering data-extraction fastapi flask numpy oop pandas python webhooks

Last synced: 10 Apr 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/trivediayush/Accident-and-Drowsiness-Detection

Our college minor project is a comprehensive solution aimed at enhancing road safety. The project comprises two main components: an Accident Detection Model and a Real-Time Driver Drowsiness System.

accident-detection accidents-analysis college-project cvzone dlib-face-detection dlib-face-recognition face-detection face-recognition imutils minor-project numpy opencv pandas python python3 yolov8

Last synced: 22 Aug 2025

https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas

This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian

data-analysis numpy pandas python

Last synced: 10 Apr 2026

https://github.com/kunalshelke90/wine-quality-testing

This project is about creating a machine learning algorithm that can predict the quality of wine based on the given dataset. Different machine learning algorithms such as logistic regression, decision tree and random forest are used in this project.

eda feature-engineering flask machine-learning numpy pandas python

Last synced: 10 Apr 2026

https://github.com/ahmedabdalkreem/covid-19

In this project we build complete project conatin tree part first part build deep leanring model using CNN second part build simple mobile app using Flutter and the last part is to connect mobile app with model using Flask API.

cnn dart deep-learning flask-api flutter matplotlib mobile-app numpy pandas python tensorflow

Last synced: 10 Apr 2026

https://github.com/balakrish001/python-course

The Python course provides a thorough foundation in Python programming, starting with core concepts and progressing to OOP. It covers essential libraries like NumPy and Pandas for data manipulation and analysis. You'll also learn how to send emails using Python's SMTP library and interact with APIs for real-world applications,

api numpy oops-in-python pandas problem-solving python smtp

Last synced: 10 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/kwasi-dankwa/game-sales

This project investigates why Nintendo’s global market share fluctuated between 2010–2016, identifies top-performing genres and regions, and builds predictive models to forecast sales and regional performance. It culminates in specific, actionable recommendations to grow market share.

business-intelligence gaming juptyernotebook-language matplotlib numpy pandas postgresql python3

Last synced: 09 Apr 2026