An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/arrmansa/leetcode-nice-solutions

Some solutions for leetcode problems that I found fun - either the algorithm or using numpy bruteforce where it shouldn't be

leetcode leetcode-python leetcode-python3 leetcode-solutions numpy python python3

Last synced: 16 Mar 2025

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/hrolive/fundamentals-of-accelerated-data-science

How to use GPU-accelerated tools to conduct data science faster, leading to more scalable, reliable, and cost-effective results.

accelerated-computing bokeh cudf cugraph cuml cupy dask data-science jupyter numpy pandas python rapids xgboost

Last synced: 12 Apr 2026

https://github.com/kishankrishna1/spam-classifier

Developed a Machine Learning-based Spam Classifier using Multinomial Naive Bayes to identify and filter spam messages with high precision

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/9keystrokes/photorealistic-person-scene-composition

This computer vision project implements an advanced image processing pipeline to seamlessly integrate a person into any background scene with photorealistic results.

numpy opencv python rembg skimage

Last synced: 04 May 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/chea-young/financial-statistics-practice-using-python

파이썬을 활용한 금융통계실습 정리 repository (matplotlib, numpy, pandas, seaborn) 😀

matplotlib numpy pandas seaborn yfinance

Last synced: 11 May 2026

https://github.com/rahulchouhan1/car-theft-data-in-india

This project analyzes car theft incidents in India using Python and data visualization techniques. It provides insights into theft trends, frequently stolen car models, recovery statistics, and the impact of security measures like GPS and CCTV.

matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/hannah-aji/predictive-modeling-to-identify-high-risk-areas

Predictive Modeling to Identify High-Risk Areas for Targeted Interventions in Chicago, Illinois

machine-learning matplotlib numpy supervised-learning visualization

Last synced: 17 Jul 2025

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/sohamkul16/excel-sheet-comparator-by-python

The Excel Sheet Comparator project is a Python-based application designed to compare two Excel spreadsheets to identify differences between them. These differences are then highlighted in other downloadable .csv format.

html-css numpy pandas python

Last synced: 17 Jul 2025

https://github.com/muhammadadilnaeem/kidney-disease-classification-using-mlflow-and-dvc

Kidney disease is a significant health issue worldwide. Early and accurate classification of kidney diseases can help in better treatment planning and patient management. This project focuses on building a robust classification model, ensuring reproducibility and scalability.

github numpy python streamlit tensorflow

Last synced: 07 Apr 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 12 Apr 2026

https://github.com/mnitin-reddy/summer-olympics-data-analysis-web-app

An interactive web app for exploring trends in Olympic Games history, analyzing overall medal tallies, country-wise performance, and athlete demographics. Built with Python and Streamlit, this app offers insights through visualizations and data-driven statistics.

dataanalytics matplotlib numpy pandas python seaborn streamlit

Last synced: 12 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/luona-zhang/kaggle-data-science-competitions

This repository contains code developed for participating in Kaggle Data Science competitions.

fitting-algorithm machine-learning model-evaluation numpy pandas scikit-learn seaborn tensorflow

Last synced: 07 Apr 2026

https://github.com/emhash/machine-learning-and-ai

Personal tasks or codes of Machine Learning and Artificial Intelligence. Practice codes and projects of Ml/AI will be share here.

ai datasets ipynb jupyter-notebook keras linear-regression logistic-regression machine-learning machine-learning-algorithms numpy opencv pandas python seaborn sklearn tensorflow

Last synced: 05 Jan 2026

https://github.com/kiok46/subset-sum-problem

Solving the Subset Sum Problem using Python, Pandas and Numpy.

numpy pandas python subset-sum

Last synced: 05 May 2026

https://github.com/hoyirul/flask-starter

Flask-Starter provides a streamlined foundation for Flask web development, ensuring a quick setup and optimal configuration. Simplify your Flask project initiation with Flask-Starter.

flask numpy pandas python rest-api

Last synced: 07 Apr 2026

https://github.com/yash-dave/creditcard-fraud-detector

An end-to-end Streamlit app for detecting credit card fraud using unsupervised anomaly detection techniques like Isolation Forest, One-Class SVM, LOF, and Autoencoders.

numpy pandas plotly python seaborn streamlit tensorflow

Last synced: 07 Apr 2026

https://github.com/nadahamdy217/movies-data-etl-using-python-gcp

Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.

analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/fabricioarendtorres/streamauc

Light-weight package for classification metrics computed on streams or minibatches of data. Mainly for area under the curve (AUC) of precision-recall (PR) or receiver operating characteristic (ROC) curves. Supports multi-class setting with either macro- or micro aggregation..

classification-model machine-learning metrics numpy precision-recall-curve receiver-operating-characteristic

Last synced: 06 Jan 2026

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 28 Jan 2026

https://github.com/ksharma67/partial-dependent-plots-and-individual-conditional-expectation-plots

Individual Conditional Expectation (ICE) plots display one line per instance that shows how the instance's prediction changes when a feature changes. The Partial Dependence Plot (PDP) for the average effect of a feature is a global method because it does not focus on specific instances, but on an overall average.

eda gradient-boosting individual-conditional-expectation linear-regression matplotlib numpy pandas partial-dependence-plot python seaborn sklearn xgboost

Last synced: 12 Apr 2026

https://github.com/artuk009/numericalprogramming

Notes and Code examples for Numerical Programming.

matplotlib numpy pandas python scipy sympy

Last synced: 07 Apr 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/fioraz/ml-project

Salary Prediction Using Multiple Machine Learning Approaches

machine-learning ml natural-language-processing numpy pandas prediction pytorch tensorflow

Last synced: 12 Apr 2026

https://github.com/sakhileln/multimodal-agent

A beginner-friendly project to build a simple multimodal AI agent. 🦾

agent agents ai keras nlp nlp-machine-learning numpy opencv pillow spacy-nlp tensorflow

Last synced: 07 Apr 2026

https://github.com/ewakamoto/03.-python-projects

This is my collection of Python Projects

jupyter jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/sonatipek/matris_transactions_py

My "Python Programlama-I" lesson's homework. Some matris transactions with numpy library in Python

matrix numpy numpy-arrays python sonatipek

Last synced: 24 Mar 2025

https://github.com/anthippi/naive-bayes-imdb-classification

A custom Naive Bayes classifier for sentiment analysis of movie reviews from the IMDb dataset, utilizing feature selection based on Information Gain and comparing its performance with scikit-learn's BernoulliNB.

classification imdb matplotlib naive-bayes-classifier numpy pandas scikit-learn sklearn

Last synced: 09 Apr 2026

https://github.com/debjyotisaha/generative-ai-projects

A set of Generative AI projects which can be shown in your portfolio

chatgpt flask genai langchain nlp numpy pandas python streamlit

Last synced: 07 Apr 2026

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 12 Apr 2026

https://github.com/aneeshmurali-n/ann-diabetes-prediction

Predicting diabetes progression using an Artificial Neural Network (ANN). This project leverages the scikit-learn diabetes dataset for training and evaluation. Includes data preprocessing, model building, and performance visualization.

ann data-preprocessing data-visualization deep-learning diabetes-prediction exploratory-data-analysis keras machine-learning matplotlib neural-network numpy pandas regression scikit-learn seaborn tensorflow visualization

Last synced: 07 Apr 2026

https://github.com/alejoduarte23/bs6472

Serviceability assessment as per BS 6472 for human exposure to vibration in buildings Vibration sources other than blasting

matplotlib numpy pydantic scipy

Last synced: 07 Jan 2026

https://github.com/paulo-santos-ds/analise_vendas_supermercado

Este projeto realiza o web scraping de Variação de preços de produtos do supermercado

env numpy pandas scrapy sqlite3 streamlit

Last synced: 29 Apr 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/ericchw/youth_discord_nlp_chatbot

A python AI chatbot with emotion detection model. Frontend using PHP, API using Flask and database using PostgreSQL. Collaborate with CyberYouth from SJS. @HKMU 2022-2023 FYP

ai bootstrap chatbot chinese-nlp docker emontion-detection flask jquery machine-learning nlp nlp-machine-learning nltk numpy pgsql php postgresql pycord torch transformers

Last synced: 21 Jan 2026

https://github.com/athiyaman-m/brain-tumor-classification-using-deep-learning

• This deep learning model is developed to classify and identify different types of brain tumors, as well as determine their existence, using Convolutional Neural Networks. This is achieved through the training and testing of 3264 MRI images.

keras numpy opencv pandas sklearn tensorflow

Last synced: 08 Apr 2026

https://github.com/fikri-rouzan/energy-consumption-prediction

Final Project for the AI/ML Weekly Class by Google Developer Group on Campus (GDGoC) UIN Jakarta.

jupyter-notebook matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 07 Apr 2026

https://github.com/rajatnai49/data-science-practices

In this, I am adding all the Python exercises with the solutions related to data handling, analysis, and visualization using libraries such as NumPy, Pandas, and Matplotlib for data analysis and visualization.

jupyter-notebook matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/soumyapro/heart-disease-prediction

This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.

logistic-regression numpy pandas simpleimputer sklearn

Last synced: 21 Jan 2026

https://github.com/muhdhammad/machine-learning

Crafted for hands-on learning and implementation of ML with scikit-learn

data-science jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/gusenov/max-empty-rect-py

:black_square_button: Реализация на Питоне алгоритма поиска на изображении пустого прямоугольника максимальной площади.

algorithm empty-spot graphics numpy python python-image-library python-library rectangle-detection

Last synced: 06 Feb 2026

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 12 Apr 2026

https://github.com/shweta189/sms-spam-detection

Detecting Spam SMS's messages using machine learning algorithms.Utilizing NLP techniques such as tf-idf vectorization and machine learning algorithms like naive bayes and logistic regression for classifications. Integrated with Streamlit to create an interactive web app where users can input a message and instantly check if it's spam or not.

eda machine-learning-models nltk numpy pandas plotly python seaborn streamlit

Last synced: 07 Apr 2026

https://github.com/manshreet27/mrs

This Movie Recommendation System is a web-based application built using Python and Streamlit, designed to provide movie recommendations based on user preferences. It utilizes TMDb API for fetching real-time movie details and Kaggle's TMDB 5000 Movies dataset for content-based filtering.

numpy pandas python scikit-learn streamlit tmdb-5000-movies-dataset-from-kaggle tmdb-api-for-fetching-real-time-movie-data

Last synced: 07 Apr 2026

https://github.com/raghavendra317/brain-tumor-detection

A deep learning-based project for classifying brain MRI images into Glioma, Meningioma, No Tumor, and Pituitary Tumor using Convolutional Neural Networks (CNNs). This project aims to assist in early tumor detection, improving diagnostic accuracy and healthcare outcomes.

cnn-classification keras numpy opencv scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/shekharkram/project

A collection of data analytics projects showcasing skills in data cleaning, exploration, visualization, and basic SQL queries. Designed to demonstrate entry-level data analyst competencies using real-world datasets and tools.

datacleaning excel jupyter-notebook mysql numpy pandas postgresql python sql

Last synced: 07 Apr 2026

https://github.com/edegan-furb/conwaygameoflife

This project is a simulation of Conway's Game of Life using Python with the Pygame library.

conways-game-of-life numpy pygame python

Last synced: 29 Apr 2026

https://github.com/sandipkushwaha20/brain_tumour_detector

This project utilizes deep learning to detect and classify brain tumors from MRI images. The primary objective is to determine whether a tumor is present and, if so, classify its type with high accuracy.

deeplearning keras numpy pandas python tensorflow

Last synced: 12 Apr 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/gauravpatil04/ml

This repo is on "Basics skills required for Machine Learning"

matplotlib numpy pandas python seaborn

Last synced: 11 Jul 2025

https://github.com/shankhadweep/diabetes-prediction-systemv3

This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 01 May 2026

https://github.com/konynour/facedectection

This project demonstrates real-time face detection using OpenCV and a pre-trained SSD (Single Shot MultiBox Detector) model. The system captures video from a camera, processes the frames, detects faces, and displays the results with bounding boxes and confidence scores. This application showcases the use of deep learning models for object detection

cv2 jupyter-notebook lambda-functions matplotlib numpy opencv python3

Last synced: 12 Apr 2026

https://github.com/fjohnson87/planning-and-feasability

Data-driven feasibility analysis using pandas, seaborn, and FAO food supply data

matplotlib numpy pandas

Last synced: 04 May 2026

https://github.com/aqueeqazam/numpy-for-mathematical-computing

You're at the perfect spot if you're interested in developing your career in data science or machine learning, or if you just enjoy having fun with mathematical operations.

mathematics numpy numpy-arrays

Last synced: 19 Apr 2026

https://github.com/chenshuliu/python-guide-to-numpy

This repository is dedicated to explaining the functionalities of Numpy in Python

data-structures numpy python

Last synced: 18 Jan 2026

https://github.com/atharvkadammm/calmlytic

An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.

anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost

Last synced: 21 Jun 2025

https://github.com/AsuquoAA/CDC_2017_Immunization_Data_Analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 21 Jul 2025

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026

https://github.com/rubada/machine-learning-with-ruba-dabbas

Advance your skills and start your career here, by taking the online courses on Intuidemy.

course learning machine machine-learning matplotlib matplotlib-pyplot models numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/ayushhhh123/diwali_sales_analysis

This project provides a comprehensive analysis of Diwali sales data using Python. The primary goal is to derive business insights from customer purchasing behavior during the Diwali festival season. This analysis can help improve marketing strategies, customer targeting, and overall sales performance.

matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/pedronjorge/average-face-python

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 21 Jun 2025

https://github.com/nik-kusanagi/jupyter

Treinamento Para Análise de Dados.

big-data jupyter jupyter-notebook matplotlib numpy pandas python python3

Last synced: 30 Apr 2026

https://github.com/sumaiyyaf/spotify-data-analysis

This project explores Spotify music data, using Python to clean, analyze, and visualize the dataset. It examines the relationship between audio features and sentiment in song lyrics, uncovering patterns and trends to provide valuable insights into music preferences.

eda numpy pandas python

Last synced: 08 May 2026

https://github.com/ahmedabdalkreem/skin-cancer

In this project we work to extraction features from Images using CNN and build the Neural Network to arrive the patient have skin-cancer Malignant or Benign.

cnn deep-learning matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 12 Apr 2026

https://github.com/marcelovilla/fire-congress-2019-tdf-talk

This repository contains the code used for the project presented in the "Fire activity on the Colombian Tropical Dry Forest: an environmental and social perspective" talk during the 8th International Fire Ecology and Management Congress.

earth-observation ecology fire-activity gdal latex matplotlib modis numpy pandas python remote-sensing requests seaborn tikz trmm xarray

Last synced: 12 Apr 2026