An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ebadshabbir/decision_tree_algorithm

Decision Tree Classifier for Social Network Ads A Python implementation of a Decision Tree Classifier to predict user purchasing behavior based on age and estimated salary. Includes feature scaling, model evaluation (confusion matrix and accuracy), and visualizations of decision boundaries for both training and test sets.

decision-tree-classifier jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/alefair/ds

Data Science projects

ipython jupyter matplotlib numpy pandas python sklearn

Last synced: 15 Mar 2026

https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations

This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.

jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode

Last synced: 28 Jan 2026

https://github.com/tugot17/neural-network-from-scratch

Simple neural network implemented from scratch using numpy package

from-scratch neural-network numpy python

Last synced: 26 Oct 2025

https://github.com/ggrbill/phd-plot-scripts

My personal plot scripts used to generate graphs for my PhD Thesis

hacktoberfest matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/sonaligill/olympics-analysis

The outcome of this project is an interactive streamlit web application that visualizes the analysis of Olympic data while rendering different aspects of Olympic history, compare country performances, and gain insights into athlete demographics.

numpy plotly python scikit-learn scipy streamlit

Last synced: 28 Jan 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/yoavram/numpycharm

Some examples on using PyCharm with Scientific Python

cython numpy pycharm python scientific-computing training-materials

Last synced: 11 Apr 2026

https://github.com/lintangwisesa/python_pillow_image_array

Convert 🖼 image to 📜 Numpy array using Pillow [Python]

image numpy pillow python

Last synced: 07 May 2026

https://github.com/normyee/chatbot-pytorch_nlp

ChatBot utilizing neural networks, NLP techniques, and the Bag of Words model. Implements tokenization and stemming for efficient language processing.

bag-of-words chatbotai neural-network nlp-machine-learning nltk-python numpy python pytorch-nlp

Last synced: 09 May 2026

https://github.com/ambrish8/bell-s-palsy

BELL’S PALSY SEVERITY DETECTION USING DEEP LEARNING

cnn-architecture imagedatagenerator keras numpy opencv python pytorch resnet-50

Last synced: 15 Apr 2026

https://github.com/sa001gar/password-manager-on-the-top-of-streamlit

Secure Password Manager is a Python-based application developed by Sagar Kundu, designed to provide users with a secure and convenient way to manage their passwords. With features like user authentication, password generation, and secure storage using bcrypt and MongoDB, it ensures the safety of sensitive information while offering an intuitive use

bcrypt gui-application mongodb numpy numpy-library pandas-dataframe pandas-library pandas-python password-generator password-manager pymongo python python3 streamlit-application streamlit-webapp urllib

Last synced: 15 Apr 2026

https://github.com/djdurga/google_play_store_apps_analysis

This data analysis project focuses on exploring and understanding the Google Play Store Apps dataset.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/deepankkartikey/ml-projects

A repository of projects done while learning and exploring ML

classification csv machine-learning numpy pandas regression sklearn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/arn4/labtools

Package with useful scripts for Physics Laboratory Courses

numpy python3 scipy uncertainties

Last synced: 05 Jan 2026

https://github.com/keyurparalkar/breast-cancer-detection

Predict whether the cancer is benign or malignant

gradient-descent logistic-regression machine-learning numpy

Last synced: 26 Apr 2026

https://github.com/omogbolahan94/lego-analysis

Analysis of LEGO data from 1949 to 2021.

matplotlib merge numpy pandas

Last synced: 09 May 2026

https://github.com/bdurga26/fake-news-detection

This project aims to find whether the given news is real or fake by using Machine learning Algorithms.

ds ml numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/erikbrinkman/hilbert-bytes

A python library for converting between d-dimensional points and indices on a hilbert curve

hilbert-curve numba numpy python

Last synced: 08 May 2025

https://github.com/franciscomartinez45/social-network-analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 09 May 2026

https://github.com/farha-dahman/investigate_dataset

Investigate [No-show appointments] Dataset

data-analyst jupyter-notebook matplotlib numpy pandas python

Last synced: 15 Apr 2026

https://github.com/codehub001/age-and-gender-detection

This AI-powered project detects faces and predicts age & gender from images or video streams using OpenCV and deep learning. 🧠💡 🔹 Face Detection 👀 🔹 Gender Classification 🏷️ (Male/Female) 🔹 Age Estimation 📊 (from predefined age groups) 🔹 Live Video & Image Support 🎥🖼️

argparse deep-learning numpy opencv pretrained-model python

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/vyask21/opencv-drowsy

Drowsiness detection can be quickly done using shape predictor face landmarks.dat model file that mark the essential landmarks on the face using OpenCV modules.

numpy opencv-python scipy

Last synced: 08 Feb 2026

https://github.com/tsar123/neutral-single-nucleotide-polymorphisms-analysis

analysis of data on neutral single nucleotide polymorphisms

dna-polymorphism matplotlib numpy pandas seaborn snv

Last synced: 01 Feb 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/bursasha/pandas-numpy-matplotlib-cavies-analysis

Complete statistical analysis of cavy lifetime dataset using Python, Pandas, NumPy, Matplotlib, and SciPy to explore, visualize, and infer the impact of bacilli infection on cavy lifetimes 🦫

cavy dataset-analysis distribution-fitting hypothesis-testing jupyter-notebook matplotlib numpy pandas python-analysis scipy statistical-analysis statistical-methods

Last synced: 09 Feb 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 26 Feb 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/mattkevan/data-science

Notes from learning various data science topics.

data-science matplotlib numpy python3

Last synced: 09 May 2026

https://github.com/puureya2/catalogue-price-scraper

Scrape & store product prices with Selenium & BeautifulSoup4. Visually represent data with MatPlotLib,Personal Project

beatifulsoup4 chrome-webdriver matplotlib numpy pandas python python-csv selenium webscraping

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/andydevs/optipy

Optimization algorithms written in python

numpy optimization optimization-algorithms

Last synced: 13 May 2026

https://github.com/adriantomin/heart-disease-prediction

Predicting Heart Disease with ML 🏥🤖 Machine learning model to predict heart disease using clinical data. Includes data analysis, feature engineering, and model training.

machine-learning matplotlib numpy pandas python sklearn

Last synced: 12 Feb 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/asuquoaa/big_4_sports_teams_and_city_population_analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

data-cleaning-and-preprocessing numpy pandas

Last synced: 13 May 2026

https://github.com/gmcgibbon/udacity-dlnd-project-1

Udacity Deep Learning Nanodegree - Project 1

deep-learning jupyter-notebook nn numpy udacity

Last synced: 13 May 2026

https://github.com/ishaj72/analysis_model2

Dataset analysis model using matplotlib and other libraries

machine-learning matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/kingsley-ezenwaka/mean-var-std

This is a (mini) Python project, completed as part of the requirements for the Data Analysis with Python certification from freeCodeCamp.com.

numpy python

Last synced: 09 May 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 09 May 2026

https://github.com/saurabh274/netflix-eda

Analyze the data and generate insights that could help Netflix in deciding which type of shows/movies to produce and how they can grow the business in different countries.

matplotlib numpy pandas python seaborn

Last synced: 28 Feb 2026

https://github.com/killervardhan8/gesturedecode

The Sign Language Interpretation project focuses on recognizing and interpreting hand gestures to facilitate communication for individuals who use sign language. This project leverages computer vision and machine learning techniques to accurately identify and translate hand signs into text

csv mediapipe numpy python tensorflow

Last synced: 28 Feb 2026

https://github.com/ben854719/trade-transfer-workflow-optimizer

This AI portfolio tools securely analyzes asset allocation, sector exposure, and performance. Its graph compares real and predicted close prices, enhanced with sentiment indicators such as “Confident” and “Surprised.” An avatar assistant provides user guidance combining technical accuracy with intuitive explanations.

agentic-ai avatar cryptography emotion-analysis encryption gemini json langgraph langraph machine-learning-algorithms matplotlib mcp-server numpy opencv-python pandas python pytorch rs256 seaborn

Last synced: 15 Feb 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/type0-1/salary-truth-predictor

A supervised machine learning regression model. Includes problem statement, approach to solution, code, images, dataset, and Jupyter Notebook for interactive analysis.

linear-regression machine-learning matplotlib-pyplot ml numpy pandas polynomial-regression projects scikitlearn-machine-learning support-vector-regression

Last synced: 16 Apr 2026

https://github.com/claby2/ascii-generator

Python program to convert images into ASCII art.

ascii numpy optparse pillow python

Last synced: 13 May 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/lmizner/grokking_data_science

Coding practice for basic data science interview questions in Python

data-science numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/balakrish001/python-course

The Python course provides a thorough foundation in Python programming, starting with core concepts and progressing to OOP. It covers essential libraries like NumPy and Pandas for data manipulation and analysis. You'll also learn how to send emails using Python's SMTP library and interact with APIs for real-world applications,

api numpy oops-in-python pandas problem-solving python smtp

Last synced: 10 Apr 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/kunalshelke90/wine-quality-testing

This project is about creating a machine learning algorithm that can predict the quality of wine based on the given dataset. Different machine learning algorithms such as logistic regression, decision tree and random forest are used in this project.

eda feature-engineering flask machine-learning numpy pandas python

Last synced: 10 Apr 2026

https://github.com/shriansh8619/eda_customer_behavior

This project analyzes Nielsen transaction data using Python to uncover sales trends, customer preferences, and purchasing patterns. It provides insights to optimize inventory, create targeted promotions, and improve store performance. The goal is to help supermarkets enhance strategies and boost profitability based on data-driven insights

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database

apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql

Last synced: 11 Apr 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/mairagalvao/final_grades

An analysis of the final grades of students using Python

matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026