An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/nik-kusanagi/jupyter

Treinamento Para Análise de Dados.

big-data jupyter jupyter-notebook matplotlib numpy pandas python python3

Last synced: 30 Apr 2026

https://github.com/sumaiyyaf/spotify-data-analysis

This project explores Spotify music data, using Python to clean, analyze, and visualize the dataset. It examines the relationship between audio features and sentiment in song lyrics, uncovering patterns and trends to provide valuable insights into music preferences.

eda numpy pandas python

Last synced: 08 May 2026

https://github.com/vjpaij/datascience-udemy

Various basic codes on datascience

machine-learning matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/claby2/mouse-track

Track, display, and graph the position of a mouse cursor.

c cpp matplotlib numpy python sdl2 windows

Last synced: 12 Apr 2026

https://github.com/abynxv/automatedexamsheetgrader-ocr

This Python project automates grading of handwritten or printed answer sheets using OCR (Optical Character Recognition). It extracts student answers from scanned images, compares them to an answer key, and generates a detailed report with class statistics.

numpy ocr-recognition opencv pandas python3 tesseract

Last synced: 27 Jun 2025

https://github.com/baschin1103/principal_component_analysis

In this repository you find a python program and the prints and 3D-visualization of it. After the KNN-Classification I wanted to know which variables have the most relevance for the results. One approach for this is the Principal-Component-Analysis (PCA). More details in the python program as comments.

3d-printing labelencoder matplotlib numpy pandas principal-component-analysis sklearn-library sqlalchemy standardization variable-importance

Last synced: 13 May 2026

https://github.com/aritrakar/ml-from-scratch

🤖Common ML implementations from scratch.

machine-learning numpy pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/1adore1/face-analysis

An application for face recognition and prediction of gender, age range and emotional state.

numpy opencv pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/abdelmajidlh/fifa19

Ce projet conciste à analyser et visualiser les données FIFA19. Il est destiné aux jeunes data analystes ou ceux et celles qui souhaite apprendre la programmation Python. Le projet sera mis à jour au fur et à mesure.

fifa matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/michaeldehaney94/opencv

Python OpenCV computer vision projects

matplotlib numpy opencv-python python

Last synced: 20 Jun 2025

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/kumpatlapavankumar/recognizing-flowers-with-convolutional-neural-networks

This project uses a Convolutional Neural Network (CNN) to classify flower images into five categories: Daisy, Dandelion, Rose, Sunflower, and Tulip. The model is trained on the Flowers Recognition dataset with image augmentation techniques to enhance generalization.

cnn-keras coding data-science deep-learning enjoying flower-classification machine-learning numpy opencv2 pandas pathlib pillow python tensorflow testing training

Last synced: 08 Apr 2026

https://github.com/vyjayanthipolapragada/human_detection_hog

Detect humans in real-time, image or video and count the detected people using OpenCV and HOG Descriptor

computer-vision easygui hog-descriptor human-detection image-processing imutils numpy opencv os python support-vector-machine

Last synced: 06 May 2026

https://github.com/suryavamsi-p/youtube-trends-and-tracks-engagement

This project aims to leverage Python’s robust visualization libraries to derive deep insights from YouTube video trends data. Using the 'youtube.csv' dataset, this analysis explores the dynamics of YouTube video attributes such as views, likes, dislikes, and comment counts to understand user engagement and their emotional insights.

dash datavisualization hypothesis-testing matplotlib normality-test numpy pandas python3 scipy seaborn visual-studio-code

Last synced: 03 Nov 2025

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 04 May 2026

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/faraazarsath/handson-numpy

This respository is on various Numpy Tasks provided as academic assignment

numpy

Last synced: 16 May 2026

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/singhvishal003/email-spam-detector-model

Email Spam Detector model to detect spam messages.

numpy pandas sklearn

Last synced: 08 Apr 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/ricomeetsworld/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Apr 2025

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/marknature/opencv-bootcamp

If you are looking to take your first steps towards learning Computer Vision and AI using OpenCV, this is the best OpenCV course to jumpstart your career.

anaconda computer-vision cv2-library ffmpeg jupyter-notebook matplotlib numpy opencv os urllib zipfile

Last synced: 02 Mar 2025

https://github.com/aqueeqazam/a-deep-dive-introduction-to-tensorflow

TensorFlow is a deep learning system that provides an interface for expressing deep learning algorithms and implementing them. It uses dataflow graphs to represent computation, shared state, and the operations that mutate that state.

deep-learning neural-network numpy tensor tensorflow

Last synced: 03 Aug 2025

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 11 Apr 2026

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 07 May 2026

https://github.com/mohammedrashithkp/drlforsnakegame

This project utilises a Deep Reinforcement Learning specifically Q learning implemented using PyTorch which learns the game by interacting with the pygame environment .I have followed a tutorial on Freecodecamp YouTube Channel and made the initial model.

deep-reinforcement-learning matplotlib-pyplot numpy pygame python

Last synced: 16 May 2026

https://github.com/omarsaad21/credit-train-data-science-project

This a full web application to predict the credit score of clients plus I did many visulizations to express many insights in chart

eda matplotlib ml numpy pandas python sklearn streamlit-webapp

Last synced: 09 Apr 2026

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/evernext10/coursera_capstone

This repository is created for sending the IBM Data Science Lab and will be used primarily for the Capstone project.

coursera-data-science ibm numpy pandas

Last synced: 10 May 2026

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 05 Feb 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/emredemirbas/google-playstore-eda

Exploratory Data Analysis (EDA) of the Google Play Store dataset — examining trends in app ratings, categories, pricing, and user engagement using Python and LaTeX.

exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/nskamaleshmani/exoseeker

🌌 Discover and analyze exoplanets with ExoSeeker, a tool designed for efficient world-hunting using Python and data visualization techniques.

exoplanet-transits exoplanets gradient-boosting machine-learning matplotlib nasa nasa-data nasa-spaceapps-challenge numpy pandas perceptron-neural-networks random-forest streamlit

Last synced: 03 Nov 2025

https://github.com/pradyumnakr/facial-recognition-using-opencv

Uses haarcascades to recognize facial features, patterns and performs feature extraction followed by labelling the data to recognize the faces on a live webcam.

numpy opencv python3

Last synced: 09 May 2026

https://github.com/moritzsalla/cci-advanced-visualization

Classwork for Mick Grierson's module at the Creative Computing Institute

matplotlib numpy openframeworks python tensorflow

Last synced: 06 May 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/isk-daniar/-contrast-improvements-on-pil

Contrast improvements on PIL

numpy pillow python

Last synced: 10 Jun 2026

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/mohammedsaqibms/planar_data_classification_with_onehidden_layer

This repository implements a simple neural network for binary classification of 2D planar data using Python and NumPy. It compares logistic regression with neural networks and includes code for forward/backward propagation, gradient descent, and decision boundary visualization.

backpropagation binary-classification gradient-descent logistic-regression machine-learning neural-networks numpy python3

Last synced: 10 Jul 2025

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/ijproject/calculate-absorption-rate

大気を構成する分子ごとの赤外線吸収割合を計算するプログラム。

numpy python

Last synced: 11 May 2026

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 20 Feb 2026

https://github.com/cyberlument/opencv-colormasking-red-

This mini project opencv is for educational purpose.

numpy opencv opencv-python pycharm vscode

Last synced: 20 Jan 2026

https://github.com/priyasingh26/financial_document-data_extraction

This project extracts key information from financial documents like invoices and receipts using text recognition. It processes images, classifies documents, and extracts data, which is then stored in a CSV file. The aim is to automate data collection from scanned documents, reducing manual work and increasing accuracy.

data-extraction numpy ocr pandas pillow preprocessing pytesseract-ocr python sklearn torch transformers

Last synced: 08 Apr 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 12 Apr 2025

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/poziloi/-image_processing-

Методы и алгоритмы цифровой обработки изображений, задания

cv2 matplotlib numpy python

Last synced: 20 Jan 2026

https://github.com/offthetab/statistics_homework

Практические работы по статистике МИРЭА 3 курс.

numpy pandas python statsmodels

Last synced: 05 Apr 2026

https://github.com/michelenana/projet-7

PRODUISEZ UNE ETUDE DE MARCHE AVEC R OU PYTHON

acp boxplots cah kmeans matplotlib numpy pandas python scipy seaborn sklearn

Last synced: 08 Apr 2026

https://github.com/shreeparab1890/laptop-price-predictor

This notebook is trying to build a model which will predict the price of the laptop based on the given specifications.

machine-learning-algorithms matplotlib numpy pandas price-prediction python scikit-learn sklearn

Last synced: 06 Apr 2026

https://github.com/jancervenka/genalg

Solver for traveling salesman problem

genetic-algorithm numpy python

Last synced: 12 Apr 2025

https://github.com/asut00/python-piscine_42ai

Python Bootcamp: A one-week intensive course with 42AI at École 42, covering Python fundamentals, data manipulation, and introductory AI concepts.

matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/chandkund/image-classification-using-the-mnist-dataset

Image Classification using the MNIST dataset. This project leverages a Convolutional Neural Network (CNN) to recognize and classify handwritten digits with high accuracy. Includes data preprocessing, model architecture, and evaluation. Explore the code and results here!

computer-vision data-science machine-learning matplotlib numpy pandas python

Last synced: 08 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/allenvox/neural

Workspace for Neural Networks class

jupyter-notebook neural-networks numpy python pytorch tensorflow

Last synced: 02 Jan 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/raghavendranhp/airbnb-data-analysis

The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.

eda jupyter-notebook mongodb numpy pandas powerbi preprocessing

Last synced: 08 Apr 2026

https://github.com/jaffarkeikei/hackbio

This repository contains a comprehensive toolkit for analyzing single-cell perturbation data from the NeurIPS 2023 Competition. The project focuses on understanding how different drugs (small molecules) affect various cell types at the gene expression level.

csv-files jupyter-notebook matplotlib numpy python tensors

Last synced: 11 May 2026

https://github.com/vipinchaudhary31122002/cardiopredict

CardioPredict is a smart, ML-powered application that helps analyze your risk of heart disease using interactive forms, clinical features, and trained classification models.

machine-learning matplotlib numpy pandas python3 random-forest seaborn streamlit

Last synced: 12 Apr 2025

https://github.com/vipinchaudhary31122002/bytephone

A data science project that explores, visualizes, and models smartphone features from a futuristic 2025 dataset to predict mobile prices using machine learning.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/tberchanov/clustering-k-means

Sample program where is implemented clustering by K-means algorithm, and its visualisation.

clustering-algorithm kmeans-clustering matplotlib numpy python

Last synced: 16 May 2026

https://github.com/themihirmathur/machine-failure-predictor

The 'Machine Failure Predictor' is a web application designed to enable predictive maintenance for industrial equipment. Leveraging machine learning techniques, this application predicts whether equipment will fail and identifies the type of failure based on key process parameters.

cicd digitalocean docker fastapi flask github github-actions machine-learning matplotlib mlops numpy plotly python randomforest scikit-learn seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/winstxnhdw/pyalign

An over-engineered website for aligning multiline strings to a common character with PyScript.

numpy pyscript uv

Last synced: 13 Jun 2025

https://github.com/bishopce16/surfs_up

An analysis of weather data to determine opening of an ice cream and surf shop using SQLite, SQLAlchemy, and Python.

matplotlib numpy pandas python sqlalchemy sqlite weather

Last synced: 02 Apr 2026

https://github.com/rdvdev2/tf-test

Mostres de ML i IA amb TensorFlow basades en els tutorials oficials per al PR

numpy python research-project tensorflow tensorflow-tutorials

Last synced: 22 Mar 2025

https://github.com/danielpancake/mlml

my little machine learning library written in NumPy

machine-learning numpy

Last synced: 05 Apr 2025

https://github.com/omraj0/covid19-data-analysis

Analysis of COVID-19 infection rates in various countries, correlating them with factors such as GDP per capita and social support.

covid-19 google-colab matplotlib numpy pandas python

Last synced: 19 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026