An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ohyeasabhi/signature-recognizer

Este projeto desenvolve um sistema para verificar a similaridade de assinaturas usando redes neurais. Ele utiliza o Triplet Loss para criar embeddings, focando na avaliação biométrica com o dataset CEDAR. 🖊️🌐

backpropogation-neural-network computer-vision django image-recognition jupyter-notebook keras machine-learning-projects neural-network numpy one-shot-learning python-project siamese-neural-network signature signature-recogniton tensorflow type voice-recognition windows

Last synced: 08 Apr 2026

https://github.com/abdelrahman-lila/seam-carving

Seam Carving for Content Aware Image Resizing Project - Computer Vision Lab

numba-jit numpy python

Last synced: 10 Jul 2025

https://github.com/joeylemon/python-kmeans

an implementation of the K-means clustering algorithm in Python

k-means-clustering matplotlib ml numpy python

Last synced: 21 May 2026

https://github.com/abdelmajidlh/fifa19

Ce projet conciste à analyser et visualiser les données FIFA19. Il est destiné aux jeunes data analystes ou ceux et celles qui souhaite apprendre la programmation Python. Le projet sera mis à jour au fur et à mesure.

fifa matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/rtmigo/markov_walk_py

🔢 Python module that calculates probabilities for a random walk in 1-dimensional discrete state space

absorbing-markov-chains absorbing-states markov-chain mathematics numpy probability probability-theory random-walk stochastic-matrix stochastic-models

Last synced: 21 May 2026

https://github.com/srikarveluvali/dataanalysis

The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.

css data-analysis html javascript matplotlib numpy pandas python seaborn tableau

Last synced: 09 Apr 2026

https://github.com/pramodyasahan/titanic-survival

This repository contains a machine learning project focused on predicting the survival of passengers on the Titanic. The project uses a Support Vector Regression (SVR) model from the sklearn library and involves data preprocessing and prediction.

data-preprocessing matplotlib numpy pandas python scikit-learn support-vector-regression

Last synced: 08 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/lemonsoda9013/easy-watermark-tool

A batch image watermarking tool based on Python and Tkinter.

batch-processing gui image-processing numpy pillow python tkinter watermark

Last synced: 15 May 2026

https://github.com/hansalemaos/numpy_choices

Some useful np.random functions

choices numpy python random

Last synced: 04 Apr 2025

https://github.com/tushar365/titanic---machine-learning-from-disaster

"Titanic: Machine Learning from Disaster" is a classic Kaggle competition for beginners https://www.kaggle.com/competitions/titanic. The goal is to use machine learning to predict which passengers survived the sinking of the Titanic based on historical data. This teaches data analysis and model building skills in a real-world context.

data-science dicision-tree logistic-regression machine-learning matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/hac8hi/ppgmonitoring

A PyQt generated app who display PPG signal, heart rate and breath rate

numpy pca ppg ppg-signal principal-component-analysis pyqt5 pyqtgraph scipy signal-analysis signal-processing

Last synced: 06 May 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/kumpatlapavankumar/recognizing-flowers-with-convolutional-neural-networks

This project uses a Convolutional Neural Network (CNN) to classify flower images into five categories: Daisy, Dandelion, Rose, Sunflower, and Tulip. The model is trained on the Flowers Recognition dataset with image augmentation techniques to enhance generalization.

cnn-keras coding data-science deep-learning enjoying flower-classification machine-learning numpy opencv2 pandas pathlib pillow python tensorflow testing training

Last synced: 08 Apr 2026

https://github.com/natiellitorres/python-data-science-alura

Repositório contendo meus estudos e projetos realizados durante o curso de Python para Data Science da Alura. Inclui notebooks, datasets e análises aplicadas com bibliotecas como Pandas, NumPy, Matplotlib, entre outras.

data-science jupyter-notebook matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/heyitsjoealongi/fantasy-football-qbwr-model

Fantasy Football: Quarterback / Wide Receiver - Gaussian Process Regression (GPR) Machine Learning Model

machine-learning matplotlib model numpy python scikit-learn

Last synced: 01 Apr 2025

https://github.com/vyjayanthipolapragada/human_detection_hog

Detect humans in real-time, image or video and count the detected people using OpenCV and HOG Descriptor

computer-vision easygui hog-descriptor human-detection image-processing imutils numpy opencv os python support-vector-machine

Last synced: 06 May 2026

https://github.com/rkschroeder/portfolio

This repository contains a data science project.

matplotlib numpy pandas sckit-learn seaborn

Last synced: 17 Feb 2026

https://github.com/suryavamsi-p/youtube-trends-and-tracks-engagement

This project aims to leverage Python’s robust visualization libraries to derive deep insights from YouTube video trends data. Using the 'youtube.csv' dataset, this analysis explores the dynamics of YouTube video attributes such as views, likes, dislikes, and comment counts to understand user engagement and their emotional insights.

dash datavisualization hypothesis-testing matplotlib normality-test numpy pandas python3 scipy seaborn visual-studio-code

Last synced: 03 Nov 2025

https://github.com/kkKaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 09 Sep 2025

https://github.com/lexxai/goit_python_ds_hw_02

Модуль 2. Знайомство з Pandas, Аналіз файлів, Аналіз датасет c Kaggle.com.  Python Data Science. Homework.

datascience google-colab homework jupyter-nootebook matplotlib numpy numpy-arrays pandas python seaborn seaborn-plots

Last synced: 17 Apr 2026

https://github.com/night-fury-me/deep-learning-fau

A repository that contains implementation of DNN, CNN and RNN from Scracth using numpy only. ResNet implementation using PyTorch

cnn deep-learning dnn machine-learning numpy pytorch rnn

Last synced: 10 Jul 2025

https://github.com/mhmudfzli/loan-approval-prediction

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

automl catboost numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/wisskirchenj/matrix-processor

Python app to process matrices using numpy

matrix-calculations numpy python3

Last synced: 16 May 2026

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/klaudiozdrava/image-analysis

A Python project that was developed as a university assignment and the goal is to colorize an grayscale image using machine learning techniques.

colorization image-processing kmeans-clustering numpy svm

Last synced: 16 May 2026

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/singhvishal003/email-spam-detector-model

Email Spam Detector model to detect spam messages.

numpy pandas sklearn

Last synced: 08 Apr 2026

https://github.com/paulomppatricio/projeto_challenge_telecomx-br_parte-2

Projeto Challenge TelecomX-BR_Parte-2 - Formação Data Science do programa ONE - Oracle Next Education em parceria com a Alura.

data-science imblearn joblib machine-learning matplotlib modelos-preditivos numpy pandas python scipy seaborn sklearn statsmodels xgboost

Last synced: 12 Apr 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/faisal-khann/banking_portfolio_risk_analysis

This analysis explores the banking customer dataset to uncover key patterns in account holdings, loan portfolios, income profiles, and transactional behavior. By integrating both financial and demographic variables, we examine customer engagement, lending trends, and risk exposure across different segments.

data-manipulation eda matplotlib numpy pandas powerbi python seaborn

Last synced: 21 Aug 2025

https://github.com/cr00z/dlschool_adv

Репозиторий для сдачи домашних заданий на первом семестре продвинутого потока курса "Deep Learning" Школы глубокого обучения ФПМИ МФТИ

matplotlib numpy pandas python pytorch sklearn

Last synced: 02 Apr 2026

https://github.com/ahmedabdalkreem/sentiment-analysis

This project performs sentiment analysis on a Twitter dataset, aiming to classify tweets into positive, negative, or neutral sentiments. Sentiment analysis is crucial for understanding public opinion on various topics, brands, or events based on social media data.

bert-model lematization matplotlib nlp nltk numpy pandas python3 sentiment-analysis stopwords streamlit

Last synced: 08 Apr 2026

https://github.com/ricomeetsworld/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Apr 2025

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 08 Apr 2026

https://github.com/marknature/opencv-bootcamp

If you are looking to take your first steps towards learning Computer Vision and AI using OpenCV, this is the best OpenCV course to jumpstart your career.

anaconda computer-vision cv2-library ffmpeg jupyter-notebook matplotlib numpy opencv os urllib zipfile

Last synced: 02 Mar 2025

https://github.com/abhishek-k-git/feature-engineering-using-stock-exchange-data

Creating quantitative trading strategies requires historical and real-time data and involves comparing models against future returns post-training.

lightgbm matplotlib numpy pandas plotly seaborn

Last synced: 12 Apr 2026

https://github.com/stephnna/my_image_classifier

A deep learning-based image classifier built with PyTorch, designed to identify various flower species using architectures like VGG16, ResNet101, and AlexNet. This project leverages transfer learning for improved performance, checkpointing for seamless training, and supports both CPU and GPU devices. Perfect for exploring deep learning models and i

matplotlib numpy python pytorch

Last synced: 11 May 2026

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 19 Apr 2026

https://github.com/crispengari/effects-of-covid19-on-trade

This repository visulises the effects of covid19 on trade within these years:

100daysofcode datascience datascience-machinelearning matplotlib matplotlib-pyplot numpy pandas python python3

Last synced: 02 May 2026

https://github.com/sutterseba/des-python

A simple DES implementation in Python

cryptography numpy python

Last synced: 15 Jun 2026

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/omarsaad21/credit-train-data-science-project

This a full web application to predict the credit score of clients plus I did many visulizations to express many insights in chart

eda matplotlib ml numpy pandas python sklearn streamlit-webapp

Last synced: 09 Apr 2026

https://github.com/rahul-404/full_stack_data_science_masters

Welcome to the repository for the course "Full Stack Data Science Masters". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science techniques.

computer-vision data-science database deep-learning exploratory-data-analysis flask machine-learning natural-language-processing numpy pandas python statistics time-series visualization

Last synced: 10 Apr 2026

https://github.com/ramyacp14/salesforecasting

Forecasts future sales for a retail company using time series analysis with Facebook Prophet. The project involves data preprocessing, exploratory data analysis (EDA), and forecasting with holiday effects considered.

data-preprocessing data-visualization exploratory-data-analysis fbprophet machine-learning matplotlib model-evaluation numpy pandas python seaborn time-series-forecasting

Last synced: 06 Apr 2026

https://github.com/caioandrian/data-analyst-procon

Análise dos dados do Procon, período de 2013 à 2016.

dataset numpy pandas python

Last synced: 11 Apr 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook

using numpy and pandas to analyze a synthetic loan dataset with python

data-analysis matlabplot numpy pandas plotting python seaborn

Last synced: 12 Mar 2025

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/om-kanabar/sciencefair2025

This is my project for Chicago Public School's student science fair 2025.

chicago-public-schools matplotlib-pyplot neural-networks numpy python science-fair tensorflow

Last synced: 03 Nov 2025

https://github.com/emredemirbas/google-playstore-eda

Exploratory Data Analysis (EDA) of the Google Play Store dataset — examining trends in app ratings, categories, pricing, and user engagement using Python and LaTeX.

exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/nskamaleshmani/exoseeker

🌌 Discover and analyze exoplanets with ExoSeeker, a tool designed for efficient world-hunting using Python and data visualization techniques.

exoplanet-transits exoplanets gradient-boosting machine-learning matplotlib nasa nasa-data nasa-spaceapps-challenge numpy pandas perceptron-neural-networks random-forest streamlit

Last synced: 03 Nov 2025

https://github.com/alejandrolara11/desafio_latam_introduccion_analisis_de_datos

Repositorio del curso "Introducción al Análisis de Datos" de Desafío Latam. Ejercicios prácticos realizados durante el curso, enfocados en análisis de datos con Python, Pandas, y visualización básica.

data-analysis data-science data-visualization matplotlib numpy pandas python seaborn statsmodels

Last synced: 29 Apr 2026

https://github.com/shibam120302/dog_breed_prediction_app

The Dog Scanner app will identify your dog's breed reliably in just a few seconds! Besides taking a picture, you can also record a video or upload an image from your gallery. Got a mixed breed? No problem, the Dog Scanner app also recognizes mixed breeds!

keras numpy opencv python

Last synced: 06 Apr 2026

https://github.com/utkarsh251106/tracking-with-yolo

This project uses YOLOv8 and DeepSORT to detect and track children and adults in video streams. It assigns unique IDs, handles re-tracking after occlusions, and outputs an annotated video with labeled bounding boxes.

computer-vision deep-learning deepsort machine-learning numpy opencv python torch ultralytics yolov8

Last synced: 30 Jan 2026

https://github.com/terrelbrinkley/python-projects

Python Scripts & Data Analyst Projects

aws numpy pandas python selenium

Last synced: 10 Apr 2026

https://github.com/manuethomas/traffic-accident-analysis-us

The project provides a comprehensive analysis of traffic accidents in the US from 2016-2023 aiming to identify key factors contributing to accidents. The analysis also focussed on finding features that could be used to develop a predictive model

exploratory-data-analysis feature-engineering feature-selection matpllotlib numpy pandas seaborn

Last synced: 20 Mar 2025

https://github.com/mukhtarmid/data-science

This repository is for the knowledge of data science.

datascience eda numpy pandas

Last synced: 10 May 2026

https://github.com/magzybog12/real-time-object-distance-measurement-using-opencv-python

Using an A4 paper as a guide, performing the distance measurements to find the height and width of objects placed in the region

numpy opencv-python pycharm-ide

Last synced: 05 May 2026

https://github.com/isk-daniar/-contrast-improvements-on-pil

Contrast improvements on PIL

numpy pillow python

Last synced: 10 Jun 2026

https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 05 Apr 2025

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/nazmul-1117/100-days-of-machine-learning

I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah

data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics

Last synced: 11 Aug 2025

https://github.com/thavinduushan/object-detection

Application developed for real-time object detection and counting utilizing COCO dataset

numpy opencv python

Last synced: 15 Apr 2026

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026

https://github.com/apuravdivekar2032/real-estate-price-prediction

A real estate price prediction website developed using Python with Numpy and Pandas for data cleaning Matplotlib for data visualization and scikit-learn for model building, featuring a Flask server to handle HTTP requests and integrates a user-friendly UI built with HTML, CSS, and JavaScript

flask html-css-javascript jupyter-notebook matplotlib numpy pandas pycharm python3 sklearn vscode

Last synced: 20 Jan 2026

https://github.com/chaitanyac22/cross_platform_product_mapping_algorithm_for_products

This repository contains a product ID mapping solution using TF-IDF vectorizer for weighted text vectors, Facebook AI Similarity Search (FAISS) for coarse filtering with cosine similarity, and Levenshtein distance for refined matching against the Blinkit catalog. Achieved 11.45% match for Zepto and 11.48% for Instamart.

exploratory-data-analysis faiss levenshtein-distance nlp numpy pandas similarity-search tf-idf-vectorizer

Last synced: 20 Mar 2025

https://github.com/ansh2709/customer-segmentation-ml-project

Project segregates the customers on the basis of their spending score and annual income using K-Means Clustering that is a part of unsupervised learning

clustering-algorithm k-means-clustering machine-learning matplotlib-pyplot numpy pandas python unsupervised-machine-learning wcss

Last synced: 01 May 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 29 Sep 2025

https://github.com/amardeepsingh-06/smart_attendance_system

The Smart Attendance System is designed to automate the attendance process using facial recognition. Traditional methods of taking attendance can be time-consuming and prone to errors. This project aims to streamline the process by leveraging computer vision technology, allowing for efficient, contactless, and accurate attendance tracking.

asgiref django mysql numpy opencv pillow-library python sqlparse

Last synced: 05 Apr 2026

https://github.com/ijproject/calculate-absorption-rate

大気を構成する分子ごとの赤外線吸収割合を計算するプログラム。

numpy python

Last synced: 11 May 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/cyberlument/opencv-colormasking-red-

This mini project opencv is for educational purpose.

numpy opencv opencv-python pycharm vscode

Last synced: 20 Jan 2026

https://github.com/priyasingh26/financial_document-data_extraction

This project extracts key information from financial documents like invoices and receipts using text recognition. It processes images, classifies documents, and extracts data, which is then stored in a CSV file. The aim is to automate data collection from scanned documents, reducing manual work and increasing accuracy.

data-extraction numpy ocr pandas pillow preprocessing pytesseract-ocr python sklearn torch transformers

Last synced: 08 Apr 2026

https://github.com/caefleury/cis-ieee-difusion-model

Repositório teste para códigos do projeto de Modelo de Difusão do branch CIS-IEEE

docker docker-compose numpy python3 tensorflow

Last synced: 08 Apr 2026

https://github.com/yash-3-bit/human-activity-recognition-using-smartphone-data

Human Activity Recognition (HAR) Using Smartphone Data This project leverages smartphone sensor data to recognize human activities such as walking, running, sitting, and standing.

numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 09 Apr 2026