An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/mnitin-reddy/content-based-recommendation-system-using-deep-learning

A content-based movie recommendation system using deep learning to predict user ratings by leveraging user and movie features. The system integrates neural networks for feature extraction, utility scripts for data processing, and supports both new and existing user recommendations.

deep-learning keras neural-networks numpy pandas python scikit-learn tensorflow

Last synced: 03 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/anag0es/projetos-ml

Este é um lugar onde guardo vários projetos para entender como funciona a aprendizagem de máquina. Cada projeto é uma forma de praticar o que estou aprendendo nos meus estudos.

artificial-intelligence crisp-dm data-science deep-learning learning machine-learning neural-network numpy pandas prediction-model python

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/bvsam/nnfs

Basic neural network built using Python and Numpy. Created to better understand neural networks.

artificial-intelligence deep-learning machine-learning neural-network neural-networks numpy python3

Last synced: 18 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/ccolpasm/pageviewanalyzer

PageViewAnalyzer analyzes and visualizes time series data of forum visits (2016-2019). It uses Pandas, Matplotlib, and Seaborn to create line plots, bar charts, and box plots, highlighting trends, monthly averages, and seasonality in page views. Perfect for data exploration with Python.

data-science matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/alezoon/movie-revenue-prediction

Sk-learn practice using Linear Regression, ML workflow practice.

jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 05 Apr 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/stffns/snapvec

Fast compressed ANN search via randomized Hadamard transform + Lloyd-Max quantization. Pure NumPy.

ann embeddings hadamard numpy quantization rag vector-search

Last synced: 07 Apr 2026

https://github.com/iv4n-ga6l/titanic-survival-prediction

a predictive model in Python that answers the following question: “using data relating to passengers (name, age, gender, socio-economic class, etc.), who has the greatest chance to survive?"

matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Apr 2026

https://github.com/nksmanya/virtual_canvas

A hand gesture-based drawing app that lets you create art in the air using hand movements

mediapipe numpy opencv opencv-python python

Last synced: 18 Apr 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/alexkhype/04python-carpricepredictor

Modelo de machine learning para predecir el valor de mercado de autos usados, optimizando precisión y eficiencia.

catboost data-science lightgbm machine-learning numpy pandas python sklearn statistics xgboost

Last synced: 05 Apr 2026

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/nowon1/insurance-claim-prediction_version

This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.

data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligência artificial para analisar documentos e características físicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as áreas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/al-ghaly/prosper-loans-analysis

A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 18 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/znreza/paradoxes

Q-learning algorithm from scratch in python for optimal path finding

grid grid-search matrix numpy python qlearning-algorithm reinforcement-learning

Last synced: 18 Apr 2026

https://github.com/noopur-phadkar/botnavigator

BotNavigator utilizes the A* (A-Star) algorithm for pathfinding, leveraging Python along with libraries such as Matplotlib and Numpy for efficient array operations and visualization. This innovative solution optimizes robotic navigation within warehouse environments, enhancing operational efficiency and scalability.

a-star-path-finding matplotlib numpy pathfinding-algorithms python robotics

Last synced: 18 Apr 2026

https://github.com/md-emranhossen/deep-learning-hub

Welcome to Deep Learning Playground, a personal repository for exploring and experimenting with deep learning concepts, architectures, and techniques. This repo serves as a hands-on space for implementing and understanding deep learning models, from fundamental neural networks to advanced architectures.

cnn-model deep-learning deeplearning gan generative-adversarial-network neaural-network numpy pandas pthon tensorflow

Last synced: 18 Apr 2026

https://github.com/zombietdv/rfl-13_8

Implementaion for Genetic algorithm and Q-Learning.

genetic-algorithm grid-world machine-learning numpy pygame q-learning

Last synced: 18 Apr 2026

https://github.com/amdjadouxx/my_neural_network

implementation of a neural network library

deep-neural-networks ia machine-learning numpy python3

Last synced: 18 Apr 2026

https://github.com/mayankmittal29/criticalphenomena-exploring_phase_transitions_and_self-organized_criticality

A computational physics project exploring critical phenomena through percolation theory and the Bak-Tang-Wiesenfeld sandpile model.

criticality-c gif-animation jupyter-notebook matplotlib numpy percolation-theory python3 sandpile-model

Last synced: 18 Apr 2026

https://github.com/shortthirdman/distributedlagmodels

Distributed Lag Models in Dynamic Model Time Series

disributed-log-models numpy pandas python statsmodels time-series

Last synced: 19 Apr 2026

https://github.com/aniket2021448/movie-recommender-system

A Machine Learning Project implemented from scratch which involves web scraping, data engineering, exploratory data analysis, NLP processing and ML, achieving the functionality of a Content based movie recommender system

data-exploration data-exploration-and-preprocessing free-hosting-service machine-learning natural-language-processing nltk-python numpy pandas streamlit-webapp

Last synced: 19 Apr 2026

https://github.com/jcardonamde/reto_cobranzas_sector_financiero

This project analyzes data provided by the organization iData, focusing on the collection process of a financial company. The objective is to develop segmentation strategies that enhance portfolio recovery and improve debt management.

data-science kmeans-clustering machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/mwoss/opencv-raspberry-pi

Video capture on Raspberry Pi/PC using OpenCV

numpy opencv python rapsberrypi video-capture

Last synced: 19 Apr 2026

https://github.com/simonusher/multi-layer-perceptron

My solution to an assignment on multi-layer perceptron. Implemented in python and numpy. Contains the forward and backward pass code, as well as some utility functions.

multi-layer-perceptron neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/emerald-geomodelling/downfile

A safe & version independent pickle alternative that integrates setuptool entry points for extensions

feather json numpy pandas python serialization

Last synced: 19 Apr 2026

https://github.com/renukadhule/airbnb_listings_and_reviews_python_analysis

Explore Airbnb listings and reviews from over 250,000 properties across 10 major cities. Dive into data on listing details, host information, pricing, and customer reviews for better insights

matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/abdoufermat5/gps_statistiques

Calculs statistiques pour le cloud personnel

geolife gps numpy pandas python statistics trajectory-analysis

Last synced: 20 Apr 2026

https://github.com/gemmus/tictactoe-ai

Built with minimax algorithm, using pygame.

minimax-algorithm numpy oop pygame python tic-tac-toe

Last synced: 20 Apr 2026

https://github.com/vyjayanthipolapragada/car_mileage_prediction

Predicting the mileage of car using the linear regression model with Scikit-learn

kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn

Last synced: 20 Apr 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 20 Apr 2026

https://github.com/levieyal/oop-course-exercises

The first exercises in the OOP in Java and python course that took place in the first semester of 2021

docs git github graph-algorithms interfaces numpy python solid-principles unit-testing

Last synced: 20 Apr 2026

https://github.com/prithvikings/zomato_data_anaysis

This analysis provides insights into customer preferences and restaurant performance on Zomato. The visualizations and findings can help Zomato make informed decisions to improve customer experience and tailor their offerings.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/jbalooshie/school_district_analysis

Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.

data-analysis data-science dataframes jupyter-notebook numpy pandas python

Last synced: 20 Apr 2026

https://github.com/vtorres09/projetomonitoriaip

📋 Data analysis of a dataset created to select students to be teacher assistant (Introduction to Programming).

numpy object-oriented-programming pandas python

Last synced: 20 Apr 2026

https://github.com/mkhekare/bikesharing_ml

Provides insights into the bike sharing trends based on various factors such as time of day, season, and weather. The findings can be used to improve bike sharing services and understand user behavior.

matplotlib numpy numpy-library pandas seaborn

Last synced: 20 Apr 2026

https://github.com/anselmoo/useful-math-functions

useful-math-functions for Optimization, Benchmarking, Visualizing, and more ...

basic functions math mathematics numpy

Last synced: 17 Mar 2025

https://github.com/mogankumar/covid-19-classification

A deep learning-based project that classifies chest X-ray images into Normal, COVID-19, and Viral Pneumonia using a two-layer CNN architecture. Achieves high accuracy and demonstrates potential for aiding in early diagnosis and treatment of COVID-19.

keras-tensorflow matplotlib numpy python3

Last synced: 11 May 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/amardeepsingh-06/smart_attendance_system

The Smart Attendance System is designed to automate the attendance process using facial recognition. Traditional methods of taking attendance can be time-consuming and prone to errors. This project aims to streamline the process by leveraging computer vision technology, allowing for efficient, contactless, and accurate attendance tracking.

asgiref django mysql numpy opencv pillow-library python sqlparse

Last synced: 05 Apr 2026

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/prajakta1321/authencheck

Amdocs Gen AI Graduate Hackathon 2024-25- A comprehensive fact-checking and misinformation detection system that leverages cutting-edge AI models and multiple news sources to verify information circulating on social media

api bert-fine-tuning flask-application matplotlib ngrok-server nlp nlp-machine-learning numpy pandas python3 scikit-learn seaborn wandb

Last synced: 05 Apr 2026

https://github.com/offthetab/statistics_homework

Практические работы по статистике МИРЭА 3 курс.

numpy pandas python statsmodels

Last synced: 05 Apr 2026

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the Lévy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 02 May 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/iv4n-ga6l/simple-image-segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 13 May 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/Marvin-VW/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 29 Sep 2025

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 29 Sep 2025

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026