An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/marcow2812/zuse-projekt

Python-basierte Software zur Projektion von 3D-Objekten auf einen Würfel

augmented-reality numpy opencv-contrib python

Last synced: 16 Apr 2026

https://github.com/littie28/hazy-frames

Hierarchical coordinate frames - crystal clear transforms

3d coordinate-systems geometry numpy python reference-frames scipy transformations

Last synced: 18 Apr 2026

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 16 Apr 2026

https://github.com/dr-rompecabezas/page-view-time-series-visualizer

A data science project built as part of the freeCodeCamp curriculum.

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting

An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.

forcasting machine-learning numpy pandas scikit-learn time-series-analysis

Last synced: 17 Apr 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/ricobuilds/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Mar 2026

https://github.com/gauravtrivedi967/lane_detection

Lane detection and tracking are the advanced key features of the advanced driver assistance system. Lane detection is the process of detecting white lines on the roads. In this we I have used prebuild models for the development of the code

ipython-notebook numpy python thresholding

Last synced: 06 Mar 2026

https://github.com/radinshahdaei/ce40215-nc

Theoretical and practical assignments for "Numerical Computation".

jupyter-notebook numpy python sympy

Last synced: 17 Apr 2026

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/psavarmattas/speechtotext

we shall build a very simple speech recognition system that takes our voice as input and produces the corresponding text by hearing the input.

facebook-api ipython librosa machine-learning numpy python pytorch soundfile transformers

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/amirmohammadgholampour/mall-customer-segmentation

Project for segmenting customers in a shopping mall using the Clustering algorithm.

numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/akshitvats026/heart_disease_prediction

An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.

accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/loong64/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu loong64 loongarch64 machine-learning neural-network numpy python tensor

Last synced: 02 Apr 2026

https://github.com/mac2bua/autoalgo

Autonomous algorithm optimization research - 19x speedup achieved through self-driven iterations

algorithm-optimization autoresearch numpy performance python

Last synced: 02 Apr 2026

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/mnagaharshithrao/diamondpriceprediction

Diamond Price Prediction is a machine learning project that predicts the price of diamonds based on features like carat, cut, color, clarity, and more. Using models such as XGBoost, Random Forest, and Gradient Boosting, the project aims to accurately forecast diamond prices, helping buyers and sellers in the diamond industry make informed decisions

machine-learning numpy pandas sckiit-learn xgboost

Last synced: 17 Apr 2026

https://github.com/kernel-loophole/numerical-computing

Numerical Computing use python(Numpy,metplotlib).

numpy python quantum-computing

Last synced: 17 Apr 2026

https://github.com/renan-siqueira/reinforcement-learning-cart-pole

This repository provides implementations of a Q-learning agent to balance a pole on a cart.

artificial-intelligence gym-environment machine-learning numpy openai-gym pygame python qlearning-algorithm reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/kiarashyavari/python_interview

Practice intermediate to advance topics of Python here

decorators generator multiprocessing multithreading numpy pandas python python3

Last synced: 17 Apr 2026

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 02 Apr 2026

https://github.com/nikhilgugwad/sentiment-analysis

Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.

numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/ffatahillah7/eda-dsf-dibimbing-titanic-accident

Data Science Fair 3.0 Dibimbing Portofolio - Analyctics and Learning from titanic dataset

data numpy pandas python science seaborn

Last synced: 17 Apr 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 17 Apr 2026

https://github.com/theakashshukla/diabetes-prediction

Diabetes Prediction using futuristic tool

ai jupiter-notebook ml numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/sofiakhutsieva/analytical_report_for_producers_of_educational_programs

Подготовка аналитического отчета (Проект в skillbox )

analytics matplotlib numpy pandas seaborn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/tashi-2004/deep-learning-grid-world-q-learning

Deep Learning Grid World Q-Learning . Implement Q-learning in a 5x5 grid where an agent navigates obstacles and rewards. Train the agent with varying learning rates, visualize its progress, and see Q-values as heatmaps. Run the script to start training and view results. Contributions are welcome!

agent-based-modeling artificial-intelligence deep-learning deep-q-learning exploitation exploration machine-learning machine-learning-algorithms matplotlib-pyplot numpy python q-learning q-learning-algorithm reinforcement-learning reinforcement-learning-algorithms state-value-function training

Last synced: 17 Apr 2026

https://github.com/prahaladhchandrahasan/object-localization

this is a blue colour detector in keras and tensorflow

computer-vision keras numpy pkl tensorflow

Last synced: 17 Apr 2026

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 17 Apr 2026

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 04 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_01

GoIT, Python Data Science, Homework number 1.

data-science google-colab jupyter-nootebook numpy numpy-arrays python

Last synced: 18 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/jass024/bitcoin-historical-analysis

Performed comprehensive time-series analysis on 14+ years of Bitcoin historical data using Python (Pandas, NumPy, Plotly). Calculated key technical indicators (SMA, EMA, RSI, volatility) to identify market trends, potential anomalies, and areas of regulatory concern. Visualized results using interactive Plotly charts.

numpy pandas python-3

Last synced: 18 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/sundanc/weatherprediction

This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month

machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/stffns/snapvec

Fast compressed ANN search via randomized Hadamard transform + Lloyd-Max quantization. Pure NumPy.

ann embeddings hadamard numpy quantization rag vector-search

Last synced: 07 Apr 2026

https://github.com/nksmanya/virtual_canvas

A hand gesture-based drawing app that lets you create art in the air using hand movements

mediapipe numpy opencv opencv-python python

Last synced: 18 Apr 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/alexkhype/04python-carpricepredictor

Modelo de machine learning para predecir el valor de mercado de autos usados, optimizando precisión y eficiencia.

catboost data-science lightgbm machine-learning numpy pandas python sklearn statistics xgboost

Last synced: 05 Apr 2026

https://github.com/jigyasag18/iit-guhawati-final-capstone-project

Smart Dynamic Parking Price Optimization System that adjusts parking fees in real-time based on demand, traffic, and competition. It employs adaptive pricing models and rerouting logic to enhance parking utilization and reduce congestion. The system is visualized via an interactive Streamlit dashboard, enabling users to simulate dynamic pricing.

bokeh bokeh-server bokehplots capstone-project data dataset deployment machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot mlproject normalisation numpy pandas pathway python streamlit

Last synced: 05 Apr 2026

https://github.com/ben854719/nvidia---gen-ai-

The goal of the project is to analyze and predict with the use of AI for the revenue of Nvidia

matplotlib numpy pandas python pytorch

Last synced: 05 Apr 2026

https://github.com/shdrn2402/incomesresearch

The project aims to study the influence of various characteristics on a person’s income. In this project, I practiced data preprocessing and visualization, and also touched on the basics of machine learning.

jupyter-notebook matplotlib numpy pandas python seaborn sklearn visualization

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/nowon1/insurance-claim-prediction_version

This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.

data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-movies

Machine Learning python bootcamp: Recommender Systems on movies dataset

ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligência artificial para analisar documentos e características físicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as áreas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/matiassenia/galaxypy

La aplicación carga una imagen en escala de grises y utiliza los puntos de menor luminosidad (más oscuros) para dibujar líneas de colores que crean una estructura parecida a una galaxia.

numpy python

Last synced: 18 Apr 2026