An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/ricobuilds/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Mar 2026

https://github.com/gauravtrivedi967/lane_detection

Lane detection and tracking are the advanced key features of the advanced driver assistance system. Lane detection is the process of detecting white lines on the roads. In this we I have used prebuild models for the development of the code

ipython-notebook numpy python thresholding

Last synced: 06 Mar 2026

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/iamwatchdogs/cardiovascular-risk-prediction

This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.

jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/broodhoney/blue-book-for-bulldozers

This repository holds the project which solves a regression problem on predicting the futures sales of bulldozers. This is from a kaggle competition.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/satyas567/weatherdataanalysis

Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/offthetab/ml_homework

Практические работы по анализу данных МИРЭА 3 курс.

ml numpy pandas python shap sklearn xgboost

Last synced: 02 Apr 2026

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 02 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/ledsouza/curso_de_estatistica_parte_4

Projeto de estatística para estudo de correlação e regressão

data-science data-visualization numpy pandas seaborn statsmodels vitrinedev

Last synced: 17 Apr 2026

https://github.com/renan-siqueira/reinforcement-learning-cart-pole

This repository provides implementations of a Q-learning agent to balance a pole on a cart.

artificial-intelligence gym-environment machine-learning numpy openai-gym pygame python qlearning-algorithm reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/cosmicsid004/air_draw

Digital drawing....

handtracking mediapipe numpy opencv

Last synced: 30 Apr 2026

https://github.com/tharushi11/titanic-data-analysis

This repository contains a simple prediction model and survival predictions based on the Titanic dataset from Kaggle, along with detailed visualizations using Power BI.

dax machine-learning numpy pandas powerbi sklearn titanic-survival-prediction visualization

Last synced: 17 Apr 2026

https://github.com/lmizner/codecademy_nba_trends

Investigate whether there is an association between two variables

chi2-contingency jupyter-notebook matplotlib numpy pandas pearson-correlation pyplot python seaborn

Last synced: 17 Apr 2026

https://github.com/sofiakhutsieva/analytical_report_for_producers_of_educational_programs

Подготовка аналитического отчета (Проект в skillbox )

analytics matplotlib numpy pandas seaborn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 17 Apr 2026

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/mugambi645/classify-song-genre-from-audio-data

Classify song genres using supervised learning models & PCA for dimensiality reduction

decision-tree-classifier logistic-regression numpy pandas pca-analysis

Last synced: 17 Apr 2026

https://github.com/jjwizardmp/a-start-search-python

Python project where the 'A Start Search' algorithm and a simulation of the agent's path with animations were implemented.

cv2 ipython iterto jupyter-lab jupyter-notebook mathplotlib numpy pip python python-env python3

Last synced: 04 Apr 2026

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 04 Apr 2026

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_01

GoIT, Python Data Science, Homework number 1.

data-science google-colab jupyter-nootebook numpy numpy-arrays python

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/thoughtscript/python_api_2023

Some fun with Flask, Python, and Machine Learning

angular docker docker-compose flask machine-learning mysql numpy

Last synced: 04 Apr 2026

https://github.com/arnoldchrisoduor1/potato_disease

Trained a CNN to detect the presence and the current stage of Blight disease in potatoes, WebApp in FastAPI and React

fastapi keras numpy react serving-tensors tensorflow

Last synced: 04 Apr 2026

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/alainlebret/python-et-ia-1

Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN

artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/ccolpasm/pageviewanalyzer

PageViewAnalyzer analyzes and visualizes time series data of forum visits (2016-2019). It uses Pandas, Matplotlib, and Seaborn to create line plots, bar charts, and box plots, highlighting trends, monthly averages, and seasonality in page views. Perfect for data exploration with Python.

data-science matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/sundanc/weatherprediction

This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month

machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/dyceszn/i.duruohacos102

First-year deep dive into problem-solving with Python—featuring class practices and projects using NumPy, Pandas, Tkinter, and psycopg2.

numpy psycopg2 python tkinter

Last synced: 18 Apr 2026

https://github.com/ayushmaan-xd/exploratory-spotify-data-analysis

Exploratory Spotify Data Analysis is a project where I analyzed Spotify’s music dataset to uncover trends in audio features and song popularity. Using Python and data visualization tools

jupyter-notebook matplotlib numpy pandas python seaborn spotify-dataset

Last synced: 18 Apr 2026

https://github.com/ashithapallath/feature-engineering

This repository contains a range of examples and techniques for feature engineering, aimed at improving dataset quality and boosting model performance. It covers essential methods such as Exploratory Data Analysis (EDA) and Interquartile Range (IQR) analysis for detecting and handling outliers.

exploratory-data-analysis feature-engineering iqr-method matplotlib numpy outlier-detection pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/alexkhype/04python-carpricepredictor

Modelo de machine learning para predecir el valor de mercado de autos usados, optimizando precisión y eficiencia.

catboost data-science lightgbm machine-learning numpy pandas python sklearn statistics xgboost

Last synced: 05 Apr 2026

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligência artificial para analisar documentos e características físicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as áreas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/noopur-phadkar/botnavigator

BotNavigator utilizes the A* (A-Star) algorithm for pathfinding, leveraging Python along with libraries such as Matplotlib and Numpy for efficient array operations and visualization. This innovative solution optimizes robotic navigation within warehouse environments, enhancing operational efficiency and scalability.

a-star-path-finding matplotlib numpy pathfinding-algorithms python robotics

Last synced: 18 Apr 2026

https://github.com/autokmg/ai-foundations

Practice exercises related to the Foundations of Artificial Intelligence university course. The code examples are designed to help students gain a deeper understanding of these topics and to prepare for exams and assignments. The repository is open to contributions from other students.

numpy python

Last synced: 18 Apr 2026

https://github.com/rhazra-003/modelinglab

Assignment - Solution of Modeling Lab of 7th Semester (Batch of 2023) of NIT Durgapur

jupyter-notebook matplotlib-pyplot numpy pandas python3 shapely

Last synced: 19 Apr 2026

https://github.com/jcardonamde/reto_cobranzas_sector_financiero

This project analyzes data provided by the organization iData, focusing on the collection process of a financial company. The objective is to develop segmentation strategies that enhance portfolio recovery and improve debt management.

data-science kmeans-clustering machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/mwoss/opencv-raspberry-pi

Video capture on Raspberry Pi/PC using OpenCV

numpy opencv python rapsberrypi video-capture

Last synced: 19 Apr 2026

https://github.com/simonusher/multi-layer-perceptron

My solution to an assignment on multi-layer perceptron. Implemented in python and numpy. Contains the forward and backward pass code, as well as some utility functions.

multi-layer-perceptron neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/renukadhule/airbnb_listings_and_reviews_python_analysis

Explore Airbnb listings and reviews from over 250,000 properties across 10 major cities. Dive into data on listing details, host information, pricing, and customer reviews for better insights

matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/shreeya-methuku/predictive_analysis

This project leverages machine learning to perform time-series forecasting on key financial metrics, including sales (income), expenses, cash flow, and profit.

arima financial-forecasting numpy pandas sarima xgboost

Last synced: 20 Apr 2026

https://github.com/vyjayanthipolapragada/car_mileage_prediction

Predicting the mileage of car using the linear regression model with Scikit-learn

kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn

Last synced: 20 Apr 2026

https://github.com/prahaladhchandrahasan/housingprices_adavanced_regression

A machine learning model for "House Prices: Advanced Regression Techniques" kaggle competition.

machine-learning-algorithms matplotlib-pyplot numpy pandas python3 scikit-learn

Last synced: 20 Apr 2026

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 20 Apr 2026

https://github.com/prithvikings/zomato_data_anaysis

This analysis provides insights into customer preferences and restaurant performance on Zomato. The visualizations and findings can help Zomato make informed decisions to improve customer experience and tailor their offerings.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/jbalooshie/school_district_analysis

Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.

data-analysis data-science dataframes jupyter-notebook numpy pandas python

Last synced: 20 Apr 2026

https://github.com/vtorres09/projetomonitoriaip

📋 Data analysis of a dataset created to select students to be teacher assistant (Introduction to Programming).

numpy object-oriented-programming pandas python

Last synced: 20 Apr 2026

https://github.com/who-else-but-arjun/pyspyce-cs

Circuit Simulator for R, L, C components and AC/DC voltage and current sources built using python as a part of EE204 circuit theory course project.

circuit-simulator numpy python streamlit sympy

Last synced: 20 Apr 2026

https://github.com/abyss-s/data-analysis-project

[컴퓨팅사고와 데이터분석 기초] 최종 프로젝트 (2024-1)

matplotlib numpy pandas python sckiit-learn

Last synced: 20 Apr 2026

https://github.com/zombietdv/n_gram-model

n_gram model implementation with numpy only

backpropagation numpy

Last synced: 21 Apr 2026

https://github.com/souvikmajumder26/neural-network-from-scratch

🚀 Building a simple Neural Network to classify the images of the Fashion-MNIST dataset, using only numpy and no TensorFlow, Keras, or PyTorch.

config fashion-mnist logger neural-network neural-network-from-scratch notebook-jupyter numpy project python

Last synced: 21 Apr 2026

https://github.com/aleklukanen/systemsofsprings

This repository contains a few helpful pieces of Python code that can be used to compute time dependent positions of springs in 2 and 3 dimensions.

finite-element-methods finite-elements jupyter-notebook numerical numpy python

Last synced: 21 Apr 2026

https://github.com/danpoynor/pet-shelter-data-analysis-notebook

Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.

csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data

Last synced: 21 Apr 2026

https://github.com/star2dust/numpy-notes

A tutorial to learn numpy and matplotlib.

matplotlib numpy tutorial

Last synced: 16 Jun 2026