An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/rushilsharma1/iphonesales_eda

An exploratory data analysis of an Iphone sales in jupyter notebook with the help of python and its libraries

data-science exploratory-data-analysis jupyter-notebook numpy pandas python seaborn

Last synced: 20 Jan 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/jeniljani-4444/end-to-end-car-price-prediction-model

Predict car prices effortlessly using this machine learning model. Built with Python and Scikit-learn it analyzes features like mileage age brand and more to estimate accurate prices. Perfect for buyers sellers and dealerships.

machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/jjwizardmp/gallery-art-problem

Resolve the Gallery Art Problem implement BFS and Delaunay Triangulation

jupyter jupyter-notebook jupyterlab mathplotlib numpy pandas python python-env python3 scipy shapely

Last synced: 09 Apr 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/kgruiz/linalg-practice

LinAlg-Practice is a Python library developed to deepen my understanding of linear algebra through hands-on implementation of various matrix operations. It includes comprehensive tests that compare the results with established libraries like NumPy to ensure accuracy and reliability.

algorithms data-science linear-algebra math matrix-operations numpy python sympy

Last synced: 21 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/mogankumar/covid-19-classification

A deep learning-based project that classifies chest X-ray images into Normal, COVID-19, and Viral Pneumonia using a two-layer CNN architecture. Achieves high accuracy and demonstrates potential for aiding in early diagnosis and treatment of COVID-19.

keras-tensorflow matplotlib numpy python3

Last synced: 11 May 2026

https://github.com/vetrivel07/flight-price-prediction

Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 15 Jun 2025

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_02

Модуль 2. Знайомство з Pandas, Аналіз файлів, Аналіз датасет c Kaggle.com.  Python Data Science. Homework.

datascience google-colab homework jupyter-nootebook matplotlib numpy numpy-arrays pandas python seaborn seaborn-plots

Last synced: 17 Apr 2026

https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook

using numpy and pandas to analyze a synthetic loan dataset with python

data-analysis matlabplot numpy pandas plotting python seaborn

Last synced: 12 Mar 2025

https://github.com/vinicius999/eda-imdb-top1000-films

Análise exploratória dos Top 1000 filmes no IMDB até 2020

eda numpy pandas python

Last synced: 07 May 2026

https://github.com/sawaira-iqbal/data-visualization-project-on-car-sales-data

Explore Sales Data Visualization with Interactive Charts & Insights! 📊 Uncover trends and patterns to drive smarter automotive decisions.

bivariate-analysis data-science data-visualization interactive-visualizations matplotlib multivariate-analysis numpy plotly python seaborn univariate-analysis

Last synced: 12 Apr 2026

https://github.com/hemanthsaikumarbethapudi/last-mile_delivery_optimization.py-

This project implements an AI-powered last-mile delivery optimization system using Python. It leverages graph-based route optimization, demand prediction, and cost analysis to improve delivery efficiency in urban areas.

heapq networkx numpy python

Last synced: 27 Aug 2025

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. 🛠️🎥

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera

introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera

data-analysis matplotlib numpy pandas

Last synced: 03 May 2026

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project

In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.

data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql

Last synced: 09 Apr 2026

https://github.com/abhisek-13/multi-commodity-price-predictor

The Multi-Commodity Price Predictor is a machine learning project that predicts the prices of cars, bikes, and laptops. Using advanced techniques, it provides accurate price predictions based on user inputs. The project features an easy-to-use interface built with Streamlit, enabling users to get price predictions quickly and easily.

machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn sklearn-library streamlit

Last synced: 09 Apr 2026

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026

https://github.com/abdullahashfaqvirk/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 02 Apr 2026

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 09 Apr 2026

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://github.com/marcinz20/sortingalgorithms

This is a basic console program which contains just a few, basic sorting algorithms and showcases their usage

algorithms numpy object-oriented-programming python

Last synced: 15 May 2026

https://github.com/qanastek/parseur-pdf

https://trello.com/b/SbT2XGyF/g%C3%A9nie-logiciel-scrum

beautifulsoup4 numpy python

Last synced: 15 May 2026

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 29 Sep 2025

https://github.com/sandyherho/buku_metnum_python

Repositori ini memuat kode dan gambar yang digunakan pada buku "Pengantar Metode Numerik Terapan Menggunakan Python".

belajar-pemrograman belajar-python indonesia indonesian-language matematika-terapan matplotlib metode-numerik numerical-methods numpy python scipy

Last synced: 16 Jan 2026

https://github.com/wasifsohail5/virtual_mouse

Virtual Mouse is an AI-powered, gesture-based control system that replaces traditional peripherals with hand movements. Using MediaPipe for hand tracking and PyAutoGUI for cursor control, it enables seamless clicking, scrolling, and navigation through simple gestures. Experience a futuristic, touchless interaction with your PC! 🚀🖱️

cv2 mediapipe mediapipe-hands numpy opencv pyautogui python

Last synced: 02 Aug 2025

https://github.com/mustafadanabasi/python-linearregression-evfiyatlari

Ev Fiyatlarını Linear Regrasyon ile tahminleme çalışması.

linear-regression numpy pandas python

Last synced: 06 Apr 2026

https://github.com/saif-gitreps/face-mask-detection

A CNN classification model scoring 94% accuracy built from scratch using a dataset that detects if an image of a person has Mask on.

cnn-classification image-classification image-processing kaggle keras numpy pandas

Last synced: 09 Apr 2026

https://github.com/magnitopic/ft-linear-regression

Your first implementation of a machine learning algorithm. Predicting the price of a car by it's milage

42school ai ai-algorithm ft-linear-regression linear-regression machine-learning matplotlib numpy pandas prediction-algorithm python

Last synced: 16 Apr 2026

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/tomdewildt/google-foobar

Solutions for the Google Foobar problems implemented in Python

foobar google numpy pylint python

Last synced: 22 Feb 2026

https://github.com/Marvin-VW/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 29 Sep 2025

https://github.com/akshatkmistry/parkinsons_disease_predictor-voice_measures

This project implements a machine learning system to detect Parkinson's disease using voice measurements. The application uses a Random Forest classifier trained on voice feature data to predict the likelihood of Parkinson's disease with high accuracy (94%).

machine-learning matplotlib numpy pandas random-forest-classifier seaborn sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/franciscomartinez45/Social-Network-Analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 11 Apr 2025

https://github.com/anselmoo/useful-math-functions

useful-math-functions for Optimization, Benchmarking, Visualizing, and more ...

basic functions math mathematics numpy

Last synced: 17 Mar 2025

https://github.com/kasraskari/tumor-predict

Streamlit app for predicting tumor malignancy using logistic regression.

logistic-regression machine-learning numpy pandas python scikit-learn streamlit tumor-detection

Last synced: 09 Apr 2026

https://github.com/sultanazhari/the-effect-of-weather-on-travel-frequency

Zuber Company need to know passenger preferences and external effect on the trip. As an Data Analyst we're provide actual data who has affect on travel frequency.

matplotlib-pyplot numpy python3 scipy seaborn

Last synced: 03 Aug 2025

https://github.com/mays-m/deeplearning

In deep learning, with the help of data and training, neural networks are created and trained, which create better predictions than before with the help of neural networks.

anaconda artificial-intelligence conda-environment matplotlib neural-networks numpy pandas python3 tensorflow tensorflow-gpu transortfow-

Last synced: 11 Apr 2026

https://github.com/roaajadaa/text-classification

Classify documents using custom naïve bayes model

matplotlib nlp nltk numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/nischalkshaj/image-identification

This is a repository for AI image training for beginners.

express mongodb node numpy pillow python3 pytorch reactjs

Last synced: 09 Apr 2026

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/kevin-rsj/real-estate-investments

Sistema de scoring que clasifica ciudades francesas para inversión en segundas viviendas según perfil de riesgo(alto, moderado y bajo). Evalúa ratios clave en áreas como demanda, disponibilidad, infraestructura, demografía y precios.

bigquery data-analytics looker-studio numpy pandas python sklearn-library sql visualization

Last synced: 09 Apr 2026

https://github.com/mobinabatebi/electricity_consumption_project

The project aimed at estimating electricity consumption in administrative offices, along with analyzing the impacts of such consumption on power outages, is critical for addressing the pressing issue of summer electricity shortages.

jupyter-notebook numpy pandas python

Last synced: 28 Sep 2025

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 06 Apr 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/jose-zothner-meyer/finance-individual-loans

A comprehensive analysis of a financial institution's loan portfolio using Python, focusing on data preprocessing, exploratory data analysis (EDA), and feature engineering.

credit data-science data-visualization eda exploratory-data-analysis finance jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn

Last synced: 09 Apr 2026

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/thavinduushan/linear-regression

A linear regression model to predict premium price of an insurance. The dataset includes around 1,000 observations. The project, aimed at learning, was implemented in Jupyter notebooks using NumPy, Matplotlib, Pandas, Keras, and TensorFlow.

matplotlib numpy pandas python tensorflow

Last synced: 09 Apr 2026

https://github.com/iv4n-ga6l/simple-image-segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 13 May 2026

https://github.com/rahulsm20/car-data

A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.

data-analysis data-analytics matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/fazal-coder/data-science-business-analytics-tsf

I have completed two tasks in this repository , task 3 and task 4

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/kenwuqianghao/ml-zoomcamp

Code and homework for ML Zoomcamp

machinelearning numpy pandas python3 tensorflow

Last synced: 06 Apr 2026

https://github.com/RedInfinityPro/Python-Task-Manager

Rating: (7/10): The Python Task Manager, is a user-friendly interface using tkinter library for system monitoring and management. It displays detailed system process information using psutil, with interactive features.

matplotlib numpy psutil pygetwindow tkinter

Last synced: 30 Sep 2025

https://github.com/RedInfinityPro/PygameCicruit

Rating: (6/10) The code combines a Pygame GUI application with a PySpice circuit simulation, providing a user-friendly interface with buttons, menu options, dropdown menus, input boxes, sliders, and text elements.

matplotlib numpy os pygame pyspice random sys

Last synced: 30 Sep 2025

https://github.com/RedInfinityPro/CircuitPythonProjects

Rating: (4/10) This Python script simulates circuits using PySpice, importing libraries like NumPy and Matplotlib. It defines voltage divider, diode circuits, subcircuit definitions, DC sweep analysis, and simulates each circuit using the simulator.

matplotlib numpy os pyspice sys

Last synced: 30 Sep 2025

https://github.com/mzayles/vendas_ficticias_dataprep

💻📊✅ Curso de Programação em Python para Data Science | Analisando e tratando dados fictícios.

numpy pandas python

Last synced: 15 Jun 2025

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 06 Apr 2026

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 06 Apr 2026

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/subhas-pramanik-09/mediscan-ai

A smart and scalable ML-powered health prediction system that can help detect the risk of three major diseases: Diabetes + Heart Disease + Parkinsons Disease

jupyter-notebook logistic-regression machine-learning numpy pandas scikit-learn streamlit svm-classifier

Last synced: 09 Apr 2026

https://github.com/prat555/python_practice

These files contain the Python exercises I completed as practice.

deep-learning numpy pyhton3 regex

Last synced: 01 Oct 2025

https://github.com/lfgodoi/rpm-deep-guesser

A deep learning-based RPM estimator based on spectral features extracted from vibration signals of rotating machines.

condition-monitoring deep-learning docker flask machine-learning neural-networks numpy python pytorch scipy signal-processing spectral-analysis

Last synced: 02 Mar 2025

https://github.com/shreyasdankhade/portfolio_optimatization_project

The Portfolio Optimization Project uses optimization techniques to balance risk and return, helping investors make efficient asset allocation decisions.

flask flask-application matplotlib numpy pandas pandas-python porfolio-optimization portfolio python

Last synced: 06 Apr 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025