An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/nikhleshshukla123/web-scraping-using-python

scrapes multiple pages of Amazon search results using python.

beautifulsoup4 numpy pandas python

Last synced: 03 May 2026

https://github.com/shahriarha/programming

Python problem solving, HR-30 days of code, DSA, NumPy

algorithms-and-data-structures cpp numpy programming python

Last synced: 06 May 2026

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 08 May 2026

https://github.com/armahdavi/analytics_statistics_ml_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 14 Apr 2026

https://github.com/abdulhadi806/muffin-or-cupcake-model

𝐌𝐚𝐜𝐡𝐢𝐧𝐞 𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 model that is capable of checking whether the model I inputted is a mufin or cupcake, I am in a learning face today is 1/25/2024 let's see how far I can go in this field. Wish me luck:)

ai jupyter-notebook machine-learning numpy pandas python sklearn support-vector-machine svm

Last synced: 09 Apr 2026

https://github.com/arsh-pixel-cmd/ai-image-classifier

AI Image Classifier is a web app that uses AI and deep learning to classify images in real-time. Built with Streamlit, TensorFlow (MobileNetV2), OpenCV, and NumPy, it allows users to upload single or multiple images and get the top-3 predictions with confidence scores instantly in the browser.

docker numpy opencv python3 streamlit tensorflow2 uv

Last synced: 10 Apr 2026

https://github.com/naveen88112/vehicle-routing-problem

Vehicle Routing Problem (VRP) Solver This project applies a Genetic Algorithm (GA) to optimize the Vehicle Routing Problem (VRP). This is done for more than one vehicle to ensure delivery routes that save travel distance and increase efficiency.

deap genetic-algorithm matplotlib numpy python vehicle-routing-problem

Last synced: 13 Apr 2026

https://github.com/pem-humboldt/wiutils

Exploration utilities for Wildlife Insights projects.

biodiversity camera-trap images matplotlib numpy pandas python seaborn trail-cameras wildlife-insights

Last synced: 09 Aug 2025

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 14 Apr 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/cinnaavox/loan-prediction

Machine Learning project predicting loan approvals using Decision Tree Classification. Includes data cleaning, feature engineering, model evaluation and key business insights.

decision-trees matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/amiridk/statistics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 14 Apr 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/jcm-ai/Robust-MLOps-Elevating-Machine-Learning-to-Production-Scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 09 Aug 2025

https://github.com/sahil210695/convolution-evolution

This is to see how a kernel will convolve over an image and what will be its output after convolution

cnn convolutional-neural-networks data-science image-kernel image-processing numpy numpy-neural-network python streamlit

Last synced: 10 May 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/lorenzorottigni/ml-breast-cancer

Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 14 Apr 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/lmizner/codecademy_life_expectancy

Calculate quartiles, quantiles, and the inter-quartile range (IQR) for a variable

histogram jupyter-notebook matplotlib-pyplot numpy pandas python quantiles quartiles

Last synced: 09 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 14 Apr 2026

https://github.com/dwija12903/ai-lab

A collection of practical implementations from my AI Labs course

keras numpy sckit-learn tensorflow

Last synced: 19 Jan 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/zahediparsa/nlp_ta_uok

This repository contains exercises designed for students in the Natural Language Processing (NLP) course at the University of Kurdistan, taught by Dr. Fatemeh Daneshfar. The course took place from February 2024 to July 2024.

cbow jupyter-notebook matplotlib nlp nltk numpy pandas python tf-idf

Last synced: 19 Jan 2026

https://github.com/chandkund/ev-market-segmentation

This project focuses on segmenting the Electric Vehicle (EV) market in India to identify key customer groups based on their preferences, behaviors, and demographics.

machine-learning matplotlib numpy pandas seaborn sklearn visualization

Last synced: 19 Jan 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 09 Apr 2026

https://gitlab.com/DLR-SY/boxbeam

Boxbeam is a legacy Fortran tool translated to python. It calculates effective beam properties of composite cross sections comprised of rod-like elements.

fortran numpy python

Last synced: 12 Oct 2025

https://github.com/BiocPy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 03 Oct 2025

https://github.com/harmanveer-2546/sentiment-analysis-of-amazon-fine-food

For this project, machine learning algorithms are used on amazon fine food reviews dataset to analyze if the given review is a positive review or a negative review.

mathplotlib nlp-machine-learning ntlk numpy pandas pipeline python roberta-model seaborn transformer vader-sentiment-analysis

Last synced: 09 Apr 2026

https://github.com/jsimell/sleepanalysis

A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.

data-analysis matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/omdoshi13/pricing-of-laptops-using-ml

Data Analysis, training Machine Learning models, and Model Evaluation and Refinement for Pricing of Laptops dataset.

data-analysis data-analysis-project datascience google-colab jupyter-notebook machine-learning matplotlib model-evaluation model-refinement numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/subhas-pramanik-09/mediscan-ai

A smart and scalable ML-powered health prediction system that can help detect the risk of three major diseases: Diabetes + Heart Disease + Parkinsons Disease

jupyter-notebook logistic-regression machine-learning numpy pandas scikit-learn streamlit svm-classifier

Last synced: 09 Apr 2026

https://github.com/velascort/image_equalizer_gui

Image equalizer using LUT interface.

image-processing lut numpy python scikit-image ttk

Last synced: 10 May 2026

https://github.com/wasifsohail5/amusic-ai_powered_musicrecommendationsystem

AMUSIC is an AI-driven music recommendation system that helps users discover personalized songs. Using Python, Streamlit, and Scikit-learn, it offers smart recommendations, advanced search, and interactive music insights. Users can save favorites, create playlists, and export data for a seamless music discovery experience.

joblib k-nearest-neighbours matplotlib minmaxscaler numpy pandas pickle plotly python scikit-learn seaborn streamlit

Last synced: 14 Oct 2025

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/hoyirul/flask-starter-fs

Flask-Starter-FS is a concise full-stack starter kit for Flask web development, offering quick setup and pre-configured settings for seamless integration of front-end and back-end components. Accelerate your full-stack web application development with Flask-Starter-FS.

flask machine-learning numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/hansalemaos/locate_pixelcolor_cython_multi_auto_compile

Detects colors in images 8 x faster than Numpy / Uses Cython - returns a dict

bot c color colors colour colours cpp cython fast find multiprocessing numpy numpy-array opencv rgb rgba search

Last synced: 24 Feb 2026

https://github.com/relostar-devil/census-income-prediction

An end-to-end ML project using 1994 US Census data to classify income (>50K/<50K). The Jupyter Notebook covers data preprocessing, EDA, and model evaluation with multiple classifiers.

matplotlib numpy pandas scikitlearn-machine-learning seaborn

Last synced: 10 May 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/RedInfinityPro/DrawImages

Rating: (7/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 30 Sep 2025

https://github.com/hossein-cod/parking-space-detection

Identifying empty spaces in the parking lot with openCV

cv2 jupyter-notebook numpy opencv pickle python

Last synced: 14 Apr 2026

https://github.com/RedInfinityPro/PygameCicruit

Rating: (6/10) The code combines a Pygame GUI application with a PySpice circuit simulation, providing a user-friendly interface with buttons, menu options, dropdown menus, input boxes, sliders, and text elements.

matplotlib numpy os pygame pyspice random sys

Last synced: 30 Sep 2025

https://github.com/mizcausevic-dev/embedding-drift-graph

Track how entity embeddings drift across encoder model versions. SQLite store + Strawberry GraphQL API. Cosine drift events computed automatically on every record. Reference impl for RAG/eval pipelines re-encoding their corpus.

ai-governance drift-detection embeddings graphql llm numpy python rag sqlite strawberry-graphql vector-search

Last synced: 01 Jun 2026

https://github.com/RedInfinityPro/Python-Task-Manager

Rating: (7/10): The Python Task Manager, is a user-friendly interface using tkinter library for system monitoring and management. It displays detailed system process information using psutil, with interactive features.

matplotlib numpy psutil pygetwindow tkinter

Last synced: 30 Sep 2025

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/karthik9273/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 06 May 2026

https://github.com/lyubov0406/data_analyst_portfolio

В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных

data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization

Last synced: 09 Apr 2026

https://github.com/mjul/scipy-lab

Scientific computation with Python

matplotlib numpy python scipy

Last synced: 04 May 2026

https://github.com/dulajkavinda/handwritten-recognition

🛢Handwritten Digit Recognition using OpenCV

added machinelearning numpy opencv python

Last synced: 09 Apr 2026

https://github.com/cosmoquester/numpy-neural-net

This is neural network samples implemented with numpy without machine learning libraries.

implementation neural-network numpy

Last synced: 22 Jan 2026

https://github.com/thavinduushan/linear-regression

A linear regression model to predict premium price of an insurance. The dataset includes around 1,000 observations. The project, aimed at learning, was implemented in Jupyter notebooks using NumPy, Matplotlib, Pandas, Keras, and TensorFlow.

matplotlib numpy pandas python tensorflow

Last synced: 09 Apr 2026

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/sunilvarma-l/liverdiseaseprediction

"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."

machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/sharif-minhaz/rag-system

Ingest and vectorize content upon publication, store embedding, then retrieve and augment user queries with context to generate high-quality responses.

faiss flask mysql2 nodejs numpy rag react transformers

Last synced: 14 Apr 2026

https://github.com/navindafernando/lane-detection

Lane Detection with Image Feature Extraction Techniques :aerial_tramway:

image-feature-extraction kernel-filters machine-learning neighborhood numpy opencv sobel-filter

Last synced: 13 Apr 2026

https://github.com/muhammadshavaiz/hand-sketch-recognition--inceptionv3

The Hand Drawn Sketch Classification project uses PyTorch to classify hand-drawn sketches. It evaluates model Inception_v3, with Inception_v3 achieving the highest accuracy of 57%. The repository features scripts for dataset management, model training, and evaluation.

inception-v3 matplotlib numpy pandas python pytorch

Last synced: 14 Apr 2026

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 09 Apr 2026

https://github.com/nischalkshaj/image-identification

This is a repository for AI image training for beginners.

express mongodb node numpy pillow python3 pytorch reactjs

Last synced: 09 Apr 2026

https://github.com/sun-data/ndoptimize

Numba-accelerated minimization of objective functions.

astropy minimization numba numpy optimization scientific-computing

Last synced: 27 Jan 2026