An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hifly81/photox

photo and album organizer for gtk3

gtk3 numpy opencv-python photography pil pillow python

Last synced: 24 Oct 2025

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/atul-maurya-30/galaxy

Galaxy Classification is a machine learning project focused on classifying galaxies into two subclasses: 'STARFORMING' and 'STARBURST'. This project demonstrates data preprocessing, model training, and evaluation using advanced machine learning techniques and Python libraries.

flask machine-learning matplotlib numpy pandas python regression-classification seaborn sklearn

Last synced: 09 Mar 2026

https://github.com/apbetioli/neural-network

A simple neural network implementation

machine-learning neural-network numpy python3

Last synced: 29 Apr 2026

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 23 Jan 2026

https://github.com/gusenov/max-empty-rect-py

:black_square_button: Реализация на Питоне алгоритма поиска на изображении пустого прямоугольника максимальной площади.

algorithm empty-spot graphics numpy python python-image-library python-library rectangle-detection

Last synced: 06 Feb 2026

https://github.com/sahraiidle/email-spam-detector

Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.

data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm

Last synced: 24 Jan 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/asherk7/house-price-prediction

House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting

data-science numpy pandas regression scikit-learn

Last synced: 15 Apr 2026

https://github.com/itssahilwhat/ai-fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 15 Apr 2026

https://github.com/stavrosarm/mathematics_algorithms

Fundamental algorithms of Linear Algebra , Numerical Analysis and Calculus in Python and Julia.

calculus julia-language linear-algebra numerical-analysis numpy python

Last synced: 30 Jan 2026

https://github.com/keyurparalkar/breast-cancer-detection

Predict whether the cancer is benign or malignant

gradient-descent logistic-regression machine-learning numpy

Last synced: 26 Apr 2026

https://github.com/hansalemaos/npzigloc

Zig for Numpy

numpy python zig

Last synced: 31 Jan 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 15 Apr 2026

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 01 Feb 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/chandkund/customer-segmentation

Customer segmentation divides customers into distinct groups based on characteristics and behaviors. This project uses K-Means clustering, an unsupervised machine learning algorithm, to segment customers and provide insights for targeted marketing strategies

kmeans-clustering matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/murugavl/forage_tata_genai_powered_data_analytics

This repository showcases my work from the TATA GenAI Powered Data Analytics Virtual Experience on Forage, where I applied data cleaning, analysis, and visualization techniques to derive actionable business insights.

excel genai matplotlib numpy pandas powerbi python

Last synced: 09 Feb 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/mindkerchief/baselineml

A collection of machine learning task performed during my studies in computer science major in intelligent system.

decision-tree dummy gaussian-mixture-models kmeans-clustering linear-regression logistic-regression machine-learning matplotlib numpy pandas random-forest scikit-learn seaborn tensorflow

Last synced: 16 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/hansalemaos/np_asciiart

Ascii art with numpy

ascii numpy opencv python

Last synced: 16 Apr 2026

https://github.com/shakhthi/mlproject

This project is an end-to-end ML solution that predicts math scores from student attributes. It starts with ingesting and preprocessing data, followed by feature engineering to pinpoint key predictors. A robust model is then trained and fine-tuned before being seamlessly deployed to deliver real-time predictions.

jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/separatrixxx/nm_labs_6_7_sem

👓 Laboratory work for 6 and 7 semesters of MAI on numerical methods

numerical-methods numpy python

Last synced: 16 Apr 2026

https://github.com/vgvr0/analisis-de-datos-con-streamlit-numpy-pandas-y-matplotlib

Sistema completo de análisis y visualización de datos cinematográficos que proporciona insights detallados sobre películas, incluyendo análisis financiero, puntuaciones, tendencias temporales y un sistema de recomendación. Desarrollado con Python y Streamlit, ofrece una interfaz interactiva y amigable para explorar datos de películas.

matplotlib numpy pandas plotly plotly-dash recommendation-system streamlit

Last synced: 16 Apr 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/justin-marian/numpy-mlp

MLP from-scratch implementation only with NumPy, it supports regression and classification tasks.

classification lasso mlp numpy r2-score regression ridge

Last synced: 16 Feb 2026

https://github.com/jessicahora/studies-on-linear-algebra

Repositorio com Estudos sobre Algebra Linear.

linalg linear-algebra matplotlib-pyplot matrix numpy python scipy

Last synced: 01 Mar 2026

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/lorenzorottigni/ml-advertising

Machine Learning python bootcamp: logistic regression on advertising dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/ejw-data/tableau-songs

Brief analysis of songs from 1890 - 2015 using pandas and Tableau

numpy pandas python songs tableau

Last synced: 16 Apr 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/leftcoastnerdgirl/supervised_learning

This project demonstrates supervised machine learning using scikit-learn.

classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn

Last synced: 16 Apr 2026

https://github.com/dr-rompecabezas/page-view-time-series-visualizer

A data science project built as part of the freeCodeCamp curriculum.

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/kodemartin/np-pd-portfolio

Portfolio of numpy, pandas, and matplotlib solutions

matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/loong64/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu loong64 loongarch64 machine-learning neural-network numpy python tensor

Last synced: 02 Apr 2026

https://github.com/mnagaharshithrao/diamondpriceprediction

Diamond Price Prediction is a machine learning project that predicts the price of diamonds based on features like carat, cut, color, clarity, and more. Using models such as XGBoost, Random Forest, and Gradient Boosting, the project aims to accurately forecast diamond prices, helping buyers and sellers in the diamond industry make informed decisions

machine-learning numpy pandas sckiit-learn xgboost

Last synced: 17 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/cosmicsid004/air_draw

Digital drawing....

handtracking mediapipe numpy opencv

Last synced: 30 Apr 2026

https://github.com/tharushi11/titanic-data-analysis

This repository contains a simple prediction model and survival predictions based on the Titanic dataset from Kaggle, along with detailed visualizations using Power BI.

dax machine-learning numpy pandas powerbi sklearn titanic-survival-prediction visualization

Last synced: 17 Apr 2026

https://github.com/ffatahillah7/eda-dsf-dibimbing-titanic-accident

Data Science Fair 3.0 Dibimbing Portofolio - Analyctics and Learning from titanic dataset

data numpy pandas python science seaborn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 17 Apr 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/jass024/bitcoin-historical-analysis

Performed comprehensive time-series analysis on 14+ years of Bitcoin historical data using Python (Pandas, NumPy, Plotly). Calculated key technical indicators (SMA, EMA, RSI, volatility) to identify market trends, potential anomalies, and areas of regulatory concern. Visualized results using interactive Plotly charts.

numpy pandas python-3

Last synced: 18 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/bvsam/nnfs

Basic neural network built using Python and Numpy. Created to better understand neural networks.

artificial-intelligence deep-learning machine-learning neural-network neural-networks numpy python3

Last synced: 18 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/shdrn2402/incomesresearch

The project aims to study the influence of various characteristics on a person’s income. In this project, I practiced data preprocessing and visualization, and also touched on the basics of machine learning.

jupyter-notebook matplotlib numpy pandas python seaborn sklearn visualization

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/matiassenia/galaxypy

La aplicación carga una imagen en escala de grises y utiliza los puntos de menor luminosidad (más oscuros) para dibujar líneas de colores que crean una estructura parecida a una galaxia.

numpy python

Last synced: 18 Apr 2026

https://github.com/linggarm/lvq-and-elm-with-iris-dataset

Implementation of Learning Vector Quantization (LVQ) and Extreme Learning Machine (ELM) with Iris Dataset

artificial-intelligence deep-learning elm extreme-learning-machine learning-vector-quantization lvq machine-learning numpy python soft-computing

Last synced: 18 Apr 2026