An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/parthshah02/customer_churn_dashboard

This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python

data-analysis matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/lingesh81051/similar-template-document-matching-and-fraud-detection

An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.

numpy opencv opencv-python pillow pymupdf pytesseract pytesseract-ocr python tkinter

Last synced: 13 Apr 2026

https://github.com/sonatipek/matris_transactions_py

My "Python Programlama-I" lesson's homework. Some matris transactions with numpy library in Python

matrix numpy numpy-arrays python sonatipek

Last synced: 24 Mar 2025

https://github.com/ola-kaznowska/numpy-matrix-arrays

Practice working with the NumPy library and multiple arrays. Copy() and view() functions

numpy python python3

Last synced: 18 Apr 2026

https://github.com/suap-dev/python-essentials

a repository intended for people who want to know some python basics

lambda-functions numba numpy oop pandas teaching

Last synced: 17 Mar 2025

https://github.com/ewakamoto/03.-python-projects

This is my collection of Python Projects

jupyter jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/22p31a0512/sentimental-analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

feature-engineering machine-learning natural-language-processing numpy pandas python supervised-learning text-classification text-processing tokenization

Last synced: 12 Apr 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/patex1987/ghcn-temperature-dataset

Analysis of GHCN (Global Historical Climatology Network) temperature dataset using Numpy

climate-data jupyter-notebook numpy python temperature-data

Last synced: 10 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/jsnevt/regressao_logistica

Cálculos de probabilidades sobre uma eleição

matplotlib numpy pandas-dataframe python

Last synced: 11 May 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/anikdascodes/fantasy-cricket-project

🏏 Python simulation for generating 20K+ unique fantasy cricket teams with probability-based player selection and constraint optimization. Advanced algorithms, data analysis & visualizations included.

algorithm constraint-satisfaction cricket data-science fantasy-sports jupyter machine-learning numpy optimization pandas plotly probability python simulation sports-analytics

Last synced: 28 Apr 2026

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 06 May 2026

https://github.com/bhushan148/ipl-stats-analysis

This project explores IPL data to show team and player performances. 📈 Key highlights include analyzing match results, highest scores, and performance at different venues. 🏟️ It uses charts to reveal trends like win margins and average scores by venue. 📉 Discover important insights into what affects match outcomes and team success.

jupyter-notebook markdown matplotlib-pyplot numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/fimbres/matplotlib-start

A crash project with the most important matplotlib functions and methods, used in data science, AI and machine learning.

data-science matplotlib matplotlib-pyplot numpy pandas python python-script

Last synced: 13 Apr 2026

https://github.com/williyam-m/sentiment_analysis

A Django-based Sentiment Analysis web app leveraging the power of the BERT transformer model to classify text sentiment accurately.

beautifulsoup bert django numpy pandas python torch transformers

Last synced: 20 Jun 2026

https://github.com/rizkyrajitha/ann_101

A really simple neural network for binary classification written from scratch using python

jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/lijesh010/employeeanalysis-

This is a simple data analysis project developed as part of my Entri Elevate Data Science and Machine Learning course. It serves as a practical demonstration of my skills in data analysis.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/disney35/stock-prices-dashboard

A dashboard to analyze, predict, and visualize stock prices using Python & LSTM

ema jupyter-notebook keras macd matplotlib-pyplot mfi numpy pandas python rsi scikit-learn sma streamlit tenserflow yfinance

Last synced: 12 Apr 2026

https://github.com/elvisrock/government_analysis

Esse projeto, tem como objetivo, analisar, explorar algumas técnicas e propor uma sugestão ao governo de forma geral, sem qualquer viés politico.

data-science machine-learning numpy pandas python sklearn

Last synced: 05 Apr 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/armahdavi/mlops

Productionizing ML Models using a variety of tools including FastAPI, Flask, Doocker, AWS, GCP, TensorFlow Extended (TFX), and TF.js.

apache-beam docker fastapi logistic-regression mlops mlops-workflow numpy random-forest sklearn tensorflow-extended unicorn xgboost

Last synced: 12 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/akashash01/support_vector_machine

It's an linear model for classification and regression problems. It can solve non-linear problems by creating a hyperplane which separates the data into classes using KERNAL .

data-science datavisualization machine-learning numpy pandas python sklearn svm-classifier

Last synced: 13 Apr 2026

https://github.com/mnitin-reddy/summer-olympics-data-analysis-web-app

An interactive web app for exploring trends in Olympic Games history, analyzing overall medal tallies, country-wise performance, and athlete demographics. Built with Python and Streamlit, this app offers insights through visualizations and data-driven statistics.

dataanalytics matplotlib numpy pandas python seaborn streamlit

Last synced: 12 Apr 2026

https://github.com/adityarohatgi11/regression_analysis_london_crime

This project examines how police presence impacts crime rates in London using regression analysis, panel data, and a natural experiment from the 2005 London attacks.

data-wrangling numpy pandas regression-analysis statsmodels

Last synced: 11 May 2026

https://github.com/mehediehteshum/minesweeper

Building classic 'Minesweeper' game from scratch using Pygame with VS Code.

numpy pygame python python3

Last synced: 06 Oct 2025

https://github.com/farhad-here/height-distribution-analysis

Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.

coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance

Last synced: 13 Apr 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 12 Apr 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/gatuno1/details_meson_numpy_error

Documents for exploration of meson error on numpy installation over PyPy on Windows

meson numpy pypy unicode windows

Last synced: 11 May 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/apal21/tensorflow-linear-regression-fit-data

Fit a line passing through data with some random noise using Tensorflow linear regression with Gradient Descent Optimizer.

gradient-descent linear-regression numpy python3 tensorflow

Last synced: 12 Apr 2026

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/christabelsakyi/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib-pyplot numpy python seaborn sklearn

Last synced: 08 May 2026

https://github.com/umer-farooq-cs/california-housing-regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

california-housing cross-validation deep-learning-for-perception linear-regression numpy polynomial-features reproducible-research residual-analysis sgd sklearn

Last synced: 09 May 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/zolppy/recommendation-system

This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.

computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16

Last synced: 12 Apr 2026

https://github.com/prarthana-singh/bangalore-house-price-predictor

🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.

data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/sumitkundu102022/ecobuddy

EcoBuddy: Personalized Carbon Footprint Tracker

bcrypt numpy pandas plotly pymongo python-dotenv requests streamlit

Last synced: 12 Apr 2026

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/pedasoft-consult/employee-attrition-prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

joblib numpy pandas python3 sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/techcsispit/algorithmic-trading

This project focuses on algorithmic training using Python in Jupyter Notebooks. It covers various data structures and algorithms, providing a hands-on approach to solving algorithmic challenges. The notebooks are designed to be interactive and educational for learners of all levels.

matplotlib numpy pandas python sklearn

Last synced: 04 Sep 2025

https://github.com/prashhhant213/customer-behavior-analysis-for-walmart-black-friday-sales

This project is a Walmart case study analyzing customer purchase behavior by gender and demographics to inform business decisions, especially around spending habits during events like Black Friday.

matplotlib numpy pandas python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/rohitinu6/image_cartoonify

To Cartoonify an Image using opencv-python

image-processing matplotlib numpy opencv opencv-python python

Last synced: 12 Apr 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/NetBr3ak/Planetary-System-Simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 08 Oct 2025

https://github.com/sayan520/titanic-data-insights

Conducting data analysis on Kaggle's Titanic: Machine Learning from Disaster dataset using essential data wrangling, exploratory data analysis (EDA), and visualization techniques to uncover insights, identify patterns, and explore factors influencing passenger survival.

jupyter-notebook kaggle matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/akashash01/random_forest

It enable learning method for classification, regression and other task. Operates by building multiple decision tree ay training time. It is much more robust than Decision Tree algorithm.

data-science machine-learning numpy pandas python random-forest-classifier sklearn supervised-learning

Last synced: 01 Apr 2025

https://github.com/lluispp/hunting_for_exoplanets_with_ai

An AI-powered tool that classifies exoplanets using NASA datasets and provides an interactive web interface for real-time exploration and discovery.

ai challenge exoplanets nasa numpy python spaceappchallenge-2025 spaceapps swai web

Last synced: 18 May 2026

https://github.com/byteshiva/getmaplocation

Automate Dummy data from the open data source. Parse International Address, Geopy to automate map locations using lat and lon, Cache geopy map location calls using sqlite3, Use Pandas to parse and convert CSV to JSON

csv-parser geopy json numpy pandas pandas-dataframe parser pickle python3 sqlite3

Last synced: 18 Apr 2026

https://github.com/kareem74x/data-mining-uni-project

A project focused on applying data mining techniques to extract insights from datasets

data-mining matplotlib numpy pandas python

Last synced: 25 Feb 2026

https://github.com/darkusphantom/newton-method

7 ejercicios practicos del método de Newton en la programación computacional

computational-maths numpy python

Last synced: 25 Mar 2025

https://github.com/khushneetsingh/datasanity

DataSanity is a AI-powered web application for dataset cleaning, synthetic data generation, vectorization, and data enrichment using natural language prompts.

cerebrus exa faiss-vector-database llm nextjs numpy pandas serperdev sqlite tailwindcss

Last synced: 08 Apr 2026

https://github.com/c-mohammed-zaid/python

A diverse collection of Python projects showcasing skills in game development, data manipulation, and utility tools, including classic games, encryption, and data analysis.

jupyter-notebook matplotlib numpy pandas python scipy turtle

Last synced: 06 Jan 2026

https://github.com/alex-s2222/mem_chat_bot

телеграмм бот для создания gif из текста введённого пользователем

cv2 google-api moviepy numpy python python-telegram-bot requests

Last synced: 08 Oct 2025

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/chandkund/stock-price-prediction-

Predict Tesla stock prices using historical data. The project preprocesses data, applies normalization and standardization, trains a linear regression model, and evaluates its performance. Visualizations include box plots and time series plots.

data-science linear-regression numpy pandas plotly python sklearn sklearn-model

Last synced: 21 Jan 2026

https://github.com/chopikus/raytracing-bench

Benchmarking ray tracers in Python, Java, Numpy, Cupy, Pypy

cupy java numpy pypy python raytracing

Last synced: 06 May 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/khushi130404/catnet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 12 Apr 2026

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/samuelpuerta/modelosiasaberprocolombia

PROYECTO KAGGLE - Crear un modelo para predecir el redimiento de los estudiantes en las pruebas Saber Pro

ai ia jupyter-notebook machine-learning numpy pandas prediction-model predictive-modeling python

Last synced: 06 May 2026