An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/chandkund/customer-segmentation-using-k-means-clustering

Implemented K-Means Clustering to segment customers based on purchasing behavior, enabling targeted marketing strategies. Analyzed data, optimized clusters using the Elbow Method, and derived insights to enhance customer engagement and retention.

data-science kmeans-clustering machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/pedasoft-consult/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance.

nltk numpy pandas sklearn

Last synced: 30 Apr 2026

https://github.com/chandkund/customer-segmentation

Customer segmentation divides customers into distinct groups based on characteristics and behaviors. This project uses K-Means clustering, an unsupervised machine learning algorithm, to segment customers and provide insights for targeted marketing strategies

kmeans-clustering matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/sayginsaman/hand-gesture-desktop-controller

A comprehensive Python application that transforms your webcam into a powerful desktop control interface using computer vision and hand gesture recognition. Control media playback, volume, scrolling, and navigation with simple hand gestures - no mouse or keyboard required!

chmod gesture gesture-control gesture-detection gesture-recognition gestures media mediapipe numpy opencv pip pyautogui python

Last synced: 30 Apr 2026

https://github.com/darinochka/course_python_iad

ДЗ\Семинары майнора "Интеллектуальный анализ данных"

machine-learning numpy pandas python3

Last synced: 30 Apr 2026

https://github.com/alimiheb/morphotype-web-app

A modern web application that analyzes body morphotypes (Ectomorph, Mesomorph, Endomorph) using MediaPipe AI and provides personalized fitness and nutrition plans.

body-tracking flask mediapipe numpy opencv pandas pillow python3

Last synced: 10 Apr 2026

https://github.com/coueghlani/nlp

Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos

mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy

Last synced: 08 Feb 2026

https://github.com/bursasha/pandas-numpy-matplotlib-cavies-analysis

Complete statistical analysis of cavy lifetime dataset using Python, Pandas, NumPy, Matplotlib, and SciPy to explore, visualize, and infer the impact of bacilli infection on cavy lifetimes 🦫

cavy dataset-analysis distribution-fitting hypothesis-testing jupyter-notebook matplotlib numpy pandas python-analysis scipy statistical-analysis statistical-methods

Last synced: 09 Feb 2026

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/alphan26/airbnb-data-visualization

I've used the airbnb new york dataset from kaggle to make a comprehensive data visualization using pandas, seaborn, matplotlib.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/mahmud-kandawala/simulating-covid-19-python-

This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.

matplotlib numpy python

Last synced: 16 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/feguedi/clasificador-numeros

Proyecto final de la materia de Inteligencia Artificial donde se desarrolla un software que pueda clasificar los números de la base de datos MNIST.

artificial-intelligence numpy python3

Last synced: 03 May 2026

https://github.com/baschin1103/neural-network-building-in-python

In this repository is a programm with a pretrained artificial neural network from Building AI. The network predicts the price of a cabin based on it's attributes.

bias building-ai forward-pass functioning identity linear-combinations neural-network numpy python regression relu

Last synced: 16 Apr 2026

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/foggysmile/ml_labs_itmo

Лабораторные работы по машинному обучению

bert decision-trees jupyter-notebook machine-learning numpy pandas random-forest xgboost

Last synced: 11 Feb 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/mogankumar/bird-flock-simulation-with-pyspark

A PySpark-based bird flock simulation modeling alignment, cohesion, and separation forces with a leader guiding the flock. Efficiently handles large-scale simulations and generates a visualized GIF of the flock’s movement.

argparse matplotlib numpy pyspark python3

Last synced: 01 May 2026

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/harmanveer-2546/covid-19-chances-detection

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

covid-19 covid-data logistic-regression matplotlib mlp-classifier numpy pandas python random-forest seaborn sklearn-library xgboost

Last synced: 10 Apr 2026

https://github.com/ywatanabe1989/scitex-io

Universal scientific data I/O with plugin registry — save/load 30+ formats with one API. Part of SciTeX.

cli csv data-io hdf5 mcp numpy openscience pandas plugin-registry python research scientific-computing scitex

Last synced: 07 Jun 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/mouradhamzaoui/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 16 Apr 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/liblaf/array-typing

🧮 Type-safe array operations for NumPy, JAX, and PyTorch

array code-quality documentation jax numpy python static-analysis torch type-hints typing

Last synced: 19 Aug 2025

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 13 Feb 2026

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/harmonicode/signal_equalizer

Signal Equalizer is a Python-based application designed to process and analyze various types of signal data, including audio and ECG signals. It provides functionalities for reading, processing, visualizing, and enhancing signal data using techniques such as Wiener filtering.

audio-processing numpy pyqt5 scipy signal-processing wiener-filter

Last synced: 13 Feb 2026

https://github.com/ksganni/diabetes-prediction-using-machine-learning

🔬 A machine learning project that predicts diabetes likelihood using patient health metrics and multiple classification algorithms.

jupyter-notebook matplotlib numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 28 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/rakesh3596/tutedude_projects

It consist of 3 major project 1:Covid19 Analysis 2:Rainfall Prediction 3:Tumor Detection

csv-files matplotlib numpy pandas plotly python sklearn-library

Last synced: 16 Apr 2026

https://github.com/mjshubham21/ny_yellow_taxi_python_da_project

A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/harmanveer-2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python

Last synced: 10 Apr 2026

https://github.com/splithor1zon/edu-barriermethodlp

Education material for learning/teaching of Linear Progrmming using Barrier Method. Presentation with commentary and python code available.

barrier-method convex-optimization cvxpy educational-project linear-programming numpy presentation python

Last synced: 01 May 2026

https://github.com/ialam085/super_store_sales_analysis_python

The project is indeed focused on performing an exploratory data analysis (EDA) of Super Store Sales data from various perspectives, using comprehensive visualizations.

charts matplotlib numpy pandas python seaborn visualization

Last synced: 10 Apr 2026

https://github.com/sairupeshl/portfolio-risk-calculator

A portfolio risk engine using Python, integrating the Yahoo Finance API to ingest and preprocess historical multi-asset price data

numpy pandas python yfinance

Last synced: 08 Jun 2026

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/killervardhan8/gesturedecode

The Sign Language Interpretation project focuses on recognizing and interpreting hand gestures to facilitate communication for individuals who use sign language. This project leverages computer vision and machine learning techniques to accurately identify and translate hand signs into text

csv mediapipe numpy python tensorflow

Last synced: 28 Feb 2026

https://github.com/mchenryspagg/economic-variable-correlation-and-productivity-model

This report provides an in-depth analysis of Nigeria's economic performance over the past decade, spanning from 2014 to 2024.

correlation exchangerates inflation moneysupply numpy productivity python regression regression-analysis regression-models visualization

Last synced: 15 Feb 2026

https://github.com/hghyhghy/notebook

Repo consist numpy , matplotlib and pymongo aggression

jupyter-notebook matplotlib numpy pymongo python-3

Last synced: 01 May 2026

https://github.com/ahmedabdalkreem/naive-bayes

Show what types of naive-bayes and how to use model and deal with data ( Numerical, Categorical and Mixed between Numerical and Categorical )

numpy pandas python3

Last synced: 10 Apr 2026

https://github.com/terrelbrinkley/python-projects

Python Scripts & Data Analyst Projects

aws numpy pandas python selenium

Last synced: 10 Apr 2026

https://github.com/ahmedsamir45/demograghic-data-analyzer

answer some questions about the data

demographic-data-analyzer numpy pandas

Last synced: 01 May 2026

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/sasanka14/water_quality_predictions

Water Quality Prediction - College Project 🌊💧 Predicts water potability (safe/unsafe) using ML models like XGBoost & Random Forest. Features data preprocessing, feature importance, model evaluation, and visualizations. Built with Python, Pandas, Scikit-learn & Seaborn for analysis. 🚀

anaconda jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/mohhasan1/machine-learning

This repository is where I learn and store information about various machine learning libraries, such as Pandas, scikit-learn, and others. It serves as a personal reference for exploring and experimenting with different ML tools and techniques.

machine-learning matplotlib numpy pandas seaborn sklearn

Last synced: 01 May 2026

https://github.com/lorenzorottigni/ml-advertising

Machine Learning python bootcamp: logistic regression on advertising dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/williyam-m/movie_booking_app

A Django-based movie booking app featuring an admin dashboard with CRUD functionality for movies, theatres, screens, shows, and bookings. Includes an ML-powered movie recommendation system.

django machine-learning numpy orm pandas python sckiit-learn sqlite3

Last synced: 16 Apr 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/shoaib1522/credit-card-fraud-analysis

"A comprehensive project on Credit Card Fraud Detection combining Exploratory Data Analysis, Machine Learning, and an interactive Streamlit web app for real-world applicability."

eda epochs mlp-classifier neural-network numpy pandas-python perceptron-learning-algorithm python regression sckiit-learn seaborn streamlit streamlit-webapp tensorflow

Last synced: 09 Apr 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025