An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abhinav330/credit-score-classification-problem

This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn

credit-card credit-card-fraud credit-card-fraud-detection credit-risk credit-scoring data-science machinelearning numpy pandas python sklearn

Last synced: 08 Sep 2025

https://github.com/kelvynamaral/rolling-stones-spotify-eda

Análise Exploratória de Dados dos Rolling Stones no Spotify Projeto de EDA focado nas músicas e álbuns dos Rolling Stones no Spotify. Explora características musicais, popularidade e evolução da banda ao longo do tempo. Utiliza técnicas de análise de dados e visualizações para revelar insights sobre a discografia da icônica banda de rock.

matplotlib numpy pandas seaborn

Last synced: 15 May 2026

https://github.com/abeertechcamus/documentdata

The dataset was cleaned and queried using Python inside Jupyter Notebook and visualizes using PowerBI Document Data Analysis Projects

dax jupyter-notebook numpy pandas powerbi python

Last synced: 04 May 2026

https://github.com/fandreuz/python-serialization-benchmark

Low-effort benchmark of several serialization libraries for scientific applications

avro json numpy protobuf python serialization

Last synced: 12 Apr 2026

https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook

using numpy and pandas to analyze a synthetic loan dataset with python

data-analysis matlabplot numpy pandas plotting python seaborn

Last synced: 12 Mar 2025

https://github.com/dakshsammi/arkaid

Arkaid is a game performance analytics platform developed for the Information Integration Architecture Course - CSE656 (IIIT Delhi). It uses a data warehouse approach to analyze gaming data from multiple sources and provides insights via an AI-driven interface.

ai airflow aws database-management datawarehouse docker etl flask information-integration numpy openai postgresql python togetherai

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/campus-placement-prediction

Student Campus-Placement Prediction ML Project using Python

decision-trees machine-learning matplotlib numpy python random-forest scipy seaborn

Last synced: 08 Mar 2026

https://github.com/roluochke/real-estate-price-prediction-app

This is a data science project for a real estate price prediction website.

flask html-css-javascript jupyter-notebook matplotlib numpy python sklearn

Last synced: 09 Apr 2026

https://github.com/abdelrahman-amen/face_recognition

This project implements a real-time face recognition system using Python, OpenCV, and the face_recognition library. It identifies individuals by comparing facial features from live video streams with pre-stored encodings, enabling fast and accurate recognition.

cmake cv2 dlib-face-detection face-recognition numpy opencv os python

Last synced: 10 May 2026

https://github.com/dhavltharkaaar/portfolioprojects

This Repository contains the project related to my portfolio

css flask flask-api html5 idle jupyter-notebook matplotlib numpy pandas python python3 sklearn spyder

Last synced: 06 Apr 2026

https://github.com/dsowles/miniproject1davidsowles2025

INF601 Mini Project 1

numpy yfinance

Last synced: 16 May 2026

https://github.com/riyajain255/customer-segmentation-for-e-commerce

This project analyzes online retail data to segment customers using K-Means clustering and build classification models to predict those segments based on purchasing behavior.

customer-segmentation data-analysis kmeans-clustering logistic-regression machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn-plots

Last synced: 02 Apr 2026

https://github.com/abyshergill/ultralytics_yolo_object_detection_testing_gui

This is a desktop application for real-time object detection using the YOLO (You Only Look Once) model build with ultralytics libarary. It's built with PyQt5 for the graphical user interface and OpenCV for video and image processing. The application allows users to load a YOLO model, adjust detection settings, and perform object detection on media.

numpy opencv-python pytorch qt5-gui threading ultralytics yolo

Last synced: 08 May 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python

This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.

data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/makcoder-2004/exploratory-data-analysis-projects

This repository features Exploratory Data Analysis (EDA) projects using Python, Pandas, NumPy, Matplotlib, and Seaborn to clean, explore, and visualize data. The goal is to uncover patterns, trends, and insights through statistical analysis and clear visualizations, preparing datasets for deeper analysis or modeling.

csv-files dataframes jyputer-notebook matplotlib numpy pandas seaborn

Last synced: 07 May 2026

https://github.com/kandil2001/lid-cavity-evolution

Lid Cavity Evolution is an open-source CFD suite for the lid-driven cavity problem, featuring MATLAB, Python, and parallel solvers with benchmark comparisons.

benchmarking cfd cfd-simulation lid-driven-cavity matlab mpi numpy open-source openfoam openmp parallel-computing python scientific-computing simple-algorithm star-ccm

Last synced: 06 May 2026

https://github.com/4ngelojr/ai-ml-cheatsheets

🗂️ Access essential AI and ML concepts with quick-reference cheatsheets for effective learning and project implementation.

bigdata cheatsheet computer-science convolutional-neural-networks data-science deep-learning dl excel hadoop keras linux neural-networks numpy pig plotly probability sql statistics

Last synced: 01 May 2026

https://github.com/labex-labs/numpy-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.

array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations

Last synced: 20 Jun 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/rohanbanerjee1234567-cell/prediction-of-expected-salary-using-machine-learning

Here is my first Project Repository where I have made a Machine Learning Project using Python. The Problem statement was to train a model based on the given Dataset and from there we need to Predict the Expected Salary of an Employee who will have similar profiles.

exploratory-data-analysis linearregression matplotlib-pyplot numpy pandas randomforest randomforestregressor scikit-learn scikitlearn-machine-learning searborn visualization

Last synced: 27 Apr 2026

https://github.com/ars-sg/python_projects

A collection of diverse Python projects. Each project is meant to highlight different aspects of Python, from data analysis and web development to automation and machine learning. Feel free to explore the code, contribute, or reach out if you have any questions or collaboration ideas!

machine-learning matplotlib numpy pandas python seaborn streamlit

Last synced: 09 Apr 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/shotadft/vecposlib

This is a library that enables vector calculations in Python.

math math-library numpy python python-lib python-library python3

Last synced: 18 Sep 2025

https://github.com/imnotannamaria/ia-ml-scikit-learn

Learning tradicional machine learning with scikit-learn.

numpy pandas python scikit-learn seaborn

Last synced: 06 Apr 2026

https://github.com/ayushtiwari134/stock_price_predictor_dl

This is a full stack end to end project with the model trained in jupyter notebook, the backend file written in python, and for simplicity, the frontend created using streamlit.

deep-learning eda keras lstm-model machine-learning matplotlib numpy pandas python rnn streamlit tensorflow yfinance-library

Last synced: 06 Apr 2026

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/crvargasm/metnumun2021i

Universidad Nacional de Colombia - Métodos Numéricos ~ Grupo 1 2021-I

matplotlib numerical-analysis numpy pandas python quantopian

Last synced: 12 Apr 2026

https://github.com/anushkapal28/maatri-care

MaatriCare is an AI-based healthcare application designed to provide personalized nutrition advice and maternal care recommendations for pregnant women, lactating mothers, and women in general.

gemini-api jupyter-notebook labelencoder numpy pandas python3 random-forest streamlit svm-training

Last synced: 18 Sep 2025

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/noxs1d/cv-emotion-classification

This project is a deep learning-based system for detecting human emotions from facial images using TensorFlow, OpenCV, and NumPy.

cnn cnn-classification dockerfile keras numpy opencv tensorflow transfer-learning

Last synced: 09 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/reshiadavan/musica

A Music Composer AI which produces music and associated transcripts when given Midi files.

keras lstm-ann music21 numpy pickle python

Last synced: 13 Apr 2026

https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 02 Mar 2025

https://github.com/ayan6943/employee-attrition-prediction-with-machine-learning

Employee Attrition Prediction with Machine Learning | Analyzing HR data to predict employee turnover using Random Forest. Includes EDA, feature engineering, model training, and evaluation. Achieved 90% accuracy.

attrition employee machine-learning matplotlib numpy pandas python randomforestclassifier scikit-learn seaborn smote

Last synced: 09 Apr 2026

https://github.com/doonse/ml-handwritten

Neural Network which recognize numbers from images of size 8x8. Done using NumPy.

handwritten-digit-recognition machine-learning-algorithms neural-network numpy

Last synced: 15 May 2026

https://github.com/abdulhadi806/muffin-or-cupcake-model

𝐌𝐚𝐜𝐡𝐢𝐧𝐞 𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 model that is capable of checking whether the model I inputted is a mufin or cupcake, I am in a learning face today is 1/25/2024 let's see how far I can go in this field. Wish me luck:)

ai jupyter-notebook machine-learning numpy pandas python sklearn support-vector-machine svm

Last synced: 09 Apr 2026

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/azaz9026/loan_approval_prediction

Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio

data data-analysis data-visualization eda machine-learning numpy pandas python statistics

Last synced: 06 Apr 2026

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/emv271828/calculadora-rpn

Calculadora com notação polonesa reversa desenvolvida em python

numpy pyside6 python

Last synced: 15 May 2026

https://github.com/nguyenanhtuan1912/datatable-image-to-text

Repository này là đồ án môn học Computer Vision

computer-vision javascript nodejs numpy opencv python tesseract

Last synced: 09 Apr 2026

https://github.com/blackcub3s/msc-finalthesis

The most important programming files, code functions and data processing pipelines for the Machine learning final thesis of my Master's degree. Also, the LaTeX code of the thesis.

data-analysis latex machine-learning numpy python sklearn

Last synced: 09 Apr 2026

https://github.com/cimedl/connect4-ai

Testing and discovering ai methods/functionalities such as Minimax with search trees / game trees

ai ml numpy pygame python3

Last synced: 15 May 2026

https://github.com/prakharagr2001/whatsappchatanalyzer

WhatsAppChat Analyzer is a Python-powered website leveraging the capabilities of NumPy, Pandas, and Streamlit. It provides comprehensive analysis of WhatsApp conversations, extracting valuable insights and statistics for a user-friendly interpretation of chat data.

chat-analysis data-visualization numpy pandas python streamlit whatsapp-chat

Last synced: 20 Jan 2026

https://github.com/vyjayanthipolapragada/image_classifier_cnn_data_augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

adam-optimizer convolutional-neural-networks data-augmentation deep-learning image-classification jupyter-notebook neural-networks numpy optimizer pandas python pytorch relu-layer

Last synced: 09 Apr 2026

https://github.com/hansalemaos/numpyslicesplit

Splits a numpy array or a list based on the given indices or ranges and returns the split arrays.

numpy split

Last synced: 16 Feb 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/mirzaazwad/tymbert

TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy

bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/Khushi130404/CatNet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 19 Sep 2025

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/rushilsharma1/iphonesales_eda

An exploratory data analysis of an Iphone sales in jupyter notebook with the help of python and its libraries

data-science exploratory-data-analysis jupyter-notebook numpy pandas python seaborn

Last synced: 20 Jan 2026

https://github.com/shauryashaurya/marty_mcfly

Code, text and notebooks on a tutorial for Introduction to Machine Learning using open sources

anaconda jupyter-notebooks machine-learning machine-learning-tutorials notebook numpy python regression scikit-learn scipy tutorial

Last synced: 09 Apr 2026

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 16 May 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/ghaniketrajputp005/adobe-gensolve-hackathon-2024

This Project aims to identify, regularize, and beautify curves in 2D Euclidean space.

cv2 keras-tensorflow numpy os sklearn

Last synced: 07 Feb 2026

https://github.com/kgruiz/linalg-practice

LinAlg-Practice is a Python library developed to deepen my understanding of linear algebra through hands-on implementation of various matrix operations. It includes comprehensive tests that compare the results with established libraries like NumPy to ensure accuracy and reliability.

algorithms data-science linear-algebra math matrix-operations numpy python sympy

Last synced: 21 Apr 2026

https://github.com/vetrivel07/flight-price-prediction

Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 15 Jun 2025

https://github.com/vinicius999/eda-imdb-top1000-films

Análise exploratória dos Top 1000 filmes no IMDB até 2020

eda numpy pandas python

Last synced: 07 May 2026

https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera

introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera

data-analysis matplotlib numpy pandas

Last synced: 03 May 2026

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026

https://github.com/abdullahashfaqvirk/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 02 Apr 2026

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 09 Apr 2026

https://github.com/mdarshad1000/ml-libraries

Code for the scientific and visualization libraries in Python for Machine Learning.

matplotlib numpy pandas seaborn

Last synced: 09 Apr 2026

https://github.com/marcinz20/sortingalgorithms

This is a basic console program which contains just a few, basic sorting algorithms and showcases their usage

algorithms numpy object-oriented-programming python

Last synced: 15 May 2026

https://github.com/kikoveiga/feup-ia1

Artificial Intelligence (IA) First Project (2023/2024): BSc in Informatics and Computing Engineering @ FEUP

feup feup-ia feup-leic genetic-algorithms hill-climbing numpy pandas python simulated-annealing tabu-search

Last synced: 06 May 2026

https://github.com/qanastek/parseur-pdf

https://trello.com/b/SbT2XGyF/g%C3%A9nie-logiciel-scrum

beautifulsoup4 numpy python

Last synced: 15 May 2026

https://github.com/mustafadanabasi/python-linearregression-evfiyatlari

Ev Fiyatlarını Linear Regrasyon ile tahminleme çalışması.

linear-regression numpy pandas python

Last synced: 06 Apr 2026

https://github.com/franciscomartinez45/Social-Network-Analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 11 Apr 2025