An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abeertechcamus/documentdata

The dataset was cleaned and queried using Python inside Jupyter Notebook and visualizes using PowerBI Document Data Analysis Projects

dax jupyter-notebook numpy pandas powerbi python

Last synced: 04 May 2026

https://github.com/harmanveer-2546/campus-placement-prediction

Student Campus-Placement Prediction ML Project using Python

decision-trees machine-learning matplotlib numpy python random-forest scipy seaborn

Last synced: 08 Mar 2026

https://github.com/dsowles/miniproject1davidsowles2025

INF601 Mini Project 1

numpy yfinance

Last synced: 16 May 2026

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/4ngelojr/ai-ml-cheatsheets

🗂️ Access essential AI and ML concepts with quick-reference cheatsheets for effective learning and project implementation.

bigdata cheatsheet computer-science convolutional-neural-networks data-science deep-learning dl excel hadoop keras linux neural-networks numpy pig plotly probability sql statistics

Last synced: 01 May 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/anushkapal28/maatri-care

MaatriCare is an AI-based healthcare application designed to provide personalized nutrition advice and maternal care recommendations for pregnant women, lactating mothers, and women in general.

gemini-api jupyter-notebook labelencoder numpy pandas python3 random-forest streamlit svm-training

Last synced: 18 Sep 2025

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/vyjayanthipolapragada/image_classifier_cnn_data_augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

adam-optimizer convolutional-neural-networks data-augmentation deep-learning image-classification jupyter-notebook neural-networks numpy optimizer pandas python pytorch relu-layer

Last synced: 09 Apr 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/ghaniketrajputp005/adobe-gensolve-hackathon-2024

This Project aims to identify, regularize, and beautify curves in 2D Euclidean space.

cv2 keras-tensorflow numpy os sklearn

Last synced: 07 Feb 2026

https://github.com/mallocode300/colour_palette_generator

Upload an image and immediately obtain a colour palette with the 10 most common colors in HEX codes and RGB

numpy python

Last synced: 29 Jul 2025

https://github.com/ledsouza/MedPhys-BI

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 22 Sep 2025

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://github.com/ashishsingh789/bcg_virtual_internship

This repository showcases my BCG X virtual internship project on customer churn analysis for PowerCo, covering business understanding, EDA, feature engineering, and modeling using Python and machine learning.

data-manipulation data-science dataanalysis datavisualization eda machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/sultanazhari/the-effect-of-weather-on-travel-frequency

Zuber Company need to know passenger preferences and external effect on the trip. As an Data Analyst we're provide actual data who has affect on travel frequency.

matplotlib-pyplot numpy python3 scipy seaborn

Last synced: 03 Aug 2025

https://github.com/kevin-rsj/real-estate-investments

Sistema de scoring que clasifica ciudades francesas para inversión en segundas viviendas según perfil de riesgo(alto, moderado y bajo). Evalúa ratios clave en áreas como demanda, disponibilidad, infraestructura, demografía y precios.

bigquery data-analytics looker-studio numpy pandas python sklearn-library sql visualization

Last synced: 09 Apr 2026

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 09 Apr 2026

https://github.com/jose-zothner-meyer/finance-individual-loans

A comprehensive analysis of a financial institution's loan portfolio using Python, focusing on data preprocessing, exploratory data analysis (EDA), and feature engineering.

credit data-science data-visualization eda exploratory-data-analysis finance jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn

Last synced: 09 Apr 2026

https://github.com/lyubov0406/data_analyst_portfolio

В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных

data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization

Last synced: 09 Apr 2026

https://github.com/RedInfinityPro/PygameCicruit

Rating: (6/10) The code combines a Pygame GUI application with a PySpice circuit simulation, providing a user-friendly interface with buttons, menu options, dropdown menus, input boxes, sliders, and text elements.

matplotlib numpy os pygame pyspice random sys

Last synced: 30 Sep 2025

https://github.com/RedInfinityPro/DrawImages

Rating: (7/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 30 Sep 2025

https://github.com/mandeep04sharma/ev-cars-analysis---python

This project dives deep into electric vehicle specifications and performance using Python. It involves data cleaning, filtering, statistical analysis, outlier detection, and hypothesis testing to uncover meaningful insights.

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/sebastianofazzino/ibm-data-science-professional-certificate

In this repository I've stored exercises and projects I've been working on while attending IBM Data Science Professional Certificate, using Python and its libraries.

data-analysis data-mining data-science data-structures data-visualization database machine-learning matplotlib numpy pandas python regression seaborn sql

Last synced: 09 Apr 2026

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 09 Apr 2026

https://github.com/bearddan2000/python-web-3d-matplotlib-stem-graph

A demo of creating a 3d scatter parametric curve and line graph.

3d graph matplotlib numpy pandas python stem web

Last synced: 09 Apr 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/samanthalang/samanthalang_portfolio

Une data analyste avec la vision d'une consommatrice et la stratégie d'une marketeuse.

bigquery excel figma mysql notebook numpy pandas postgresql powerbi powerquery python sql sqlite wordpress

Last synced: 09 Apr 2026

https://github.com/pem-humboldt/wiutils

Exploration utilities for Wildlife Insights projects.

biodiversity camera-trap images matplotlib numpy pandas python seaborn trail-cameras wildlife-insights

Last synced: 09 Aug 2025

https://github.com/ayan6943/employee-attrition-prediction-with-machine-learning

Employee Attrition Prediction with Machine Learning | Analyzing HR data to predict employee turnover using Random Forest. Includes EDA, feature engineering, model training, and evaluation. Achieved 90% accuracy.

attrition employee machine-learning matplotlib numpy pandas python randomforestclassifier scikit-learn seaborn smote

Last synced: 09 Apr 2026

https://github.com/blackcub3s/msc-finalthesis

The most important programming files, code functions and data processing pipelines for the Machine learning final thesis of my Master's degree. Also, the LaTeX code of the thesis.

data-analysis latex machine-learning numpy python sklearn

Last synced: 09 Apr 2026

https://github.com/ananupam/eda-stock_market

An Exploratory Data Analysis project done on Financial data read from a search engine (yahoo) to predict the trends of stocks of 4 Tech companies.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/paulomppatricio/projeto_challenge_telecomx-br_parte-2

Projeto Challenge TelecomX-BR_Parte-2 - Formação Data Science do programa ONE - Oracle Next Education em parceria com a Alura.

data-science imblearn joblib machine-learning matplotlib modelos-preditivos numpy pandas python scipy seaborn sklearn statsmodels xgboost

Last synced: 12 Apr 2026

https://github.com/nazmul-1117/100-days-of-machine-learning

I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah

data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics

Last synced: 11 Aug 2025

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/hansalemaos/group_by_continuous_sequence

Groups lists/NumPy arrays by continuous sequence

group list numpy python

Last synced: 19 Feb 2026

https://github.com/micmurawski/cloud-array

cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.

aws azure big-data bigarray blob-storage cloud data-structures digitalocean-spaces gcp gcp-cloud-storage ibm-cloud-object-storage numpy s3 stream-processing streaming zadara

Last synced: 12 Aug 2025

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 09 Apr 2026

https://github.com/adi3042/diamond-price-prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupytrnotebooks matplotlib numpy pandas price-prediction readme regression sckiit-learn setuptools venv

Last synced: 09 Apr 2026

https://github.com/niteshchawla/netflix-case-study

Analyze the data and generate insights that could help Netflix ijn deciding which type of shows/movies to produce and how they can grow the business in different countries

matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/alphacrypto246/old-car-price-prediction

The Old Car Price Prediction project predicts used car prices using features like age, mileage, and fuel type. It includes data preprocessing, model training, and visualization of trends, with easy customization for additional features or models.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 09 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/abdullah-masood-05/eye-balll-detection

This repository provides advanced eye gaze tracking using MediaPipe Face Mesh for both video files and live camera feeds.

mediapipe numpy opencv scipy

Last synced: 10 Apr 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/kylehperez/mushroomnet

This API is for research purposes ONLY and is NOT to be used for food-safety or medical advice. Mushroomnet is a machine learning neural network for classifying mushrooms as poisonous or edible. The model was trained on data from uni.edu: over 5,000 instances of mushrooms, with 22 qualitative features and a determination of poisonous or edible.

artificial-intelligence botany flask-api machine-learning mycology neural-network numpy python3 pytorch scikit-learn

Last synced: 09 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/mayankmittal29/pagesage-ai_urls_content_scraper_webapp

PageSage is a modular web scraping and semantic retrieval system using BeautifulSoup for extraction, SentenceTransformers for dense embeddings, and ChromaDB for scalable vector search—enabling efficient data indexing for advanced RAG pipelines.

beautifulsoup chromadb fastapi gpt-turbo numpy openai-api python3 question-answering rag react requests sentence-transformers tailwindcss web-scraping

Last synced: 10 Apr 2026

https://github.com/ledsouza/machine-learning-introducao-a-classificacao-com-sklearn

Explorar classificação por machine learning, analisando algoritmos em exemplos cotidianos, comparando técnicas e entendendo o aprendizado de máquina por trás, com abordagem científica.

graphviz matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas

This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian

data-analysis numpy pandas python

Last synced: 10 Apr 2026

https://github.com/rotantos/ml-projects

Includes various ML projects created for educational purposes.

classification-model jupyter-notebook machine-learning numpy pandas python sklearn-library

Last synced: 10 Apr 2026

https://github.com/sc0v0ne/blueflix

Simple Application Recommend Movies and Tv Shows

csv k-means k-means-clustering kaggle movies numpy pandas python recommends-movies sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/sawaira-iqbal/data-visualization-project-on-car-sales-data

Explore Sales Data Visualization with Interactive Charts & Insights! 📊 Uncover trends and patterns to drive smarter automotive decisions.

bivariate-analysis data-science data-visualization interactive-visualizations matplotlib multivariate-analysis numpy plotly python seaborn univariate-analysis

Last synced: 12 Apr 2026

https://github.com/bdurga26/fake-news-detection

This project aims to find whether the given news is real or fake by using Machine learning Algorithms.

ds ml numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/tomdewildt/google-foobar

Solutions for the Google Foobar problems implemented in Python

foobar google numpy pylint python

Last synced: 22 Feb 2026