An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/manoj-kumar-c/data_analyst_projects

Live Data Analyst Projects.

excel numpy pandas powerbi

Last synced: 07 Feb 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1

Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?

confidence-interval normalization numpy pandas python scipy stats

Last synced: 13 Apr 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/murugavl/loan_approval_prediction

This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.

jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/gustavo-venceslau/housing-price

A Machine Learning project to create a model to predict housing prices. The project is based on the example in chapter 2 of the book Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélion Géron

conda machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/barkerbg001/framefusion

Python script that creates a video from a set of images and adds a random audio clip to each video

glob moviepy numpy opencv pygt5 python python-3 random video video-editor

Last synced: 16 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/dineshdhamodharan24/industrial-copper-modeling

It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.

bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit

Last synced: 13 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/alphacrypto246/stock-price-movement-prediction

A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.

machine-learning numpy pandas polynomial-regression scikit-learn yfinance

Last synced: 13 Apr 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ellennewerkla/apriori-analise_cesta_de_mercado

Análise de Cesta de Mercado (Market Basket Analysis) utilizando o algoritmo Apriori para identificar regras de associação entre produtos em um conjunto de transações de supermercado.

apriori-algorithm jupyternotebook marketbasketanalysis mathplotlib mba numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker

Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.

cnn-classification computer-vision keras-tensorflow numpy python tensorflow

Last synced: 07 Jan 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/sunilsm7/numpy-starter

IPython Notebook(s) demonstrating NumPy functionality.

numpy python

Last synced: 13 Apr 2026

https://github.com/rajsinha7/hand-gesture-recognition

This project serves as a foundation for integrating hand gesture controls into interactive applications like games, presentations, or robotic controls. It features: Real-time hand tracking with MediaPipe Hands and Live visualization of hand landmarks and gesture labels

mediapipe numpy opencv python

Last synced: 13 Apr 2026

https://github.com/alinenog/desenvolve_gb_2022

Formação Desenvolve 2022 do Grupo Boticário na área de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/kartikeya443/handgesture_vol_control

This OpenCV project enables remote control of sound playback using wireless communication. It lets users adjust the master volume of the central device using finger gestures, enhancing convenience and user experience.

mediapipe numpy opencv pycaw wireless-communication

Last synced: 13 Apr 2026

https://github.com/allanotieno254/pinecone-index-creation-vector-store-database-

This project demonstrates how to create an index in Pinecone, a vector database optimized for similarity search and machine learning applications. The notebook covers setting up a Pinecone client, creating an index, and managing vector data efficiently.

index jupyter-notebook numpy pinecone python

Last synced: 13 Apr 2026

https://github.com/mike014/soundbot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 13 Apr 2026

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/goessl/adjugate

A package for calculating submatricies, minors, adjugate- and cofactor matrices.

adjugate cofactor determinant inverse linear-algebra matrix minor numpy python submatrix

Last synced: 07 Jan 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/siddhi-c19/tourism_sid_tournaut

A comprehensive analysis of tourism data of various countries

data-science dataset mathplot mathplotlib numpy plotly statistics streamlit

Last synced: 06 May 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/prateekrajsrivastav/financial-transition-classification

This project focuses on classifying financial transactions using machine learning techniques. By leveraging labeled data, the model aims to categorize transactions into predefined categories (e.g., "Food," "Transport," "Shopping," etc.).

matplotlib numpy pandas-python scikitlearn-machine-learning seaborn

Last synced: 07 Sep 2025

https://github.com/murugavl/customer_churn_prediction

Customer churn prediction is a crucial task in business analytics, helping companies identify customers who are likely to leave and take preventive actions. This project utilizes Machine Learning (ML) techniques to analyze customer behavior and predict churn.

jupyter-notebook machine-learning matplotlib numpy pandas pickle python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de predição de preços de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 13 Apr 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/bbhargavpanchal/brain-tumor-segmentation-unet

U-Net based Convolutional Neural Network for brain tumor segmentation using MRI images. Implemented in Python with TensorFlow, OpenCV, and Scikit-learn.

keras-tensorflow numpy opencv pandas python scikit-learn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/oleksiilatypov/titanic_ml

Titanic Survival Prediction using ML and Flask Deployment.

data-science docker flask machine-learning numpy pandas skit-learn

Last synced: 13 Apr 2026

https://github.com/ksprugevics/magic-square

Some simple algorithm implementations for solving (ordinary) magic squares using NumPy

algorithms mathematics numpy python

Last synced: 21 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/ivancaez/analisis_dades_microbit

Data analysis of Micro:bit with maplotlib, numpy and pandas

csv jupyter-notebook matplotlib microbit numpy pandas python

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/deep-learning-projects-phase-1

Deep Learning Projects which demonstrate the use of Python and its complex ML algorithms

deep-learning machine-learning neural-network numpy opencv pandas python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/chandkund/predicting-diabetes-onset

The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.

deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/sanjaiveeran44/shopping-intent-analasys

This project predicts whether a visitor to an e-commerce website will make a purchase based on their browsing behavior. It includes data analysis, preprocessing, and machine learning modeling to understand online shopping patterns.

matplotlib matplotlib-pyplot numpy pandas python3 sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 13 Apr 2026

https://github.com/caterinatasinato/python-projects

Projects I worked on as Trainee in Data Analytics at ProfessionAI

matplotlib numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/joiceo/python

Projetos e exercícios em Python

eda machine-learning numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/danaelshrbiny10/gold-prices

The Egypt Gold Prices project is a data analysis and visualization initiative that focuses on tracking and understanding the daily gold prices in Egyptian pounds per gram.

data-visualization docker docker-compose matplotlib mongodb numpy pandas powerbi python3 webscraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/kizman-23/supervised_models

Classical prediction of future data using models trained by labeled data

numpy pandas scikit-learn supervised-machine-learning

Last synced: 13 Apr 2026

https://github.com/blairnangle/python3-numpy-ta-lib

Docker image with NumPy and TA-Lib installed. Built and uploaded to Docker Hub using GitHub Actions.

docker numpy python3 ta-lib

Last synced: 13 Apr 2026

https://github.com/venkat-0706/accenture-hackathon

Developing an e-commerce recommendation system involves utilizing technologies such as Python for programming, Pandas for data manipulation, SQL for database management, FastAPI for building APIs, PostgreSQL for data storage, and Docker for containerization.

alembic api docker fastapi machinelearningalgorithms matplotlib numpy postgresql pydantic python3 scipy seaborn sqlmodel

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/data-analytics-projects-phase-1

A set of projects on Data Analysis using Python and its libraries

matplotlib numpy pandas python seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/asier-ortiz/python-for-data-science-and-machine-learning-bootcamp

Python for Data Science and Machine Learning Bootcamp: NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn, TensorFlow, and more

matplotlib numpy pandas plotty python scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/fearless-spider/spideroptic-cli

Spider Optic CLI is a command line application.

cement cli cli-app command-line makefile numpy oanda pypi python

Last synced: 01 May 2026

https://github.com/archishmansengupta/dnn

Digit Neural Network is a digit recognition network based on MNIST data set using numpy, pandas and matplotlib

matplotlib mnist neural-network numpy pandas python

Last synced: 13 Apr 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/eda-on-indian-railways

Indian Railways is a statutory body under the ownership of the Ministry of Railways of the Government of India that operates India's national railway system. As of 2023, it manages the fourth largest national railway system by size with a track length of 132,310 km, running track length of 106,493 km and route length of 68,584 km.

clean-data eda exploratory-data-analysis geometry geopandas indian-railways json linestring matplotlib numpy os pandas plotly python railway seaborn shapely train visualization

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026