An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/gattsu001/telecom-churn-predictor

Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.

churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm

Last synced: 18 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/amal-prasad/healthcare-insights

Healthcare Insights based on a dataset provided of a particular Hospital.

css data-preprocessing data-visualization html ipynb-jupyter-notebook mysql python3 streamlit

Last synced: 04 Apr 2026

https://github.com/leandrocollares/urbanization-versus-income

A responsive scatter plot that shows urban population percentages and GDP per capita in Americas.

d3 data-visualization svelte

Last synced: 18 Apr 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/mtimma001/clinical-trial-data-tool

Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.

bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python

Last synced: 05 Apr 2026

https://github.com/p1n2o/ngx-oracle-dv

Angular component to embed visualizations from Oracle Analytics Cloud into your Angular application.

angular data-visualization embed oracle-analytics-cloud oracle-visualization

Last synced: 18 Apr 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/reallyabdullah/machine-learning-portfolio

Explore my Machine Learning Portfolio Repository for impactful projects showcasing my expertise in fraud detection, MLOps, and cloud deployment. Dive into innovative solutions and let's shape the future of AI together! 🚀

artificial-intelligence cloud-deployment data-analytics data-science data-visualization llms ml mlops nlp python

Last synced: 19 Apr 2026

https://github.com/vyjayanthipolapragada/data_analytics_medical_appointments

Analyzing the data set which consists of medical appointments to draw insights about patient's no-show scenarios

data-analysis data-analytics data-cleaning data-visualization data-wrangling jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/samwhaaa/superfoodsmax

A customer demographic & spending trend analysis on the fictional SuperFoodsMax grocery chain

data-analysis data-analytics data-visualization jupyter jupyter-notebook python

Last synced: 20 Apr 2026

https://github.com/syncfusionexamples/integrate-.net-maui-maps-with-android-native-embedding

This repository contains a sample that explain how to integrate .NET MAUI SfMaps with android native embedding.

android data-visualization dotnet map maui native-embedding sfmaps

Last synced: 20 Apr 2026

https://github.com/bitkeks/hn-vis

Visualization of the Hacker News frontpage with Python and R.

data-visualization ffmpeg scraping

Last synced: 20 Apr 2026

https://github.com/robinmillford/hr-analytics-employee-performance-analysis

HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.

data-analysis data-visualization jupyter-notebook powerbi python3 sql

Last synced: 20 Apr 2026

https://github.com/timjjting/nba-teams-pts

The NBA Teams’ Performance Tracking System is a project aims to help users to track the performance of NBA teams from 2000 to 2009.

bootstrap d3js data-visualization jquery nba nba-visualization

Last synced: 20 Apr 2026

https://github.com/chauxvive/fccscatter

An interactive D3.js scatter plot visualizing doping allegations in professional cycling. Built as part of FreeCodeCamp’s Data Visualization certification

d3 d3js data-visualization dataviz

Last synced: 21 Apr 2026

https://github.com/rorrell/lifeexpectancy

A Jupyter Notebook where I create a chart with two line plots on it to check out the life expectancy of men vs. women from 1900-2018

data-analysis data-visualization jupyter-notebook python3

Last synced: 22 Apr 2026

https://github.com/teditae/us-states-guessing-game

A Python game built with Turtle and Pandas where the player guesses the names of U.S. states and sees them appear on a U.S. map. A fun way to learn geography while coding!

data-visualization geography-game pandas python turtle

Last synced: 22 Apr 2026

https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis

A data science notebook project focused on analyzing car features and building a model for car price prediction.

data data-analysis data-visualization jupyter-notebook python

Last synced: 23 Apr 2026

https://github.com/ppatrzyk/heatmap

Display CSV as a heatmap in terminal

csv data data-visualization terminal

Last synced: 24 Apr 2026

https://github.com/gowhale/daily-spend-analysis

Python script to analyse spending habits.

data-visualization pandas python

Last synced: 24 Apr 2026

https://github.com/avnigoyal25/ipl_eda

Exploratory Data Analysis on IPL datasets

data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/dpb24/datakind-2025

📊 Data Analytics: Identifying Actionable Insights to Improve Financial Inclusion in Kenya

data-analytics data-visualization databricks datakind exploratory-data-analysis financial-data geopandas jupyter-notebook kenya matplotlib numpy python seaborn

Last synced: 24 Apr 2026

https://github.com/pyrypp/taxipoint_streamlit

The front-end for the taxi demand prediction service

data-visualization streamlit

Last synced: 24 Apr 2026

https://github.com/pedrohdosanjos/economic-data-analysis

This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.

api data-analysis data-visualization jupyter-notebook python

Last synced: 24 Apr 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/kruthiktr/crop-recommendation-system-using-machine-learning

A machine learning-based system recommending crops based on soil, climate, and environmental conditions to optimize agricultural yields.

ai-in-agriculture crop-recommendation data-visualization machine-learning prediction python python3 recommendation-system

Last synced: 25 Apr 2026

https://github.com/m-biriulova/python-job-market-analysis

Web scraping, data analysis, and visualization of Python developer vacancies in Czech Republic.

automation beautifulsoup data-analysis data-visualization portfolio-project python selenium web-scraping

Last synced: 25 Apr 2026

https://github.com/bkamapantula/chart-recommender-gui

A rule-based chart recommendation service. Useful for teams to self-host or as an internal service.

data-visualization python

Last synced: 25 Apr 2026

https://github.com/dulajkavinda/matplotlib-ml

📊Data visualisation with matplotlib library.

data-visualization jupyter-notebook matplotlib python seaborn

Last synced: 25 Apr 2026

https://github.com/danilowskic/queue-initialisation

An app shows data structure called queue. Describes how it works and what it is based on.

data-structures data-visualization java-8 queue queue-simulation

Last synced: 26 Apr 2026

https://github.com/odinleepro/airbnbnewyorkcityanalysis

AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.

airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis

Last synced: 27 Apr 2026

https://github.com/zonggen/uiuc-cs416-a2

Data visualization assignment with D3.js

d3 data-visualization

Last synced: 27 Apr 2026

https://github.com/gabrieldiem/data_visualization_lifespan_wealth

Little python script that shows a data visualization of life span and wealth worldwide

data-visualization pandas plotly python script

Last synced: 27 Apr 2026

https://github.com/benzerinsio/floralspecies-eda

📊 Análise Exploratória de Dados (EDA) - Flores Iris | Exploração de padrões e clustering com K-Means

analise-de-dados analise-exploratoria analise-exploratoria-de-dados botany clustering data-visualization eda exploratory-analysis exploratory-data-analysis python seaborn

Last synced: 27 Apr 2026

https://github.com/natanast/tidytuesday_python

This repository contains my submissions for the TidyTuesday Python Challenge.

data-science data-visualization posit python quarto tidytuesday

Last synced: 07 Jun 2026

https://github.com/sungj921028/data-analysis-for-aqi

A project that using python to analysis the AQI quality.

aqi data-science data-visualization jupyter-notebook

Last synced: 07 Jun 2026

https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark

The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.

big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark

Last synced: 28 Apr 2026

https://github.com/jgohel9902/comprehensive-healthcare-analytics

An end-to-end healthcare analytics project integrating SQL, Python, and Power BI to analyze patient data, billing information, and doctor performance. This project showcases skills in data cleaning, advanced querying, visualization, and comprehensive insights generation to support data-driven decision-making in the healthcare industry.

data-visualization pandas powerbi python pythonfordatascience sql

Last synced: 28 Apr 2026

https://github.com/hadson0/chess-live-ratings-data

A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.

beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping

Last synced: 28 Apr 2026

https://github.com/tyfiero/ada-priceart-explorer

Generative art NFT project on Cardano that uses real ADA/USD price data to create 3D candlestick art

ada cardano cnft data-visualization generative-art javascript nft p5js

Last synced: 28 Apr 2026

https://github.com/al-chris/whatsapp-dashboard-web

A client-side only web application for analyzing and visualizing WhatsApp chat exports. This version runs entirely in your browser without requiring any server or backend - your data never leaves your device!

data-visualization javascript whatsapp

Last synced: 28 Apr 2026

https://github.com/shreeparab1890/indian-elections-2019-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.

data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization

Last synced: 28 Apr 2026

https://github.com/sawaira-iqbal/used-cars-price-prediction-ml-project

🚗 The Used Car Price Prediction project uses advanced ML models like Random Forest 🌲, Decision Tree 🌳, XGBoost 🚀, and SVR 🔍 to predict used car prices, enhancing buying and selling decisions.

data-visualization decision-tree machine-learning price-prediction python random-forest-regressor support-vector-machine xgboost

Last synced: 28 Apr 2026

https://github.com/malbiruk/salesflow-data-pipeline

End-to-end data engineering pipeline using Azure Blob, Data Factory, dbt, Snowflake, and Streamlit for interactive business analytics. (WIP)

azure-data-factory cloud-data-engineering data-visualization dbt etl snowflake streamlit

Last synced: 08 Jun 2026

https://github.com/alexquilis1/news-sentiment-analyzer

A Flask web app that analyzes sentiment in news articles and generates word clouds to visualize emotional trends in current events

data-visualization flask natural-language-processing news-api nlp nltk python sentiment-analysis vader-sentiment wordcloud

Last synced: 19 May 2026

https://github.com/marcusrprojects/stock-return-analyzer

Analyze and visualize cumulative stock returns against a benchmark (e.g., S&P 500) across multiple time scopes using Python, yfinance, and Matplotlib.

cumulative-return data-visualization matplotlib pandas python stock-analysis yfinance

Last synced: 29 Apr 2026

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 29 Apr 2026

https://github.com/anilyigitsel/istanbul-rental-apartments-analysis

This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.

data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing

Last synced: 29 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/jofaval/melbourne-temperature-timeseries

Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990

data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis

Last synced: 29 Apr 2026

https://github.com/aykutsahinn/carpredictapp

İkinci El Araçların Analizi | Jupyter Notebook

analysis data-visualization jupyter-notebook pyhton streamlit

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/r-mahesh45/fraud-detection-and-sales-analysis-using-random-forest

This project uses Random Forest to classify fraud risk based on taxable income and analyze key factors driving high sales for a cloth manufacturing company.

classification data-visualization extract-transform-load python3 random-forest

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/mxagar/eda_fe_summary

An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.

data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/samuelpillai/machine-learning-classification-regression-nlp

A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.

classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining

Last synced: 30 Apr 2026

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/abhi227070/ipl-2024-sold-player-data-analysis

This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.

data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3

Last synced: 30 Apr 2026

https://github.com/cdeweyx/bryce-harper-2016-analysis

Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.

data-analysis data-visualization python

Last synced: 01 May 2026

https://github.com/gerhynes/d3-movie-quotes

A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 01 May 2026

https://github.com/kevinandersontech/ecommerce_dashboard_streamlit

A Streamlit dashboard that reads daily revenue metrics from the data pipeline. Provides date filters, summary KPIs, line charts, and a table to explore revenue over time across different statuses (e.g. paid, refunded, failed).

charts dashboard data-visualization duckdb filters metrics python streamlit

Last synced: 01 May 2026

https://github.com/codesaadumair/data-science-monorepo

Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.

data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python

Last synced: 01 May 2026

https://github.com/rafath0ssain/predihome

Data analysis using economic factors affecting living conditions across Canadian provinces.

data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr

Last synced: 01 May 2026

https://github.com/kushshriv/onlinejobpostings-infographic

The Python Data Cleaning Code and Input Dataset For My Telling Stories With Data Project

data-visualization pandas python

Last synced: 01 May 2026

https://github.com/ujjwalll/get-that-flair

It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/

data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling

Last synced: 01 May 2026

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 02 May 2026

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/faithererer/haokanvideo_spider

好看视频爬取与数据分析

data-analysis data-visualization python spider

Last synced: 02 May 2026

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/inevolin/multivariate-data-analysis

Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.

analytics data-analysis data-science data-visualization javascript

Last synced: 09 Jun 2026

https://github.com/monteirooscar98/tarifas-publicas-sp-dieese

Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.

data-analysis data-visualization python webscraping

Last synced: 03 May 2026

https://github.com/james-julius/latent-space-explorer

A flythrough 3D map of meaning — type any concept and watch it land near related ideas. In-browser embeddings (no install, no key), pre-seeded scenes, and a multi-model knowledge explorer.

3d-visualization ai claude data-visualization embeddings gemini latent-space llm machine-learning nextjs openai react-three-fiber semantic-search text-embeddings threejs transformers-js typescript umap vector-search webgpu

Last synced: 09 Jun 2026

https://github.com/ronitjariwala/prodigy_ds_05

Prodigy InfoTech Data Science Internship Task-5

data-science data-visualization python

Last synced: 03 May 2026