An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/leotrja/my-book-hands-on-machine-learning-with-scikit-learn-keras-and-tensorflow

📘 Explore the digital translation of "Practical Machine Learning" covering machine learning, deep learning, and neural networks in Persian.

computer-vision data-visualization deep-learning keras keras-tensorflow machi machine-learning neural-networks nlp num panda python reinforcement-learning sci tensorflow2

Last synced: 28 Apr 2026

https://github.com/rorrell/coviddeaths

A Jupyter Notebook where I create several visualizations based on data about COVID-19 deaths from 2020 to 2024

data-analysis data-visualization jupyter-notebook python3

Last synced: 28 Apr 2026

https://github.com/razalkr70/customer-segmentation-using-dataset

A data science project that segments mall customers using K-Means clustering. Based on age, income, and spending score, it identifies customer groups and visualizes them with 2D and 3D plots for targeted marketing insights.

clustering customer-segmentation data-science data-visualization kmeans machine-learning pca python scikit-learn

Last synced: 28 Apr 2026

https://github.com/ezrahsieh/narrativevisualization

This project is an interactive narrative visualization designed to illustrate the impact of the COVID-19 pandemic on global life expectancy. The visualization is implemented using D3.js and follows the Martini glass narrative structure. This serves as the final project for CS416 at UIUC.

d3 data-visualization interactive-visualizations javascript narrative-visualization

Last synced: 28 Apr 2026

https://github.com/robertovicario/uninsubria-datavisualization-project-work

Project Work for the Data Visualization module in the MSc in Computer Science program in Varese.

data-visualization dogecoin elonmusk python

Last synced: 28 Apr 2026

https://github.com/joshuadch/customer-churn-prediction

Predicting customer churn with Python (ETL, feature engineering, ML models, AUC/ROC) and business insights.

classification customer-churn data-science data-visualization feature-engineering machine-learning pandas python sklearn xgboost

Last synced: 28 Apr 2026

https://github.com/vanshuchaudhary/zomato

This Jupyter Notebook contains an exploratory data analysis (EDA) of Zomato restaurant data. It includes data cleaning, visualization, and insights into restaurant ratings, pricing, cuisine distribution, and location-based trends.

business-analytics data-analysis data-mining data-science data-visualization datascience matplotlib pandas-dataframe pandas-python python python-3 python-library

Last synced: 29 Apr 2026

https://github.com/anilyigitsel/istanbul-rental-apartments-analysis

This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.

data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing

Last synced: 29 Apr 2026

https://github.com/frammenti/knowledge-sake

Documentation and code for the course project in Open Access and Digital Ethics, University of Bologna, a.y. 2024/2025.

data-visualization dcat-ap education eurostat observable-plot oecd open-data

Last synced: 29 Apr 2026

https://github.com/chauxvive/fccchoropleth

A D3.js-based choropleth map visualizing U.S. census data on education levels by state, built as part of the FreeCodeCamp Data Visualization certification. It enables users to explore state-wise educational attainment data interactively.

d3 d3js data-visualization dataviz

Last synced: 29 Apr 2026

https://github.com/mfakhriazhar/python-data-analyst-tutorial

A collection of My Python learning files for Data Analyst purposes. Covers fundamental to advanced topics such as data exploration, visualization, statistical analysis, and the use of popular libraries like Pandas, NumPy, Matplotlib, and Seaborn. Suitable for personal documentation or shared learning references.

data-analysis data-science data-visualization exploratory-data-analysis portfolio python

Last synced: 29 Apr 2026

https://github.com/varshan1123/sql-tableau-project

We analyze key indicators for our pizza sales data to gain insights into our business performance - A Data Analysis Project performed on Tableau & SQL.

analysis data-analysis data-science data-visualization excel mysql powerbi sql sql-server tableau tableau-dashboards

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/mominurr/amazon-best-sellers-data-analysis

Exploring trends and product insights in Amazon Best Sellers data.

data-analysis data-visualization python scraping selenium tableau

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/chrka/d3-chessboard-count

Plot per-square frequencies on a chessboard

chess d3 data-visualization

Last synced: 30 Apr 2026

https://github.com/edgarhtt/uber_freight_data_analysis

Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task

data-analysis data-science data-visualization python

Last synced: 30 Apr 2026

https://github.com/bachtiarashidiqy/ecommercedashboard

An interactive e-commerce analytics dashboard built with Streamlit, providing visualizations for sales performance, product analysis, geographic insights, and delivery status. Includes date filtering, company branding, and comprehensive documentation.

analytics dashboard data-analysis data-visualization e-commerce matplotlib pandas python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/diogojorgebasso/dataanalysis_r_minesnancy

Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.

analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives

Last synced: 30 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/01110011011101010110010001101111/tigercosmosbootstrapdash

Sample Repository of Visualisaing TigerGraph Data with Cosmos in a Bootstrap Dashboard

bootstrap cosmos data-visualization graph-visualization tigergraph

Last synced: 30 Apr 2026

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/gerhynes/d3-movie-quotes

A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 01 May 2026

https://github.com/fbarffmann/project1

Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.

box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn

Last synced: 01 May 2026

https://github.com/corezen/graph-voyager

Interactive React app for visualizing and exploring graph data structures. Built with Vite and Tailwind CSS.

data-visualization github-pages graph-visualization react tailwindcss vite

Last synced: 01 May 2026

https://github.com/treyhamilton/baccarat-betting-simulator

Streamlit app simulating a baccarat betting strategy with full results visualization and CSV output.

baccarat betting data-visualization gambling machine-learning simulator streamlit

Last synced: 01 May 2026

https://github.com/kushshriv/onlinejobpostings-infographic

The Python Data Cleaning Code and Input Dataset For My Telling Stories With Data Project

data-visualization pandas python

Last synced: 01 May 2026

https://github.com/vedantshi/stock-price-prediction-for-maang-companies

This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.

data-analysis data-visualization lstm machine-learning python stock-price-prediction

Last synced: 01 May 2026

https://github.com/ujjwalll/get-that-flair

It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/

data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling

Last synced: 01 May 2026

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 02 May 2026

https://github.com/vaxdata22/redfin-analytics-etl-using-amazon-emr-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.

amazon-emr-cluster apache-airflow apache-spark aws-ec2 aws-s3 business-intelligence dags data-visualization etl-pipeline google-colab-notebook orchestration power-bi pyspark redfin snowflake snowpipe sqs-queue

Last synced: 02 May 2026

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/faithererer/haokanvideo_spider

好看视频爬取与数据分析

data-analysis data-visualization python spider

Last synced: 02 May 2026

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 02 May 2026

https://github.com/kimaruthagna/segmente

A journey through understanding customer segmentation using python with the general goal of encouraging data driven decision making

clustering crosstab customer-segmentation data-science data-visualization knn-classification lifetime-value pandas rfm-analysis seaborn

Last synced: 02 May 2026

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/inevolin/multivariate-data-analysis

Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.

analytics data-analysis data-science data-visualization javascript

Last synced: 09 Jun 2026

https://github.com/vincenzopalazzo/visualsars2chart

Visual analytics data COVID-19 (SARS 2) with python and Tableau

covd-19 covid-2019 covid19 data-visualization datacleaning dataset python3

Last synced: 03 May 2026

https://github.com/james-julius/latent-space-explorer

A flythrough 3D map of meaning — type any concept and watch it land near related ideas. In-browser embeddings (no install, no key), pre-seeded scenes, and a multi-model knowledge explorer.

3d-visualization ai claude data-visualization embeddings gemini latent-space llm machine-learning nextjs openai react-three-fiber semantic-search text-embeddings threejs transformers-js typescript umap vector-search webgpu

Last synced: 09 Jun 2026

https://github.com/ronitjariwala/prodigy_ds_05

Prodigy InfoTech Data Science Internship Task-5

data-science data-visualization python

Last synced: 03 May 2026

https://github.com/dvmorozov/arxiv

ArxivExpress - arxiv.org client for Android and iOS, ArxivNavigator - interactive arxiv.org metadata visualization. I would appreciate any way of contributing: GitHub issue, email or pull request.

android-application arxiv arxiv-api arxiv-org csharp d3-js d3-visualization d3js data-science data-visualization datascience ios-app javascript pyhton3 xamarin xamarin-forms

Last synced: 03 May 2026

https://github.com/nathadriele/diabetes-clinical-etl-pipeline

Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.

cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus

Last synced: 09 Jun 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/gerhynes/d3-birth-chart

An interactive UN birth rate bar chart built to practice manipulating SVGs with D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/eikospartners/ngcatalyst

Repo for visuzalizations for Angular

angular d3 data-visualization

Last synced: 03 May 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/parmaster/logserver

Collecting telemetry data posted by IoT devices into MQTT, filtering, processing and visualizing results

data-visualization golang-template iot-application mqtt mqtt-client template

Last synced: 04 May 2026

https://github.com/joaopalmeiro/altair-playground-app

An application powered by Streamlit to test different configurations on an Altair chart.

altair data-visualization streamlit

Last synced: 04 May 2026

https://github.com/xiaohan2012/myunisport

Visualize your Unisport annual training records

data-analysis data-visualization pandas pygal sports-stats tikzposter

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python

This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.

analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 05 May 2026

https://github.com/sundarmd/digital_twin_for_li-ion_batteries

Digital Twin for Li-ion batteries on AWS built using S3, EC2, SageMaker, Redshift, Terraform, QuickSight

aws-ec2 aws-s3 data-visualization iot python3 pytorch sql terraform

Last synced: 05 May 2026

https://github.com/shruthin4/ipl-cricket-analysis-2007-2024

In-depth IPL Cricket Data Analysis (2007–2024) with visual insights on teams, players, and match outcomes.

analysis cricket data-visualization eda ipl python sports-analytics

Last synced: 05 May 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/hashkanna/youtube-wrapped-2025

🎬 Spotify Wrapped-style dashboard for YouTube watch history. Built with React + Tessl spec-driven development for EF x Tessl x Anthropic Hackathon.

analytics data-visualization hackathon react recharts tessl vite wrapped youtube

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/badr-moufad/dashboard-agri-edge-frontend

Dashboard of Moroccan weather data adapted to the wheat calendar. This is part of my research internship.

clustering dashboard data-visualization morocco-regions plotlyjs reactjs redux tailwindcss weather

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/satyam4229/identify-employee-attrition

This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 08 May 2026

https://github.com/ropaxyz/octobot-octopus-energy-discord-bot

A Discord bot for Octopus Energy users to track and visualize their energy consumption. Integrates with Octopus Energy's API to fetch and display personalized energy data, costs, and usage charts.

asyncio data-visualization discord-bot energy-monitoring graphql matplotlib octopus-energy octopus-energy-api python rest-api sqlite

Last synced: 08 May 2026

https://github.com/femincan/d3-scatterplot-graph

My solution for the Visualize Data with a Scatterplot Graph project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 08 May 2026

https://github.com/dsaikiran01/seismomap

An interactive React + Leaflet web app that visualizes real-time global earthquakes from the USGS API with live map markers, magnitude filtering, dark mode, and responsive UI.

data-visualization earthquakes geoscience material-ui react react-leaflet tailwindcss usgs-api vite

Last synced: 08 May 2026

https://github.com/js-konda/naturaldisasterseda

The project repository for the Exploratory Data analysis of natural disasters done as part of ECE143 course at UCSD

data-science data-visualization pandas python visualization

Last synced: 08 May 2026

https://github.com/iyashwantsaini/911_capstone

For this capstone project we will be analyzing some 911 call data from Kaggle.

capstone data-science data-visualization python3

Last synced: 10 Jun 2026