An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ishmal793/bi-dummy-

An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.

data-analytics data-visualization eda pandas plotly python-dashboard streamlit

Last synced: 01 May 2026

https://github.com/corezen/graph-voyager

Interactive React app for visualizing and exploring graph data structures. Built with Vite and Tailwind CSS.

data-visualization github-pages graph-visualization react tailwindcss vite

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/treyhamilton/baccarat-betting-simulator

Streamlit app simulating a baccarat betting strategy with full results visualization and CSV output.

baccarat betting data-visualization gambling machine-learning simulator streamlit

Last synced: 01 May 2026

https://github.com/vedantshi/stock-price-prediction-for-maang-companies

This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.

data-analysis data-visualization lstm machine-learning python stock-price-prediction

Last synced: 01 May 2026

https://github.com/guptakushal03/whatsapp-chat-analyser

The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.

chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud

Last synced: 01 May 2026

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 02 May 2026

https://github.com/vaxdata22/redfin-analytics-etl-using-amazon-emr-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.

amazon-emr-cluster apache-airflow apache-spark aws-ec2 aws-s3 business-intelligence dags data-visualization etl-pipeline google-colab-notebook orchestration power-bi pyspark redfin snowflake snowpipe sqs-queue

Last synced: 02 May 2026

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/rbreeze/dashboard

My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.

css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization

Last synced: 02 May 2026

https://github.com/teja-1403/ignosis-tech-ml-assignment

Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.

customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python

Last synced: 02 May 2026

https://github.com/kimaruthagna/segmente

A journey through understanding customer segmentation using python with the general goal of encouraging data driven decision making

clustering crosstab customer-segmentation data-science data-visualization knn-classification lifetime-value pandas rfm-analysis seaborn

Last synced: 02 May 2026

https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor

AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.

data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics

Last synced: 02 May 2026

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/debjyotisaha/web-application-projects-streamlit-phase-2

This repository showcases interactive web applications built using the Streamlit framework.

dashboard data-visualization python streamlit

Last synced: 02 May 2026

https://github.com/femincan/d3-treemap-diagram

My solution for the Visualize Data with a Treemap Diagram project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 02 May 2026

https://github.com/gkar90/gdp-vs-life-expectancy

Statistical analysis on GDP vs Life Expectancy

data-science data-visualization statistical-analysis

Last synced: 09 Jun 2026

https://github.com/davityak03/stock-value-prediction-and-forecasting

This project predicts stock market trends using an LSTM neural network, focusing on Apple Inc.'s historical data for accurate future price forecasting. It includes data retrieval, preprocessing, model training, and evaluation.

data-visualization datareader lstm pandas python tensorflow tiingo

Last synced: 03 May 2026

https://github.com/ronaldkanyepi/southern-africa-health-indicators-analysis

Visualizing Health Trends in Southern Africa: Analyzing World Bank data (2013–2020) to uncover patterns in life expectancy, infant mortality, HIV prevalence, and maternal mortality.

botswana data-visualization dplyr ggplot2 ggtext health healthcare mozambique patchwork r south-africa tidyverse viridis zimbabwe

Last synced: 09 Jun 2026

https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python

In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.

data-analysis data-cleaning data-science data-visualization jupyter-notebook python

Last synced: 03 May 2026

https://github.com/james-julius/latent-space-explorer

A flythrough 3D map of meaning — type any concept and watch it land near related ideas. In-browser embeddings (no install, no key), pre-seeded scenes, and a multi-model knowledge explorer.

3d-visualization ai claude data-visualization embeddings gemini latent-space llm machine-learning nextjs openai react-three-fiber semantic-search text-embeddings threejs transformers-js typescript umap vector-search webgpu

Last synced: 09 Jun 2026

https://github.com/nix7amcm/fcc-data-viz-cert-projects

These are my projects for the freeCodeCamp Data Visualization certification.

d3 d3-visualization d3js data-visualization data-viz freecodecamp freecodecamp-project html-css-javascript

Last synced: 03 May 2026

https://github.com/ronitjariwala/prodigy_ds_05

Prodigy InfoTech Data Science Internship Task-5

data-science data-visualization python

Last synced: 03 May 2026

https://github.com/baggiponte/ta-business-statistics-2023

🎓 Business Statistics [UniMi • AY 2022/2023]

data-mining data-science data-visualization r tidyverse

Last synced: 03 May 2026

https://github.com/nathadriele/diabetes-clinical-etl-pipeline

Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.

cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus

Last synced: 09 Jun 2026

https://github.com/ledsouza/reg-linear-ii

Projeto de estudo de técnicas avançadas de modelagem em regressão linear

data-science data-visualization linear-regression pandas seaborn sklearn statsmodels vitrinedev

Last synced: 03 May 2026

https://github.com/leandrocollares/foreign-born-population-in-canada

Responsive bar chart that shows the percentage of foreign-born population in Canada between 1871 and 2011

d3 data-visualization svelte

Last synced: 03 May 2026

https://github.com/ljadhav25/swiggy-restaurant-analysis

This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots

Last synced: 03 May 2026

https://github.com/joaopalmeiro/b2-presentation

Materials for the presentation of the paper: "B2: Bridging Code and Interactive Visualization in Computational Notebooks" (Yifan Wu, Joseph M. Hellerstein, Arvind Satyanarayan — UIST 2020).

data-visualization interactive-visualizations jupyter jupyter-notebook jupyter-notebook-extension python r vega-lite xaringan

Last synced: 03 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/rumana-amin/sql-data-warehouse-and-analytics-project

Building a data warehouse using with SQL Server, including ETL processes, data modeling, and analytics

data-cleaning data-visualization data-warehouse etl etl-pipeline sql-queries sql-server

Last synced: 09 Jun 2026

https://github.com/gerhynes/d3-pie-chart

A D3 pie chart showing UN birth data grouped by continent. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/xiaohan2012/myunisport

Visualize your Unisport annual training records

data-analysis data-visualization pandas pygal sports-stats tikzposter

Last synced: 04 May 2026

https://github.com/furkalb/global-power-plant-analysis

Global Power Plant Database Analysis is a Streamlit-based interactive web app designed to explore and analyze worldwide energy generation data

analysis data-science data-visualization energy-analysis exploratory-data-analysis gis global-energy machine-learning mapbox plotly predictive-modeling python streamlit streamlit-dashboard

Last synced: 04 May 2026

https://github.com/mr-chang95/sf_data_visualization

In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.

business data-analysis data-visualization jupyter-notebook pandas python san-francisco

Last synced: 04 May 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/thuongtruong1009/charts

📊 Visualize data with common charts

chart chartjs csv d3 data-visualization nextjs react reactjs typescript

Last synced: 05 May 2026

https://github.com/tanyagarg25/project_covidanalysis

This repository is a project for analyzing COVID-19 data using SQL and visualizing it with Tableau. Technologies used include SQL for querying and Tableau for data visualization.

analysis dashboard data data-visualization sql tableau

Last synced: 08 Feb 2026

https://github.com/athari22/extracting-and-visualizing-stock-data

The idea of the project is to work for a new startup investment firm that helps customers invest their money in stocks.

amazon dashboard data-science data-visualization database ibm ibm-watson pandas python sql tesla visualization

Last synced: 13 Apr 2026

https://github.com/sroman0/data-analytics

Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.

data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics

Last synced: 15 Apr 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/jacekkala/world_road_safety_report

World Road Safety Analysis - Power BI Report

data-visualization power-bi public-health road-safety who-data

Last synced: 08 Feb 2026

https://github.com/siddhant2105s/airline-performance-analysis-dashboard

Enhancing Airline Performance Analysis for the Department of Transport

data-analysis data-visualization tableau

Last synced: 08 Feb 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/saisurajmatta/data-professionals-survey-data-visualization-project

Transformed survey data of data professionals using Power BI, showcasing diverse visualizations and analytics, with emphasis on job roles, programming languages, and geographical insights.

data-cleaning data-visualization power-query powerbi powerbi-dashboards powerbi-visuals

Last synced: 08 Feb 2026

https://github.com/teditae/us-states-guessing-game

A Python game built with Turtle and Pandas where the player guesses the names of U.S. states and sees them appear on a U.S. map. A fun way to learn geography while coding!

data-visualization geography-game pandas python turtle

Last synced: 22 Apr 2026

https://github.com/leftcoastnerdgirl/visualizations_with_tableau

This projects gives the student an opportunity to demonstrate their skills in data visualization.

data-analytics data-visualization storytelling storytelling-with-data tableau

Last synced: 09 Feb 2026

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/jordansrowles/rowles.winforms.vizmorph

A versatile WinForms chart control that smoothly transitions between line and bar chart modes, featuring customizable colors, axes, animations, and persistent value labels. Ideal for dynamic data visualization with support for negative values and real-time updates.

data-visualisation data-visualization winforms winforms-controls

Last synced: 23 Apr 2026

https://github.com/HarmoniCode/Filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 09 Oct 2025

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 09 Feb 2026

https://github.com/jlee9503/telecommunication-churn

Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.

data-analysis data-visualization matplotlib pandas python scikit-learn

Last synced: 18 Jan 2026

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/ayushi-gajendra/buenos-aires-subway-statistics

A comprehensive data analysis of the Buenos Aires subway system ridership using Python and Pandas. This project identifies peak-hour congestion patterns, explores hourly passenger distributions, and utilizes the 95th percentile to isolate extreme traffic conditions for urban mobility insights.

95th-percentile buenos-aires data-analysis data-science-portfolio data-visualization matplotlib pandas python statistical-analysis subway-ridership transit-data urban-mobility

Last synced: 05 Jun 2026

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks

Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.

amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql

Last synced: 09 Feb 2026

https://github.com/tranngoca5039/bigquery-a5y

📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.

analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql

Last synced: 05 Jun 2026

https://github.com/shubham200137/spotify-listening-habits-analytics

Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver

Last synced: 18 Mar 2026

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-wpf-chart

Learn how to enhance WPF charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-axis chart-customization charting-library charts data-visualization line-annotation wpf-char wpf-sfcharts

Last synced: 08 Oct 2025

https://github.com/franckalbinet/teuvo

Self-Organising Map implemented as Literate Programming

data-visualization dimensionality-reduction neural-network

Last synced: 09 Feb 2026

https://github.com/shivani0126/resturant_rating_analysis

Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.

dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi

Last synced: 27 Jan 2026

https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis

A data science notebook project focused on analyzing car features and building a model for car price prediction.

data data-analysis data-visualization jupyter-notebook python

Last synced: 23 Apr 2026

https://github.com/pedromarquetti/spotify-analyser

Data analyser for Spotify account usage

data-visualization python spotify

Last synced: 18 Mar 2026

https://github.com/pranavsp108/market_basket_analysis-instacart

Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.

customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn

Last synced: 05 May 2026

https://github.com/lgibson7/stat_651

Coursework for STAT 651 Data Visualization, Cal State East Bay Fall 2022

data-visualization leaflet shiny-apps tableau

Last synced: 05 Feb 2026

https://github.com/rickyarians/scrappy-do

Projek ini dikembangkan sebagai salah satu capstone project dari Algoritma Academy Data Analytics Specialization. Deliverables yang diharapkan dari projek ini adalah melakukan simple webscrapping untuk mendapatkan informasi.

beautifulsoup4 data-science data-visualization flask flask-application pandas webscraping

Last synced: 15 Apr 2026

https://github.com/mardavsj/matplotlib-in-python

The fundamentals of Python Matplotlib Library.

data-visualization matplotlib python

Last synced: 15 May 2026

https://github.com/ammahmoudi/statisticallearning

Homework Solutions for Statistical Learning Course as Computer Science B.Sc. Student at Department of Mathematical Sciences, Sharif University of Technology

data-visualization feature-selection logistic-regression random-forest

Last synced: 26 Feb 2026

https://github.com/pejpero/neural_network_regression_and_classification

This repository contains neural network regression models built from scratch and using Keras for comparison. It visualizes training and testing performance, analyzing MSE, R², and decision boundaries. The project demonstrates learning techniques and optimization for regression tasks.

data-visualization keras machine-learning neural-network regression

Last synced: 09 Feb 2026

https://github.com/ninadpatil09/heart_disease_detection_analysis

The Heart Disease Detection Analysis aims to create a predictive model for identifying individuals at risk of heart disease. Using a dataset with attributes like age, sex, and health metrics, the project focuses on distinguishing patients with and without heart disease.

data-analysis data-cleaning data-science data-visualization machine-learning

Last synced: 15 Apr 2026

https://github.com/mackenly/suicide-rate-map-explorer

An interactive map that plots U.S. CDC suicide rate and population data on a county level built with React and Python.

cdc data-visualization react suicide-data suicide-prevention

Last synced: 01 Apr 2025