An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/pragmaai/yelp-datapipeline

🍽️ Yelp Data Pipeline & Analytics Dashboard End-to-end data engineering pipeline processing Yelp dataset with Rust transforms, Apache Airflow orchestration, and interactive Streamlit analytics. Features business insights, user engagement analysis, and city performance comparisons. 🚀 Docker-ready • 📊 Interactive Dashboard • ⚡ High-performance R

airflow data-engineering data-pipeline data-visualization datafusion docker rust streamlit yelp yelp-dataset

Last synced: 04 May 2026

https://github.com/udipta14/historical-olympic-games-eda-python

Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-cleaning data-visualization eda matplotlib numpy pandas python3 seaborn

Last synced: 28 Apr 2026

https://github.com/tyfiero/ada-priceart-explorer

Generative art NFT project on Cardano that uses real ADA/USD price data to create 3D candlestick art

ada cardano cnft data-visualization generative-art javascript nft p5js

Last synced: 28 Apr 2026

https://github.com/al-chris/whatsapp-dashboard-web

A client-side only web application for analyzing and visualizing WhatsApp chat exports. This version runs entirely in your browser without requiring any server or backend - your data never leaves your device!

data-visualization javascript whatsapp

Last synced: 28 Apr 2026

https://github.com/incalculable-driverslicence975/data-projects-portfolio

📊 Showcase data projects that highlight analytics, machine learning, and MLOps with reproducible code and clear business insights.

ai computer-vision dashboard data-science-projects data-visualization deep-learning etl excel finance hadoop hiveq keras machine-learning nlp pandas portfolio-project scikit-learn tableau-dashboards

Last synced: 28 Apr 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/rorrell/coviddeaths

A Jupyter Notebook where I create several visualizations based on data about COVID-19 deaths from 2020 to 2024

data-analysis data-visualization jupyter-notebook python3

Last synced: 28 Apr 2026

https://github.com/razalkr70/customer-segmentation-using-dataset

A data science project that segments mall customers using K-Means clustering. Based on age, income, and spending score, it identifies customer groups and visualizes them with 2D and 3D plots for targeted marketing insights.

clustering customer-segmentation data-science data-visualization kmeans machine-learning pca python scikit-learn

Last synced: 28 Apr 2026

https://github.com/sawaira-iqbal/used-cars-price-prediction-ml-project

🚗 The Used Car Price Prediction project uses advanced ML models like Random Forest 🌲, Decision Tree 🌳, XGBoost 🚀, and SVR 🔍 to predict used car prices, enhancing buying and selling decisions.

data-visualization decision-tree machine-learning price-prediction python random-forest-regressor support-vector-machine xgboost

Last synced: 28 Apr 2026

https://github.com/parmaster/logserver

Collecting telemetry data posted by IoT devices into MQTT, filtering, processing and visualizing results

data-visualization golang-template iot-application mqtt mqtt-client template

Last synced: 04 May 2026

https://github.com/benzerinsio/winequality-eda

📊 Análise Exploratória de Dados (EDA) - Vinhos Tintos | Exploração de características físico-químicas e sua relação com qualidade

analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-visualization eda exploratory-analysis exploratory-data-analysis food-science python quality-control seaborn wine wine-quality

Last synced: 28 Apr 2026

https://github.com/ezrahsieh/narrativevisualization

This project is an interactive narrative visualization designed to illustrate the impact of the COVID-19 pandemic on global life expectancy. The visualization is implemented using D3.js and follows the Martini glass narrative structure. This serves as the final project for CS416 at UIUC.

d3 data-visualization interactive-visualizations javascript narrative-visualization

Last synced: 28 Apr 2026

https://github.com/rumana-amin/sql-data-warehouse-and-analytics-project

Building a data warehouse using with SQL Server, including ETL processes, data modeling, and analytics

data-cleaning data-visualization data-warehouse etl etl-pipeline sql-queries sql-server

Last synced: 09 Jun 2026

https://github.com/robertovicario/uninsubria-datavisualization-project-work

Project Work for the Data Visualization module in the MSc in Computer Science program in Varese.

data-visualization dogecoin elonmusk python

Last synced: 28 Apr 2026

https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice

A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.

data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks

Last synced: 02 Aug 2025

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/angchekar28/valorant-gameplay-analysis

This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 12 Apr 2026

https://github.com/syncfusionexamples/ej2-angular-7-heatmap

A quick start project that helps you to create an Angular 7 Heatmap with minimal code configuration.

angular-heatmap angular7 data-visualization ej2-heatmap

Last synced: 03 Apr 2025

https://github.com/shridhar1504/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization eda loan-prediction loan-status machine-learning predictive-modeling sql supervised-learning

Last synced: 09 Apr 2025

https://github.com/soajala/shopify-sales-analysis-powerbi

End-to-end Power BI dashboard project analyzing Shopify sales data with real-time metrics, DAX, and business insights.

business-intelligence data-analysis data-visualization dax interactive-dashboard powerbi sales-analysis shopify

Last synced: 05 Sep 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/soumyajiitdas/My-GenAICapstoneProject

A Generative AI-powered journaling assistant that analyzes daily entries to extract emotions, stress levels, and mood trends — built using Google Gemini API for mental wellness insights.

ai-assistant data-visualization generative-ai machine-learning mental-health prompt-engineering python

Last synced: 04 Jul 2025

https://github.com/rmitsch/paella

Web application for visual parameter space analysis of topic models utilizing word embeddigs.

data-visualization latent-dirichlet-allocation natural-language-processing topic-modeling word2vec

Last synced: 07 Jan 2026

https://github.com/faysalalmahmud/bd-med-professional-analysis

Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.

data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau

Last synced: 04 Sep 2025

https://github.com/joaopalmeiro/altair-playground-app

An application powered by Streamlit to test different configurations on an Altair chart.

altair data-visualization streamlit

Last synced: 04 May 2026

https://github.com/neyhere07/music_popularity_prediction

Music popularity prediction involves building machine learning models to estimate the popularity of tracks based on their audio features.

data-science data-visualization eda jupyter-notebook machine-learning python

Last synced: 29 Apr 2026

https://github.com/shrinidhi857/simpledataanalysisonstartups

The Indian startup ecosystem has experienced remarkable growth over the past decade, becoming a hotbed of innovation and entrepreneurship. In this data analysis we are segregating fields ,finding new insights.

data-analysis data-science data-visualization indian-startups

Last synced: 17 Sep 2025

https://github.com/teja-1403/game-of-thrones-analysis

Demonstrate Exploratory Data Analysis on GOT Dataset using plots and graphs and using the information extracted from text.

analysis data-visualization datascience machine-learning python

Last synced: 12 Apr 2026

https://github.com/busesimsek/dataanalysisportfolio

A compilation of my data analysis projects using SQL, Python, and Tableau.

data-analysis data-visualization python sql tableau

Last synced: 12 Jun 2025

https://github.com/emircanakyuzz/veri_gorsellestirilmesi_ve_analizi-analysis_and_visualization_of_dataset

Bu çalışmada numpy, pandas, seaborn ve matplotlib gibi veri biliminde çokca bilinen modülleri kullanarak analiz ve görselleştirme işlemleri gerçekleştirdim.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/carcesar/salariogovernadores2023

Visualização dos salários dos governadores em 2023

data-science data-visualization politics

Last synced: 24 Apr 2025

https://github.com/carcesar/mg2020

Visualização do mapa com os partidos dos prefeitos eleitos em 2020 - Minas Gerais

altair data-visualization infoviz minas-gerais politics python

Last synced: 24 Apr 2025

https://github.com/sanjana-bongale/cta_ridership_data_visualization_using_tableau

Tableau-based analysis of Chicago Transit Authority (CTA) ridership trends (2015-2024). It includes interactive dashboards, heatmaps, and comparative visualizations to explore bus and rail boarding data, COVID-19 impact, and long-term trends.

customer-analysis dashbaord data-visualization tableau

Last synced: 16 Feb 2026

https://github.com/hyoaru/prime-number-forest-3d

A data visualization of how prime numbers from 1 - 200 would look like if it was a forest

data-visualization plotly python

Last synced: 31 Mar 2025

https://github.com/archanakokate/ml_cardiovascular-disease-prediction-

EDA and Model building to predict the risk of a heart attack using a Logistic Regression and Random Forest Classifier

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/chanmeng666/customer-insight

AI-powered customer review analysis platform — sentiment analysis, keyword extraction, topic modeling, and anomaly detection

chinese-nlp customer-feedback customer-insights data-visualization machine-learning nlp python review-analysis sentiment-analysis streamlit text-analysis text-mining topic-modeling

Last synced: 29 Apr 2026

https://github.com/itskshitija/lego-set-explorer

As a part of the Maven Analytics Lego challenge, I developed an interactive Power BI dashboard exploring the evolution of LEGO sets from 1970 to 2022.

data-analysis data-science data-visualization dataanalysis dataset powerbi powerbi-desktop powerbi-report

Last synced: 12 Jun 2025

https://github.com/sanjiban08/coffee-sales-dashboard

Explore your coffee sales like never before with our Interactive Excel Dashboard—unlock insights, track trends, and enhance decision-making for a robust and caffeinated business strategy. ☕📈

data-cleaning data-visualization excel pivot-tables

Last synced: 26 Jan 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/martingit2/aiportal-frontend

Frontend for Aracanix. Et interaktivt dashboard bygget med React og TypeScript for å visualisere data og administrere en analyseplattform. Se README for lenker til backend og ML-tjeneste.

clerk dashboard data-visualization fullstack react recharts typescript vite

Last synced: 13 Apr 2026

https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown

This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/afifahhadie/wine-clasificassion

This project focuses on classifying different types of wine using machine learning techniques. The dataset contains various chemical properties of wines, which are used as features to predict the wine class.

classification data-science data-visualization jupyter-notebook machine-learning machine-learning-algorithms pandas scikit-learn wine-dataset

Last synced: 13 Apr 2026

https://github.com/petarran/gun-violence-usa

Data Science project comparing USA gun violence cases to its causes.

data-science data-visualization r

Last synced: 05 Sep 2025

https://github.com/misha-mayskiy/lootbox_analytics

Lootbox Analytics: Your personal dashboard for tracking and analyzing lootbox/gacha opening statistics from popular games. Currently supports Genshin Impact with detailed Pity/luck analysis. (Python, Flask, SQLAlchemy)

chartjs data-visualization flask gacha game-analytics genshin-impact pity-tracker python sqlalchemy statistics

Last synced: 29 Apr 2026

https://github.com/nafiealhilaly/first-dash-app

A simple dash plotly app to explore and analyze imagined students assessment dataset

data-analysis data-analytics data-visualization eda plotly-dash python

Last synced: 02 Apr 2025

https://github.com/kathyreid/geelong-council-elections-2017

Chord diagram of distributed preferences based on Victorian Electoral Commission data

chord-diagram d3js data-visualization

Last synced: 13 Mar 2025

https://github.com/victorlcastro-dsa/pbl-datacamp

This repository features projects from DataCamp's Project-Based Learning (PBL) courses, showcasing practical applications of data analysis, machine learning, and visualization. Explore real-world datasets and interactive results that highlight the skills gained through hands-on learning.

data-analysis data-science data-visualization datacamp-projects hypothesis-testing machine-learning project-based-learning

Last synced: 30 Jun 2026

https://github.com/amg-ai-labs/petrol_station_finder

A Python script to find nearby petrol stations and fuel prices using UK government data.

api data-visualization fuel geo python uk

Last synced: 13 Jun 2025

https://github.com/eduardorodriguesf/youtube-trending-scraper

Scraper program that searches youtube trending videos categories

data-visualization matplotlib pandas seaborn selenium

Last synced: 05 May 2026

https://github.com/analysisbyvivek/Road-Accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 29 Jan 2026

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/wilkerhop/vanguard-anime-critique

Neo-Brutalist web application demonstrating the Vanguard Analytical Framework for anime critique with interactive data visualizations and comparative analysis.

anime article chartjs critical-analysis css data-visualization github-pages neo-brutalism web-design

Last synced: 29 May 2026

https://github.com/wilkerhop/linestream

A dynamic line visualization using HTML, JavaScript, and SVG. Each point has a vertical position based on its currentPosition, and all points are connected. New points can be added dynamically, updating the visual representation in real time. This project explores JavaScript, DOM manipulation, and SVG rendering.

data-visualization dynamic-graphics frontend html interactive-ui javascript proof-of-concept svg web-development

Last synced: 29 May 2026

https://github.com/erabossid/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 10 Mar 2025

https://github.com/kirby-b/assorted-r-files

Mainly files from learning to use datasets and do data analysis with R

barchart data-visualization r-language r-programming

Last synced: 25 Mar 2025

https://github.com/nature40/casestudies

Case studies for testing the functionality of database systems, sensors, etc

casestudies data-analysis data-visualization database

Last synced: 02 May 2026

https://github.com/hassanislam463/data-cleaning-and-modelling-top-5-categories-analysis-forage

This project involves cleaning, merging, and analyzing datasets to identify the top 5 performing categories based on aggregate popularity scores. It includes cleaned datasets, a final merged dataset, visualizations, and a presentation summarizing the tasks and results. Tools used: Microsoft Excel, Python, and PowerPoint.

data-analysis data-visualization microsoft-excel

Last synced: 07 Jan 2026

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/yanny-alt/banking-customer-retention-analysis

The objective of this analysis is to identify factors contributing to the increased customer churn rate at the bank. The insights gained from this analysis will help business users make informed decisions and develop strategies to improve customer retention and reduce churn.

data-visualization power-bi powerbi-customer-churn-analysis

Last synced: 07 Jan 2026

https://github.com/orbulant/hourlyweatherdata

Using R, i have created a comprehensive R tool (with Shiny and etc.) to analyse hourly weather data from 2 Airport Stations.

analysis data-visualization r rmarkdown rmarkdown-document weather

Last synced: 04 Apr 2025

https://github.com/salvof88/raspberry-sensor-kit-demo

A lightweight Raspberry Pi sensor logger in Python for HC-SR04 (ultrasound) and DHT11 (temperature/humidity), exporting data to CSV or Google Sheets. Perfect for IoT experiments, smart home logging, or Raspberry Pi Zero DIY kits.

automation csv-logger data-logging data-visualization dht11 google-sheets gpio hc-sr04 hc-sr04-ultrasonic-sensor iot python python-sensors python3 raspberry-pi raspberry-pi-3 raspberry-pi-4 raspberry-pi-gpio raspberry-pi-zero sensor-data

Last synced: 29 Apr 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/sandravizz/analytical-system-design

Teaching material for bachelor course at Arcada

d3-js data-structures data-visualization system-design

Last synced: 24 Jan 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/keshavg125/whatsapp-chat-analyzer

WhatsApp Chat Analyzer extracts insights from chat data, visualizing activity trends, emoji usage, and sentiment analysis using "ganeshkharad/gk-hinglish-sentiment". Built with Streamlit, Pandas, and Matplotlib for interactive analysis. 🚀

data-visualization emoji-analysis huggingface matplotlib nlp pandas python seaborn streamlit whatsapp-chat-analysis wordcloud

Last synced: 07 May 2026

https://github.com/jbalooshie/surfs_up

Project to perform analysis of weather data in Hawaii, including a Flask app to view the findings.

data-science data-visualization flask python sqlalchemy sqlite

Last synced: 06 May 2026

https://github.com/muthukumar0908/imdb_movie_analysis_with_powerbi

The project aim is to analyze the dataset using Power Bi, The dataset is related to IMDB Movies.

data-analysis data-visualization powerbi

Last synced: 12 Jun 2025

https://github.com/andrewobwocha/titanicsurvival

🚢 End-to-end Python pipeline for Titanic survival classification. Demonstrates EDA, preprocessing, feature engineering, and Logistic Regression evaluation using Scikit-learn.

classification data-preprocessing data-visualization exploratory-data-analysis feature-engineering machine-learning pandas python scikit-learn titanic

Last synced: 13 Jun 2025

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 29 Apr 2026

https://github.com/joekakone/db-analytics-tools

Databases Analytics Tools - Data Integration - Data Visualization - Machine Learning

data-engineering data-integration data-visualization etl machine-learning pipeline sql

Last synced: 01 Aug 2025

https://github.com/fatihilhan42/the-office-eda

Data analysis study of my favorite sitcom, The Office (US).

data-analysis data-science data-visualization fatihilhan office python sitcom

Last synced: 04 May 2026

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/jansim/nicknames

Specify human readable names for the columns in your data once and then reuse them across your project to rename plots axes, dataframe columns, tables and anything else.

data-cleaning data-visualization r r-package

Last synced: 04 Sep 2025