An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/leandrocollares/foreign-born-population-in-canada

Responsive bar chart that shows the percentage of foreign-born population in Canada between 1871 and 2011

d3 data-visualization svelte

Last synced: 03 May 2026

https://github.com/imgios/dependensee

📊 Minimal relaxed functional dependencies set visualization webapp.

angular d3js data-visualization ionic rfd typescript visual-analytics webapp

Last synced: 03 May 2026

https://github.com/ljadhav25/swiggy-restaurant-analysis

This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots

Last synced: 03 May 2026

https://github.com/joaopalmeiro/b2-presentation

Materials for the presentation of the paper: "B2: Bridging Code and Interactive Visualization in Computational Notebooks" (Yifan Wu, Joseph M. Hellerstein, Arvind Satyanarayan — UIST 2020).

data-visualization interactive-visualizations jupyter jupyter-notebook jupyter-notebook-extension python r vega-lite xaringan

Last synced: 03 May 2026

https://github.com/pragmaai/yelp-datapipeline

🍽️ Yelp Data Pipeline & Analytics Dashboard End-to-end data engineering pipeline processing Yelp dataset with Rust transforms, Apache Airflow orchestration, and interactive Streamlit analytics. Features business insights, user engagement analysis, and city performance comparisons. 🚀 Docker-ready • 📊 Interactive Dashboard • ⚡ High-performance R

airflow data-engineering data-pipeline data-visualization datafusion docker rust streamlit yelp yelp-dataset

Last synced: 04 May 2026

https://github.com/rumana-amin/sql-data-warehouse-and-analytics-project

Building a data warehouse using with SQL Server, including ETL processes, data modeling, and analytics

data-cleaning data-visualization data-warehouse etl etl-pipeline sql-queries sql-server

Last synced: 09 Jun 2026

https://github.com/gerhynes/d3-pie-chart

A D3 pie chart showing UN birth data grouped by continent. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/xiaohan2012/myunisport

Visualize your Unisport annual training records

data-analysis data-visualization pandas pygal sports-stats tikzposter

Last synced: 04 May 2026

https://github.com/furkalb/global-power-plant-analysis

Global Power Plant Database Analysis is a Streamlit-based interactive web app designed to explore and analyze worldwide energy generation data

analysis data-science data-visualization energy-analysis exploratory-data-analysis gis global-energy machine-learning mapbox plotly predictive-modeling python streamlit streamlit-dashboard

Last synced: 04 May 2026

https://github.com/mr-chang95/sf_data_visualization

In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.

business data-analysis data-visualization jupyter-notebook pandas python san-francisco

Last synced: 04 May 2026

https://github.com/halyusa16/e-commerce-analysis

This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.

data-analysis data-cleaning data-exploration data-visualization self-project

Last synced: 09 Jun 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/balajimohan18/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-science data-testing data-visualization forecasting forecasting-models machine-learning model-evaluation predictive-modeling python regression-algorithms salesforecast scipy sklearn-library supervised-learning

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/drod75/nyc-arrests-analysis

This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.

data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/thuongtruong1009/charts

📊 Visualize data with common charts

chart chartjs csv d3 data-visualization nextjs react reactjs typescript

Last synced: 05 May 2026

https://github.com/dhruvsrikanth/basic-data-science

A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!

analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn

Last synced: 05 May 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/benedart/interactive-information-visualization

Interactive Information Visualization Project

d3 data-visualization information-visualization

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/sundarmd/digital_twin_for_li-ion_batteries

Digital Twin for Li-ion batteries on AWS built using S3, EC2, SageMaker, Redshift, Terraform, QuickSight

aws-ec2 aws-s3 data-visualization iot python3 pytorch sql terraform

Last synced: 05 May 2026

https://github.com/kevinwood15/python_ml_neuralnetwork_project

I build a neural network to evaluate the CIFAR-10 dataset and achieve above 50% accuracy

cifar10 data-science data-visualization deep-learning neural-network python pytorch

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/lkasym/smart-dynamic-pricing

An AI-powered dynamic pricing system using Dueling DQN and customer behavior simulation, with a full-stack React + Flask dashboard for real-time insights and performance benchmarking.

ai-project data-visualization deep-learning dqn-tensorflow ecommerce full-stack-ai machine-learning reinforcement-learning tensorflow

Last synced: 05 May 2026

https://github.com/priyapuranik/diwali-sales-analysis_with-data-visualization

Diwali sales data analysis with visualizations, highlighting customer demographics, product categories, and purchase trends.

data-visualization matplotlib-pyplot pandas python seaborn

Last synced: 06 May 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/deaneeth/aqi-monitoring-system

Real-time air quality monitoring system with interactive maps, role-based access control, and automated AQI data simulation. Built with Laravel, MySQL, and Leaflet.js.

air-quality alerts-system data-visualization fullstack geospatial interactive-maps laravel leaflet mysql php phpunit simulation web-application

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/leosimoes/alura-streamlit-dashboard

Projeto da demonstração do curso Alura - Streamlit - construindo um dashboard interativo. Aplicativo web com duas páginas para exibição dos dados.

dashboards data-visualization python streamlit

Last synced: 07 May 2026

https://github.com/parnika-singh/oncovision

An intelligent machine learning model for classifying breast cancer cells as benign or malignant using the UCI Breast Cancer Wisconsin dataset.

breast-cancer-prediction cancer-detection classification data-visualization decision-tree healthcare knn logistic-regression machine-learning medical-ai-project python3 sklearn svm-model xgboost

Last synced: 07 May 2026

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/jgohel9902/property-analytics-u.s.-owned-and-leased-properties

This project focuses on analyzing the U.S. Inventory of Owned and Leased Properties using datasets from Data.gov. It includes SQL queries for data cleaning and trend analysis, Excel for manipulation and reporting, Python for automated workflows and exploratory data analysis, and Power BI for creating interactive dashboards to visualize key insights

data-visualization dataanalysis excel jupyter-notebook pandas powerbi python sql

Last synced: 07 May 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/sumit-sinha9/t20wc_2022-best-11

Finalizing best 11 players from 2022 T20 world cup using Python Web scraping, Pandas and Power BI

data-visualization pandas powerbi python webscraping

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/athenyx04/arion

Smart animal weighing module for Demeter

data-visualization firebase livestock nextjs

Last synced: 07 May 2026

https://github.com/satyam4229/identify-employee-attrition

This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 08 May 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/vanshuchaudhary/flightpriceanalysis-

The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.

business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis

Last synced: 08 May 2026

https://github.com/vinit714/player-retention-analysis

A complete Streamlit + Machine Learning + SHAP + NLP project to analyze, predict, and improve player retention in games. This project simulates a game environment, models churn behavior, and provides insights using SHAP, NLP word clouds, and strategy simulators.

churn-prediction classification data-visualization eda feature-engineering game-analytics game-data-analysis gaming-analytics machine-learning model-interpretability nlp pandas player-retention python retention-analysis sckiit-learn shap streamlit wordcloud

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/koushikphy/covid-19-visualizer

A python plotly-dash app showing different statistics regarding Coronavirus 2019

covid-19 covid19-data covid19-tracker dash data-visualization plotly-dash webapp

Last synced: 08 May 2026

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 09 May 2026

https://github.com/erikad88/belly-button-challenge

This project is an interactive dashboard that visualizes the Belly Button Biodiversity dataset, which catalogs microbes found in human navels.

css d3js dashboard data-visualization html javascript json plotly

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/zxjahid/matplotlib

A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊

candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx

Last synced: 09 May 2026

https://github.com/prishabhanot/skin_cancer_classification_model

Classifies 7 types of skin cancer lesions using a deep learning CNN model. Processes and balances the dataset, trains the model, and evaluates its accuracy with visualizations.

cnn confusion-matrix data-visualization keras machine-learning medical-imaging python tensorflow

Last synced: 09 May 2026

https://github.com/jingvu/global-water-title-grid-r-project

This is a visualization from my assignment, where I used the World Title Grid Map to address issues in the original design, making it more informative and effective.

data-visualization r water-quality world-title-grid-map

Last synced: 02 Apr 2025

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/youssef-saaed/zc-dashboard

The ZC Dashboard is a comprehensive data visualization tool designed to provide insights into the academic landscape of Zewail City.

amcharts dashboard data-visualization flask sqlite3

Last synced: 09 May 2026

https://github.com/rfonod/narrative-visualization

Explores the relationships between countries' GDP, population, and cumulative Olympic medals. Features a narrative visualization of changes over time, critically examining the modern Olympic Games' original vision.

css d3 d3-visualization d3js data-visualization html javascript visualization

Last synced: 09 May 2026

https://github.com/shrutiijoshi/corporate-campus-hiring-analysis

This project analyzes corporate campus hiring trends for fresh graduates in India.

dashboard data-analysis data-visualization excel powerbi

Last synced: 09 Mar 2026

https://github.com/justfifi17/nyc-real-estate-sales-prediction

Analyzes real estate sales data in NYC by performing exploratory data analysis and building models to predict sale prices based on various features.

data-visualization exploratory-data-analysis gradient-boosting linear-regression random-forest xgboost-regression

Last synced: 05 Aug 2025

https://github.com/luarmen/covid-19-testing-forecast-iceland-2020-2022-

Time series forecasting of daily COVID-19 testing in Iceland using R. Models compared include ETS, SARIMA, and Auto ARIMA with cross-country validation on UAE data.

arima covid-19 data-visualization ets forecasting pandemic-data r sarima statistical-modeling statistical-modeling-techniques time-series

Last synced: 07 Aug 2025

https://github.com/gmasson/datadash

DataDash é uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualização de dados dinâmicos em páginas web.

dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript

Last synced: 08 Aug 2025

https://github.com/albertofaraujo/pbi_dashboard_anatel

Analisar o desempenho das operadoras no atendimento das reclamações registradas por usuários de telecomunicações no sistema Anatel Consumidor

data-visualization dax-studio power-query powerbi

Last synced: 19 Feb 2026

https://github.com/syncfusionexamples/how-to-collapse-the-visibility-of-specific-data-label-in-.net-maui-cartesian-chart

This article in the Syncfusion Knowledge Base explains how to collapse the visibility of specific data label in .NET MAUI Cartesian chart

charting-library charts column-chart data-label data-label-customization data-label-visibility data-visualization hide-data-label maui-charts

Last synced: 03 Oct 2025

https://github.com/oncs21/expenai

An open source modern and simple expense tracking app with numerous integrations

bootstrap css data-visualization database-management ejs express mindee mongodb mongoose node-js server-security

Last synced: 09 Apr 2026

https://github.com/jawwad-fida/stroke-prediction

Predict whether a patient is likely to get stroke using machine learning classification algorithms. Performance Comparison of algorithms.

classification data-science data-visualization machine-learning python stroke-prediction

Last synced: 19 Feb 2026

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/dcostachar/telco-customer-churn-dashboard

An interactive Tableau dashboard using the Telco Customer Churn dataset to analyze key drivers of customer churn and develop data-driven retention strategies for the telecommunications industry.

business-intelligence customer-churn-analysis data-analysis data-visualization marketing-analytics tableau

Last synced: 09 Mar 2026

https://github.com/vincent-tran-94/Dataviz_Tweets_ChatGPT

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization sentiment-analysis streamlit text-mining twitter

Last synced: 10 Aug 2025

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 09 Nov 2025

https://github.com/hari00887/analysis-of-global-terrorism

Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.

data-analysis data-visualization powerbi

Last synced: 02 Mar 2026

https://github.com/hemangsharma/hotel-revenue-booking-analysis

This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.

analysis data-analysis data-visualization hotel

Last synced: 10 Aug 2025

https://github.com/nafisrayan/decentai

A comprehensive platform built using ReactJS and Flask, combining blockchain technology with AI to create a secure and intelligent space for community engagement and policy discussions. Leverages NLP and LLM for meaningful interactions and sentiment analysis while ensuring data security and user privacy.

chatbot data-analysis data-visualization flask gemini gemini-ai gemini-ai-chatbot gemini-api government government-tech llm mongodb nlp polls python react tailwind voting-systems winknlp

Last synced: 12 Apr 2026

https://github.com/cecoeco/networks-r-project

Visualizing static networks with R (Coursera)

data-visualization igraph network-analysis r

Last synced: 04 Aug 2025

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/tashi-2004/apache-flink-spark-data-streaming

This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.

apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3

Last synced: 09 Feb 2026

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/r12habh/canada-imigration-data-analysis

Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)

canada data-analysis data-science data-visualization datascience python python3

Last synced: 23 May 2026