An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/rightfulcode/retail-sales-breakdown

Time Series Analysis of Walmart Retail Sales – Internship project analyzing sales trends, seasonal patterns, and revenue breakdowns using Pandas, Matplotlib, and Seaborn.

data-analytics data-visualization elevvo-internship matplotlib pandas python retail-sales seaborn time-series-analysis

Last synced: 08 May 2026

https://github.com/vinit714/player-retention-analysis

A complete Streamlit + Machine Learning + SHAP + NLP project to analyze, predict, and improve player retention in games. This project simulates a game environment, models churn behavior, and provides insights using SHAP, NLP word clouds, and strategy simulators.

churn-prediction classification data-visualization eda feature-engineering game-analytics game-data-analysis gaming-analytics machine-learning model-interpretability nlp pandas player-retention python retention-analysis sckiit-learn shap streamlit wordcloud

Last synced: 08 May 2026

https://github.com/iyashwantsaini/911_capstone

For this capstone project we will be analyzing some 911 call data from Kaggle.

capstone data-science data-visualization python3

Last synced: 10 Jun 2026

https://github.com/koushikphy/covid-19-visualizer

A python plotly-dash app showing different statistics regarding Coronavirus 2019

covid-19 covid19-data covid19-tracker dash data-visualization plotly-dash webapp

Last synced: 08 May 2026

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 09 May 2026

https://github.com/arif-rachmat/powershell-wpf-serialmonplot

A modern, WPF-based Serial Port Monitor and Real-time Data Plotter Powershell script

data-visualization powershell powershell-script serial-communication windows wpf

Last synced: 09 May 2026

https://github.com/cgoliver/nlplotlib

Web-server for natural language based data visualization.

data-visualization flask matplotlib ml nlp reinforcement-learning web-app

Last synced: 09 May 2026

https://github.com/erikad88/belly-button-challenge

This project is an interactive dashboard that visualizes the Belly Button Biodiversity dataset, which catalogs microbes found in human navels.

css d3js dashboard data-visualization html javascript json plotly

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/sasai-lab/statplay-opensource

統計を視覚的に理解できるツール。Interactive statistics visualizer ‐ learn by doing. Vanilla JS, Canvas 2D, zero dependencies

bayesian bilingual canvas data-visualization educational interactive-visualization javascript oer open-educational-resources probability pwa regression statistics vanilla-js

Last synced: 30 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/digitaboss/d3js-heatmap

Global Heatmap presentation with D3js and Reactjs

d3js data-science data-visualization heatmap javascript reactjs

Last synced: 22 Aug 2025

https://github.com/mohamedsaid-sd/3

A repository dedicated to exploring the significance of the number 3 in various cultures, mathematics, and symbolism. Delve into the mystical and mathematical properties of this enigmatic digit through code, analysis, and creative interpretations.

androidstudio clion cpp data-visualization django gemma jetbrains llama lua nodejs opengl trinitycore vue webstorm

Last synced: 10 Apr 2026

https://github.com/smahala02/materials-science-image-analysis

Image analysis for materials science with a focus on particle diameter measurement and image scaling using Python.

data-visualization image-analysis materials-science particle-measurement python

Last synced: 09 May 2026

https://github.com/priyanshul28/exercise_pandas

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib.

data-cleaning data-visualization matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/shyamkumarnagilla/big-sales-prediction

The "Big Sales Prediction" model is a machine learning project that aims to accurately forecast sales for a given period. The model utilizes the Random Forest Regressor algorithm, a powerful ensemble learning technique, to analyze historical sales data and make predictions. It can be valuable for businesses looking to optimize sales forecasting.

data-analytics data-preprocessing data-science data-visualization machine-learning model-evaluation model-training

Last synced: 09 May 2026

https://github.com/rfonod/narrative-visualization

Explores the relationships between countries' GDP, population, and cumulative Olympic medals. Features a narrative visualization of changes over time, critically examining the modern Olympic Games' original vision.

css d3 d3-visualization d3js data-visualization html javascript visualization

Last synced: 09 May 2026

https://github.com/fbarffmann/python-api-challenge

Accessed and analyzed real-world weather and location data using Python and public APIs. Automated data collection, cleaned API responses, and visualized geographic trends to support business-ready insights.

api automation data-analysis data-visualization google-places-api mapping openweathermap-api pandas python weather-analysis

Last synced: 10 May 2026

https://github.com/chauxvive/fcctreemap

A responsive treemap visualization built with D3.js to display hierarchical data in an interactive format. Created as part of the FreeCodeCamp Data Visualization Certification.

d3 d3js data-visualization dataviz treemap

Last synced: 10 May 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application

The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.

data-analysis data-science data-visualization python python3 streamlit

Last synced: 18 Apr 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/ericgio/r-d3

Low-level components and examples for rendering data with D3 + React

d3 data-visualization react

Last synced: 11 May 2026

https://github.com/sammccord/data-ops-meltano-cube-sample

This is a sample repository demonstrating a data ops pipeline loading DefiLlama data into BigQuery using Meltano and self-hosted Cube for data visualization, with configuration for deployment to Render

cube data-visualization docker meltano render

Last synced: 11 May 2026

https://github.com/deliprofesor/amazon-movie-analysis-and-visualization

"Amazon Movie Analysis and Visualization" is a Python project that analyzes and visualizes movie data from Amazon.com, including ratings, directors, actors, release years, MPAA ratings, and pricing. The project provides insights into movie trends and popular films, helping users explore key patterns through interactive visualizations.

data-analysis data-visualization matplotlib pandas python

Last synced: 12 May 2026

https://github.com/jbalooshie/pyber_analysis

Analysis of ride share data using Matplotlib and pandas, executed in Jupyter Notebook. Breakdowns are provided based on the city size, average fare, and number of rides taken.

data-analysis data-science data-visualization jupyter-notebook matplotlib pandas python

Last synced: 12 May 2026

https://github.com/sahilmate/personal-finance-tracker

A Python application for tracking personal finances. It allows you to log income and expenses, view summaries within date ranges, and visualize financial trends with graphs. Ideal for managing and analyzing your financial data easily.

data-visualization matplotlib-pyplot pandas python3

Last synced: 12 May 2026

https://github.com/magnus0969/gdp-analysis

An in-depth exploration of global GDP trends using Python and data science techniques. This project involves data preprocessing, exploratory data analysis (EDA), statistical insights, and interactive visualizations to understand economic patterns and correlations.

data-science data-visualization gdp-analysis plotly python3

Last synced: 12 May 2026

https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis

The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.

data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium

Last synced: 10 Jun 2026

https://github.com/pascalx-git/randomcharts

Generate charts with random data.

data-visualization design javascript

Last synced: 12 May 2026

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 13 May 2026

https://github.com/silkiemoth/eds-240-class-examples

Repository for in-class work assignments and notes in EDS-240 Data Visualization and Communication at UCSB.

classwork data-visualization r ucsb-meds

Last synced: 13 May 2026

https://github.com/deliprofesor/joblocationmapper

JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.

clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python

Last synced: 14 May 2026

https://github.com/satvikpraveen/matplotlibmasterpro

📷 MatplotlibMasterPro is a complete, portfolio-ready project to master data visualization using matplotlib. Includes 16 notebooks, real datasets, exportable plots, custom themes, Streamlit dashboard, and Docker support. Ideal for learners and data professionals.

charts custom-plots dashboarding data-analysis data-science data-visualization educational-project interactive-visualizations jupyter-notebook matplotlib notebooks open-source plotting portfolio-project python python-utilities reproducible-research subplots time-series-analysis visualization-tools

Last synced: 14 May 2026

https://github.com/ewels/contributor-graphs

Contributor timelines for any git or GitHub repo: a publication-ready SVG and an interactive HTML page

cli contributors data-visualization git github open-source rust svg timeline visualization

Last synced: 11 Jun 2026

https://github.com/mogalina/graph-rank-dynamics

Computational engine for analyzing how importance propagates through directed graphs.

data-visualization education graph-theory page-rank statistics

Last synced: 12 Jun 2026

https://github.com/danielvartan/estela

🧒🏽🍎 Analysis and Visualization of Food Consumption Data for Brazilian Children Aged 2 to 4, as Monitored by SISVAN in 2019

brazil child-health child-nutrition data-science data-visualization malnutrition nutritional-epidemiology sisvan sus

Last synced: 12 Jun 2026

https://github.com/alvitachen/breathe-retrospective

A dashboard that visualizes AQI and PM job openings across target cities.

aqi beginner charts data-visualization personal-project react vite

Last synced: 10 Apr 2026

https://github.com/ganesh774218/students-health-predictor

An end-to-end machine learning project that applies logistic regression to identify students at risk of depression based on demographic, academic, and lifestyle features. This repository includes data preprocessing, feature engineering, model training, evaluation metrics, and visualizations to provide actionable insights.

data-science data-visualization jupyter-notebook logistic-regression machine-learning machine-learning-algorithms python regression-models

Last synced: 19 May 2026

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/rahmamohammad/retail_project

Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.

data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling

Last synced: 17 May 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/v-mayya/quantitative-analysis-data-dashboard

Quantitative survey data analysis using R

data data-analysis data-visualization flourish r

Last synced: 01 Apr 2025

https://github.com/nrobledosagredo/covid19-dashboard

Dashboard to visualize and compare COVID-19 cases across different countries over time.

data-visualization

Last synced: 11 Sep 2025

https://github.com/jabonsote/financial-anomaly-detection-with-deepseek-and-isolation-forest

🚀 Financial Anomaly Detection with DeepSeek and Isolation Forest – A powerful, locally-run tool for detecting financial anomalies using Isolation Forest and DeepSeek LLM. Features AI-powered insights, interactive time-series visualization, and automated PDF audit reports. 🔍📊

anomaly-detection chatbot data-visualization deepseek financial-analysis financial-data isolation-forest llm machienlearning ollama report-generator streamlit

Last synced: 12 Apr 2026

https://github.com/vbhatsaccnt/softdrinktrendsanalysis

A Tableau dashboard project providing comprehensive insights into soft drink sales trends, allowing for detailed analysis and informed decision-making within the beverage industry.

dashboard data-visualization food-products marketing tableau trend-analysis

Last synced: 01 Mar 2026

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 10 Mar 2025

https://github.com/sanjana-bongale/walmart_retail_data_visualization_using_powerbi

Interactive Power BI dashboard analyzing Walmart sales data. Covers sales trends, customer insights, and branch performance using charts, KPIs, and filters for age, gender, year, and category. Includes a presentation for business storytelling and insights.

customer-analysis-for-retail dashboard data-storytelling data-visualization powerbi sales-analysis

Last synced: 04 Feb 2026

https://github.com/albertofaraujo/excel_dashboard_prev_fraudes

O objetivo da análise é extrair informações de performances individuais dos colabores de uma empresa fictícia para tomadas de decisão. (Dashbord em Excel)

analise-de-dados dashboard data-visualization excel

Last synced: 06 Jan 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/davidchocholaty/bithack_hackathon_2024

This repository contains my personal code tasks for the BIT_Hack hackathon, created in 2024.

data-mining data-science data-visualization exploratory-data-analysis hackaton hackaton-project machine-learning

Last synced: 06 May 2026

https://github.com/datastalker/survival-cox

This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.

breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis

Last synced: 02 Apr 2025

https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms

The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.

data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier

Last synced: 01 May 2026

https://github.com/toodef/light-engine

Lightweight and fast 3D visualisation engine

cpp data-visualization linux python visualization windows

Last synced: 11 Feb 2026

https://github.com/anshajk/covid-vaccinations

A repository to track the rate of covid vaccinations in India

covid-19 data-visualization streamlit

Last synced: 17 May 2026

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/acdh-oeaw/visartist

Visual Artwork Analysis and Collection Tool

color-clustering color-space data-visualization visual-analysis

Last synced: 13 Jul 2025

https://github.com/mackenly/suicide-rate-map-explorer

An interactive map that plots U.S. CDC suicide rate and population data on a county level built with React and Python.

cdc data-visualization react suicide-data suicide-prevention

Last synced: 01 Apr 2025

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset

In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report

data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset

Last synced: 08 Feb 2026

https://github.com/gustavo-victor/scatter-plot

Scatter Plot Graph in JS and D3

css d3 data-visualization html js scatter-plot

Last synced: 12 Apr 2026

https://github.com/architj6/cancerguardian

CancerGuardian is a machine learning-powered web app that helps predict breast cancer diagnoses based on cytology measurements. 🩺✨ Built with Streamlit, Scikit-Learn, and Plotly, this tool visualizes tumor characteristics and provides predictions using a trained model. 🚀

binary-classification breast-cancer-prediction classification-models data-science data-visualization deep-learning healthcare healthcare-ai machine-learning medical-ai medical-diagnostics predictive-analytics python streamlit supervised-learning

Last synced: 01 May 2026

https://github.com/saro0307/voronoi-diagram-for-classification

Using Voronoi diagram to map random points scattered on a plane subdivides in exactly n cells enclosing a portion of the plane that is closest to each point

artificial-intelligence data-visualization dataanalytics graph machine-learning matplotlib plot plotting pyplot python python3 voronoi voronoi-diagram

Last synced: 08 Jun 2026

https://github.com/tashi-2004/apache-spark-geospatial-air-quality-analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 25 Mar 2025

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 09 Apr 2025

https://github.com/hossamAhmedSalah/Computer-Vision-

contains my training projects in this field

computer-vision data-visualization detection edge opencv

Last synced: 10 Mar 2025

https://github.com/fahadnasir13/financial-data-parser-application

“Financial Data Parser – Advanced Next.js & TypeScript app for parsing and visualizing complex financial datasets with confidence scoring and exportability.”

data-parsing data-visualization financial-parser framer-motion framer-motion-3d nextjsdata-parsing react-testing-library shadcn-ui

Last synced: 17 Jun 2026

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 24 Nov 2025

https://github.com/gaurav0502/router-traffic-analysis

Exploratory Analysis of the different kinds of traffics being experienced by a router.

data-analytics data-visualization network-analysis python

Last synced: 06 Apr 2025

https://github.com/mattsebastianh/Making-a-Visual-Argument

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/katrinleinweber/leaving-the-bar

A less-code variant of Joachim Goedhart's "Leaving the bar in five steps"

barchart boxplot boxplots data-visualisation data-visualization ggplot

Last synced: 20 Aug 2025

https://github.com/shrutiijoshi/e-commerce

The dataset contains various attributes related to orders, customers, and products, providing a comprehensive view of the sales process.

analysis data-visualization tableau-public visualization

Last synced: 07 Jan 2026

https://github.com/dianaow/fifa22-svelte

Svelte/D3.js: Visualizing club attributes in FIFA games from 2015 to 2022

d3-visualization d3js data-visualization fifa sports-analytics sports-visualisation svelte

Last synced: 29 May 2026

https://github.com/saba-gul/google_data_analystics_belabeat_fitness_capstone_project

This project focuses on leveraging Fitbit user data to derive valuable insights and facilitate data-driven decision-making for Bellabeat, a leading wellness company. The objective is to harness the wealth of information captured by Fitbit devices to enhance the wellness offerings provided by Bellabeat.

bellabeat-case-study bellabeat-eda data-analytics data-visualization fitbit google-casestudy

Last synced: 08 Jun 2026