An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/zoliqua/venn-diagram-lab

🟢 🟤 Interactive Venn diagram viewer & editor — 44 SVG models (2–9 sets), pre-computed region paths, React + TypeScript + Vite

adelaide bioinformatics data-visualization edwards-venn euler hamilton interactive manawatu massey palmerton-north python-package react set-theory svg typescript upsetplot venn venn-diagram venndiagram victoria

Last synced: 05 May 2026

https://github.com/baggiponte/ta-statistics-for-big-data-2022

🎓 Introduction to Python and Machine Learning [UniMi • AY 2021/2022]

clustering data-science data-visualization machine-learning python scikit-learn

Last synced: 03 May 2026

https://github.com/rohithay/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 03 May 2026

https://github.com/zeynepcol/data-analysis-visualization

Data visualization and interactive analytics - Olympics Dataset

data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit

Last synced: 03 May 2026

https://github.com/nicholas-miklaucic/rho_plus

The Python data viz nitro canister you didn't know you needed

aesthetics bokeh colormap data-visualization matplotlib plotly python

Last synced: 05 May 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 06 May 2026

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 06 May 2026

https://github.com/basedrhys/global-stock-vis

Visualisation of financial stock data using CesiumJS

cesiumjs data-visualization nodejs python

Last synced: 06 May 2026

https://github.com/scarblase/portfolioprojects

A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊

csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql

Last synced: 06 May 2026

https://github.com/himanchalchandra/science-canvas

Repo containing projects I did during a four months bootcamp on Data Science and Machine Learning organized by Science Canvas India.

data-mining data-science data-visualization machine-learning-algorithms mysql nlp-machine-learning

Last synced: 06 May 2026

https://github.com/vietdoo/real-estate-marketplace

a webapp that can visualize homes for sale on a cluster map. Data is continuously fetched from MongoDB, build filtering functions and APIs to find homes.

api bigdata data-visualization flask maps mongodb reactjs webscraping

Last synced: 07 May 2026

https://github.com/1ayanabil1/iris-visualization

This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.

analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python

Last synced: 07 May 2026

https://github.com/pm25/youbike-station-finder

🚲 Display and visualize real-time information for Taipei YouBikes.

css3 d3js data-visualization html5 javascript visualization website youbike

Last synced: 08 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 08 May 2026

https://github.com/jethronap/jstat-gui

Web-based GUI application for data analysis

data-analysis data-visualization java jstat mongodb

Last synced: 08 May 2026

https://github.com/md-emon-hasan/4-eda-football-ml-app

A ML application focused on exploratory data analysis and football analytics, featuring data visualization and insights using Python and relevant libraries.

data-science-projects data-visualization eda exploratory-data-analysis football-analytics sports-analytics webapp

Last synced: 08 May 2026

https://github.com/avijit-jana/redbus-data-scraper-dashboard

A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.

automation dashboard data-analysis data-visualisation data-visualization datadrivendecisions filtering python3 redbus selenium selenium-python streamlit streamlit-application travel web-scraping webscrapping

Last synced: 09 May 2026

https://github.com/chauxvive/uschildpoverty

An interactive choropleth map visualizing U.S. state-level child poverty data using D3.js. Compare child poverty rates over time with data from KIDS COUNT and the US Census Bureau.

choropleth-map d3 d3js data-visualization dataviz

Last synced: 09 May 2026

https://github.com/douglasvolcato/brazilian-stock-market-analysis

Brazilian stocks analysis focused in dividend yield, diversification and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction

This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection

classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python

Last synced: 09 May 2026

https://github.com/vrostbyte/budget-app

Web app to manage personal finances: track expenses, income, bills, and visualize budgets with charts.

bills-management budget css data-visualization expense- finance html income-tracker javascript json personal-finance web-app

Last synced: 10 May 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 13 May 2026

https://github.com/ali-el-badry/machine-learning-algorithm

It is a Repo that contain different type of Machine Learning Algorithm like Regression ,classification and clustering that will be added soon

ai data-science data-visualization decision-tree feature-selection knn linear-regression logestic-regression machine-learning modelling random-forest svc svm titanic-kaggle xgboost xgboost-classifier

Last synced: 12 Jun 2026

https://github.com/mohini1403/road_accident_data_analytics

This project aims to analyze road accident data to gain insights into the factors contributing to accidents, identify patterns, and propose data-driven recommendations for improving road safety. The dataset used in this project contains information about various aspects of road accidents, such as location, time, weather conditions, and severity.

analytics data-visualization pandas powerbi

Last synced: 14 Jun 2026

https://github.com/akshadk7/exploratory-data-analysis

Implementing EDA and Machine Learning Algorithms on Kaggle Car Dataset

data-visualization exploratory-data-analysis machine-learning-algorithms predictive-modeling

Last synced: 17 Jun 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/dorukalkan/3a-superstore-analysis

End-to-end data analysis, machine learning, and visualization project on a Turkish supermarket dataset

data-science data-visualization dbt machine-learning power-bi python sql

Last synced: 20 Jun 2026

https://github.com/alicankaya192/world-happiness-report-2025

Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).

correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report

Last synced: 21 Jun 2026

https://github.com/rogernet/desafio-profissional-produto-data-driven

Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.

data-analysis data-science data-visualization product

Last synced: 23 Jun 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/jeugregg/coronavirusmodel

Coronavirus Visualization & Modeling

coronavirus covid-19 data-visualization

Last synced: 24 Jun 2026

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/ahmednurabdii/data-analytics-portfolio-superstore

My first portfolio project showcasing data cleaning, analysis, and visualization of Superstore sales data.

data-analysis data-visualization jupyter-notebook matplotlib numpy pandas portfolio-project python sales-analysis scipy seaborn superstore-dataset

Last synced: 07 Apr 2026

https://github.com/callmequant/visualization-with-r

Create nice plot, transparent and avoid misleading figures (Use R!)

data-visualization

Last synced: 24 Jul 2025

https://github.com/buddhilive/solias-js

Web Component Library for data visualization made using StencilJS

charts data-visualization stenciljs visualization webcomponents

Last synced: 17 May 2026

https://github.com/scilifelabdatacentre/dash-covid-in-sweden

Dashboard showing the impact of COVID-19 in Sweden: number of cases, admissions to ICU, and deaths.

covid-19 dash data-visualization plotly-dash

Last synced: 18 Mar 2025

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/teja-1403/breast-cancer-detection-using-python

A machine learning project for breast cancer detection, classifying images as Benign, Malignant, or Normal using models like SVM and Random Forest. Includes pre-processing, performance evaluation and focusing on advancing medical imaging through classification and analysis techniques.

data-science data-visualization internship-project machine-learning-algorithms python

Last synced: 17 May 2026

https://github.com/michaelpgalen/CVE-DataVis-Prototype-JS

A vanilla javascript prototype for a React data visualization project.

cve data-visualization vanilla-javascript vanilla-js

Last synced: 10 Mar 2025

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 20 Jan 2026

https://github.com/tom474/data_pipeline_with_docker

[RMIT 2024C] EEET2574 - Big Data for Engineering - Data Pipeline with Docker

cassandra data-engineering data-visualization docker jupyter-notebook kafka python

Last synced: 26 Jun 2025

https://github.com/devandrenicolas/analise-de-vendas

This project is a comprehensive data analysis tool designed to analyze sales performance data. It includes modules for generating fake sales data, cleaning and preprocessing the data, and performing exploratory data analysis (EDA) with advanced visualizations.

data-analysis data-visualization faker-generator matplotlib pandas python

Last synced: 07 May 2026

https://github.com/RoryDungan/music-fandom-map

Interactive maps showing the popularity of musicians around the world

choropleth data-visualization map music spotify

Last synced: 18 Jul 2025

https://github.com/boss294/user-analytics-dashboard

This User Analytics Dashboard is a real-time, interactive web application designed to track and visualize active, inactive, and total users with dynamic data updates.

analytics chartjs countdown-timer data-visualization dynamic html-css-javascript js livedata

Last synced: 28 Jun 2025

https://github.com/amirhosseinhonardoust/how-streamlit-makes-ai-accessible

A detailed educational article exploring how Streamlit revolutionizes AI app development. Learn how this Python framework bridges the gap between data science and usability, empowering anyone to deploy interactive machine learning models without front-end coding or complex infrastructure.

ai ai-accessibility dashboard data-engineering data-science data-visualization educational machine-learning mlops nlp open-source python streamlit tutorial web-app

Last synced: 02 May 2026

https://github.com/nick-peter-marcus/job-market-dashboard

Build a dashboard exploring data from the Data Science job market in Germany

chatgpt dashboard data-cleaning data-visualization levenshtein-distance regex streamlit streamlit-webapp

Last synced: 16 May 2026

https://github.com/marked01one/rio-airbnb-web-portal

A web portal to display data visualization results from the RIO Airbnb Predictive Model research project at Cal Poly Pomona

data-visualization flask plotly-dash research-project

Last synced: 19 May 2026

https://github.com/sn2606/breast-cancer-wisconsin-diagnostic

Data Visualization of the Breast Cancer Wisconsin diagnostic dataset

breast-cancer-wisconsin data-visualization matplotlib plotting python-3 seaborn-plots

Last synced: 21 Mar 2025

https://github.com/lonnygomes/sankey-diagram-poc

D3 sankey diagram data visualization mapping people to projects

d3 data-visualization sankey sankey-diagram

Last synced: 17 May 2026

https://github.com/rvalla/covid-19

Some code to analyze international data from COVID-19 Data Repository by Johns Hopkins CSSE and Argentina outbreak status.

argentina argentina-data covid-19 data-visualization johns-hopkins-csse plotting python3

Last synced: 24 Jun 2025

https://github.com/rauhanahmed/financialanalystaiagent

Agentic AI–driven stock analytics leveraging Phidata, Google Gemini 2.0 Flash, and Yahoo Finance. Features real-time data, interactive Plotly charts, and a Streamlit dashboard for comprehensive, actionable market insights.

agentic-ai data-visualization finance google-gemini investment phidata plotly stock-analysis streamlit yfinance

Last synced: 18 Jul 2025

https://github.com/furkantsnb/coalclassifie

CoalClassifier: A deep learning model for classifying coal types using EfficientNetB0-based transfer learning and fine-tuning techniques. This project is designed to accurately distinguish between Anthracite, Bituminous, Lignite, and Peat classes and is developed using TensorFlow/Keras

coal-classification computer-vision data-visualization efficientnetb0 image-classification keras python streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/nick-peter-marcus/marketing-data-analysis

Analyzing Marketing Analytics Data on Purchase Behavior and Campaign Responses - Customer Segmentation, Data Visualization, Regression Analysis, Random Forest

data-visualization k-means-clustering linear-regression logistic-regression pca random-forest segmentation

Last synced: 09 Sep 2025

https://github.com/cianhub/issue-tracker

This application is an issue tracker/new feature request platform for a theoretical app. The 'Great Idea Issue Tracker' is a responsive web application that allows users to create, upvote, pay for, comment on, update, view progress on, delete and read tickets containing bugs or new feature suggestions. The app was developed as a platform from which an existing platform can be improved and a team can receive constructive criticism and suggestions from their customers.

bootstrap css3 data-visualization django full-stack-web-development heroku-deployment html5 javascript jquery python3 user-management

Last synced: 07 Apr 2026

https://github.com/arction/lcjs-example-1111-coviddrilldowndashboard

In-depth example of map dashboard with data drill-down. Visualizes relations between CoVID vaccinations and cases

covid-19 data-visualization demo example lightningchart-js map-chart maps template

Last synced: 12 Mar 2025

https://github.com/csinva/global-sports-analysis

Analyzing how different factors influence global sports rankings

data-visualization plotly python sports-data sports-stats

Last synced: 02 Apr 2025

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/giordano-lucas/tesco-extension

Products clustering and interactive visualization

clustering data-analysis data-visualization tesco

Last synced: 17 Jun 2026

https://github.com/30lima/gym-members-exercise-dataset

Dataset para analisar a queima de calorias em relação ao gênero masculino e feminino

analysis data-science data-visualization jupyter-notebook python

Last synced: 18 May 2026

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/rdrahul123/sales-dashboard

The Sales Analysis Dashboard was developed to provide insights into sales, profits, and product performance across different categories, timeframes, and geographic locations. By leveraging Power BI, the project aimed to transform raw data into actionable visualizations, facilitating better decision-making for stakeholders.

data-analysis data-science data-visualization dax powerbi

Last synced: 06 Jan 2026

https://github.com/inspirate789/bmstu-db

:open_file_folder: Лекции, семинары и лабораторные работы по курсу "Базы данных" в МГТУ им. Н. Э. Баумана.

5sem bmstu data-engineering data-science data-visualization database db etl golang grafana ics7 iu7 nifi notes papers plpgsql postgres postgresql redis sql

Last synced: 02 Mar 2026

https://github.com/rob-med/data-visualizations-for-python

A collection of useful snippets for clean data visualizations in Python (with matplotlib)

academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization

Last synced: 08 May 2026

https://github.com/elitay152/the_expanse_nlp_character_map

Natural language processing project. Created interactive character map with Networkx based on The Expanse book series.

data-visualization natural-language-processing nlp-machine-learning relationship-graph

Last synced: 10 Apr 2025

https://github.com/arction/lcjs-example-0909-3drealtimeline

A demo application showcasing LightningChart JS to display 3D Line chart in real-time.

3d-visualization chart data-visualization lcjs lightningchart-js line-chart

Last synced: 12 Mar 2025

https://github.com/juniors90/flask-plots

Flask-Plots is a library for creating and rendering static visualizations using Matplotlib in Python.

data-visualization datavisualization flask jinja2 matplotlib-figures python

Last synced: 19 May 2026