An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/hms-homelab/hms-cpapdash-charts

Shared Angular chart component library for CPAP therapy data visualization — used across the CpapDash apps.

angular charts cpap data-visualization sleep-apnea typescript

Last synced: 21 Jun 2026

https://github.com/shariqayan/diwali_sales_analysis_python

The Diwali Sales Analysis project focuses on analyzing sales data during the Diwali festival to gain insights into customer behavior, improve customer experience, and optimize sales strategies.

data-visualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/sukitsubaki/image-color-scheme

Extract dominant colors from images and create beautiful color palettes with minimal dependencies. Supports various palette types: monochromatic, analogous, complementary, triadic, and tetradic.

color-extraction color-palette data-visualization design-tools image-analysis minimal python python-library

Last synced: 29 Apr 2026

https://github.com/amirrezaskh/nyc-taxi-dashboard

A comprehensive data analytics platform that processes NYC taxi trip data from Google BigQuery and visualizes insights through an interactive React dashboard. Features real-time heatmaps, temporal analysis, and geographic intelligence across 263 NYC taxi zones.

bigquery dashboard data-analytics data-science data-visualization geospatial leaflet material-ui nyc-taxi plotly react typescript

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/machinelearningzuu/data-engineering-projects

This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.

airflow bigquery data-engineering data-science data-visualization data-warehouse

Last synced: 30 Apr 2026

https://github.com/chrka/d3-chessboard-count

Plot per-square frequencies on a chessboard

chess d3 data-visualization

Last synced: 30 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/tashi-2004/global-ecommerce-retail-trends-analysis

The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

boxplot clv-analysis data-science data-visualization dataintegration deep-learning dimensionality-reduction ecommerce heatmap machine-learning normalization outlier-detection outlier-removal pca-analysis preprocessing python scatter-plot whatif-analysis

Last synced: 30 Apr 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/mauriciovazquezm/caso_cas_fall2024

Repositorio de competencia de Caso de CAS (casualty actuarial society) otoño 2024.

data-visualization eda exploratory-data-analysis python

Last synced: 22 Jun 2026

https://github.com/diogojorgebasso/dataanalysis_r_minesnancy

Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.

analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives

Last synced: 30 Apr 2026

https://github.com/srinibas-masanta/ibm-applied-data-science-capstone

This repository contains the work completed for the Applied Data Science Capstone Project offered by IBM on Coursera. The capstone project is the final course in the IBM Data Science Professional Certificate series and serves as an opportunity to apply the skills and knowledge gained throughout the series to a real-world data science problem.

capstone-project data-analysis data-science data-visualization machine-learning python web-scraping

Last synced: 30 Apr 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/rayxiang03/indeed-job-scraping

Python toolkit for scraping Indeed job listings, preprocessing data, and generating visualizations for market analysis.

cloudscraper data-visualization indeed job-analysis nlp pandas python web-scraping

Last synced: 30 Apr 2026

https://github.com/mitchellharrison/mitchellharrison.github.io

Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.

ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials

Last synced: 30 Apr 2026

https://github.com/erishen/langgraph-csv-analyst

CSV → Multi-Agent Analysis Pipeline → Visual HTML Report. Built with LangGraph StateGraph: data profiling, trend analysis, anomaly detection, and investment portfolio analysis.

asset-lens csv-analysis data-visualization investment-analysis langchain langgraph multi-agent plotly python

Last synced: 23 Jun 2026

https://github.com/markalbrand56/ds-laboratorio-9

Data Exploration in different datasets

data-exploration data-visualization matplotlib pandas

Last synced: 23 Jun 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/fazatholomew/marlboroplan

In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.

data-analysis data-visualization energy jupyter-notebook massachusetts python

Last synced: 01 May 2026

https://github.com/akhdandann/productdashboard-powerbi

A Power BI dashboard to monitor engineering effectiveness at PT. XL Axiata Tbk. It tracks release frequency, production defects, cycle time, developer activity, and team happiness. Note: All names have been changed and the data has been modified - this is dummy data for demonstration purposes only.

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/cdeweyx/bryce-harper-2016-analysis

Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.

data-analysis data-visualization python

Last synced: 01 May 2026

https://github.com/gerhynes/d3-movie-quotes

A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 01 May 2026

https://github.com/kevinandersontech/ecommerce_dashboard_streamlit

A Streamlit dashboard that reads daily revenue metrics from the data pipeline. Provides date filters, summary KPIs, line charts, and a table to explore revenue over time across different statuses (e.g. paid, refunded, failed).

charts dashboard data-visualization duckdb filters metrics python streamlit

Last synced: 01 May 2026

https://github.com/kristishqau/apartmentregressionanalysis

This data science project aims to predict apartment prices through regression analysis. The dataset used contains information about apartments, and the project involves various steps such as data preprocessing, exploratory data analysis, feature engineering, and building a decision tree regression model.

apartment-prices data-preprocessing data-science data-visualization decision-tree-regression jupyter-notebook prediction python3

Last synced: 01 May 2026

https://github.com/ishmal793/bi-dummy-

An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.

data-analytics data-visualization eda pandas plotly python-dashboard streamlit

Last synced: 01 May 2026

https://github.com/fbarffmann/project1

Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.

box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn

Last synced: 01 May 2026

https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support

RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.

ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit

Last synced: 01 May 2026

https://github.com/jfaccioli/leaflet-earthquake

Geo mapping earthquakes with Leaflet / Javascript / GeoJSON

data-visualization geojson javascript json leaflet

Last synced: 01 May 2026

https://github.com/gabrieldiem/iss_locator

Little python script that plots the ISS (International Space Station) location in a world map at a given time

data-visualization pandas plotly python script

Last synced: 01 May 2026

https://github.com/samia35-2973/world-university-ranking-2023-prediction

This repository is about creating models for predicting world university rankings 2023. The World University Rankings 2023 dataset include 1,799 universities across 104 countries and regions, making them the largest and most diverse university rankings to date. A clean dataset is generated through data preprocessing.

data-cleaning data-preprocessing data-visualization decision-trees machine-learning machine-learning-algorithms model-training prediction world-university-rankings world-university-rankings-2023

Last synced: 01 May 2026

https://github.com/kushshriv/onlinejobpostings-infographic

The Python Data Cleaning Code and Input Dataset For My Telling Stories With Data Project

data-visualization pandas python

Last synced: 01 May 2026

https://github.com/rbreeze/dashboard

My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.

css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization

Last synced: 02 May 2026

https://github.com/scottdj92/nivo-data-viz-poc

a POC for data visualizations using Nivo

data-visualization emotion nivo parcel-bundler typescript

Last synced: 08 Jun 2026

https://github.com/teja-1403/ignosis-tech-ml-assignment

Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.

customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python

Last synced: 02 May 2026

https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot

A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 02 May 2026

https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor

AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.

data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics

Last synced: 02 May 2026

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/debjyotisaha/web-application-projects-streamlit-phase-2

This repository showcases interactive web applications built using the Streamlit framework.

dashboard data-visualization python streamlit

Last synced: 02 May 2026

https://github.com/benzerinsio/breastcancer-eda

📊 Análise Exploratória de Dados (EDA) - Câncer de Mama | Exploração de características clínicas para identificar padrões e relações no diagnóstico de câncer de mama.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis data-visualization diagnosis eda exploratory-data-analysis health-care medical-data python seaborn

Last synced: 02 May 2026

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/gkar90/gdp-vs-life-expectancy

Statistical analysis on GDP vs Life Expectancy

data-science data-visualization statistical-analysis

Last synced: 09 Jun 2026

https://github.com/nimomach/skateboarding-in-olympics

Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.

data-analysis data-visualization olympics paris skateboarding tokyo

Last synced: 10 Mar 2026

https://github.com/monteirooscar98/tarifas-publicas-sp-dieese

Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.

data-analysis data-visualization python webscraping

Last synced: 03 May 2026

https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python

In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.

data-analysis data-cleaning data-science data-visualization jupyter-notebook python

Last synced: 03 May 2026

https://github.com/corey-richardson/microbit-data-logger

In preparation for Work Experience Students coming in, I am using this project to familiarise myself with the BBC micro:bits which we will provide them with. I am also using it as a chance to expand on my data visualisation with Python experience.

data-visualization matplotlib microbit pandas pyplot signal-processing

Last synced: 03 May 2026

https://github.com/baggiponte/ta-business-statistics-2023

🎓 Business Statistics [UniMi • AY 2022/2023]

data-mining data-science data-visualization r tidyverse

Last synced: 03 May 2026

https://github.com/dvmorozov/arxiv

ArxivExpress - arxiv.org client for Android and iOS, ArxivNavigator - interactive arxiv.org metadata visualization. I would appreciate any way of contributing: GitHub issue, email or pull request.

android-application arxiv arxiv-api arxiv-org csharp d3-js d3-visualization d3js data-science data-visualization datascience ios-app javascript pyhton3 xamarin xamarin-forms

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/panastasiadis/k8s-smart-home-frontend-service

A Streamlit-based front-end, developed as part of the thesis "Enhancing Integration Process and Manageability of a Microservices-Based Home Automation Application with Kubernetes". This GUI connects to a Django backend and InfluxDB to enable real-time sensor monitoring, device and room management, and action automation.

dashboard data-visualization gui home-automation python smart-home streamlit user-interface

Last synced: 03 May 2026

https://github.com/gerhynes/d3-birth-chart

An interactive UN birth rate bar chart built to practice manipulating SVGs with D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/allanreda/ga4-session-predictor-flask-app

Flask app that can predict future number of GA4 sessions, using the Prophet library.

data-visualization flask ga4-api matplotlib pandas prophet-library python time-series-forecasting

Last synced: 03 May 2026

https://github.com/prachipatel15/priceprediction

I've build a real estate price prediction project which predicts a house price in Banglore.

data-science data-visualization machine-learning pandas python

Last synced: 03 May 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/eikospartners/ngcatalyst

Repo for visuzalizations for Angular

angular d3 data-visualization

Last synced: 03 May 2026

https://github.com/joaopalmeiro/b2-presentation

Materials for the presentation of the paper: "B2: Bridging Code and Interactive Visualization in Computational Notebooks" (Yifan Wu, Joseph M. Hellerstein, Arvind Satyanarayan — UIST 2020).

data-visualization interactive-visualizations jupyter jupyter-notebook jupyter-notebook-extension python r vega-lite xaringan

Last synced: 03 May 2026

https://github.com/minervarose/applysignal

Candidate-side recruitment intelligence dashboard for tracking job applications, scoring process signals, and prioritizing opportunities.

candidate-experience career-tools dashboard data-visualization decision-support job-search portfolio-project python recruitment streamlit

Last synced: 24 Jun 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/pragmaai/yelp-datapipeline

🍽️ Yelp Data Pipeline & Analytics Dashboard End-to-end data engineering pipeline processing Yelp dataset with Rust transforms, Apache Airflow orchestration, and interactive Streamlit analytics. Features business insights, user engagement analysis, and city performance comparisons. 🚀 Docker-ready • 📊 Interactive Dashboard • ⚡ High-performance R

airflow data-engineering data-pipeline data-visualization datafusion docker rust streamlit yelp yelp-dataset

Last synced: 04 May 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/samruddhi3012/screen-time-analysis

Hi! This repo demonstrates a python project on Screen Time Analysis.

data-analysis data-visualization python

Last synced: 04 May 2026

https://github.com/parmaster/logserver

Collecting telemetry data posted by IoT devices into MQTT, filtering, processing and visualizing results

data-visualization golang-template iot-application mqtt mqtt-client template

Last synced: 04 May 2026

https://github.com/mindlessmuse666/titanic-data-visualization

Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.

data-analysis data-visualization matplotlib pandas plotly python seaborn titanic

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/xiaohan2012/myunisport

Visualize your Unisport annual training records

data-analysis data-visualization pandas pygal sports-stats tikzposter

Last synced: 04 May 2026

https://github.com/sanchittechnogeek/rental-data-visualization_python

Statistics and visualization of rental data with python

data-analysis data-science data-visualization statistics

Last synced: 04 May 2026

https://github.com/furkalb/global-power-plant-analysis

Global Power Plant Database Analysis is a Streamlit-based interactive web app designed to explore and analyze worldwide energy generation data

analysis data-science data-visualization energy-analysis exploratory-data-analysis gis global-energy machine-learning mapbox plotly predictive-modeling python streamlit streamlit-dashboard

Last synced: 04 May 2026

https://github.com/mr-chang95/sf_data_visualization

In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.

business data-analysis data-visualization jupyter-notebook pandas python san-francisco

Last synced: 04 May 2026