An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/fatihilhan42/the-office-eda

Data analysis study of my favorite sitcom, The Office (US).

data-analysis data-science data-visualization fatihilhan office python sitcom

Last synced: 04 May 2026

https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor

AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.

data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics

Last synced: 02 May 2026

https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python

We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.

data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system

Last synced: 02 May 2026

https://github.com/benzerinsio/breastcancer-eda

📊 Análise Exploratória de Dados (EDA) - Câncer de Mama | Exploração de características clínicas para identificar padrões e relações no diagnóstico de câncer de mama.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis data-visualization diagnosis eda exploratory-data-analysis health-care medical-data python seaborn

Last synced: 02 May 2026

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/peter-gy/autovistype

Probing vision-language model alignment with human expert visual grouping over stratified sample of VIS30K dataset.

data-visualization google-genai langchain llm-benchmarking marimo meta-llama mistral multi-label-classification openai polars qwen uv vis30k vision-language-model visual-stimuli visualization-categorization vlm

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/gkar90/gdp-vs-life-expectancy

Statistical analysis on GDP vs Life Expectancy

data-science data-visualization statistical-analysis

Last synced: 09 Jun 2026

https://github.com/ronaldkanyepi/southern-africa-health-indicators-analysis

Visualizing Health Trends in Southern Africa: Analyzing World Bank data (2013–2020) to uncover patterns in life expectancy, infant mortality, HIV prevalence, and maternal mortality.

botswana data-visualization dplyr ggplot2 ggtext health healthcare mozambique patchwork r south-africa tidyverse viridis zimbabwe

Last synced: 09 Jun 2026

https://github.com/miteshgupta07/covid-19-report-dashboard-using-streamlit

A Streamlit dashboard for COVID-19 reporting that provides real-time updates, visualizations, and analysis of global and local COVID-19 data to track the pandemic's progress and impact.

data-visualization python streamlit

Last synced: 03 May 2026

https://github.com/lut-ful/credit-card-financial-report-dashboard

Visualizing and unlocking different KPI's of a Financials instution depending on credit card data using Power BI.

data-analytics data-visualization dax-query postgresql powe-bi python statics

Last synced: 03 May 2026

https://github.com/james-julius/latent-space-explorer

A flythrough 3D map of meaning — type any concept and watch it land near related ideas. In-browser embeddings (no install, no key), pre-seeded scenes, and a multi-model knowledge explorer.

3d-visualization ai claude data-visualization embeddings gemini latent-space llm machine-learning nextjs openai react-three-fiber semantic-search text-embeddings threejs transformers-js typescript umap vector-search webgpu

Last synced: 09 Jun 2026

https://github.com/nix7amcm/fcc-data-viz-cert-projects

These are my projects for the freeCodeCamp Data Visualization certification.

d3 d3-visualization d3js data-visualization data-viz freecodecamp freecodecamp-project html-css-javascript

Last synced: 03 May 2026

https://github.com/baggiponte/ta-business-statistics-2023

🎓 Business Statistics [UniMi • AY 2022/2023]

data-mining data-science data-visualization r tidyverse

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/gerhynes/d3-birth-chart

An interactive UN birth rate bar chart built to practice manipulating SVGs with D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/leandrocollares/foreign-born-population-in-canada

Responsive bar chart that shows the percentage of foreign-born population in Canada between 1871 and 2011

d3 data-visualization svelte

Last synced: 03 May 2026

https://github.com/shivabajelan/belly-button-biodiversity-dashboard

Belly Button Biodiversity Dashboard is an open-source interactive dashboard that visualises the Belly Button Biodiversity dataset. Built with JavaScript, D3.js, Plotly.js, HTML, and CSS, the dashboard features include a dropdown menu, horizontal bar chart, bubble chart, demographic information display.

css d3 dashboard data-visualization html javascript plotly visualization

Last synced: 03 May 2026

https://github.com/imgios/dependensee

📊 Minimal relaxed functional dependencies set visualization webapp.

angular d3js data-visualization ionic rfd typescript visual-analytics webapp

Last synced: 03 May 2026

https://github.com/ljadhav25/swiggy-restaurant-analysis

This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots

Last synced: 03 May 2026

https://github.com/ibrahim1993-2/iraq-primary-enrollment

Data analysis of Iraq’s primary school enrollment (1960–2024). Includes SQL-style exploration, 14 insights with CSV exports, visualizations, and a polished Jupyter Notebook (PDF + HTML).

data-visualization eda education-data iraq jupyter-notebook matplotlib pandas portfolio-project python3 world-bank

Last synced: 03 May 2026

https://github.com/devesh8423/machine_learning

Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.

classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn

Last synced: 03 May 2026

https://github.com/donmaruko/flask-data-analysis

Flask API for statistical calculations. Data analysis, cleansing, visualization, and manipulation. Documented by Swagger.

api api-rest data-analysis data-science data-visualization datascience flasgger matplotlib pandas seaborn sqlite wordcloud

Last synced: 03 May 2026

https://github.com/pragmaai/yelp-datapipeline

🍽️ Yelp Data Pipeline & Analytics Dashboard End-to-end data engineering pipeline processing Yelp dataset with Rust transforms, Apache Airflow orchestration, and interactive Streamlit analytics. Features business insights, user engagement analysis, and city performance comparisons. 🚀 Docker-ready • 📊 Interactive Dashboard • ⚡ High-performance R

airflow data-engineering data-pipeline data-visualization datafusion docker rust streamlit yelp yelp-dataset

Last synced: 04 May 2026

https://github.com/samruddhi3012/screen-time-analysis

Hi! This repo demonstrates a python project on Screen Time Analysis.

data-analysis data-visualization python

Last synced: 04 May 2026

https://github.com/rumana-amin/sql-data-warehouse-and-analytics-project

Building a data warehouse using with SQL Server, including ETL processes, data modeling, and analytics

data-cleaning data-visualization data-warehouse etl etl-pipeline sql-queries sql-server

Last synced: 09 Jun 2026

https://github.com/gerhynes/d3-pie-chart

A D3 pie chart showing UN birth data grouped by continent. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 04 May 2026

https://github.com/joaopalmeiro/altair-playground-app

An application powered by Streamlit to test different configurations on an Altair chart.

altair data-visualization streamlit

Last synced: 04 May 2026

https://github.com/michal-radomski/3d-libraries_data-visualization_webgl

3D libraries, data visualization and WebGL courses from Udemy.com

3d d3js data-visualization threejs webgl

Last synced: 04 May 2026

https://github.com/sanchittechnogeek/rental-data-visualization_python

Statistics and visualization of rental data with python

data-analysis data-science data-visualization statistics

Last synced: 04 May 2026

https://github.com/nitrocosmstudios/temperature-sonification

Experimental sonification of temperature sensor data revealing structured, speech-like patterns coinciding with geomagnetic disturbances. Includes original dataset processing, synthetic signal attempts, and full write-up.

audio-analysis data-visualization electromagnetic-interference emf environmental-monitoring open-science raspberry-pi signal-processing sonification temperature temperature-monitoring temperature-sensor usb-devices weird-science

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/fatihilhan42/book-recommendation-system-with-python

In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...

books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system

Last synced: 04 May 2026

https://github.com/sagarprajapat2004/data-analysis-visualization

Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.

data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/halyusa16/e-commerce-analysis

This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.

data-analysis data-cleaning data-exploration data-visualization self-project

Last synced: 09 Jun 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/dhruvsrikanth/basic-data-science

A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!

analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn

Last synced: 05 May 2026

https://github.com/s4rrar/israel-gaza-war

Human losses of the Israeli genocide in Gaza and the West Bank in numbers (October 7th War Victims)

data-engineering data-science data-visualization datascience jupyter jupyter-notebook python

Last synced: 05 May 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python

This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.

analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 05 May 2026

https://github.com/soupu07/ibm-employee-attrition-prediction

The aim of this project analyzes factors driving IBM employee attrition and predicts those likely to leave, helping the organization understand turnover causes and improve retention and performance.

data-science data-science-projects data-visualization machine-learning python python-data-analysis python-programming-language python-project

Last synced: 05 May 2026

https://github.com/sundarmd/digital_twin_for_li-ion_batteries

Digital Twin for Li-ion batteries on AWS built using S3, EC2, SageMaker, Redshift, Terraform, QuickSight

aws-ec2 aws-s3 data-visualization iot python3 pytorch sql terraform

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/kevinwood15/python_ml_neuralnetwork_project

I build a neural network to evaluate the CIFAR-10 dataset and achieve above 50% accuracy

cifar10 data-science data-visualization deep-learning neural-network python pytorch

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/femincan/d3-bar-chart

My solution for the Visualize Data with a Bar Chart project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 06 May 2026

https://github.com/robmcelhinney/roman-emperors

D3.js Bar Charts of Roman Emperors' Age and length of Reign from 63 BCE to 395 CE

d3 d3js data-visualization emperor roman rome

Last synced: 06 May 2026

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/deaneeth/aqi-monitoring-system

Real-time air quality monitoring system with interactive maps, role-based access control, and automated AQI data simulation. Built with Laravel, MySQL, and Leaflet.js.

air-quality alerts-system data-visualization fullstack geospatial interactive-maps laravel leaflet mysql php phpunit simulation web-application

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/loosenthedark/strikeforce

StrikeForce is an interactive frontend site that processes and presents data on Premier League goalscorers in a meaningful, easy-to-digest form

ajax-request api autocomplete-search bootstrap4 chartjs css3 css3-animations data-visualization emailjs entypo fontawesome frontend html5 interactive javascript jquery tablesorter

Last synced: 06 May 2026

https://github.com/gerhynes/d3-median-age-histogram

A D3 histogram displaying UN data on median ages. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 06 May 2026

https://github.com/adesartika33/proyek-analisis-data-dataset-iris

Proyek ini bertujuan untuk menganalisis dataset Iris, salah satu dataset klasik dalam bidang Machine Learning dan Data Science. Dataset ini terdiri dari 150 sampel bunga Iris dari tiga spesies (Setosa, Versicolor, dan Virginica)

classification data-science data-visualization eda exploratory-data-analysis iris-dataset machine-learning python random-forest scikit-learn

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/tetchen9/mapa

A map of a trip to Europe. Using d3.js, Eurostat dataset in GeoJSON.

cartography d3 d3-visualization data-visualization eurostat-data geojson maps typescript

Last synced: 06 May 2026

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 06 May 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-

This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.

data-analysis data-visualization exploratory-data-analysis

Last synced: 10 Jun 2026

https://github.com/jgohel9902/property-analytics-u.s.-owned-and-leased-properties

This project focuses on analyzing the U.S. Inventory of Owned and Leased Properties using datasets from Data.gov. It includes SQL queries for data cleaning and trend analysis, Excel for manipulation and reporting, Python for automated workflows and exploratory data analysis, and Power BI for creating interactive dashboards to visualize key insights

data-visualization dataanalysis excel jupyter-notebook pandas powerbi python sql

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/citizenlabsgr/openbudgetkw

Promoting a deeper understanding of the Kentwood, MI city budget process and city priorities.

csv data-visualization jade javascript react

Last synced: 07 May 2026

https://github.com/antrikshy/personalmovieanalysis

Finds interesting patterns in an IMDb ratings export; written as a Jupyter notebook, viz using Seaborn

data-visualization imdb jupyter-notebook movie-ratings pandas python seaborn

Last synced: 07 May 2026

https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python

Last synced: 07 May 2026

https://github.com/athenyx04/arion

Smart animal weighing module for Demeter

data-visualization firebase livestock nextjs

Last synced: 07 May 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting

This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

data-analysis data-visualization mechine-learing plotly python streamlit

Last synced: 07 May 2026