An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/joaopalmeiro/b2-presentation

Materials for the presentation of the paper: "B2: Bridging Code and Interactive Visualization in Computational Notebooks" (Yifan Wu, Joseph M. Hellerstein, Arvind Satyanarayan — UIST 2020).

data-visualization interactive-visualizations jupyter jupyter-notebook jupyter-notebook-extension python r vega-lite xaringan

Last synced: 03 May 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/pragmaai/yelp-datapipeline

🍽️ Yelp Data Pipeline & Analytics Dashboard End-to-end data engineering pipeline processing Yelp dataset with Rust transforms, Apache Airflow orchestration, and interactive Streamlit analytics. Features business insights, user engagement analysis, and city performance comparisons. 🚀 Docker-ready • 📊 Interactive Dashboard • ⚡ High-performance R

airflow data-engineering data-pipeline data-visualization datafusion docker rust streamlit yelp yelp-dataset

Last synced: 04 May 2026

https://github.com/samruddhi3012/screen-time-analysis

Hi! This repo demonstrates a python project on Screen Time Analysis.

data-analysis data-visualization python

Last synced: 04 May 2026

https://github.com/rumana-amin/sql-data-warehouse-and-analytics-project

Building a data warehouse using with SQL Server, including ETL processes, data modeling, and analytics

data-cleaning data-visualization data-warehouse etl etl-pipeline sql-queries sql-server

Last synced: 09 Jun 2026

https://github.com/mindlessmuse666/titanic-data-visualization

Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.

data-analysis data-visualization matplotlib pandas plotly python seaborn titanic

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/michal-radomski/3d-libraries_data-visualization_webgl

3D libraries, data visualization and WebGL courses from Udemy.com

3d d3js data-visualization threejs webgl

Last synced: 04 May 2026

https://github.com/fatihilhan42/the-office-eda

Data analysis study of my favorite sitcom, The Office (US).

data-analysis data-science data-visualization fatihilhan office python sitcom

Last synced: 04 May 2026

https://github.com/nitrocosmstudios/temperature-sonification

Experimental sonification of temperature sensor data revealing structured, speech-like patterns coinciding with geomagnetic disturbances. Includes original dataset processing, synthetic signal attempts, and full write-up.

audio-analysis data-visualization electromagnetic-interference emf environmental-monitoring open-science raspberry-pi signal-processing sonification temperature temperature-monitoring temperature-sensor usb-devices weird-science

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/ibrahimm7004/supermarket-sales-analysis

This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.

apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales

Last synced: 04 May 2026

https://github.com/sagarprajapat2004/data-analysis-visualization

Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.

data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis

Last synced: 04 May 2026

https://github.com/jdanielgoh/abuso-escuelas

Codigo fuente del proyecto depredadores en las aulas. Construido con Nuxt.js, three.js y d3.js

d3js data-visualization ddj threejs

Last synced: 09 Jun 2026

https://github.com/halyusa16/e-commerce-analysis

This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.

data-analysis data-cleaning data-exploration data-visualization self-project

Last synced: 09 Jun 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/balajimohan18/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-science data-testing data-visualization forecasting forecasting-models machine-learning model-evaluation predictive-modeling python regression-algorithms salesforecast scipy sklearn-library supervised-learning

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/thuongtruong1009/charts

📊 Visualize data with common charts

chart chartjs csv d3 data-visualization nextjs react reactjs typescript

Last synced: 05 May 2026

https://github.com/dhruvsrikanth/basic-data-science

A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!

analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn

Last synced: 05 May 2026

https://github.com/s4rrar/israel-gaza-war

Human losses of the Israeli genocide in Gaza and the West Bank in numbers (October 7th War Victims)

data-engineering data-science data-visualization datascience jupyter jupyter-notebook python

Last synced: 05 May 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/dona-eric/projet-etudiant

Objectif : Étudier et Analyser les facteurs qui influencent l'engagement des étudiants et les niveaux de risque dans un contexte éducatif .

data-science data-visualization fastapi machine-learning sante-etudiant streamlit

Last synced: 05 May 2026

https://github.com/soupu07/ibm-employee-attrition-prediction

The aim of this project analyzes factors driving IBM employee attrition and predicts those likely to leave, helping the organization understand turnover causes and improve retention and performance.

data-science data-science-projects data-visualization machine-learning python python-data-analysis python-programming-language python-project

Last synced: 05 May 2026

https://github.com/sundarmd/digital_twin_for_li-ion_batteries

Digital Twin for Li-ion batteries on AWS built using S3, EC2, SageMaker, Redshift, Terraform, QuickSight

aws-ec2 aws-s3 data-visualization iot python3 pytorch sql terraform

Last synced: 05 May 2026

https://github.com/shruthin4/ipl-cricket-analysis-2007-2024

In-depth IPL Cricket Data Analysis (2007–2024) with visual insights on teams, players, and match outcomes.

analysis cricket data-visualization eda ipl python sports-analytics

Last synced: 05 May 2026

https://github.com/kevinwood15/python_ml_neuralnetwork_project

I build a neural network to evaluate the CIFAR-10 dataset and achieve above 50% accuracy

cifar10 data-science data-visualization deep-learning neural-network python pytorch

Last synced: 05 May 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/femincan/d3-bar-chart

My solution for the Visualize Data with a Bar Chart project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 06 May 2026

https://github.com/robmcelhinney/roman-emperors

D3.js Bar Charts of Roman Emperors' Age and length of Reign from 63 BCE to 395 CE

d3 d3js data-visualization emperor roman rome

Last synced: 06 May 2026

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-

A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.

data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data

Last synced: 06 May 2026

https://github.com/hashkanna/youtube-wrapped-2025

🎬 Spotify Wrapped-style dashboard for YouTube watch history. Built with React + Tessl spec-driven development for EF x Tessl x Anthropic Hackathon.

analytics data-visualization hackathon react recharts tessl vite wrapped youtube

Last synced: 06 May 2026

https://github.com/deaneeth/aqi-monitoring-system

Real-time air quality monitoring system with interactive maps, role-based access control, and automated AQI data simulation. Built with Laravel, MySQL, and Leaflet.js.

air-quality alerts-system data-visualization fullstack geospatial interactive-maps laravel leaflet mysql php phpunit simulation web-application

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/loosenthedark/strikeforce

StrikeForce is an interactive frontend site that processes and presents data on Premier League goalscorers in a meaningful, easy-to-digest form

ajax-request api autocomplete-search bootstrap4 chartjs css3 css3-animations data-visualization emailjs entypo fontawesome frontend html5 interactive javascript jquery tablesorter

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/gerhynes/d3-median-age-histogram

A D3 histogram displaying UN data on median ages. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 06 May 2026

https://github.com/filip-kustura/exchange-rates-visualizer

This project is a currency exchange rate trend visualizer, enabling interactive exploration of historical exchange rate data. Built with JavaScript, jQuery, AJAX and Canvas, it features dynamic data retrieval, interactive graphs and responsive controls.

ajax api canvas css currency-exchange data-visualization exchange-rates html javascript jquery

Last synced: 06 May 2026

https://github.com/adesartika33/proyek-analisis-data-dataset-iris

Proyek ini bertujuan untuk menganalisis dataset Iris, salah satu dataset klasik dalam bidang Machine Learning dan Data Science. Dataset ini terdiri dari 150 sampel bunga Iris dari tiga spesies (Setosa, Versicolor, dan Virginica)

classification data-science data-visualization eda exploratory-data-analysis iris-dataset machine-learning python random-forest scikit-learn

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/harshsahu23/wabtec3_brakesgui_shared

💻 App to visualize error logs and dump files generated by the braking system of locomotives 🚅

csv data-visualization graphs pandas plotly python streamlit visualization

Last synced: 07 May 2026

https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-

This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.

data-analysis data-visualization exploratory-data-analysis

Last synced: 10 Jun 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/sumit-sinha9/t20wc_2022-best-11

Finalizing best 11 players from 2022 T20 world cup using Python Web scraping, Pandas and Power BI

data-visualization pandas powerbi python webscraping

Last synced: 07 May 2026

https://github.com/antrikshy/personalmovieanalysis

Finds interesting patterns in an IMDb ratings export; written as a Jupyter notebook, viz using Seaborn

data-visualization imdb jupyter-notebook movie-ratings pandas python seaborn

Last synced: 07 May 2026

https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python

Last synced: 07 May 2026

https://github.com/leandrocollares/employment-insurance-beneficiaries

A responsive line chart that shows regular Employment Insurance beneficiaries in Canada between 2019 and 2021

d3 data-visualization svelte

Last synced: 07 May 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting

This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

data-analysis data-visualization mechine-learing plotly python streamlit

Last synced: 07 May 2026

https://github.com/satyam4229/identify-employee-attrition

This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 08 May 2026

https://github.com/moustafamohamed01/mall-customer-segmentation-data

Customer segmentation using K-Means clustering based on annual income and spending score.

data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 08 May 2026

https://github.com/nishumehta/sales-analysis-project

This project aims to analyze sales performance using Excel, SQL, Python, Tableau, and Power BI. The goal is to extract insights from sales data, identify trends, and visualize key performance indicators (KPIs).

data-cleaning data-visualization eda excel matplotlib-pyplot pandas python3 tableau-dashboards

Last synced: 08 May 2026

https://github.com/bnvulpe/regression-and-time-series

This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.

colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting

Last synced: 08 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/dsaikiran01/seismomap

An interactive React + Leaflet web app that visualizes real-time global earthquakes from the USGS API with live map markers, magnitude filtering, dark mode, and responsive UI.

data-visualization earthquakes geoscience material-ui react react-leaflet tailwindcss usgs-api vite

Last synced: 08 May 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/js-konda/naturaldisasterseda

The project repository for the Exploratory Data analysis of natural disasters done as part of ECE143 course at UCSD

data-science data-visualization pandas python visualization

Last synced: 08 May 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/rightfulcode/retail-sales-breakdown

Time Series Analysis of Walmart Retail Sales – Internship project analyzing sales trends, seasonal patterns, and revenue breakdowns using Pandas, Matplotlib, and Seaborn.

data-analytics data-visualization elevvo-internship matplotlib pandas python retail-sales seaborn time-series-analysis

Last synced: 08 May 2026

https://github.com/vinit714/player-retention-analysis

A complete Streamlit + Machine Learning + SHAP + NLP project to analyze, predict, and improve player retention in games. This project simulates a game environment, models churn behavior, and provides insights using SHAP, NLP word clouds, and strategy simulators.

churn-prediction classification data-visualization eda feature-engineering game-analytics game-data-analysis gaming-analytics machine-learning model-interpretability nlp pandas player-retention python retention-analysis sckiit-learn shap streamlit wordcloud

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 08 May 2026