An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/jdfoster11/northwest_territories_collision_factors

Using Python & Tableau to perform a statistical and regression analysis on a NorthWest Territories Vehicle Collision Dataset

clustering-algorithm co-lab data-science data-visualization heatmap-visualization html python3 tableau

Last synced: 15 Mar 2025

https://github.com/constantinoschillebeeckx/meowcow

Quickly visualize multidimensional data.

d3js data-visualization nvd3 viz

Last synced: 28 Jun 2025

https://github.com/sundarsharma332/codeliner

code snippet highlighter with custom line selection

code data-visualization program snapshot visualizer

Last synced: 10 Mar 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/seblehner/feldprakt

Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.

data-analysis data-visualization jupyter-notebook python

Last synced: 05 Oct 2025

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 05 Oct 2025

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 09 Nov 2025

https://github.com/subhamghimire/dataanavis

Learning Data analysis and visualization

data-analysis data-science data-visualization dataset

Last synced: 06 Oct 2025

https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi

Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.

dashboards data-analysis data-visualization microsoft-power-bi power-bi

Last synced: 05 Jan 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/subhadipsinha722133/diamond-price-predction

This project applies 🤖Machine Learning techniques to analyze these features and build a predictive model that estimates the selling price of diamonds

data-visualization machine-learning pandas pkl-model python random-forest sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/luizassimoes/q5ga-latency-and-throughput

Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.

data-analysis data-visualization pyqt5 python

Last synced: 13 Jun 2026

https://github.com/upes-open/open-cryptocurrency-analysis

A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.

analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization

Last synced: 15 Apr 2025

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/haonamnguyen/costumer-shopping-trends-analysis

This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.

data-analysis data-visualization sql tableau

Last synced: 05 Jan 2026

https://github.com/shivani0126/resturant_rating_analysis

Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.

dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi

Last synced: 27 Jan 2026

https://github.com/jeffbla/image-and-porosity-analysis-by-dash-plotly

This app uses Dash to explore core data. Compare images in the first row and interact with a line chart in the second row. Hovering reveals details, and clicking updates the images to match the selected data point.

dashboard data-visualization dicom-images interactive-visualizations plotly-dash xct

Last synced: 21 Jul 2025

https://github.com/shaolans/projet_algav_trie

Implementation of the Patricia Trie and the Hybrid Trie in Java

algorithms data-structures data-visualization graphviz-dot hybrid-training java patricia-tree tree trie

Last synced: 11 Jun 2026

https://github.com/ibrahimceyisakar/hotel-finder

Hotel finder system with Python includes data gathering, analyzing, and visualization.

data-analysis data-gathering data-visualization pandas plotly python selenium streamlit

Last synced: 06 May 2026

https://github.com/loosenthedark/strikeforce

StrikeForce is an interactive frontend site that processes and presents data on Premier League goalscorers in a meaningful, easy-to-digest form

ajax-request api autocomplete-search bootstrap4 chartjs css3 css3-animations data-visualization emailjs entypo fontawesome frontend html5 interactive javascript jquery tablesorter

Last synced: 06 May 2026

https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino

Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.

data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision

Last synced: 20 May 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/jdede1/data-analysis-visualization-assignment-5

INFO 526 — Data Analysis and Visualization, Assignment 5 (Dashboard Reports — Iowa Liquor Sales). Part of the Master’s in MIS/ML program at the University of Arizona. Includes positive and negative dashboards showing key KPIs: top products/vendors driving sales vs bottom products/vendors hindering sales.

dashboard data-visualization matplotlib pandas seaborn

Last synced: 16 Apr 2026

https://github.com/azaz9026/myntra_review_project

Myntra Scraper Project Project Overview: The Myntra Scraper Project is designed to extract product data from the Myntra website. This tool enables users to gather information such as product names, prices, descriptions, ratings, and images for analysis, comparison, or personal use.

data-science data-structures data-visualization filesystem github mogodb mogoose python3 strreamlit web-scraping

Last synced: 10 Apr 2026

https://github.com/hiteshsahu/visual-studio-hybrid-application

Visual Studio and Java Script full duplex communication

data-visualization html5 javascript kendo-ui visual-basic

Last synced: 09 Oct 2025

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/mr-chang95/webpage_abtest_analysis_udacity

A/B Testing Project for Udacity's Data Analyst Nanodegree Program. Using Python in Jupyter Notebook.

abtesting data-science data-visualization matplotlib pandas python webpage

Last synced: 11 Apr 2026

https://github.com/lixx21/tableau_netflix_movies_tvshows_2021

Visualize netflix movies and tv shows in 2021

data-visualization dataset netflix tableau

Last synced: 19 Jan 2026

https://github.com/jameshulse/are-we-winning

An interactive map representing which countries are winning or losing their battle against Covid-19

coronavirus coronavirus-tracking covid-19 data-visualization vue vuejs

Last synced: 13 Jun 2026

https://github.com/filip-kustura/exchange-rates-visualizer

This project is a currency exchange rate trend visualizer, enabling interactive exploration of historical exchange rate data. Built with JavaScript, jQuery, AJAX and Canvas, it features dynamic data retrieval, interactive graphs and responsive controls.

ajax api canvas css currency-exchange data-visualization exchange-rates html javascript jquery

Last synced: 06 May 2026

https://github.com/alokthedataguy/financial-friend-web-app

Financial Friend is a privacy-first web app that takes a user’s payment statement (PhonePe, GPay, bank CSV/PDF), cleans and understands it, and then talks back like a friend—giving simple, human answers (plus a few tiny visuals) to questions people actually care about.

data-analysis data-science data-visualization fastapi finance-management financial-analysis financial-data insights personal-finance-and-data-anlaysis python react

Last synced: 14 Apr 2026

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/naomiwolfe/golden-isles-dashboard2

Interactive tourism analytics dashboard for Georgia's Golden Isles

analytics chartjs dashboard data-visualization georgia golden-isles tailwindcss tourism

Last synced: 05 Oct 2025

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/rightfulcode/customer-segmentation-rfm

This project performs customer segmentation using Recency, Frequency, and Monetary (RFM) metrics to identify key customer groups and provide actionable marketing insights.

data-analysis-python data-visualization elevvo-internship jupyter-notebook matplotlib pandas python rfm-analysis seaborn

Last synced: 08 May 2026

https://github.com/athulrajhere/spendwise-expensetracker-react-redux

SpendWise is a MERN full-stack expense tracker that helps users monitor and manage their financial activities. It features intuitive dashboards, analytics, account management, and secure user authentication — all designed to provide a smooth and insightful financial tracking experience. Built with React (Vite) and Redux Toolkit on the frontend.

chartjs dashboard data-visualization expense-manager expense-tracker express framer-motion fullstack-app jwt-authentication mern-project mern-stack mern-stack-app mongodb mongoose nodejs personal-finance react redux-toolkit typescript vite

Last synced: 08 Apr 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/ddeepanshu-997/datascience---youth-tobacco-survey-yts-data

Analysis of Cigarette Smoking Prevalence Among Middle and High School Students (1999-2017)

analysis data-science data-visualization graph matplotlib pandas python survey visualization

Last synced: 06 May 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/gorlix/sp-grafana-bridge

A lightweight, real-time data bridge to export Super Productivity tasks to InfluxDB v2 for advanced analytics on Grafana. Transform your time-tracking into actionable insights.

data-visualization grafana influxdb-v2 plugin productivity-tool super-productivity superproductivity time-tracking

Last synced: 31 May 2026

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 06 May 2026

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 19 Jan 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/mouradhamzaoui/End-To-End-MLOPS-Airline-Project

This project aims to predict the number of passengers, freight quantity, and mail quantity for American airlines operating between Canadian and U.S. airports using an MLOps approach. It involves automating the data pipeline, from data extraction and preparation to model training and evaluation, leveraging tools like DVC, MLflow, and Docker for vers

data-visualization docker dvc github-actions machine machine-learning-algorithms mlflow

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/titanic-eda-visualization

Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.

data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 31 May 2026

https://github.com/shubham200137/cyclistic-case-study

This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.

data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau

Last synced: 11 Jun 2026

https://github.com/maettuu/project-beatblend

Repository for the Master's Project 2024 on Visualizing and Explaining Sequential Song Recommendations through Data Humanism

audio-features aws ci-cd content-based-recommendation data-visualization discogs-api docker fastapi full-stack jwt-token masters-project postgres python recommendation-system redis rest-api spotify-api visual-data vuejs websocket

Last synced: 11 Apr 2026

https://github.com/teilomillet/mentors

Text JSONL dataset visualiser and modifier.

data-visualization datasets llm-training

Last synced: 10 Feb 2026

https://github.com/timjjting/task-lineage-generator

A simple Task Lineage Diagram Generator

d3 dag data-visualization golang graphviz-dot lineage task

Last synced: 21 Jan 2026

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/ts-kontakt/interpareto

Python utility for creating interactive Pareto charts from pandas.DataFrame objects

data-visualization html-export pandas plotly

Last synced: 01 Sep 2025

https://github.com/samruddhi3012/insurance-price-forecasting

This is a Machine Learning project where I performed EDA and forecasted the insurance pricing using Linear Regression and XGBoost Regressor.

bayesian-optimization data-visualization exploratory-data-analysis linear-regression machine-learning pipeline statistical-analysis xgboost-regression

Last synced: 31 Aug 2025

https://github.com/leosimoes/alura-streamlit-dashboard

Projeto da demonstração do curso Alura - Streamlit - construindo um dashboard interativo. Aplicativo web com duas páginas para exibição dos dados.

dashboards data-visualization python streamlit

Last synced: 07 May 2026

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/jaymax01/dvd-rental-data-analysis

Data analysis of a DVD rental database

data-visualization postgresql sql

Last synced: 22 Jul 2025

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/archanakokate/adidas_us_sales_analysis_powerbi

Analysis of Adidas Sales in US for year 2020-2021 using PowerBI

analysis data-visualization modelling powerbi

Last synced: 04 Jan 2026

https://github.com/aqueeqazam/matplotlib-for-data-science-analysis-and-statistics

With a nice mix between customization options and ease of use, Matplotlib is a robust Python library that can be used by both novice and seasoned data scientists and machine learning engineer to create a wide range of representations.

data-science data-visualization machine-learning matplotlib pyplot statistics

Last synced: 07 Oct 2025

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/satyam4229/omnify-dataanalysis

Our assessment of Omnify focused on data-driven strategies to maximize profitability. We identified "Product X" as the most profitable product and recommended leveraging the "Wellness Solutions" keyword category for optimal keyword strategy.

data-analysis data-science data-visualization excel omnify

Last synced: 04 Jan 2026