An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/happymary16/data-visualization-labs

Solutions for all labs from 'Data Visualization' course

data-visualization jupiter-notebook python

Last synced: 13 Apr 2026

https://github.com/virajbhutada/titanic-survival-prediction

ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.

ada-boost-classifier data-exploration data-science data-visualization decision-tree-classifier hyperparameter-tuning knn-classification logistic-regression machine-learning model-interpretation random-forest-classifier roc-curve titanic-classification

Last synced: 14 Jun 2026

https://github.com/marianamartiyns/rfm-cluster-analysis

Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.

cluster-analysis data-analysis data-cleaning data-visualization pyhton

Last synced: 16 Mar 2025

https://github.com/mkaspulanwar/p6_bigdata_realtime_largescale_visualization

Praktikum Week 6 Big Data: Real-time analytics dan visualisasi data skala besar menggunakan PySpark Structured Streaming, Parquet Data Lake, dan Streamlit untuk monitoring mobilitas dan traffic smart city.

big-data data-visualization pyspark spark-streaming streamlit traffic-analytics

Last synced: 13 Apr 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/estebanrucan/reporte-comunas-tasa-defuncion-alta_2017

El fin de este reporte es indicar cuales son las mayores causas de defunción en las comunas de Chile en el año 2017, el material queda a libre disposición para que se puedan tomar medidas.

chile data-visualization ggplot2 plotly rmarkdown

Last synced: 04 Feb 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights

This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.

clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality

Last synced: 14 Jun 2025

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/umutonder97/project-network-ids

Network-Based Intrusion Detection System - dev/deploy-ment of a Hybrid Intrusion Detection System (HIDS) that integrates Signature-based Network Intrusion Detection Systems (SNIDS)

artificial-neural-networks convolutional-neural-networks covid-19 covid-19-russia covid19-data data-visualization genetic-algorithm ids keras-tensorflow knn microservices-architecture network-behavioral-analysis python time-series-forecasting

Last synced: 05 Jul 2025

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/wazedkhan/medical-data-analysis

This project visualize and make calculations from medical examination data using matplotlib, seaborn, and pandas.

data-visualization jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 02 Apr 2025

https://github.com/shellynagar27/good-cabs-data-analysis-project

This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.

critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling

Last synced: 25 Jan 2026

https://github.com/aditishenoy35/netflix_analysis

An interactive data visualization project exploring Netflix content using Python and Jupyter Notebook.

data-visualization jupyter-notebook python

Last synced: 20 Apr 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/shellynagar27/business-insights-360-project

A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.

dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager

Last synced: 27 Jan 2026

https://github.com/shellynagar27/candy-market-share-analysis

Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.

critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis

Last synced: 03 Feb 2026

https://github.com/parthivnaresh/facilyst

Facilyst is a library that makes using data science and machine learning tools easier.

data-science data-visualization deep-learning machine-learning mock-data neural-network python

Last synced: 18 Mar 2025

https://github.com/cyber-security-tech/top10-movies-web

Feature-rich full-stack Flask web app that lets users search, rate, and review movies via TMDb API, with smart genre filtering, interactive statistics (Chart.js), form validation (Flask-WTF), star-based ratings, and a polished UI/UX designed for real-world deployment.

api-integration bootstrap chartjs crud-app data-visualization flask flask-blueprints flask-wtf form-validation fullstack genre-filtering jinja movie-database python responsive-design sqlalchemy sqlite tmdb-api ui-ux web-app

Last synced: 08 Apr 2026

https://github.com/bretsw/eme6356-ss23-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2023)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/cartervr/taxdatabase-sql-tableau

End-to-end process for building an SQL Azure database, performing data analysis with SQL and Python, and visualizing data with Tableau.

azure data-science data-visualization database-architecture database-deployment database-management databse-design datanalysis erdiagram sql tableau

Last synced: 13 Mar 2026

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/brazer27/iris-classification

A Python implementation of Naive Bayes algorithm for Iris flower classification. Features include cross-validation, data preprocessing, and prediction capabilities. Built from scratch without ML libraries, achieving ~95% accuracy on the classic Iris dataset.

cross-validation data-science data-visualization flower-classification iris-dataset machine-learning naive-bayes python

Last synced: 06 Sep 2025

https://github.com/no-tone/pyrowatch

Dashboard for monitoring rural fire statistics and burned area in Portugal using Astro, D3.js, and Leaflet. Data from INE (Statistics Portugal) via dados.gov.pt with interactive map and drag-and-drop KPI layout.

astro d3js data-visualization kpi leaflet

Last synced: 30 May 2026

https://github.com/djeada/data-visualization

This repository is dedicated to the exploration of various data visualization frameworks through bite-sized code snippets, as well as providing insights on effective data visualization techniques and principles.

altair data-visualization matplotlib plotly

Last synced: 08 Jan 2026

https://github.com/nittygritty-zzy/quantlab

🚀 Professional quantitative trading research platform with ML-powered backtesting, multi-source options analysis, portfolio management, and interactive Plotly visualizations. Built on qlib with CLI interface.

algorithmic-trading backtesting cli data-visualization financial-analysis machine-learning options-trading plotly portfolio-management python qlib quantitative-finance

Last synced: 14 Jan 2026

https://github.com/superskyyy/stackoverseer

This is a StackOverflow monitor where you can easily access the most trending and up-to-date questions on a particular set of tags. This project can be modified to support wider range of tags and provide functionalities.

charts data-visualization stackoverflow-api stackoverflow-questions

Last synced: 08 Jan 2026

https://github.com/zulhaditya/netflix-analysis

Netflix data analysis using multiple python libraries.

data-visualization python

Last synced: 19 May 2026

https://github.com/ak-abhilash/insightcat

📊 One-click open-source EDA tool for CSV, Excel, JSON

csv- data-analysis- data-visualization eda- fastapi- open-source- pandas- react-

Last synced: 14 Jun 2025

https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends

Last synced: 30 May 2026

https://github.com/k8hertweck/tidytuesdaydataviz

data viz for TidyTuesday lunch meetup at the Hutch

data-visualization tidytuesday

Last synced: 30 May 2026

https://github.com/lparham2/los-angeles-expansion-safety-concerns

Los Angeles Expansion Safety Concerns analyzes crime data to assess safety risks for city expansion. Using data analytics and visualization, the project identifies high-risk areas and provides insights to guide urban planning decisions for a safer, well-informed expansion strategy.

business-analytics crime-data data-visualization dataanalysis expansion losangeles powerpoint presentation

Last synced: 15 Jun 2025

https://github.com/srinibas-masanta/electric-vehicle-analysis-dashboard

This repository features an interactive Tableau dashboard that visualizes electric vehicle (EV) adoption trends in the U.S. 🚗⚡ Explore EV growth, top manufacturers, regional distribution, and the impact of incentives—all in one dynamic view. 📊 Use filters to dive deeper into the data and uncover key insights! 🚀

dashboards data-analysis data-visualization tableau

Last synced: 15 Jan 2026

https://github.com/spriggancg/hishiryo

A package to generate a picture representation of any csv file.

csv data-visualization dataset heatmap package pipy python python3

Last synced: 14 Jan 2026

https://github.com/greatwoman23/hotel_reservation_analysis

In this project, we delve into the intricate world of hotel reservations, utilizing a multifaceted analytical approach to uncover valuable insights. Through a combination of SQL queries and Tableau visualizations, we meticulously dissect a rich dataset comprising booking details, customer demographics, and reservation statuses.

data-analysis data-science data-visualization hotel hotel-reservation publications sql sql-query sqlite3 tableau

Last synced: 15 May 2026

https://github.com/rubynixx/spotify_analysis_using_spotipy

Utilising the Spotipy API to visualise personal listening patterns.

data-visualization spotify spotipy spotipy-api spotipy-library

Last synced: 27 Feb 2025

https://github.com/sssshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 01 Mar 2025

https://github.com/samruddhi3012/public-health-data-analysis

Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.

dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup

Last synced: 05 Feb 2026

https://github.com/joaopalmeiro/vtils

A Python package providing utility functions for Data Visualization.

data-visualization python visualization

Last synced: 26 Mar 2025

https://github.com/joaopalmeiro/data-vis-assets

A collection of various assets for Data Visualization assembled from inspiring sources.

catalog data-visualization documentation markdown

Last synced: 26 Mar 2025

https://github.com/samwhaaa/da_portfolio

Showcasing some of my Data Analytics projects

data-analysis data-analytics data-visualization jupyter jupyter-notebook python

Last synced: 01 Mar 2025

https://github.com/aaronjmars/token-race

Data visualization tool for tokens, based on their FDV or MC.

coinmarketcap data-visualization dexscreener web3

Last synced: 14 Sep 2025

https://github.com/pratyusha-r/air-quality-index-visualization

This project visualizes the Air Quality Index (AQI) of various cities across India using Power BI. The aim is to provide insights into the air quality trends and help raise awareness about environmental issues affecting urban areas.

air-quality-data dashboard data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/xre22zax/airline-analysis

Travel agency and need to know the ins and outs of airline prices for your clients

data-analysis data-visualization python python3 visualization

Last synced: 13 Apr 2026

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/deliprofesor/arrhythmia-classification-and-anomaly-detection

This project classifies arrhythmias and detects anomalies using machine learning and deep learning. It includes preprocessing the "INCART 2-lead Arrhythmia Database," feature engineering, KMeans clustering, Random Forest, IsolationForest, and an LSTM model for classification.

classification clustering data-science data-visualization deep-learning machine-learning

Last synced: 10 Apr 2025

https://github.com/archanakokate/iris_flower_classification

Analyzing and modeling the Iris dataset with the aim of classifying the species of Iris flowers.

analytics data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/jansim/ridges

R package for downloading and visualizing topographical elevation data.

data-visualization geospatial r ridgeline

Last synced: 02 Mar 2025

https://github.com/sehgal-vishal/blinkit-sales-project

This dashboard is designed to provide in-depth insights into key performance indicators (KPIs) critical to the operations of Blinkit.

data-visualization dataanalysis powerbi sales

Last synced: 19 Mar 2026

https://github.com/nero103/airbnb-destination

This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings

data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau

Last synced: 27 Mar 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/seblehner/feldprakt

Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.

data-analysis data-visualization jupyter-notebook python

Last synced: 05 Oct 2025

https://github.com/trismald/eurosoccer1023

Data Analyst - European Soccer 2010 2023

data-analysis data-visualization jupyter-notebook pandas powerbi python

Last synced: 06 May 2026

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/boss294/credisync

CrediSync is a comprehensive web application designed to manage and track your financial transactions, including credits, debits, creditors, and debtors. With modern UI/UX and advanced features, it provides a seamless experience for managing your financial records.

credit-tracker css data-structures data-visualization dsa html html3 js management-system money-management web-dev website websoftware

Last synced: 17 May 2026

https://github.com/theglobemc/theglobemc.github.io

An interactive HTTP visualization of Minecraft books on the web from GlobeMC.

books data-visualization datamining minecraft

Last synced: 07 Oct 2025

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/valinsogna/data_visualization_project

Analyzing scores from 17 major international skating events (Oct 2016-Dec 2017). This project delves into judge biases, athlete rankings based on difficult elements, and the significance of elements versus components in final rankings. Built using Python, it offers insights derived from publicly-released International Skating Union Protocols

data-visualization skating

Last synced: 07 Oct 2025

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/lgibson7/stat_651

Coursework for STAT 651 Data Visualization, Cal State East Bay Fall 2022

data-visualization leaflet shiny-apps tableau

Last synced: 05 Feb 2026

https://github.com/jdede1/data-analysis-visualization-assignment-5

INFO 526 — Data Analysis and Visualization, Assignment 5 (Dashboard Reports — Iowa Liquor Sales). Part of the Master’s in MIS/ML program at the University of Arizona. Includes positive and negative dashboards showing key KPIs: top products/vendors driving sales vs bottom products/vendors hindering sales.

dashboard data-visualization matplotlib pandas seaborn

Last synced: 16 Apr 2026

https://github.com/armahdavi/analytics_statistics_ml_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 14 Apr 2026

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/damisparks/machine-learning

Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention

data-science data-visualization deeplearning machine-learning machine-learning-algorithms machinelearning-python

Last synced: 09 Oct 2025

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/nidhi-kekatpure/churn-prediction-for-subscription-customers

End-to-end customer churn prediction using AWS SageMaker, XGBoost, and Python - complete ML pipeline from data generation to real-time inference with 79% AUC score.

aws business-intelligence data-science data-visualization jupyter-notebook predictive-analytics

Last synced: 16 May 2026

https://github.com/rohanchawla972/business-analytics-and-visual-storytelling

Business analytics repository integrating multi-source datasets and applying advanced methods — EDA, clustering, panel regression, sentiment analysis, and data storytelling — to extract actionable insights across hospitality, innovation, and financial services.

business-analytics clustering data-visualization jupyter-notebook predictive-modeling python storytelling

Last synced: 18 May 2026

https://github.com/anandu-jpg/coffee-shop-sales-analysis

This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.

business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton

Last synced: 18 May 2026