An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/virajbhutada/titanic-survival-prediction

ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.

ada-boost-classifier data-exploration data-science data-visualization decision-tree-classifier hyperparameter-tuning knn-classification logistic-regression machine-learning model-interpretation random-forest-classifier roc-curve titanic-classification

Last synced: 14 Jun 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/estebanrucan/reporte-comunas-tasa-defuncion-alta_2017

El fin de este reporte es indicar cuales son las mayores causas de defunción en las comunas de Chile en el año 2017, el material queda a libre disposición para que se puedan tomar medidas.

chile data-visualization ggplot2 plotly rmarkdown

Last synced: 04 Feb 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/shellynagar27/candy-market-share-analysis

Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.

critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis

Last synced: 03 Feb 2026

https://github.com/fazzaan/gitbook-sciencing

GitBook sync for Sciencing publishing & training projects

data-presentation data-visualization ebook gitbook science science-communication science-research

Last synced: 08 Jan 2026

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/mituskillologies/data-science-mar25

Programs of Data Science batch @ MITU Skillologies, March 2025

data-analytics data-science data-visualization machine-learning python

Last synced: 16 Mar 2025

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 30 May 2026

https://github.com/bertiewooster/ipywidgets

Interactive data visualizations in a Jupyter Notebook per tutorial https://python.plainenglish.io/interactive-visualizations-with-pandas-seaborn-and-ipywidgets-173e5d7d6a5e

data-analysis data-science data-visualization ipython-notebook ipywidgets juypter-notebook python

Last synced: 06 Mar 2026

https://github.com/zulhaditya/netflix-analysis

Netflix data analysis using multiple python libraries.

data-visualization python

Last synced: 19 May 2026

https://github.com/ak-abhilash/insightcat

📊 One-click open-source EDA tool for CSV, Excel, JSON

csv- data-analysis- data-visualization eda- fastapi- open-source- pandas- react-

Last synced: 14 Jun 2025

https://github.com/karishmagupta05/udemy-course-analysis

This project analyzes Udemy courses using Exploratory Data Analysis (EDA) techniques to uncover insights about course trends, pricing, subscriber counts, and popularity. By leveraging Python, Pandas, and data visualization libraries, we extract meaningful information from the dataset.

data-analysis data-visualization eda jupiter-notebook pandas python

Last synced: 13 Apr 2026

https://github.com/srinibas-masanta/olympics-data-analysis

The Olympics Analysis project explores Olympic data to uncover trends in athlete performance, medal distribution, and participation across countries and demographics. By leveraging detailed datasets, it provides insights into the evolution of the Games, highlighting key patterns and disparities over time.

data-analysis data-science data-visualization olympics olympics-visualization

Last synced: 02 Apr 2025

https://github.com/rubynixx/spotify_analysis_using_spotipy

Utilising the Spotipy API to visualise personal listening patterns.

data-visualization spotify spotipy spotipy-api spotipy-library

Last synced: 27 Feb 2025

https://github.com/sssshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 01 Mar 2025

https://github.com/joaopalmeiro/vtils

A Python package providing utility functions for Data Visualization.

data-visualization python visualization

Last synced: 26 Mar 2025

https://github.com/nmelgar/healthy_child_dataviz

Data visualization project to analyze what a healthy child is.

analysis data data-analysis data-science data-visualization dataviz research tableau visualization

Last synced: 23 Feb 2026

https://github.com/pratik-khose/realtime-sales-simulation

Power BI: Realtime Sales Simulation using SQL Server and Direct Query

data-analysis data-analytics data-visualization dax-query powerbi sql sql-server sqlserver

Last synced: 10 Jun 2026

https://github.com/beatussum/pmsexp

A software for recovering the position of an object in a video

cpp cpp17 data-visualization physics qt qt5 science utility video

Last synced: 17 May 2026

https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors

A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.

cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest

Last synced: 10 Apr 2025

https://github.com/lvsvendsen/shime-monitor-r

R script for visualizing pH and pump activity in SHIME gut microbiome experiments.

data-visualization microbiome r research-tool shime

Last synced: 13 Sep 2025

https://github.com/mrkirushko/oxywitleaf2csv

Witleaf pulse oxymeter BIN format data decoder

d data-visualization oximeter oxymeter pulseoximeter pulseoxymeter r witleaf

Last synced: 02 Apr 2025

https://github.com/freya135/personal-finance-manager

This project is a web-based personal finance manager dashboard built using Next.js and Vercel PostgreSQL. The dashboard aggregates essential financial data to help users track metrics like profits, sales, and customer activity, and it provides easy-to-read visualizations to support data-driven decision-making.

data-visualization nextjs personal-finance-manager postgresql vercel webdashboard

Last synced: 13 Apr 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/trismald/eurosoccer1023

Data Analyst - European Soccer 2010 2023

data-analysis data-visualization jupyter-notebook pandas powerbi python

Last synced: 06 May 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/valinsogna/data_visualization_project

Analyzing scores from 17 major international skating events (Oct 2016-Dec 2017). This project delves into judge biases, athlete rankings based on difficult elements, and the significance of elements versus components in final rankings. Built using Python, it offers insights derived from publicly-released International Skating Union Protocols

data-visualization skating

Last synced: 07 Oct 2025

https://github.com/iankitnegi/tableautales

"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."

data data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/meddhiachaouachi/browsetrack-datacollection

This full-stack web application streamlines data interaction with a simple and intuitive design. Built with Django and React, it offers secure and efficient tools for analysts to share insights, users to manage cookies, and clients to visualize real-time data effortlessly. The platform also tracks user browsing activity, providing valuable insights

cookies data-visualization datatracker social-network-analysis useractivity

Last synced: 07 Oct 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/gauravsy704/sct_ds_2

Performed data cleaning and exploratory data analysis (EDA) on the Titanic dataset from Kaggle. Investigated the relationships between variables and identified key patterns and trends in the data using Python, with a focus on survival rates, passenger demographics, and embarkation details.

data-science data-visualization jupyter-notebook pandas python seaborn

Last synced: 06 May 2026

https://github.com/hiteshsahu/visual-studio-hybrid-application

Visual Studio and Java Script full duplex communication

data-visualization html5 javascript kendo-ui visual-basic

Last synced: 09 Oct 2025

https://github.com/do-me/excel-column-analyzer

A free online tool to analyze Excel column data. Instantly count unique values, calculate frequencies, and visualize results in charts.

chartjs data-science data-visualization tailwind

Last synced: 09 Oct 2025

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/nidhi-kekatpure/churn-prediction-for-subscription-customers

End-to-end customer churn prediction using AWS SageMaker, XGBoost, and Python - complete ML pipeline from data generation to real-time inference with 79% AUC score.

aws business-intelligence data-science data-visualization jupyter-notebook predictive-analytics

Last synced: 16 May 2026

https://github.com/rohanchawla972/business-analytics-and-visual-storytelling

Business analytics repository integrating multi-source datasets and applying advanced methods — EDA, clustering, panel regression, sentiment analysis, and data storytelling — to extract actionable insights across hospitality, innovation, and financial services.

business-analytics clustering data-visualization jupyter-notebook predictive-modeling python storytelling

Last synced: 18 May 2026

https://github.com/anandu-jpg/coffee-shop-sales-analysis

This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.

business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton

Last synced: 18 May 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/skhosla8/analytics-webpage

A webpage that uses JSON data to render product details, a line chart and table.

d3 data-visualization react redux

Last synced: 14 Apr 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/luthfirrahmanb/tips-visualization

visualization tips data set with dash plotly

dash data-science data-visualization plotly python3

Last synced: 11 Oct 2025

https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1

Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.

data-analysis data-visualization deloitte json tableau tableau-public

Last synced: 11 Oct 2025

https://github.com/benst099/circlesplot

Visualize proportions with circles in a plot

cran cran-r data-science data-visualization proportions r visualization

Last synced: 11 Oct 2025

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/dzakwanalifi/stadata-x

Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif

bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui

Last synced: 20 Jan 2026

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/aszenz/data-viz

visualize data from your browser

csv-converter data-analytics data-visualization

Last synced: 20 Jan 2026

https://github.com/agarwalrachit399/fifa-19-analysis

Analysis of one of the most popular games ,FIFA-19 on tableau and a detailed report for the same

dashboard data-visualization fifa19 tableau

Last synced: 19 Jan 2026

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/moustafamohamed01/san-francisco-salaries-data-analysis

Salary Data Analysis 💰📊 An analysis of salary data to uncover trends, distributions, and anomalies in employee compensation. This project involves data cleaning, visualization, and statistical analysis to explore salary trends, job roles, and correlations. Insights are presented using Python and Jupyter Notebook. 🚀

data-cleaning data-visualization jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 May 2026

https://github.com/jackiboi307/simpleplot

Simple plotting tool made with pygame

data-visualization pygame python

Last synced: 13 Oct 2025

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 19 Jan 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/arunabhagit/data-driven-marketing-optimization-enhancing-engagement-conversions-and-customer-satisfaction

Analyzed ShopEasy’s marketing data using SQL, Python, and Power BI to identify low engagement and conversion issues, performed sentiment analysis, and delivered data-driven strategies for measurable performance improvement.

business-intelligence data-visualization dataanalytics marketingstrategy powerbi python sql

Last synced: 13 Oct 2025

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025

https://github.com/ashioyajotham/data_centers_are_eating_the_world

This idea is mostly how supercomputers (AI data centers) are coming up fast so it is an attempt to map them like Semi-Analysis does

data-center data-visualization supercomputers

Last synced: 14 Oct 2025

https://github.com/jasonleelunn/regex-visualiser

RegEx engine implementation, with an interactive frontend :eight_spoked_asterisk:

0de5 data-visualization regex

Last synced: 14 Oct 2025

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/coderjolly/utilisation-analysis

This provides a small glimpse of the IISc's, Supercomputer Education Research Centre (SERC) resource data, and how it was ingested, extracted to produced relevant results for data analysis between actual resource utilisation and simulated resource utilisation.

csv-parser-python data-transformation data-visualization flow plotly-dash plotly-python

Last synced: 14 Oct 2025

https://github.com/mahambilalandahaan/week8

K-Means Deep Dive: Clustering analysis with Elbow and Silhouette methods in Python

clustering data-visualization jupyter-notebook k-means machine-learning python scikit-learn unsupervised-learning

Last synced: 20 Apr 2026

https://github.com/saisurajmatta/healthcare-data-analytics

Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.

data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery

Last synced: 22 Jan 2026

https://github.com/a26nine/msc-dissertation-bitcoin-dashboard

An interactive data visualisation dashboard built using Tableau Desktop to research and analyse the relationship between the price volatility and adoptability of bitcoin.

data-analysis data-science data-visualization tableau tableau-desktop tableau-prep

Last synced: 17 Feb 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/virajbhutada/hollywood-insights-tableau

Strategic cinematic insights through Hollywood's data landscape. Tableau-driven analytics for genre, studio profitability, and audience dynamics. Uncover trends, assess audience reception, and navigate through years of film data, elevating your understanding of the cinematic world.

analystics business-intelligence dashboard data-analysis data-visualization entertainment hollywood storytelling tableau tableau-desktop visualization

Last synced: 05 Feb 2026

https://github.com/harshindcoder/sleeping_time_survey_analysis

This survey analysis aims to identify lifestyle factors—such as screen time, exercise, alcohol consumption, beverages, and sleep direction—that affect sleep quality and duration. By analyzing these factors, we seek to predict sleep patterns and provide actionable insights to improve sleep health and overall well-being.

data-cleaning data-collection data-visualization exploratory-data-analysis regression-models survey-analysis

Last synced: 15 Oct 2025

https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees

Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.

classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn

Last synced: 17 Oct 2025

https://github.com/tapas-gope/adventure-works

This project analyzes sales data for AdventureWorks, focusing on revenue, customer segments, and product performance. The dashboard provides insights into top-selling products, sales by region, and customer trends across multiple years. It helps in identifying sales opportunities and optimizing marketing strategies.

adventureworks business-intelligence data-cleaning data-transformation data-visualization dax mssql powerbi sales-analysis sql-queries

Last synced: 17 Oct 2025

https://github.com/nanxstats/ggsci-iterm

Microsite for previewing all iterm color palettes in ggsci

ansi-colors color-palettes data-visualization ggplot2 ggsci iterm iterm2 visualization

Last synced: 22 Jan 2026