An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/omari-kd/transborder-freight-data-analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi

Last synced: 30 Mar 2025

https://github.com/anuuragg/human-microbiome---eda

Fundamentals of Data Science - End Semester Project 1

data-science data-visualization eda fds microbiome

Last synced: 14 Mar 2025

https://github.com/timjjting/escaping-flatland-slides

Slides for techniques behind escaping flatland

data-visualization glsl lod octree threejs

Last synced: 14 May 2025

https://github.com/gabe-zhang/cf-dataviz

A visual data exploration of campaign finance data

data-visualization ggplot2 r

Last synced: 06 Apr 2025

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/parnika798/psychic-palm-tree

Streamlit dashboard exploring correlations between ad spending and voter turnout in the 2024 Indian General Elections. Includes dynamic filters, intuitive charts, and election campaign impact exploration.

2024-indian-general-elections data-visualization election-data-analysis interactive-dashboard streamlit

Last synced: 14 Mar 2025

https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project

This project was completed to simulate real-world tasks that data professionals encounter every day on the job.

dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery

Last synced: 08 Mar 2026

https://github.com/bala-1409/loan-clustering-datascience-projects

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning

Last synced: 22 Mar 2025

https://github.com/beolawork-art/novabank-churn-analysis

NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.

data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql

Last synced: 08 Apr 2026

https://github.com/jocelynvelarde/embraceplus-visualizer

Visualize your raw data from .avro files for the EmbracePlus device from Empatica

avro-schema csv-files data-visualization empatica health-monitoring monitoring-tool python streamlit

Last synced: 14 May 2026

https://github.com/dimits-ts/visualization-assignments

Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.

data-analysis data-visualization interactive-graphs presentation-slides r-language tableau

Last synced: 06 Nov 2025

https://github.com/dimits-ts/visualization-team-project

Team project visualizing various views for an established bike-sharing company. Includes a written report, presentation, R-code and Tableau files

data-visualization presentation-slides r-language tableau

Last synced: 06 Nov 2025

https://github.com/oelin/textgram

A simple text-based data visualisation library.

ascii-art data-visualization diagram python

Last synced: 23 May 2026

https://github.com/giatraskon/hyperspectral-image-clustering

Analysis of the Salinas hyperspectral image dataset using advanced clustering algorithms, focusing on identifying homogeneous regions in the image. Implementations of cost-function optimization and hierarchical clustering techniques, along with evaluations and visualizations in reduced-dimensional spaces.

adjusted-rand-index calinski-harabasz-index clustering data-visualization dimensionality-reduction fuzzy-cmeans-clustering hierarchical-clustering hyperspectral-imaging image-processing k-means-clustering machine-learning matlab pca possibilistic-clustering-algorithms probabilistic-clustering remote-sensing salinas-dataset silhouette-score spectral-bands unsupervised-learning

Last synced: 14 Mar 2025

https://github.com/sivkri/shiny-scatter-plot-app

This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.

data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny

Last synced: 22 Mar 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/dbolotov/ts_smoothing_visualizer

Streamlit app for visualizing and comparing time series smoothing methods on real and synthetic datasets.

data-science data-visualization streamlit time-series

Last synced: 24 Jul 2025

https://github.com/leandrocollares/street-cherry-trees-in-vancouver

Street cherry trees in Vancouver: an exploratory data analysis

data-analysis data-visualization folium pandas plotly-express

Last synced: 17 Sep 2025

https://github.com/matte34/auto-insurance-analysis

Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.

data-analysis data-visualization permutation-test python3 scipy seaborn

Last synced: 06 May 2026

https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard

Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach

data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report

Last synced: 03 Jan 2026

https://github.com/jaguzmana/colombia-covid-analysis

A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.

data-visualization mssql-database tableau

Last synced: 08 Mar 2026

https://github.com/andersoncrs/clasificacion-propina-restaurante

Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.

clasification data-analysis data-visualization tips

Last synced: 26 Jul 2025

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/ljadhav25/healthcare-data-collection-and-analysis

This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.

data-analysis data-visualization flask-application flask-backend html-css-javascript pycharm-ide python

Last synced: 09 Apr 2026

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/jakebrehm/geophotos

🗺 📍 A Python package to pull, analyze, and plot coordinates from various sources.

data-visualization gdal geopandas heatmap osgeo photos plot plotting python python-3

Last synced: 09 Jun 2026

https://github.com/easonlai/covid19_hk_analysis

This is code sample of data analysis (with visualization) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 data-analytics data-science data-visualization matplotlib pandas python seaborn seaborn-plots

Last synced: 12 Apr 2026

https://github.com/lucas-mazzolim/superstore-bi

Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.

business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet

Last synced: 27 Jul 2025

https://github.com/erictleung/tidytuesdays

:chart_with_upwards_trend: My attempts at #tidytuesday

data data-science data-visualization r rstats tables tidytuesday tidyverse

Last synced: 19 Sep 2025

https://github.com/leandrocollares/population-in-dutch-provinces

A responsive bar chart showing the population of Dutch provinces

d3 data-visualization svelte

Last synced: 16 Apr 2026

https://github.com/danielrosehill/value-factors-data-vis

Streamlit app containing visualisations of the Global Value Factors Database (GVFD) released by the IFVI in 2024

data data-visualization sustainability sustainability-data

Last synced: 29 Jul 2025

https://github.com/hasinii12/-chocolate-analysis-dashboard

This Power BI report provides a comprehensive analysis of chocolate ratings and related attributes.

data-analysis data-visualization powerbi

Last synced: 09 Feb 2026

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/hauntedhost/modern-drive

ModernDive: An Introduction to Statistical and Data Sciences via R at http://www.moderndive.com

data-science data-visualization r statistics

Last synced: 29 Jul 2025

https://github.com/sejalkoli/powerbi-dashboard

Exploring insights and boosting business success with my Superstore Sales Dashboard project using Power BI.

dashboard data-analytics data-visualization powerbi

Last synced: 07 Nov 2025

https://github.com/zborovskaanna/e-commerce-web-events-analysis

SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio

analysis bigquery dashboard data-visualization looker-studio sql

Last synced: 03 Jan 2026

https://github.com/farseenmanekhan1232/analyse-economic-cycle

A Python-based CLI tool for analyzing economic cycles and making data-driven investment decisions in the Indian stock market using Kite Connect API.

data-visualization investment matplotlib portfolio-optimization python stock-market

Last synced: 30 Jul 2025

https://github.com/sinsunsan/earth-survival-kit

Global warning data visualisation app to make everyone understand global warning and take actions that matter

angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts

Last synced: 05 May 2026

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/shaheerazam-dev/cyclistic-case-study-google-data-analytics-certificate

This case study simulates the real-world experience of a junior data analyst at Cyclistic, a fictional company. We will leverage the data analysis process framework (Ask, Prepare, Process, Analyze, Share, Act) to address critical business questions and provide data-driven insights to guide strategic decision-making.

bigquery data-science data-visualization spreadsheet sql tableau

Last synced: 06 Feb 2026

https://github.com/jakobtroidl/barrio

A visual tool to compare and analyze nanoscale brain structures.

comparison data-visualization neuroscience scientific-visualization

Last synced: 09 Apr 2026

https://github.com/j4rviscmd/streamlit-advanced-dataframe

🚀 A powerful Streamlit custom component that extends st.dataframe with advanced features: filtering, sorting, row/cell selection, column resizing, virtual scrolling (60fps with 100K rows), and more. Built with React + TanStack Table v8.

data-table data-visualization dataframe pandas python react streamlit streamlit-component streamlit-custom-component tanstack-table typescript

Last synced: 09 Mar 2026

https://github.com/kartikey2807/bike-classification-1rt700

Binary classification problem involving Logistic regression, SMOTE and feature expansion.

data-analysis data-engineering data-visualization logistic-regression

Last synced: 30 Jul 2025

https://github.com/sanveed-adnan/supermarket-sales-sql-project

SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.

business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite

Last synced: 08 Nov 2025

https://github.com/teamtigers/echartify

A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.

bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages

Last synced: 16 Apr 2026

https://github.com/alrza2003/google-data-analysis-case-study-cyclistic

This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.

business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau

Last synced: 09 May 2026

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 31 Jul 2025

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/palwisha-18/time_series_analysis_lex_vs_gdp

Analyzes how a country’s GDP per capita correlates with the life expectancy of its citizens over a period of about 100+ years

data-analysis data-visualization pandas plotl time

Last synced: 19 May 2026

https://github.com/rodolfo-brandao/pos-graduacao

[pt-BR] Repositório para armazenar alguns materiais e projetos de cada módulo da minha especialização em Ciência de Dados (2025–2027)

artificial-intelligence data-analysis data-science data-visualization databases deep-learning jupyter linear-algebra machine-learning python r statistics

Last synced: 09 Apr 2026

https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-

This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.

airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server

Last synced: 09 Sep 2025

https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown

This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/abelarduu/global-currency-viewer

Visualizador de moeda global que monitora em tempo real as cotações de moedas como Dólar, Euro, Libra e Iene em relação ao real brasileiro, utilizando web scraping e gráficos interativos. Desenvolvido em Python com Requests, BeautifulSoup, Pandas e Matplotlib.

beautifulsoup data-visualization finance grafico matplotlib matplotlib-python pandas-python python requests-python web-scraping-python web-scrapping

Last synced: 19 Apr 2026

https://github.com/apsinghanalytics/wikiviewscryptopricetrendanalysis

Crypto Sentiment Analysis via Wikipedia Page View Trends and Bitcoin Price and Volume Trends

correlation-analysis crypto data-visualization exploratory-data-analysis seaborn time-series

Last synced: 10 Oct 2025

https://github.com/cecoeco/networks-r-project

Visualizing static networks with R (Coursera)

data-visualization igraph network-analysis r

Last synced: 04 Aug 2025

https://github.com/hari00887/analysis-of-global-terrorism

Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.

data-analysis data-visualization powerbi

Last synced: 02 Mar 2026

https://github.com/sshehrozali/top-repo-visualizer

Program to generate visual graph of top most starred GitHub repos using PyGal and GitHub API.

api data-visualisation data-visualization github-api graph pygal python

Last synced: 05 Aug 2025

https://github.com/asuquoaa/understanding_distribution_through_sampling

This project demonstrates the concept of distribution through sampling using animations in Python.

animation data-visualization

Last synced: 05 Aug 2025

https://github.com/papposilene/mappingthepompidou

Visualizing the Centre Pompidou's (Centre national d'art moderne, aka CNAM) collection data.

data-visualization laravel7 museum museum-collections nuxtjs

Last synced: 02 Oct 2025

https://github.com/cagandemirmr/flo_sql_server_to_power_bi

In this project, i connect Sql server to Power Bi to visualize my Project

data-visualization dataanalysis dataanalyst directquery powerbi queries sqlserver

Last synced: 08 Aug 2025

https://github.com/yash22222/data-analysis-on-real-time-social-media-comments

EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.

data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis

Last synced: 14 May 2026

https://github.com/saikiran76/titanicdata-analysis-eda

In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:

analysis data-science data-visualization eda python

Last synced: 19 May 2026

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/hemangsharma/hotel-revenue-booking-analysis

This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.

analysis data-analysis data-visualization hotel

Last synced: 10 Aug 2025

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026