An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/andersoncrs/clasificacion_diabetes_analisis_exploratorio_de_datos

Este proyecto aplica técnicas de análisis exploratorio de datos y algoritmos de clasificación para predecir la presencia de diabetes a partir de información médica.

data-visualization logistic-regression machine-learning medical-data notebook python

Last synced: 15 May 2026

https://github.com/sevilaymuni/project-no.2-pandas-tableau-student-mobility

Pandas assisted Feature Engineering on Study Mobility: Tableau Dashboards on Students' Preferences

data-analysis data-extraction data-visualization feature-engineering pandas python tableau-dashboards tableau-desktop tableau-public

Last synced: 03 May 2026

https://github.com/zeh237/superstore-data-analytics

This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python

analytics data data-analysis data-science data-visualization flask python superstore

Last synced: 04 May 2025

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/deliprofesor/customerseg-customer-segmentation-and-shopping-analysis

This project performs data exploration, segmentation, and modeling of wholesale customer data using clustering algorithms, PCA, and decision trees to analyze purchasing behavior and predict customer channel preferences.

clustering customer-segmentation data-analysis data-visualization dbscan decision-tree gmm kmeans machine-learning pca

Last synced: 24 Jun 2025

https://github.com/nmatthews2203-del/rent-affordability-explorer

Interactive housing analytics dashboard using Zillow rent data and Census income data to analyze affordability, rent trends, and geographic housing differences across U.S. counties.

altair data-analytics data-visualization housing-data interactive-dashboard pandas plotly python real-estate sql sqlite streamlit

Last synced: 03 May 2026

https://github.com/habiburrahman-mu/exploratory-data-analysis

Methods to see if certain characteristics or features can be used to predict.

data-analysis data-mining data-science data-visualization

Last synced: 20 Jan 2026

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/tolumie/web-scraping-rest-api-stock-data-operations

Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.

api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping

Last synced: 19 May 2026

https://github.com/jofaval/iris-flowers

Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936

classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost

Last synced: 05 Apr 2026

https://github.com/bris0yzbekaye/json-to-excel-converter

This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.

automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools

Last synced: 25 Jul 2025

https://github.com/syncfusionexamples/creating-the-wpf-stackedarea-chart-to-visualize-wealth-distribution-in-america-from-1990-to-2023

This sample demonstrates how to Create the Syncfusion WPF Stacked Area Chart to visualize wealth distribution in America based on income groups from 1990 to 2023.

chart-annotations chart-appearance chart-customization charting-library charts data-visualization stacked-area-chart text-annotation wpf-chart wpf-sfcharts

Last synced: 20 Aug 2025

https://github.com/netesf13d/expt-sequence-analysis

Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.

cold-atoms data-analysis data-visualization optical-tweezers

Last synced: 24 Jul 2025

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/vetrivel07/tableau-projects

This is my Tableau project repository to showcase my Data Visualization Skills.

dashboard data-cleaning data-schema data-visualization tableau

Last synced: 15 Jun 2025

https://github.com/armahdavi/analytics_statistics_ML_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 17 Sep 2025

https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard

Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach

data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report

Last synced: 03 Jan 2026

https://github.com/vetrivel07/flight-price-prediction

Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 15 Jun 2025

https://github.com/cfelde/raspberry-pi-temperature

Raspberry Pi temperature vs room temperature

data-visualization kotlin raspberry-pi sigbla temperature-sensor

Last synced: 18 May 2026

https://github.com/syncfusionexamples/how-to-add-images-as-category-in-.net-maui-cartesian-chart-axis-labels

This article in the Syncfusion Knowledge Base explains how to add an image in category axis labels in a .NET MAUI Cartesian chart.

axis-customization category-axis charts column-chart data-visualization dotnet-maui maui-chart sfcartesianchart

Last synced: 03 Apr 2025

https://github.com/syncfusionexamples/how-to-add-multiple-trackballs-in-a-wpf-sfchart

Learn how to add multiple trackballs to a single WPF SfChart and drag them independently to view the information of different data points at the same time.

chart-trackball charting-library charting-tools custom-trackball data-visualization interaction interactive-chart line-chart multiple-trackball sfchart trackballs wpf-sfchart

Last synced: 03 Apr 2025

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/ptdewey/spotipy-wrapped

Make sense out of Spotify personal data

data-visualization jupyter-notebook python spotify

Last synced: 01 Aug 2025

https://github.com/ornl/covid19vis

Visualizations of COVID-19 case data

data-visualization scientific-visualization

Last synced: 03 Jan 2026

https://github.com/mikeludemann/python-data-visualization

Some data visualization methods

data-visualization python

Last synced: 28 Mar 2025

https://github.com/diriho/pi-approximation

Pi approximulation using the Monte Carlo Simulation🥧🥧 This python program approximate pi - 𝛑 depending on the number of dots entered. The more dots entered, the better the approximation is.

data-visualization loops-and-iterations monte-carlo-simulation python3 random-distributions vizualisation

Last synced: 15 Jun 2025

https://github.com/jonprice99/regional-election-analysis

An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.

data data-visualization election-analysis election-data pandas python

Last synced: 05 May 2026

https://github.com/pablobernabeu/depictr-py

A unified, colourblind-safe toolkit for publication-ready statistical visualisation in Python (plotnine). The sibling of the depictr R package.

accessibility colorblind data-visualization ggplot plotnine plotting python scientific-visualization statistics visualization

Last synced: 30 Jun 2026

https://github.com/outadoc/repo-lang-history

Generate language history data across the lifetime of your repository

data-visualization git graph

Last synced: 15 May 2026

https://github.com/zaydabash/envirowatch

Real time environmental dashboard for live air quality monitoring with anomaly detection, interactive maps, and natural language commands. Built with Next.js, TypeScript, and OpenAQ.

air-quality data-visualization environmental-data environmental-monitoring maplibre nextjs openaq realtime-dashboard recharts shadcn-ui tailwindcss typescript vercel zustand

Last synced: 09 Apr 2026

https://github.com/jofaval/ionosphere

Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988

data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost

Last synced: 09 Apr 2026

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/m-dadej/excess_deaths_poland

Estimation of excess deaths during COVID-19 pandemic in Poland

covid-19 data-science data-visualization rstats time-series

Last synced: 14 May 2026

https://github.com/ryancoll/hackduke-2021

Using maps created in Google Data Studio and charts from react-chartjs-2, EnviroView is an online environmental database tool designed for efficient and easy access to insightful information regarding the United States’ impact on the environment through four key categories: Nature, Energy, Transportation, and Household.

data-visualization maps-data react react-chartjs-2

Last synced: 15 May 2026

https://github.com/12danielll/neurogenomics_project

This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.

bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis

Last synced: 06 Apr 2026

https://github.com/lucas-mazzolim/superstore-bi

Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.

business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet

Last synced: 27 Jul 2025

https://github.com/pentalpha/bti-performance-study

A series of analysis on a large amount of data about the grades of students in the Technology Information course at UFRN

analysis big-data clustering data-analysis data-science data-visualization ipynb ipython jupyter-notebook performance-analysis plot python python3

Last synced: 15 May 2026

https://github.com/akhi07rx/petals-using-r

This R code generates a plot of a flower. It uses polar coordinates and the sine function to create the petal shapes and then plots them.

data-visualization graphics opensource plot r trignometry

Last synced: 23 May 2026

https://github.com/danielrosehill/impactweightedaccounts

A repository for hosting data and data visualisations related to the work of the International Foundation for Valuing Impacts and other organizations pioneering the development and adoption of these methods (disclaimer: this is an independent project not affiliated or associated with any particular entity).

accounting data-visualization impact-transparency

Last synced: 03 May 2025

https://github.com/christos-pelekis/harsourcerer

An inclusive MERN stack-based platform for comprehensive analysis and exploration of HTTP traffic data extracted from HAR (HTTP Archive) files.

data-visualization har-files http-traffic mern-stack

Last synced: 29 Jul 2025

https://github.com/shreedata/covid-da-dasboard-using-powerbi

This repository showcases a PowerBI dashboard focused on visually representing COVID-19 data for Indian states and Union Territories in an easily understandable way. The dataset is sourced from Kaggle.

data-cleaning data-visualization datanalaysis microsoft microsoft-powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/archanakokate/eda_amazon_products_and_discounts_2023

Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data

data-analysis data-mining data-visualization exploratory-data-analysis

Last synced: 03 Jan 2026

https://github.com/swethajoseph/statistical-stock-performance-analysis

Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.

advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis

Last synced: 03 Jan 2026

https://github.com/asuquoaa/predicting_viewer_engagement_with_educational_videos

This project uses machine learning to predict video engagement based on features such as transcript complexity, speaker speed, and silence periods. By understanding the factors influencing engagement, we can improve content recommendations and educational experiences.

data-visualization exploratory-data-analysis machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/yash22222/literacy-exploration-analysis

Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.

csv data-analysis data-visualization government-data india literacy literacy-analysis states

Last synced: 29 Jul 2025

https://github.com/hauntedhost/modern-drive

ModernDive: An Introduction to Statistical and Data Sciences via R at http://www.moderndive.com

data-science data-visualization r statistics

Last synced: 29 Jul 2025

https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds

With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.

data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics

Last synced: 21 Sep 2025

https://github.com/jabulente/tukey-s-hsd-for-pairwise-group-comparisons

This repository contains a Python project dedicated to performing Tukey’s Honest Significant Difference (HSD) test for pairwise group comparisons.

ai anova-analysis anova-test data-science data-visualization machine-learning math matplotlib-pyplot post-hoc post-hoc-analysis re real-world-problem-solving scipy-stats seaborn-plots statistics statsmodels string turkey-hsd

Last synced: 29 Jul 2025

https://github.com/alvinluo-tech/intimacy-tracker

Encounter — Map your shared journey. A secure digital sanctuary for couples to visualize their footprints, track shared patterns, and celebrate intimacy through data-driven storytelling.

couple-app data-visualization intimacy-tracker personal-growth tailwind-css

Last synced: 15 May 2026

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 06 Apr 2026

https://github.com/syncfusionexamples/creating-.net-maui-bubble-chart-to-visualize-gender-distribution-in-industrial-employment-in-2019

This article in the Syncfusion Blog explains the Gender Distribution in Industrial Employment in 2019 using .NET MAUI Bubble Chart

bubble-chart chart-of-the-week data-visualization dotnet dotnet-maui-sfcaretsianchart maui scatter-plot

Last synced: 03 Apr 2025

https://github.com/shawonsimon/azure-data-engineering

An end-to-end data engineering solution on Azure, transforming SQL Server data into Power BI reports using Data Lake, Data Factory, Databricks, Synapse, and Key Vault for security.

azure-keyvault data-engineering data-visualization databricks powerbi sqlserver synapse

Last synced: 15 May 2026

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/jakobtroidl/barrio

A visual tool to compare and analyze nanoscale brain structures.

comparison data-visualization neuroscience scientific-visualization

Last synced: 09 Apr 2026

https://github.com/j4rviscmd/streamlit-advanced-dataframe

🚀 A powerful Streamlit custom component that extends st.dataframe with advanced features: filtering, sorting, row/cell selection, column resizing, virtual scrolling (60fps with 100K rows), and more. Built with React + TanStack Table v8.

data-table data-visualization dataframe pandas python react streamlit streamlit-component streamlit-custom-component tanstack-table typescript

Last synced: 09 Mar 2026

https://github.com/araltos/weather-forecast-app

A modern weather forecast application built with JavaScript, HTML, and CSS. It uses the OpenWeatherMap API to display current weather conditions and a 5-day temperature forecast, with Chart.js for data visualization.

api chartjs css3 data-visualization fetch-api geolocation-api git html5 javascript json macos

Last synced: 06 Apr 2026

https://github.com/rihib/querychat

LLM integrated BI tools for data-democratization

bi-tool data-visualization golang llm wip

Last synced: 09 Apr 2025

https://github.com/teamtigers/echartify

A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.

bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages

Last synced: 16 Apr 2026

https://github.com/alrza2003/google-data-analysis-case-study-cyclistic

This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.

business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau

Last synced: 09 May 2026

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 31 Jul 2025

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/palwisha-18/time_series_analysis_lex_vs_gdp

Analyzes how a country’s GDP per capita correlates with the life expectancy of its citizens over a period of about 100+ years

data-analysis data-visualization pandas plotl time

Last synced: 19 May 2026

https://github.com/samuelkordik/chartcompletionviz

Visualizes performance across time for chart completion timeliness

data-visualization quality-improvement r

Last synced: 08 Sep 2025

https://github.com/syarwinaaa09/exploring-airbnb-market-trends

a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.

airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types

Last synced: 30 Apr 2026

https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-

This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.

airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server

Last synced: 09 Sep 2025

https://github.com/vishal-bhandary/sql-data-analytics

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql

Last synced: 30 Jun 2026

https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown

This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025