An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/gorlix/sp-grafana-bridge

A lightweight, real-time data bridge to export Super Productivity tasks to InfluxDB v2 for advanced analytics on Grafana. Transform your time-tracking into actionable insights.

data-visualization grafana influxdb-v2 plugin productivity-tool super-productivity superproductivity time-tracking

Last synced: 31 May 2026

https://github.com/itskshitija/hr-data-analysis

The HR Data Analytics Dashboard project uses Power BI to analyze employee data, visualizing key HR metrics and KPIs to support data-driven decisions for improving workforce management, employee satisfaction, and organizational growth.

analytics data-science data-visualization dataanalysis dataanalytics hrdataanalysis powerbi-desktop powerbidashboard

Last synced: 21 Jan 2026

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/timjjting/task-lineage-generator

A simple Task Lineage Diagram Generator

d3 dag data-visualization golang graphviz-dot lineage task

Last synced: 21 Jan 2026

https://github.com/agarwalrachit399/fifa-19-analysis

Analysis of one of the most popular games ,FIFA-19 on tableau and a detailed report for the same

dashboard data-visualization fifa19 tableau

Last synced: 19 Jan 2026

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025

https://github.com/ashioyajotham/data_centers_are_eating_the_world

This idea is mostly how supercomputers (AI data centers) are coming up fast so it is an attempt to map them like Semi-Analysis does

data-center data-visualization supercomputers

Last synced: 14 Oct 2025

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/mahambilalandahaan/week8

K-Means Deep Dive: Clustering analysis with Elbow and Silhouette methods in Python

clustering data-visualization jupyter-notebook k-means machine-learning python scikit-learn unsupervised-learning

Last synced: 20 Apr 2026

https://github.com/praveendecode/phonepe_pulse

Phonepe Pulse Data Visualization and Exploration: A User-Friendly Tool Using Streamlit and Plotly

data-visualization dataanalysis financial-analysis mongodb postgres python sql streamlit-dashboard

Last synced: 14 Apr 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/harshindcoder/sleeping_time_survey_analysis

This survey analysis aims to identify lifestyle factors—such as screen time, exercise, alcohol consumption, beverages, and sleep direction—that affect sleep quality and duration. By analyzing these factors, we seek to predict sleep patterns and provide actionable insights to improve sleep health and overall well-being.

data-cleaning data-collection data-visualization exploratory-data-analysis regression-models survey-analysis

Last synced: 15 Oct 2025

https://github.com/madhursinghbhadoriya/cutomer_data_analysis1

Customer_Data Analysis - Tableau

chart data-visualization tableau

Last synced: 22 Jan 2026

https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees

Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.

classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn

Last synced: 17 Oct 2025

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/tejaswirupa/united-airlines-flight-gain-analysis

Analyzed 30K+ United Airlines flights to evaluate time gained or lost during flight. Used hypothesis testing to compare on-time vs. late departures, identifying routes with the highest average time gain.

data-science data-visualization hypothesis-testing r statistical-analysis

Last synced: 27 Jan 2026

https://github.com/umutdinceryananer/ctis471-supervised-ml

This project, developed in Python as part of CTIS471, integrates data analysis, machine learning models, and visualization techniques.

data-visualization machine-learning supervised-learning

Last synced: 20 Oct 2025

https://github.com/emirhansilsupur/hotel-booking-analytics-dashboard

Interactive Power BI dashboard visualizing hotel booking metrics for two Portuguese properties (Algarve resort & Lisbon city).

dashboard data-visualization power-bi

Last synced: 27 Jan 2026

https://github.com/kamiviolet/d3_collections

As decribed, all kinds of chart and data visualisation with D3

charts d3 data-visualization

Last synced: 29 Apr 2026

https://github.com/dianaow/ploty-dash-fifa

Dash/Plotly (Python): Visualizing high-level score attributes and connections of players in the FIFA 22 game

data-visualization fifa fifa-2022 plotly plotly-dash python soccer sports-analytics sports-visualisation

Last synced: 23 Oct 2025

https://github.com/jofaval/sonar

Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987

data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci

Last synced: 09 Apr 2026

https://github.com/pocper1/visual-volume

Visual Volume visualizes volumetric data using Three.js and WebGL, rendering 3D data from sources like CT scans.

3d-graphics ct-scans data-visualization three-js volume-rendering webgl

Last synced: 28 Jan 2026

https://github.com/maryamteimouri/dataanalysis-and-knowledgediscovery

This project aims to practice the steps of Crisp Data Mining ( CRISP-DM ). The repository includes 3 phases, data understanding, supervised learning, and unsupervised learning.

agglomerative-clustering classification clustering crisp-dm cross-validation data-preprocessing data-visualization dendogram k-means-clustering one-hot-encode pca principal-component-analysis z-score

Last synced: 23 Jan 2026

https://github.com/tommcn/ontario-hs-data

A simple visualization of Ontario High School data

data-visualization r

Last synced: 25 Oct 2025

https://github.com/simongoring/giphyr

A wrapper for getting giphy GIFs in R

data-visualization gif gifs giphy package r

Last synced: 26 Oct 2025

https://github.com/srinibas-masanta/infosys-springboard-internship

An interactive Power BI dashboard developed during my Infosys Springboard Internship to visualize Indian election trends. It integrates historical and live API data to analyze vote shares, turnout patterns, and demographic insights across constituencies, helping news agencies report results in real time.

dashboard data-analysis data-cleaning data-collection data-visualization dax-functions powerbi

Last synced: 25 Feb 2026

https://github.com/ledsouza/curso_de_estatistica_parte_2

Projeto de estatística voltado para conceitos como probabilidade e amostragem

data-science data-visualization pandas scipy seaborn vitrinedev

Last synced: 07 May 2026

https://github.com/eduardo-j-morales/erp-fusion-dashboard

A modern enterprise resource planning dashboard demonstrating user role management, real-time data visualization, and complex business operations handling. Built with Vue 3 and Vuetify 3 for enterprise-grade applications.

business-intelligence dashboard data-visualization enterprise-resource-planning erp-system pinia role-based-access-control vue3 vuetify

Last synced: 24 Feb 2026

https://github.com/aaisha-nexus/sql_excel_project

A data analysis project using MS SQL Server and Excel, focusing on database setup, querying, data cleaning, and visualization with PivotTables & Charts. Explored real-world sales data, financial KPIs, and interactive dashboards.

charts dashboard data-visualization dataanalysis dataanalytics databases excel kpi mssqlserver pivot-tables queries sql

Last synced: 15 May 2026

https://github.com/annnieglez/fraud-detection-eda

Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns

data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn

Last synced: 28 Jan 2026

https://github.com/svdexe/powerbi-mysql-hr_dashboard

Comprehensive HR analytics dashboard combining SQL data processing and Power BI visualization. Features employee demographics, attrition analysis, hiring trends, and geographical distribution with interactive filters and insights.

dashboard data-visualization hr-analytics mysql powerbi

Last synced: 28 Jan 2026

https://github.com/angchekar28/sales-report-power-bi

A Power BI sales report analyzing country-wise and product-wise sales trends. Includes dashboards, decomposition trees, and key influencers analysis for business insights.

dashboard data-analysis data-cleaning data-visualization powerbi sales-report

Last synced: 16 Mar 2026

https://github.com/vbs360/sql_mysql_for_data_analytics_and_business_intelligence

This repository contains SQL exercises and projects based on the "SQL - MySQL for Data Analytics and Business Intelligence" course.

data-engineering data-management data-manipulation data-science data-visualization

Last synced: 29 Jan 2026

https://github.com/samjoesilvano/airline_ticket_fare_prediction

Airline Fare Prediction using Machine Learning focuses on developing a Random Forest model to predict flight prices, achieving an R² score of 0.804. The project includes hyperparameter tuning using RandomizedSearchCV, alongside extensive data preprocessing and feature engineering to ensure robust model performance.

airline-fare-prediction data-preprocessing data-visualization feature-engineering feature-selection hyperparameter-tuning machine-learning pandas python random-forest randomizedsearchcv regression-analysis scikit-learn

Last synced: 15 Apr 2026

https://github.com/agarwalrachit399/excel-dashboards

Interactive and simple dashboards build in excel.

dashboard data-visualization excel

Last synced: 30 Jan 2026

https://github.com/naninsv/zomato_dashboard

An interactive Power BI dashboard analyzing Zomato’s sales performance, user trends, and city-wise insights. Features include sales trends, user engagement, top-performing cities, and food category distribution to help drive data-driven business decisions

business-intelligence data-visualization dataanalysis excel powerbi sql

Last synced: 30 Jan 2026

https://github.com/jofaval/titanic-disaster

Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning

classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn

Last synced: 15 Apr 2026

https://github.com/l-gre/data_analytics_for_finance

Comprehensive course materials for the Data Analytics for Finance - Master Programme, covering data manipulation, statistical analysis, visualisation, automation, and real-world case studies using industry-standard tools.

automation data-cleaning data-manipulation data-visualization excel hypothesis-testing industry-applications matplotlib numpy pandas python real-world-case-studies regression-analysis seaborn sql statistical-analysis tableau workflow-automation

Last synced: 15 Apr 2026

https://github.com/shafaq-aslam/pandas-lab

A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.

analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series

Last synced: 15 Apr 2026

https://github.com/steviecurran/gbt-scripts

IDL scripts for the reduction of Green Bank Telescope data

data-analysis data-compression data-visualization radio-astronomy spectroscopy

Last synced: 31 Jan 2026

https://github.com/zainulabdeenofficial/data-visualization

This project showcases the use of Pandas for data visualization and analysis. It involves working with a movie dataset from TMDB (The Movie Database), and the goal is to analyze and present insights from the dataset through various visualizations.

data-visualization

Last synced: 01 Jun 2026

https://github.com/amotivv-inc/snowflake-memory-box

Snowflake Memory Box: AI analytics with persistent memory using Cortex, Claude AI and Memory Box

ai analytics data-visualization memory snowflake

Last synced: 01 Feb 2026

https://github.com/rohitdusane/healthcare-analytics

𝐏𝐨𝐰𝐞𝐫 𝐁𝐈 𝐃𝐚𝐬𝐡𝐛𝐨𝐚𝐫𝐝 is designed to provide valuable insights into patient waiting times across outpatient and inpatient healthcare services. It offers a comprehensive analysis of key factors influencing wait lists, including Age Profile, Specialty, Time Bands, and Patient Case Types.

data-analysis data-visualization dax dax-query healthcare-analysis powerbi-report

Last synced: 01 Feb 2026

https://github.com/vishnu-vamshii/data-science-jobs-salaries

Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.

data-analysis data-science data-visualization tableau tableau-dashboard

Last synced: 01 Feb 2026

https://github.com/amoghkori/data-analysis-using-algebraic-and-geometric-methods-in-statistics

This project utilizes algebraic and geometric methods in statistics to analyze real-life data using R. It involves conducting various statistical tests, creating Bayesian network graphs, employing linear regression models, and evaluating model fit.

data-interpretation data-visualization graphical-models linear-regression model-evaluation rprogramming statistical-analysis

Last synced: 02 Feb 2026

https://github.com/sroman0/data-analytics

Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.

data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics

Last synced: 15 Apr 2026

https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks

Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.

amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql

Last synced: 09 Feb 2026

https://github.com/franckalbinet/teuvo

Self-Organising Map implemented as Literate Programming

data-visualization dimensionality-reduction neural-network

Last synced: 09 Feb 2026

https://github.com/pedromarquetti/spotify-analyser

Data analyser for Spotify account usage

data-visualization python spotify

Last synced: 18 Mar 2026

https://github.com/pejpero/neural_network_regression_and_classification

This repository contains neural network regression models built from scratch and using Keras for comparison. It visualizes training and testing performance, analyzing MSE, R², and decision boundaries. The project demonstrates learning techniques and optimization for regression tasks.

data-visualization keras machine-learning neural-network regression

Last synced: 09 Feb 2026

https://github.com/ludreinsalvador/global-covid-19-data-analysis

Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.

analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization

Last synced: 26 Feb 2026

https://github.com/shruti23-ui/blinkit-powerbi-dashboard

A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.

data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql

Last synced: 09 Feb 2026

https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis

About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization

data-analysis data-visualization sql tableau

Last synced: 27 Feb 2026

https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals

Analyzed Data By Creating Interactive Dashboard Using MS Excel

data-analysis data-cleaning data-visualization excel pivot-tables powerquery

Last synced: 11 Feb 2026

https://github.com/haonamnguyen/data-science-job-analysis

Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 16 Apr 2026

https://github.com/bhavinpatel4199/artificial-intelligence--algorithm-and-mathematics

This repository focuses on AI with an emphasis on algorithms and mathematical foundations. It includes projects on data processing, fundamental AI algorithms, and mathematical concepts like linear algebra and optimization. Hands-on work with various frameworks provides practical model-building experience.

algorithms-and-data-structures data-structures data-visualization mathematic probability problem-solving python3 sklearn

Last synced: 11 Feb 2026

https://github.com/rodrigojunqueiradev/python-exercises

Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language

data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics

Last synced: 16 Apr 2026

https://github.com/kaustubh-indulkar/ola-data-analytics-project

End-to-End Ola Data Analytics project using SQL, Excel, and Power BI. Analyzes a dataset of 20000+ rides to gain insights and create visualizations.

data-analytics data-visualization excel powerbi sql

Last synced: 27 Feb 2026

https://github.com/arshc0der/n.o.v.a-geospatial-ozone-predictor

An AI-powered geospatial intelligence dashboard for predicting atmospheric ozone levels using 27 years of NASA data. Features 3D climate mapping and live satellite tracking.

atmospheric-science climate-tech dashboard-ui data-visualization desktop-app geospatial-analysis gis machine-learning matplotlib ozone-prediction pandas python random-forest-regressor satellite-tracking scikit-learn tkinter windows-executable

Last synced: 01 Mar 2026

https://github.com/thlindustries/mortalidade_neonatal_python_react

Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly

data-analysis data-visualization plotly python python3 react reactjs

Last synced: 16 Apr 2026

https://github.com/m-biriulova/m-biriulova

Hi! I'm Mariia — a Python Developer and Data Analyst passionate about automation, web development, and smart data solutions. Open to collaborations and freelance projects

automation data-analyst data-visualization freelancer github-profile open-to-work portfolio python-developer telegram-bots web-scraping

Last synced: 12 Feb 2026

https://github.com/rohitblaze10/-excel-_seller_store_analysis

A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!

data-analysis data-visualization excel excel-export

Last synced: 12 Feb 2026

https://github.com/nabilshadman/power-bi-essential-training

Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning

dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard

Last synced: 12 Feb 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/kaushik-puttaswamy/blinkit-sales-performance-analysis-powerbi

An interactive Power BI dashboard analyzing BlinkIT’s sales performance, outlet metrics, and customer insights.

data-visualization dax-expression powerbi

Last synced: 03 Mar 2026

https://github.com/karlyndiary/mavens-pizza-sales-insight

Analyzing Maven Pizza's sales performance and business insights by exploring key metrics, product trends, customer behavior, and peak sales periods, utilizing SQL for querying and Excel for dashboard visualizations.

analysis data-exploratory data-pipeline data-visualization eda etl excel-dashboard pizza-dataset sql

Last synced: 19 Mar 2026

https://github.com/bhavyabhargava/2022_aus_fed_election_analysis

R based in-depth analysis of voter behavior during the **2022 Australian Federal Election**

behavioral-patterns data-visualization election-analysis geospatial-data rprogramming sf

Last synced: 28 Feb 2026

https://github.com/saba-gul/business-intelligence-case-study-flyingwhale-airline

FlyingWhale Airline BI Case Study: Analyzing customer flight activity and loyalty history to optimize operations and enhance customer experience.

airline-data-analysis airline-datasets business-analytics business-intelligence dashboards data-visualization dataanalytics eda flyingwhale presentation

Last synced: 13 Feb 2026

https://github.com/marialuizaleitao/dsa-powerbi

Este repositório contém uma série de análises e projetos construídos durante o curso de Power BI para Business Intelligence e Data Science da Data Science Academy.

analytics business-intelligence data-visualization dax-expression machine-learning powerbi

Last synced: 14 Feb 2026