An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ujjwalll/get-that-flair

It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/

data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling

Last synced: 01 May 2026

https://github.com/devoloper-1/k-means-visualizer

Interactive K-Means clustering brought to life with animations, dynamic centroid splitting, and vibrant visuals. Learn and explore clustering like never before!

amcharts bootstrap5 css data-visualization html5 javascipt javascript k-means-clustering

Last synced: 16 Apr 2026

https://github.com/ehvenga/mis573

Data Science Master's Coursework: This repository contains projects, assignments, and notes from my Master's program in Data Science, showcasing skills in machine learning, analytics, and more.

business-analytics data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/acdh-oeaw/visartist

Visual Artwork Analysis and Collection Tool

color-clustering color-space data-visualization visual-analysis

Last synced: 13 Jul 2025

https://github.com/atheeralzhrani/data-science-projects

This repository contains my data science projects, where I utilized tools and libraries such as Spark, Python, Pandas, NumPy, SQLite, Matplotlib, Seaborn, and performed Exploratory Data Analysis .

data-engineering data-preprocessing data-science data-visualization exploratory-data-analysis matplotlib pandas python python-lambda seaborn spark

Last synced: 09 Mar 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/tejas-130704/dataanalysis-hr-manager

Presence Insights of Employees This project provides insightful data analysis on employee attendance and presence, including work-from-home (WFH) data, sick leave records, and presence excluding holidays. The analysis spans a three-month period and is visualized using Power BI to help HR managers understand trends and optimize workflow.

dashboard data-analysis data-visualization hr-manager power-bi

Last synced: 01 Mar 2026

https://github.com/mehnaz2004/data-cleaning-casestudy

This repository demonstrates data cleaning with a layoffs dataset. It covers handling missing values, detecting outliers, and encoding categorical data, using visualizations like boxplots and distplots to enhance data quality. Check out the code to see these techniques in action.

categorical-data-encoding data-cleaning data-integrity data-visualization missing-value-handling outlier-detection-and-removal pandas seaborn sklearn

Last synced: 01 Mar 2026

https://github.com/badranalyst/airbnb-listings-data-visualization-insights-into-pricing-and-popularity

This project visualizes Airbnb listings data, focusing on price by zip code, average price per bedroom, annual revenue, and distinct bedroom listings. Interactive charts reveal trends and patterns, providing valuable insights for hosts and travelers to understand market dynamics.

dashboard data-visualization dataset tableau tableau-dashboards tableau-desktop tableau-public

Last synced: 01 Mar 2026

https://github.com/larsgw/nederlands-kruidkundig-archief

Biodiversity data of the Nederlands Kruidkundig Archief

botany data-visualization vegetation

Last synced: 19 Mar 2026

https://github.com/lgibson7/unemployment-in-stem

Final Project for STAT 650 Advanced R, Cal State East Bay Fall 2022

data-visualization exploratory-data-analysis statistics tidyverse

Last synced: 20 Mar 2026

https://github.com/cuadernin/dispositivos_analisis

Breve análisis de un conjunto de datos sobre dispositivos móviles

data-analysis data-science data-visualization descriptive-statistics jupyter-notebook python-3 seaborn

Last synced: 18 Apr 2026

https://github.com/eubrunoo/beer-consumption-predictor

An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.

data-analysis data-science data-visualization machine-learning r statistical-analysis statistics

Last synced: 02 Apr 2025

https://github.com/virajbhutada/excel-reports

A curated showcase of interactive dashboards created using MS Excel, designed to transform raw data into clear, actionable insights. This collection demonstrates advanced Excel visualization techniques tailored for data-driven storytelling.

analytics business-intelligence dashboards data-cleaning data-science data-visualization ecommerce msexcel retail

Last synced: 01 Mar 2026

https://github.com/johannaschmidle/road-collisions-project

Analyzed road accident data in the UK from 2019 to 2022 to identify patterns and trends in road accidents, for Effective Road Management [Excel]

data-analysis data-visualization excel pivot-tables traffic-analysis

Last synced: 01 Mar 2026

https://github.com/living-with-machines/accidents-interactive

This is the “accidents interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.

accidents-analysis data-visualization industrial-revolution museum museum-experience museum-installation

Last synced: 20 Mar 2026

https://github.com/vaxdata22/redfin-analytics-etl-using-amazon-emr-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.

amazon-emr-cluster apache-airflow apache-spark aws-ec2 aws-s3 business-intelligence dags data-visualization etl-pipeline google-colab-notebook orchestration power-bi pyspark redfin snowflake snowpipe sqs-queue

Last synced: 02 May 2026

https://github.com/hermann-web/projet-rythmes-urbains-heetch

Heetch's "Rythmes Urbains" project analyzes driver data to understand traffic patterns in Casablanca. It identifies peak hours, main driver-concentration areas, and daily traffic variations. The project involves geospatial mapping, time series analysis, and R programming

casablanca data-mapping data-visualization geospatial-analysis heetch hur open-data r-programming time-series-analysis traffic-analysis transportation urban-mobility

Last synced: 19 Mar 2026

https://github.com/qc20/emojiadventure

EmojiAdventure: An interactive digital art experience that generates a dynamic, ever-changing landscape of emojis using Perlin noise algorithms. Explore a vibrant world where emojis come to life through mouse interaction and creative coding techniques.

algorithmic-art creative-coding data-visualization digital-art emoji font-art generative-art interaction-design interactive-art interactive-graphics interactive-visualizations javascript mouse-interaction p5js perlin-noise ux-design web-animation web-art web-art-is-great-art web-fonts

Last synced: 21 Mar 2026

https://github.com/chdl17/ipl_analysis_r

This GitHub repository contains R code for analyzing Indian Premier League (IPL) data. The repository also includes a detailed report explaining the results and insights gained from the analysis. It is a great resource for anyone interested in understanding the performance of teams and players in the IPL using data science tools.

analysis data-analysis-r data-visualization flexdashboard

Last synced: 02 Mar 2026

https://github.com/mayankyadav23/amazon-sales-data-analysis

Diving into Amazon sales data to uncover hidden gems! 📈 Analyzing iNeuron's dataset to optimize sales strategies and boost performance 💡 Driving business growth with data-driven decisions! 💻

amazon data-analysis data-visualization ineuron-ai internship-project

Last synced: 02 Mar 2026

https://github.com/controldata23/foresight-institution

This analysis is an EDA done on a certain Educational Institution Dataset

data-visualization exploratory-data-analysis spreadsheets sql tableau

Last synced: 02 Mar 2026

https://github.com/itskshitija/blinkit-customer-and-order-analysis

This project aims to uncover meaningful insights and create a comprehensive visualization of Blinkit's operational data. Analyzing key aspects of Blinkit's customer base, order patterns, and product performance. Using three datasets—Customers, Orders, and Order Details.

data-visualization dataanalysis dataanalysisusingsql dataanalyst dataanalystportfolio mysql powerbidashboard sql

Last synced: 31 Mar 2025

https://github.com/sawaira-iqbal/data-visualization-project-on-car-sales-data

Explore Sales Data Visualization with Interactive Charts & Insights! 📊 Uncover trends and patterns to drive smarter automotive decisions.

bivariate-analysis data-science data-visualization interactive-visualizations matplotlib multivariate-analysis numpy plotly python seaborn univariate-analysis

Last synced: 12 Apr 2026

https://github.com/sanjurajveer/songs_dashboard

This is a movie recommendation project on the basis of content.

data-visualization file-handling matplotlib pandas tkinter-gui

Last synced: 18 May 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/naninsv/marketing-analytics

The Marketing Analytics Project focuses on analyzing customer engagement, conversion rates, and sentiment trends to optimize marketing strategies for ShopEasy. Using SQL, Python (NLTK), and Power BI, this project extracts insights from customer reviews, transactions, and social media interactions to identify key areas for improvement.

data-visualization datanalysis excel marketing powerbi powerpoint python sql

Last synced: 23 Jun 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/imtiaz-emu/mediandata

Connect remote database or upload CSV files to get proper insight from your data using charts and graphs.

data-visualization django django-admin jquery postgresql

Last synced: 16 Apr 2026

https://github.com/silent0wings/steganopixelcrypt

SteganoPixelCrypt is a C++ tool that encodes text into images by mapping characters to pixel colors. It provides a simple form of visual encryption, turning data into colored patterns that can be decoded back to text. Lightweight, fast, and ideal for basic steganography.

cli-tool cplusplus data-visualization encryption image-processing pixel-art steganography text-to-image utf-32 visualization-tool

Last synced: 26 Aug 2025

https://github.com/rosanafss/r-ladies-bh-workshop-metricas

Como Plotar Métricas e Entregar Valor para Times Ágeis

data-analysis data-visualization r

Last synced: 25 Aug 2025

https://github.com/benedart/interactive-information-visualization

Interactive Information Visualization Project

d3 data-visualization information-visualization

Last synced: 05 May 2026

https://github.com/elena563/wordviz

WordViz is a Python visualization library designed for exploring and visualizing word embeddings.

bert clustering contextual-embeddings cosine-similarity data-visualization nlp python3 scatterplot word-embeddings

Last synced: 27 Mar 2026

https://github.com/berkeley-gif/caladapt-website-2021

Redesign and rewrite of the website for Cal-Adapt.org

cal-adapt california climate-change climate-models data-visualization svelte

Last synced: 26 Jan 2026

https://github.com/quocduyenanhnguyen/twitter-despicable-me-4-hashtag-engagement-analysis

In this project, I explored Despicable Me 4 hashtag on Twitter to gather engagement metrics for data analysis over a one week period.

csv-files data-analytics data-cleaning data-visualization mysql python3 tableau tableau-dashboards tableau-public twitter twitter-hashtag

Last synced: 16 May 2026

https://github.com/faithererer/haokanvideo_spider

好看视频爬取与数据分析

data-analysis data-visualization python spider

Last synced: 02 May 2026

https://github.com/rbreeze/dashboard

My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.

css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization

Last synced: 02 May 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/jofaval/melbourne-housing

Data Analysis of the Housing Market in Melbourne, Australia in 2016-2017

data-analysis data-science data-visualization deep-learning google-colab kaggle machine-learning melbourne python xgboost

Last synced: 16 Apr 2026

https://github.com/rauhanahmed/auto-data-analyzer

AutoDataAnalyzer: Automate data ingestion, analysis, and visualization with AI/ML-powered pipelines. Features natural language query processing, interactive Plotly visualizations, and seamless deployment via Docker.

ai-powered-analysis automated-pipeline cicd data-analysis data-visualization docker end-to-end-project flask generative-ai langchain llama3-1 machine-learning natural-language-processing plotly python3 pywebio

Last synced: 12 Apr 2026

https://github.com/dpb44/superstore-business-performance-analysis

Interactive Dashboard & Performance Report Highlighting Key Insights and Strategic Recommendations

business-analytics dashboard data-visualization dataanalytics excel superstore-data-analysis

Last synced: 04 Mar 2026

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 02 May 2026

https://github.com/samuelson777/titanic-dataset-analysis

Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.

data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 16 Apr 2026

https://github.com/pdiegel/nc-parcel-search

A web application enhancing access to North Carolina's parcel data with advanced search capabilities and an interactive map interface.

arcgis data-visualization geospatial gis map north-carolina parcel-data react typescript vercel

Last synced: 16 Apr 2026

https://github.com/themihirmathur/qlik-intern-project

Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.

data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview

Last synced: 04 Mar 2026

https://github.com/marben06/rent-in-germany

Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.

charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte

Last synced: 27 Apr 2026

https://github.com/prince-pastakiya/human-resources-tableau-project

👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.

chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public

Last synced: 18 Apr 2026

https://github.com/shellynagar27/transportation-and-logistics-challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python

Last synced: 16 May 2026

https://github.com/teja-1403/ignosis-tech-ml-assignment

Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.

customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python

Last synced: 02 May 2026

https://github.com/toodef/light-engine

Lightweight and fast 3D visualisation engine

cpp data-visualization linux python visualization windows

Last synced: 11 Feb 2026

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/johannaschmidle/bookauthors

Explored a book sales database. Cleaned data using Excel and created an interactive dashboard to analyze author popularity, ratings, and sales trends. The project highlighted key insights such as sales performance and rating distributions [Excel]

author-sales book-sales books data-analysis data-visualization excel

Last synced: 04 Feb 2026

https://github.com/donmaruko/flask-data-analysis

Flask API for statistical calculations. Data analysis, cleansing, visualization, and manipulation. Documented by Swagger.

api api-rest data-analysis data-science data-visualization datascience flasgger matplotlib pandas seaborn sqlite wordcloud

Last synced: 03 May 2026

https://github.com/digitaboss/d3js-heatmap

Global Heatmap presentation with D3js and Reactjs

d3js data-science data-visualization heatmap javascript reactjs

Last synced: 22 Aug 2025

https://github.com/johannaschmidle/netflix-subscription-analysis

Examined Netflix subscription data to understand market behaviour, predict future trends, and identify consumer preferences. [SQL, Tableau]

data-analysis data-cleaning data-trend data-visualization netflix

Last synced: 05 Mar 2026

https://github.com/vaxdata22/salifort-motors-and-waze-churn

Employee retention predictive model development for Salifort Motors and Waze. This is a terminal project I did to earn the Google Advanced Data Analytics Professional Certificate.

data-analytics data-visualization model-development predictive-analytics python statistical-analysis

Last synced: 16 Apr 2026

https://github.com/digitaboss/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 22 Aug 2025

https://github.com/anjalikumari021/ecommerce_sales_dashboard_using_powerbi

Developed an insightful Ecommerce Sales dashboard using Power BI to provide insights.

charts dashboard data-cleaning data-visualization powerbi

Last synced: 05 Mar 2026

https://github.com/gaurav-0211/seaborn-for-data-visualization

This Project aims to different plotting methods using seaborn for data Visualization.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe seaborn

Last synced: 20 Apr 2026

https://github.com/kristishqau/ApartmentRegressionAnalysis

This data science project aims to predict apartment prices through regression analysis. The dataset used contains information about apartments, and the project involves various steps such as data preprocessing, exploratory data analysis, feature engineering, and building a decision tree regression model.

apartment-prices data-preprocessing data-science data-visualization decision-tree-regression jupyter-notebook python3

Last synced: 22 Aug 2025

https://github.com/satyacoder29/e-commerce-sales-analysis

Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.

data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups

Last synced: 05 Mar 2026

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/vasugi2003/customer_churn_analysis_using_tableau

Customer Churn Analysis - To identify various reasons for customer to discontinue a company services.

business-analytics business-intelligence charts csv data-science data-visualization dataanalytics predictive-modeling preprocessing tableau

Last synced: 05 Mar 2026

https://github.com/pizofreude/divvybikes-share-success

Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).

airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3

Last synced: 17 Apr 2026

https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot

A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 02 May 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/hari7261/data-visualization

Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.

data-analysis data-visualization export hari7261 import python realtime-visualization

Last synced: 17 Jun 2025

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/ruajean/netflixmoviescraper

🎬 A powerful tool for gathering movie data and user reviews from FilmAffinity's Netflix category. This script scrapes movie details and iterates through user reviews, saving structured information to a CSV file for analysis. Ideal for insights into user sentiments and movie popularity on FilmAffinity.

data-analysis data-visualization dataset jupyter-notebook python scraping

Last synced: 17 Apr 2026

https://github.com/peterjakubowski/nyc-tree-census

Exploratory data analysis (EDA) with python of the 2015 New York City Street Tree Census dataset.

data-visualization exploratory-data-analysis geopandas matplotlib pandas python seaborn

Last synced: 17 Apr 2026