An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/bryanfks-dev/klempoken-analysis

Analysis and forcasting model for Klempoken MSMEs

big-data-analytics data-analysis data-forecast data-visualization

Last synced: 01 Apr 2025

https://github.com/davidchocholaty/bithack_hackathon_2024

This repository contains my personal code tasks for the BIT_Hack hackathon, created in 2024.

data-mining data-science data-visualization exploratory-data-analysis hackaton hackaton-project machine-learning

Last synced: 06 May 2026

https://github.com/magnusrodseth/celeritas

A website for categorizing and visualizing data structures and algorithms.

algorithms data-structures data-visualization nextjs react tailwindcss typescript

Last synced: 12 Apr 2026

https://github.com/ashwin331133/gorkha_earthquake_damage_prediction

The main objective is to predict the level of damage to buildings caused by the 2015 Gorkha earthquake in Nepal.

data-analysis data-visualization machine-learning python

Last synced: 29 Apr 2026

https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms

The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.

data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier

Last synced: 01 May 2026

https://github.com/toodef/light-engine

Lightweight and fast 3D visualisation engine

cpp data-visualization linux python visualization windows

Last synced: 11 Feb 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/lucasfranklinsilva/rnn-lstm

Modelo de Prevenção de Falhas em Turbinas Simuladas Utilizando Redes Neurais Recorrentes

data-visualization deep-learning jupyter-notebook keras machine-learning neural-networks python scikit-learn

Last synced: 12 Apr 2026

https://github.com/shellynagar27/transportation-and-logistics-challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python

Last synced: 16 May 2026

https://github.com/anshajk/covid-vaccinations

A repository to track the rate of covid vaccinations in India

covid-19 data-visualization streamlit

Last synced: 17 May 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/deaneeth/smart-beehive-dashboard

Real-time web dashboard for visualizing beehive metrics including temperature, humidity, weight, bee activity, and alerts. Built with React and Firebase, designed to work with ESP32-based Smart Beehive IoT monitoring hardware.

agriculture-tech beekeeping data-visualization environmental-monitoring firebase iot iot-dashboard nextjs react real-time-monitoring smart-farming typescript

Last synced: 12 Apr 2026

https://github.com/crazy-dot/bank-laon-case-study

Used Exploratory Data Analysis (EDA) to analyse patterns in the bank dataset. The main was to analyse the potential defaulters list and identify the cause for payment default. Tried to understand one of the risk assessment used by the banks and have duplicated the same for this project.

advanced-excel bank-loan-analysis data-analytics data-visualization exploratory-data-analysis statistical-analysis

Last synced: 06 Jan 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/soumya-thoutam/covid-19-impact-on-u.s.-states-and-colleges

Covid-19 analysis and impact on United States Colleges and States using SQL and Tableau.

covid-19 dashboard data-analysis data-visualization dataset sql sql-server tableau

Last synced: 04 Sep 2025

https://github.com/acdh-oeaw/visartist

Visual Artwork Analysis and Collection Tool

color-clustering color-space data-visualization visual-analysis

Last synced: 13 Jul 2025

https://github.com/mackenly/suicide-rate-map-explorer

An interactive map that plots U.S. CDC suicide rate and population data on a county level built with React and Python.

cdc data-visualization react suicide-data suicide-prevention

Last synced: 01 Apr 2025

https://github.com/satvikpraveen/seabornmasterpro

🎨 SeabornMasterPro is a comprehensive, modular project to master Seaborn for data visualization. Includes themed utilities, advanced plotting notebooks, dashboards, time series, Streamlit app, and Docker support — perfect for learners, analysts, and open-source enthusiasts.

categorical-plots correlation-heatmap custom-theme data-visualization docker interactive-dashboard jupyter-notebook matplotlib modular-code multi-panel-layouts open-source-project pandas plot-utils project-structure python reproducible-research seaborn streamlit time-series-visualization utility-functions

Last synced: 12 Apr 2026

https://github.com/ddeepanshu-997/random-forest-classification

in this repository i am going to perform Random Forest Classification on the dataset , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e random forest classification .

classification classification-model data-science data-visualization random-forest random-forest-classifier

Last synced: 28 Feb 2025

https://github.com/yash-3-bit/online-sales-analysis

Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization

data-analysis data-visualization pandas-library

Last synced: 27 Mar 2025

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset

In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report

data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset

Last synced: 08 Feb 2026

https://github.com/gustavo-victor/scatter-plot

Scatter Plot Graph in JS and D3

css d3 data-visualization html js scatter-plot

Last synced: 12 Apr 2026

https://github.com/albertofaraujo/pbi_dashboard_anp

Análise do preço médio do combustível automobilístico no Brasil ao longo do ano de 2022

data-visualization dax-studio power-query powerbi

Last synced: 06 Jan 2026

https://github.com/ddeepanshu-997/datascience_marketing_campaign

In this repository i am going to perform data preprocessing techniques and try to findout some useful insights using the various datascience libraries along with data visualisation library to get the precise outputs on the dataset

data-insights data-science data-visualization data-visualization-project datacleaning insights libraries matplotlib numpy-arrays output pandas-dataframe prepr techniques visualization visualization-library

Last synced: 09 Sep 2025

https://github.com/tejaswirupa/impact-of-workplace-stress-on-mental-health-conditions-of-employees

Studied how remote, hybrid, and onsite work affects employee stress and wellness. Engineered metrics to quantify fatigue and work-life balance, uncovering mental health trends across industries and roles.

data-visualization datascience exploratory-data-analysis feature-engineering

Last synced: 24 Jan 2026

https://github.com/saro0307/voronoi-diagram-for-classification

Using Voronoi diagram to map random points scattered on a plane subdivides in exactly n cells enclosing a portion of the plane that is closest to each point

artificial-intelligence data-visualization dataanalytics graph machine-learning matplotlib plot plotting pyplot python python3 voronoi voronoi-diagram

Last synced: 08 Jun 2026

https://github.com/tashi-2004/apache-spark-geospatial-air-quality-analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 25 Mar 2025

https://github.com/tanaybhadula/twitter-trends-dashboard

An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.

dash dashboard data-analysis data-visualization plotly python trends twitter

Last synced: 31 May 2026

https://github.com/tanaybhadula/avacado-analytics

It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.

css dash data-analytics data-visualization python

Last synced: 17 May 2026

https://github.com/audreyadora/r_data_analytics

RStudio Data Analytics Learning Journal

data-science data-visualization r-studio

Last synced: 04 Feb 2026

https://github.com/aninditaws/investly

Investly: A personal finance platform for young investors, offering tailored portfolio recommendations by integrating user risk profiles, real-time market data, and optimization algorithms.

api-integration data-visualization goal-based-allocation react-frontend supabase-backend

Last synced: 01 Apr 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/quantumudit/groceries-basket-analysis

This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.

data-analysis data-visualization pandas powerbi python

Last synced: 12 Apr 2026

https://github.com/blarc/fri-staff-visualization

A simple visualization of the lectures and labs of the Faculty of Computer and Information Science in Ljubljana.

data-visualization p5js p5js-animation visualization

Last synced: 25 Mar 2025

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 09 Apr 2025

https://github.com/borjamome/soho_cholera

Cholera deaths in the Soho District (London)

data-analysis data-visualization london r

Last synced: 04 Sep 2025

https://github.com/charlescro/reddit-classification-nlp

Analyzing subreddit language via Reddit API and NLP techniques.

data-analysis data-science data-visualization nlp-machine-learning reddit-api scikit-learn

Last synced: 03 Apr 2025

https://github.com/sreekar0101/electric-vehicle-market-growth-and-incentive-impact-analysis-dashboard

About This project involves the development of a comprehensive Tableau dashboard to analyze the growth and market dynamics of electric vehicles (EVs). The dashboard reveals key insights, including a 20% increase in EV adoption over five years, the dominance of Battery Electric Vehicles (BEVs) which make up 60% of the market

data-analysis data-visualization tableau-desktop

Last synced: 07 Jan 2026

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 12 Apr 2026

https://github.com/omarkawach/info-viz

Project built with Nivo, React, and Leaflet for an Information Visualization course at the University of Victoria.

d3js data-visualization leaflet map react reactjs visualization

Last synced: 12 Apr 2026

https://github.com/devanshsahu47/hr-dashboard-mysql-powerbi

A comprehensive HR dashboard that visualizes key workforce metrics such as employee demographics, attrition rates, and performance trends. Built using Power BI/Excel, it enables data-driven HR decision-making with interactive charts and KPIs.

data-analytics data-visualization excel power-bi

Last synced: 04 Feb 2026

https://github.com/danielrosehill/eco-ninja-3

Configuration for an LLM assistant that performs analysis on sustainability data

data-visualization prompt-engineering prompting sustainability

Last synced: 22 Feb 2026

https://github.com/smpotts/sp500_index_analysis

Uses the Plotly Dash framework to visualize publicly available data for companies listed on the S&P 500 index

dash-plotly data-visualization financial-analysis pandas-dataframe python

Last synced: 01 Apr 2025

https://github.com/drtfloyd/psa-network-analyzer

In a complex professional world, understanding the true strength and relevance of your network is more critical than ever. The PSA (Presence Signaling Architecture) Network Analyzer is a sophisticated yet easy-to-use local tool designed to bring clarity, strategy, and ethical visibility to your professional relationships.

career-development csv-analysis data-anlaysis data-visualization human-to-human job-seeker linkedin network-analysis privacy-first professional-networking python realationship-management responsible-ai streamlit

Last synced: 29 Apr 2026

https://github.com/chinmayee4/vrinda_store_data_analysis

Analyzed Data By Creating Interactive Dashboard Using MS Excel

data-analysis data-cleaning data-visualization excel-dashboard pivot-tables power-query

Last synced: 07 Jan 2026

https://github.com/shrutiijoshi/e-commerce

The dataset contains various attributes related to orders, customers, and products, providing a comprehensive view of the sales process.

analysis data-visualization tableau-public visualization

Last synced: 07 Jan 2026

https://github.com/zeroxjackson/trendviz

A data visualization tool for Twitter trends in the United States.

data-visualization twitter

Last synced: 01 Apr 2025

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/saba-gul/google_data_analystics_belabeat_fitness_capstone_project

This project focuses on leveraging Fitbit user data to derive valuable insights and facilitate data-driven decision-making for Bellabeat, a leading wellness company. The objective is to harness the wealth of information captured by Fitbit devices to enhance the wellness offerings provided by Bellabeat.

bellabeat-case-study bellabeat-eda data-analytics data-visualization fitbit google-casestudy

Last synced: 08 Jun 2026

https://github.com/keshavg125/whatsapp-chat-analyzer

WhatsApp Chat Analyzer extracts insights from chat data, visualizing activity trends, emoji usage, and sentiment analysis using "ganeshkharad/gk-hinglish-sentiment". Built with Streamlit, Pandas, and Matplotlib for interactive analysis. 🚀

data-visualization emoji-analysis huggingface matplotlib nlp pandas python seaborn streamlit whatsapp-chat-analysis wordcloud

Last synced: 07 May 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/bkataru/physics-e.e

Project repository for IB physics extended essay. Topic: Predictive data modeling of a variable binary star’s brightness over a period of time using astrostatistics.

astrometry astronomical-algorithms astronomical-images astronomy astrophotography astrostatistics data-analysis data-science data-visualization modeling physics polynomial-regression regression-analysis

Last synced: 09 Apr 2025

https://github.com/sandravizz/analytical-system-design

Teaching material for bachelor course at Arcada

d3-js data-structures data-visualization system-design

Last synced: 24 Jan 2026

https://github.com/saroshfarhan/dublin_pedestrian_data_analysis

Pedestrian's footfall data analysis for the city of Dublin

data-analysis data-visualization r-programming

Last synced: 07 Jan 2026

https://github.com/orbulant/hourlyweatherdata

Using R, i have created a comprehensive R tool (with Shiny and etc.) to analyse hourly weather data from 2 Airport Stations.

analysis data-visualization r rmarkdown rmarkdown-document weather

Last synced: 04 Apr 2025

https://github.com/yanny-alt/banking-customer-retention-analysis

The objective of this analysis is to identify factors contributing to the increased customer churn rate at the bank. The insights gained from this analysis will help business users make informed decisions and develop strategies to improve customer retention and reduce churn.

data-visualization power-bi powerbi-customer-churn-analysis

Last synced: 07 Jan 2026

https://github.com/chrisvilches/human-profiling

Monitorea y analiza los programas que ocupa el usuario.

csharp data-visualization human-behavior winapi

Last synced: 16 Mar 2025

https://github.com/kirby-b/assorted-r-files

Mainly files from learning to use datasets and do data analysis with R

barchart data-visualization r-language r-programming

Last synced: 25 Mar 2025

https://github.com/erabossid/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 10 Mar 2025

https://github.com/ricardo-melo-martins/docker

⚡ RMM ⚡:: 🐳 docker with database for fun development

data-visualization database datascience docker mysql postgres sakila sakila-database sqlite

Last synced: 12 Apr 2026

https://github.com/bhaskarbharati/ibm-datascience-hands-on-lab

This is the basic hands-on exercise using Jupyter Notebook. This lab is done in the process of learning course Tools For Data Science | IBM

data-analysis data-science data-visualization datawrangling eda machine-learning

Last synced: 23 Apr 2025

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/giog97/find_similar_tables_on_pubtables-1m

Find similar tables on the PubTables-1M dataset

data-analysis data-visualization datamining dm tables

Last synced: 09 Apr 2025

https://github.com/victorlcastro-dsa/pbl-datacamp

This repository features projects from DataCamp's Project-Based Learning (PBL) courses, showcasing practical applications of data analysis, machine learning, and visualization. Explore real-world datasets and interactive results that highlight the skills gained through hands-on learning.

data-analysis data-science data-visualization datacamp-projects hypothesis-testing machine-learning project-based-learning

Last synced: 29 Nov 2025

https://github.com/kathyreid/geelong-council-elections-2017

Chord diagram of distributed preferences based on Victorian Electoral Commission data

chord-diagram d3js data-visualization

Last synced: 13 Mar 2025

https://github.com/archanakokate/exploratory_data_analysis_global-terrorism_using_tableau

Using Tableau, conducted an in-depth analysis of terrorism incidents around the world.

analysis data-visualization tableau

Last synced: 04 Feb 2026

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/teja-1403/game-of-thrones-analysis

Demonstrate Exploratory Data Analysis on GOT Dataset using plots and graphs and using the information extracted from text.

analysis data-visualization datascience machine-learning python

Last synced: 12 Apr 2026

https://github.com/fbarffmann/citibike-covid-analysis

Analyzed NYC CitiBike usage during March 2020 to assess the impact of COVID-19 using Python and Tableau. Includes ridership breakdowns, user type trends, and interactive dashboard.

citibike covid19 data-analysis data-visualization exploratory-data-analysis pandas python tableau transportation

Last synced: 12 Apr 2026

https://github.com/tuni56/support_ticket_streamlit

Interactive support ticket management system

data-science data-visualization python streamlit-webapp

Last synced: 29 May 2026

https://github.com/abhash-rai/analyzing-credit-card-eligibility

This work was performed as part of BCU undergraduate course.

data-analysis data-visualization ggplot ggplot2 latex r

Last synced: 20 Jan 2026

https://github.com/faysalalmahmud/bd-med-professional-analysis

Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.

data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau

Last synced: 04 Sep 2025

https://github.com/master-helix/11-7-commercial-store

A Data Analytics project for analyzing a Commercial Store dataset and building an interactive Excel dashboard for insights.

data-analytics data-visualization excel

Last synced: 04 Feb 2026

https://github.com/soumyajiitdas/My-GenAICapstoneProject

A Generative AI-powered journaling assistant that analyzes daily entries to extract emotions, stress levels, and mood trends — built using Google Gemini API for mental wellness insights.

ai-assistant data-visualization generative-ai machine-learning mental-health prompt-engineering python

Last synced: 04 Jul 2025

https://github.com/rupeshrb/data_visualization

Data visualization is important concept which apply on datasets

data-analytics data-visualization dataset python

Last synced: 17 May 2026

https://github.com/sanand0/storynetwork

Visualize where people are mentioned in stories and their inter-relationships

data-visualization

Last synced: 04 Sep 2025

https://github.com/soajala/shopify-sales-analysis-powerbi

End-to-end Power BI dashboard project analyzing Shopify sales data with real-time metrics, DAX, and business insights.

business-intelligence data-analysis data-visualization dax interactive-dashboard powerbi sales-analysis shopify

Last synced: 05 Sep 2025

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system

To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.

analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server

Last synced: 29 May 2026

https://github.com/shridhar1504/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization eda loan-prediction loan-status machine-learning predictive-modeling sql supervised-learning

Last synced: 09 Apr 2025