An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/sehaj003/telco-churn-analysis

This repository contains files (dataset and Jupyter codebooks) for a project aimed to build machine learning models to predict customer churn based on given parameters.

data-science data-visualization exploratory-data-analysis machine-learning machine-learning-models predictive-modeling principal-component-analysis python

Last synced: 20 May 2026

https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office

My Project to learn the Basics of Analysis & Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/gustapinto/fatec_dsm_pi_terceiro_semestre

Um visualizador para tendências de animes

anime chartjs data-visualization django etl

Last synced: 05 Apr 2025

https://github.com/shafaq-aslam/data-analytics-dairy

A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.

data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau

Last synced: 20 Jan 2026

https://github.com/fbarffmann/tornado-damage-dashboard

Built a Flask dashboard visualizing 1,000+ US tornadoes from 2023 using Leaflet.js and MongoDB. Interactive maps show tornado magnitude, damage, and frequency.

api data-visualization flask geospatial leaflet mongodb pandas python tornado-dashboard

Last synced: 11 Apr 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/vishwas-r/canvasjs-data-parser

CanvasJS Data Parser - Parse Data to CanvasJS accepted Format

canvasjs charts data-parser data-visualization javascript

Last synced: 31 Mar 2025

https://github.com/kernelshreyak/kaggle-notebooks

Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets

data-analysis data-science data-visualization kaggle kaggle-competition machine-learning

Last synced: 27 Apr 2026

https://github.com/rohitdusane/power-bi-sentiment-analysis-insurance-coverage-dashboard

This project analyzes insurance data in Power BI, integrating sentiment analysis to evaluate customer feedback. Visualizations include Word Cloud, Decomposition Tree, and sentiment scoring to assess claims trends, policy performance, and demographic impacts, helping stakeholders optimize strategies and improve customer engagement. 📊

customer-feedback-analysis data-visualization decomposition-tree insurance-analysis powerbi sentiment-analysis wordcloud-visualization

Last synced: 27 Jan 2026

https://github.com/ezrahsieh/academicdatabasedashboard

The Academic Faculty and Research Insight Dashboard utilizes SQL and NoSQL databases and is designed to support academic institutions, research departments, and individual students by providing comprehensive insights into faculty members and their research activities.

dashboard data-visualization database-management mongodb mysql neo4j sql

Last synced: 21 Feb 2026

https://github.com/SebastianUrdaneguiBisalaya/datathon-expresate-peru-con-datos

Contiene la información de los 2 proyectos de análisis y ciencia de datos presentados en la Datatón de la Secretaría de Gobierno y Transformación Digital del Perú.

data-visualization jupyter-notebook machine-learning npl-data python

Last synced: 18 Jan 2026

https://github.com/adilshamim8/eda-on-health-and-sleep-data

Exploratory Data Analysis (EDA) on health and sleep data, uncovering patterns and insights using Python and visualization tools.

data-analysis data-visualization eda health healthcare sleep sleep-analysis

Last synced: 15 Mar 2025

https://github.com/gilberto-xyz/dataviz

Visualizacion de datos con Altair

altair data-visualization notebook visualization

Last synced: 10 Jun 2025

https://github.com/ginalamp/covid_dashboard_twitternews

Corona Dashboard & report based on Twitter media outlet news.

dashboard data-analysis data-visualization twitter

Last synced: 28 Jan 2026

https://github.com/vassilevsky/4sq73

Демонстрация неточности координат заведений Foursquare на примере Ульяновска

data-visualization foursquare-api geo yandex-maps

Last synced: 15 Mar 2025

https://github.com/thbaylson/datascience

All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.

data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn

Last synced: 09 May 2026

https://github.com/hyoaru/rph-retraction-relationship-visualization

A task output in GEC 3 RPH, retraction relationship visualization

data-visualization graph

Last synced: 31 Mar 2025

https://github.com/solygambas/d3-firebase

5 small projects to understand D3.js basics using Firebase and Materialize.

d3 d3js data-visualization firebase firestore javascript materialize materializecss

Last synced: 11 Apr 2026

https://github.com/srking501/futurelearn_mooc

A summative coursework for CSC8631 Data Management and Exploratory Data Analysis

crisp-dm data-mining data-preprocessing data-science data-visualization deployment eda exploratory-data-analysis

Last synced: 23 Mar 2025

https://github.com/atharva309/car-sales-dashboard-powerbi

A Advanced and Interactive Dashboard for Car Sales and more using PowerBI

dashboard data-visualization dax-query interactive-visualizations powerbi

Last synced: 11 Jan 2026

https://github.com/darkdk123/simple-heart-disease-classification

This Experiment provides a comprehensive approach to forecast heart disease risks by performing a detailed data analysis, predictive modeling & hyperparameter tuning. This leads to a `LinearSVC` model with 90% Accuracy

classification-algorithm data-science data-visualization exploratory-data-analysis heart-disease-prediction machine-learning

Last synced: 17 Nov 2025

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/danielrosehill/data-projects-index

Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.

data-analysis data-science data-visualization

Last synced: 16 Mar 2026

https://github.com/abdul-aa/drug-sentiment-analysis

Extracting Themes and Sentiments in Birth Control Drug Reviews with DistilBert and LDA Topic Modeling

data-visualization distilbert lda-topic-modeling natural-language-processing sentiment-analysis tableau

Last synced: 22 Apr 2026

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://github.com/harshmule1/product-sales-anlysis

Product Sales Anlaysis Using Power Bi

analysis data-visualization powerbi

Last synced: 04 Feb 2026

https://github.com/samaalharbi2/project-data-science-blog-post

A data science project from Udacity’s Nanodegree — exploring what drives developer success

crisp-dm data-analysis data-science data-visualization nanodegree udacity

Last synced: 26 Jan 2026

https://github.com/0xHericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 24 Mar 2025

https://github.com/petitatelier/data-sets

A collection of data sets, to play with in visualization experiments

data-visualization dataset

Last synced: 02 Jul 2025

https://github.com/petzi53/learning-plotly

Personal notes and trials during reading "Interactive web-based data visualization with R, plotly, and shiny" by Carson Sievert

data-visualization plotly visualization

Last synced: 16 Mar 2025

https://github.com/katrinleinweber/leaving-the-bar

A less-code variant of Joachim Goedhart's "Leaving the bar in five steps"

barchart boxplot boxplots data-visualisation data-visualization ggplot

Last synced: 20 Aug 2025

https://github.com/josephbarbierdarnal/matoolkit

matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib

data-analysis data-visualization matplotlib

Last synced: 31 Mar 2025

https://github.com/yuchenq/comp90055-project

This is the lastest version of my project belong to Comp90055.

couchdb crawler data-visualization python3 textblob tweepy

Last synced: 16 Jul 2025

https://github.com/mattbixley/tidy_tuesday

A home for some #tidytuesday code, plots table and general upskilling.

data-science data-visualization ggplot r4ds tidytuesday tidyverse

Last synced: 15 Feb 2026

https://github.com/datastalker/survival-cox

This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.

breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis

Last synced: 02 Apr 2025

https://github.com/living-with-machines/machines-interactive

This is the “machines interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.

data-visualization history-of-technology industrial-revolution machines museum museum-experience museum-installation

Last synced: 20 Jan 2026

https://github.com/githubsolver123/bus-tracker

Real-time bus tracking simulation built with R Shiny and Google Maps API. Visualizes bus movement along Broadway in NYC with 2-second position updates.

data-visualization geospatial gis google-maps-api r r-shiny real-time shiny simulation transportation web-application

Last synced: 01 Apr 2025

https://github.com/isaactrer/trading-gpt

TradeGPT is an intelligent trading bot built with ChatGPT and AI to automate and optimize trading strategies. It analyzes market data, predicts trends, and executes trades in real-time, providing traders with tools to enhance efficiency and profitability.

ai-trading algorithmic-trading automated-trading backtesting chatgpt crypto-trading data-visualization financial-analysis machine-learning market-data openai portfolio-management real-time-data risk-management sentiment-analysis stock-market technical-indicators trade-execution trading-strategies

Last synced: 25 Mar 2025

https://github.com/jabonsote/financial-anomaly-detection-with-deepseek-and-isolation-forest

🚀 Financial Anomaly Detection with DeepSeek and Isolation Forest – A powerful, locally-run tool for detecting financial anomalies using Isolation Forest and DeepSeek LLM. Features AI-powered insights, interactive time-series visualization, and automated PDF audit reports. 🔍📊

anomaly-detection chatbot data-visualization deepseek financial-analysis financial-data isolation-forest llm machienlearning ollama report-generator streamlit

Last synced: 12 Apr 2026

https://github.com/ekenes/elections-timeline

Data visualization showing the results of the previous 5 U.S. presidential elections in a single map.

arcgis-js-api data-visualization elections gis mapping

Last synced: 24 Mar 2025

https://github.com/albertofaraujo/excel_dashboard_prev_fraudes

O objetivo da análise é extrair informações de performances individuais dos colabores de uma empresa fictícia para tomadas de decisão. (Dashbord em Excel)

analise-de-dados dashboard data-visualization excel

Last synced: 06 Jan 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/shellynagar27/transportation-and-logistics-challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python

Last synced: 16 May 2026

https://github.com/deaneeth/smart-beehive-dashboard

Real-time web dashboard for visualizing beehive metrics including temperature, humidity, weight, bee activity, and alerts. Built with React and Firebase, designed to work with ESP32-based Smart Beehive IoT monitoring hardware.

agriculture-tech beekeeping data-visualization environmental-monitoring firebase iot iot-dashboard nextjs react real-time-monitoring smart-farming typescript

Last synced: 12 Apr 2026

https://github.com/soumya-thoutam/covid-19-impact-on-u.s.-states-and-colleges

Covid-19 analysis and impact on United States Colleges and States using SQL and Tableau.

covid-19 dashboard data-analysis data-visualization dataset sql sql-server tableau

Last synced: 04 Sep 2025

https://github.com/yash-3-bit/online-sales-analysis

Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization

data-analysis data-visualization pandas-library

Last synced: 27 Mar 2025

https://github.com/tejaswirupa/impact-of-workplace-stress-on-mental-health-conditions-of-employees

Studied how remote, hybrid, and onsite work affects employee stress and wellness. Engineered metrics to quantify fatigue and work-life balance, uncovering mental health trends across industries and roles.

data-visualization datascience exploratory-data-analysis feature-engineering

Last synced: 24 Jan 2026

https://github.com/tanaybhadula/twitter-trends-dashboard

An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.

dash dashboard data-analysis data-visualization plotly python trends twitter

Last synced: 31 May 2026

https://github.com/aninditaws/investly

Investly: A personal finance platform for young investors, offering tailored portfolio recommendations by integrating user risk profiles, real-time market data, and optimization algorithms.

api-integration data-visualization goal-based-allocation react-frontend supabase-backend

Last synced: 01 Apr 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/quantumudit/groceries-basket-analysis

This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.

data-analysis data-visualization pandas powerbi python

Last synced: 12 Apr 2026

https://github.com/blarc/fri-staff-visualization

A simple visualization of the lectures and labs of the Faculty of Computer and Information Science in Ljubljana.

data-visualization p5js p5js-animation visualization

Last synced: 25 Mar 2025

https://github.com/sreekar0101/electric-vehicle-market-growth-and-incentive-impact-analysis-dashboard

About This project involves the development of a comprehensive Tableau dashboard to analyze the growth and market dynamics of electric vehicles (EVs). The dashboard reveals key insights, including a 20% increase in EV adoption over five years, the dominance of Battery Electric Vehicles (BEVs) which make up 60% of the market

data-analysis data-visualization tableau-desktop

Last synced: 07 Jan 2026

https://github.com/devanshsahu47/hr-dashboard-mysql-powerbi

A comprehensive HR dashboard that visualizes key workforce metrics such as employee demographics, attrition rates, and performance trends. Built using Power BI/Excel, it enables data-driven HR decision-making with interactive charts and KPIs.

data-analytics data-visualization excel power-bi

Last synced: 04 Feb 2026

https://github.com/drtfloyd/psa-network-analyzer

In a complex professional world, understanding the true strength and relevance of your network is more critical than ever. The PSA (Presence Signaling Architecture) Network Analyzer is a sophisticated yet easy-to-use local tool designed to bring clarity, strategy, and ethical visibility to your professional relationships.

career-development csv-analysis data-anlaysis data-visualization human-to-human job-seeker linkedin network-analysis privacy-first professional-networking python realationship-management responsible-ai streamlit

Last synced: 29 Apr 2026

https://github.com/dianaow/fifa22-svelte

Svelte/D3.js: Visualizing club attributes in FIFA games from 2015 to 2022

d3-visualization d3js data-visualization fifa sports-analytics sports-visualisation svelte

Last synced: 29 May 2026

https://github.com/saba-gul/google_data_analystics_belabeat_fitness_capstone_project

This project focuses on leveraging Fitbit user data to derive valuable insights and facilitate data-driven decision-making for Bellabeat, a leading wellness company. The objective is to harness the wealth of information captured by Fitbit devices to enhance the wellness offerings provided by Bellabeat.

bellabeat-case-study bellabeat-eda data-analytics data-visualization fitbit google-casestudy

Last synced: 08 Jun 2026

https://github.com/erabossid/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 10 Mar 2025

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/rupeshrb/data_visualization

Data visualization is important concept which apply on datasets

data-analytics data-visualization dataset python

Last synced: 17 May 2026

https://github.com/syncfusionexamples/ej2-angular-7-heatmap

A quick start project that helps you to create an Angular 7 Heatmap with minimal code configuration.

angular-heatmap angular7 data-visualization ej2-heatmap

Last synced: 03 Apr 2025

https://github.com/zahramh99/dynamic-pricing-strategy

Dynamic Pricing is an application of data science that involves adjusting the prices of a product or service based on various factors in real time. It is used by companies to optimize revenue by setting flexible prices that respond to market demand, demographics, customer behaviour and competitor prices.

business-intelligence data-science data-visualization demand-prediction dynamic-pricing machine-learning predictive-modeling price-prediction price-prediction-model pricing-strategy revenue-optimization ride-sharing

Last synced: 27 Jun 2025

https://github.com/angchekar28/valorant-gameplay-analysis

This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 12 Apr 2026

https://github.com/dmdlgg/spotify-analysis

An interactive data analysis app built with Python, Pandas, Plotly, and Streamlit, showcasing insights about the top 1000 most played songs on Spotify. Dataset sourced from Kaggle. Users can explore the frequency, popularity, and most played songs by artist in a clean and intuitive interface.

data-analysis data-visualization pandas plotly python streamlit

Last synced: 11 May 2026

https://github.com/sravyatogarla/movie-recommendation-system

A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.

capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql

Last synced: 13 Apr 2026

https://github.com/nature40/casestudies

Case studies for testing the functionality of database systems, sensors, etc

casestudies data-analysis data-visualization database

Last synced: 02 May 2026