An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/saro0307/voronoi-diagram-for-classification

Using Voronoi diagram to map random points scattered on a plane subdivides in exactly n cells enclosing a portion of the plane that is closest to each point

artificial-intelligence data-visualization dataanalytics graph machine-learning matplotlib plot plotting pyplot python python3 voronoi voronoi-diagram

Last synced: 08 Jun 2026

https://github.com/tashi-2004/apache-spark-geospatial-air-quality-analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 25 Mar 2025

https://github.com/tanaybhadula/avacado-analytics

It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.

css dash data-analytics data-visualization python

Last synced: 17 May 2026

https://github.com/audreyadora/r_data_analytics

RStudio Data Analytics Learning Journal

data-science data-visualization r-studio

Last synced: 04 Feb 2026

https://github.com/aninditaws/investly

Investly: A personal finance platform for young investors, offering tailored portfolio recommendations by integrating user risk profiles, real-time market data, and optimization algorithms.

api-integration data-visualization goal-based-allocation react-frontend supabase-backend

Last synced: 01 Apr 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/quantumudit/groceries-basket-analysis

This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.

data-analysis data-visualization pandas powerbi python

Last synced: 12 Apr 2026

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 09 Apr 2025

https://github.com/charlescro/reddit-classification-nlp

Analyzing subreddit language via Reddit API and NLP techniques.

data-analysis data-science data-visualization nlp-machine-learning reddit-api scikit-learn

Last synced: 03 Apr 2025

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 12 Apr 2026

https://github.com/devanshsahu47/hr-dashboard-mysql-powerbi

A comprehensive HR dashboard that visualizes key workforce metrics such as employee demographics, attrition rates, and performance trends. Built using Power BI/Excel, it enables data-driven HR decision-making with interactive charts and KPIs.

data-analytics data-visualization excel power-bi

Last synced: 04 Feb 2026

https://github.com/danielrosehill/eco-ninja-3

Configuration for an LLM assistant that performs analysis on sustainability data

data-visualization prompt-engineering prompting sustainability

Last synced: 22 Feb 2026

https://github.com/drtfloyd/psa-network-analyzer

In a complex professional world, understanding the true strength and relevance of your network is more critical than ever. The PSA (Presence Signaling Architecture) Network Analyzer is a sophisticated yet easy-to-use local tool designed to bring clarity, strategy, and ethical visibility to your professional relationships.

career-development csv-analysis data-anlaysis data-visualization human-to-human job-seeker linkedin network-analysis privacy-first professional-networking python realationship-management responsible-ai streamlit

Last synced: 29 Apr 2026

https://github.com/shrutiijoshi/e-commerce

The dataset contains various attributes related to orders, customers, and products, providing a comprehensive view of the sales process.

analysis data-visualization tableau-public visualization

Last synced: 07 Jan 2026

https://github.com/jansim/nicknames

Specify human readable names for the columns in your data once and then reuse them across your project to rename plots axes, dataframe columns, tables and anything else.

data-cleaning data-visualization r r-package

Last synced: 04 Sep 2025

https://github.com/zeroxjackson/trendviz

A data visualization tool for Twitter trends in the United States.

data-visualization twitter

Last synced: 01 Apr 2025

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/saba-gul/google_data_analystics_belabeat_fitness_capstone_project

This project focuses on leveraging Fitbit user data to derive valuable insights and facilitate data-driven decision-making for Bellabeat, a leading wellness company. The objective is to harness the wealth of information captured by Fitbit devices to enhance the wellness offerings provided by Bellabeat.

bellabeat-case-study bellabeat-eda data-analytics data-visualization fitbit google-casestudy

Last synced: 08 Jun 2026

https://github.com/keshavg125/whatsapp-chat-analyzer

WhatsApp Chat Analyzer extracts insights from chat data, visualizing activity trends, emoji usage, and sentiment analysis using "ganeshkharad/gk-hinglish-sentiment". Built with Streamlit, Pandas, and Matplotlib for interactive analysis. 🚀

data-visualization emoji-analysis huggingface matplotlib nlp pandas python seaborn streamlit whatsapp-chat-analysis wordcloud

Last synced: 07 May 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/sandravizz/analytical-system-design

Teaching material for bachelor course at Arcada

d3-js data-structures data-visualization system-design

Last synced: 24 Jan 2026

https://github.com/saroshfarhan/dublin_pedestrian_data_analysis

Pedestrian's footfall data analysis for the city of Dublin

data-analysis data-visualization r-programming

Last synced: 07 Jan 2026

https://github.com/yanny-alt/banking-customer-retention-analysis

The objective of this analysis is to identify factors contributing to the increased customer churn rate at the bank. The insights gained from this analysis will help business users make informed decisions and develop strategies to improve customer retention and reduce churn.

data-visualization power-bi powerbi-customer-churn-analysis

Last synced: 07 Jan 2026

https://github.com/chrisvilches/human-profiling

Monitorea y analiza los programas que ocupa el usuario.

csharp data-visualization human-behavior winapi

Last synced: 16 Mar 2025

https://github.com/kirby-b/assorted-r-files

Mainly files from learning to use datasets and do data analysis with R

barchart data-visualization r-language r-programming

Last synced: 25 Mar 2025

https://github.com/erabossid/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 10 Mar 2025

https://github.com/ricardo-melo-martins/docker

⚡ RMM ⚡:: 🐳 docker with database for fun development

data-visualization database datascience docker mysql postgres sakila sakila-database sqlite

Last synced: 12 Apr 2026

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/victorlcastro-dsa/pbl-datacamp

This repository features projects from DataCamp's Project-Based Learning (PBL) courses, showcasing practical applications of data analysis, machine learning, and visualization. Explore real-world datasets and interactive results that highlight the skills gained through hands-on learning.

data-analysis data-science data-visualization datacamp-projects hypothesis-testing machine-learning project-based-learning

Last synced: 30 Jun 2026

https://github.com/archanakokate/exploratory_data_analysis_global-terrorism_using_tableau

Using Tableau, conducted an in-depth analysis of terrorism incidents around the world.

analysis data-visualization tableau

Last synced: 04 Feb 2026

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/teja-1403/game-of-thrones-analysis

Demonstrate Exploratory Data Analysis on GOT Dataset using plots and graphs and using the information extracted from text.

analysis data-visualization datascience machine-learning python

Last synced: 12 Apr 2026

https://github.com/abhash-rai/analyzing-credit-card-eligibility

This work was performed as part of BCU undergraduate course.

data-analysis data-visualization ggplot ggplot2 latex r

Last synced: 20 Jan 2026

https://github.com/master-helix/11-7-commercial-store

A Data Analytics project for analyzing a Commercial Store dataset and building an interactive Excel dashboard for insights.

data-analytics data-visualization excel

Last synced: 04 Feb 2026

https://github.com/rupeshrb/data_visualization

Data visualization is important concept which apply on datasets

data-analytics data-visualization dataset python

Last synced: 17 May 2026

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system

To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.

analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server

Last synced: 29 May 2026

https://github.com/asuquoaa/ann_arbor_weather_analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

data-cleaning-and-preprocessing data-visualization

Last synced: 03 Apr 2025

https://github.com/parthasarathy27/barchart-visualization-using-amcharts

This project visualizes monthly fuel usage data (petrol and diesel) using a responsive bar chart built with the amCharts library. The chart displays fuel consumption across 12 months, with separate bars for petrol and diesel for each month.

amcharts amcharts-js-charts data-visualization html-css-javascript json visualization

Last synced: 12 Apr 2026

https://github.com/angchekar28/valorant-gameplay-analysis

This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 12 Apr 2026

https://github.com/theanujsinha01/mcdonalds-customer-analysis

This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.

case-study data data-visualization dataanalysis

Last synced: 05 Sep 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/rmitsch/paella

Web application for visual parameter space analysis of topic models utilizing word embeddigs.

data-visualization latent-dirichlet-allocation natural-language-processing topic-modeling word2vec

Last synced: 07 Jan 2026

https://github.com/dmdlgg/spotify-analysis

An interactive data analysis app built with Python, Pandas, Plotly, and Streamlit, showcasing insights about the top 1000 most played songs on Spotify. Dataset sourced from Kaggle. Users can explore the frequency, popularity, and most played songs by artist in a clean and intuitive interface.

data-analysis data-visualization pandas plotly python streamlit

Last synced: 11 May 2026

https://github.com/rell/aeronet_aq

NASA - Air Quality Forecast

aeronet air-quality aqi data-visualization nasa

Last synced: 02 Apr 2025

https://github.com/itskshitija/analyzing-the-nyc-airbnb-market

The aim of this project is to utilize Python to understand the factors that influence Airbnb prices in New York City, or identifying patterns of all variables. Our analysis provides useful information for travelers and hosts in the city and some of the best insights for the Airbnb business.

data-science data-visualization dataanalysis dataanalysisusingpython

Last synced: 22 Jul 2025

https://github.com/busesimsek/dataanalysisportfolio

A compilation of my data analysis projects using SQL, Python, and Tableau.

data-analysis data-visualization python sql tableau

Last synced: 12 Jun 2025

https://github.com/sravyatogarla/movie-recommendation-system

A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.

capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql

Last synced: 13 Apr 2026

https://github.com/carcesar/salariogovernadores2023

Visualização dos salários dos governadores em 2023

data-science data-visualization politics

Last synced: 24 Apr 2025

https://github.com/sanjana-bongale/cta_ridership_data_visualization_using_tableau

Tableau-based analysis of Chicago Transit Authority (CTA) ridership trends (2015-2024). It includes interactive dashboards, heatmaps, and comparative visualizations to explore bus and rail boarding data, COVID-19 impact, and long-term trends.

customer-analysis dashbaord data-visualization tableau

Last synced: 16 Feb 2026

https://github.com/hyoaru/prime-number-forest-3d

A data visualization of how prime numbers from 1 - 200 would look like if it was a forest

data-visualization plotly python

Last synced: 31 Mar 2025

https://github.com/archanakokate/ml_cardiovascular-disease-prediction-

EDA and Model building to predict the risk of a heart attack using a Logistic Regression and Random Forest Classifier

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/gabrieladados/tableau_dashboards

Dashboards desenvolvidos no Tableau

dashboards data-visualization figma tableau

Last synced: 09 Apr 2025

https://github.com/itskshitija/lego-set-explorer

As a part of the Maven Analytics Lego challenge, I developed an interactive Power BI dashboard exploring the evolution of LEGO sets from 1970 to 2022.

data-analysis data-science data-visualization dataanalysis dataset powerbi powerbi-desktop powerbi-report

Last synced: 12 Jun 2025

https://github.com/sanjiban08/coffee-sales-dashboard

Explore your coffee sales like never before with our Interactive Excel Dashboard—unlock insights, track trends, and enhance decision-making for a robust and caffeinated business strategy. ☕📈

data-cleaning data-visualization excel pivot-tables

Last synced: 26 Jan 2026

https://github.com/fbarffmann/sqlalchemy-challenge

Built a Flask API with SQLAlchemy to analyze and visualize Hawaii climate data. Automated data extraction and developed database queries for temperature and precipitation insights.

api climate-data data-analysis data-visualization flask orm python sql sqlalchemy sqlite

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/tableau-power-bi-dashboards

Dashboards created using Tableau/Power BI

dashboards data-visualization powerbi tableau

Last synced: 04 Feb 2026

https://github.com/petarran/gun-violence-usa

Data Science project comparing USA gun violence cases to its causes.

data-science data-visualization r

Last synced: 05 Sep 2025

https://github.com/nullthefirst/py-notebooks

Jupyter Notebooks holding Data Science projects

data-analysis data-science data-visualization datasets jupyter-notebooks python

Last synced: 26 Apr 2026

https://github.com/nafiealhilaly/first-dash-app

A simple dash plotly app to explore and analyze imagined students assessment dataset

data-analysis data-analytics data-visualization eda plotly-dash python

Last synced: 02 Apr 2025

https://github.com/darrenjolson/pba-analysis-app

Data analysis and visualization tool for professional bowling tournaments, predicting performance across different oil patterns and venues.

bowling data-analysis data-visualization flask pba predictive-analytics python reactjs sports-analytics

Last synced: 13 Apr 2026

https://github.com/nurulashraf/polynomial-regression-manufacturing

A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.

data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn

Last synced: 16 Apr 2026

https://github.com/amg-ai-labs/petrol_station_finder

A Python script to find nearby petrol stations and fuel prices using UK government data.

api data-visualization fuel geo python uk

Last synced: 13 Jun 2025

https://github.com/eduardorodriguesf/youtube-trending-scraper

Scraper program that searches youtube trending videos categories

data-visualization matplotlib pandas seaborn selenium

Last synced: 05 May 2026

https://github.com/wilkerhop/vanguard-anime-critique

Neo-Brutalist web application demonstrating the Vanguard Analytical Framework for anime critique with interactive data visualizations and comparative analysis.

anime article chartjs critical-analysis css data-visualization github-pages neo-brutalism web-design

Last synced: 29 May 2026

https://github.com/wilkerhop/linestream

A dynamic line visualization using HTML, JavaScript, and SVG. Each point has a vertical position based on its currentPosition, and all points are connected. New points can be added dynamically, updating the visual representation in real time. This project explores JavaScript, DOM manipulation, and SVG rendering.

data-visualization dynamic-graphics frontend html interactive-ui javascript proof-of-concept svg web-development

Last synced: 29 May 2026

https://github.com/amoghkori/working-with-apache-spark-mllib

Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.

amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark

Last synced: 13 Apr 2026

https://github.com/nature40/casestudies

Case studies for testing the functionality of database systems, sensors, etc

casestudies data-analysis data-visualization database

Last synced: 02 May 2026

https://github.com/ireneflorez/exploringweathertrends

Exploring Weather Trends using SQL, moving averages, and data visualization

data-visualization excel sql

Last synced: 10 Feb 2026

https://github.com/hassanislam463/data-cleaning-and-modelling-top-5-categories-analysis-forage

This project involves cleaning, merging, and analyzing datasets to identify the top 5 performing categories based on aggregate popularity scores. It includes cleaned datasets, a final merged dataset, visualizations, and a presentation summarizing the tasks and results. Tools used: Microsoft Excel, Python, and PowerPoint.

data-analysis data-visualization microsoft-excel

Last synced: 07 Jan 2026

https://github.com/crazy-dot/hiring-process-analytics

Analyse the company's hiring process data and draw meaningful insights from it

data-analytics data-visualization hiring-process ms-excel-data-analytics statistical-analysis trainity

Last synced: 07 Jan 2026

https://github.com/zen204/renewable-energy-usage-v-electricity-access

Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.

d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development

Last synced: 08 Feb 2026

https://github.com/ledsouza/ml-musicas

Desenvolvida uma pipeline para recomendação de músicas a partir de um modelo clusterização do sklearn

data-science data-visualization kmeans-clustering machine-learning matplotlib pandas plotly python sklearn spotify-api spotipy

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/samruddhi3012/tata-data-visualization

Hi! This repo contains the dashboard I created using Tableau for TATA Data Visualization Training!

data-analysis data-visualization tableau tata

Last synced: 07 Jan 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026