An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sandravizz/analytical-system-design

Teaching material for bachelor course at Arcada

d3-js data-structures data-visualization system-design

Last synced: 24 Jan 2026

https://github.com/yanny-alt/banking-customer-retention-analysis

The objective of this analysis is to identify factors contributing to the increased customer churn rate at the bank. The insights gained from this analysis will help business users make informed decisions and develop strategies to improve customer retention and reduce churn.

data-visualization power-bi powerbi-customer-churn-analysis

Last synced: 07 Jan 2026

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/kirby-b/assorted-r-files

Mainly files from learning to use datasets and do data analysis with R

barchart data-visualization r-language r-programming

Last synced: 25 Mar 2025

https://github.com/giog97/find_similar_tables_on_pubtables-1m

Find similar tables on the PubTables-1M dataset

data-analysis data-visualization datamining dm tables

Last synced: 09 Apr 2025

https://github.com/kathyreid/geelong-council-elections-2017

Chord diagram of distributed preferences based on Victorian Electoral Commission data

chord-diagram d3js data-visualization

Last synced: 13 Mar 2025

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/fbarffmann/citibike-covid-analysis

Analyzed NYC CitiBike usage during March 2020 to assess the impact of COVID-19 using Python and Tableau. Includes ridership breakdowns, user type trends, and interactive dashboard.

citibike covid19 data-analysis data-visualization exploratory-data-analysis pandas python tableau transportation

Last synced: 12 Apr 2026

https://github.com/faysalalmahmud/bd-med-professional-analysis

Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.

data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau

Last synced: 04 Sep 2025

https://github.com/master-helix/11-7-commercial-store

A Data Analytics project for analyzing a Commercial Store dataset and building an interactive Excel dashboard for insights.

data-analytics data-visualization excel

Last synced: 04 Feb 2026

https://github.com/rupeshrb/data_visualization

Data visualization is important concept which apply on datasets

data-analytics data-visualization dataset python

Last synced: 17 May 2026

https://github.com/sanand0/storynetwork

Visualize where people are mentioned in stories and their inter-relationships

data-visualization

Last synced: 04 Sep 2025

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system

To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.

analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server

Last synced: 29 May 2026

https://github.com/shridhar1504/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization eda loan-prediction loan-status machine-learning predictive-modeling sql supervised-learning

Last synced: 09 Apr 2025

https://github.com/syncfusionexamples/ej2-angular-7-heatmap

A quick start project that helps you to create an Angular 7 Heatmap with minimal code configuration.

angular-heatmap angular7 data-visualization ej2-heatmap

Last synced: 03 Apr 2025

https://github.com/zahramh99/dynamic-pricing-strategy

Dynamic Pricing is an application of data science that involves adjusting the prices of a product or service based on various factors in real time. It is used by companies to optimize revenue by setting flexible prices that respond to market demand, demographics, customer behaviour and competitor prices.

business-intelligence data-science data-visualization demand-prediction dynamic-pricing machine-learning predictive-modeling price-prediction price-prediction-model pricing-strategy revenue-optimization ride-sharing

Last synced: 27 Jun 2025

https://github.com/asuquoaa/ann_arbor_weather_analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

data-cleaning-and-preprocessing data-visualization

Last synced: 03 Apr 2025

https://github.com/shrinidhi857/simpledataanalysisonstartups

The Indian startup ecosystem has experienced remarkable growth over the past decade, becoming a hotbed of innovation and entrepreneurship. In this data analysis we are segregating fields ,finding new insights.

data-analysis data-science data-visualization indian-startups

Last synced: 17 Sep 2025

https://github.com/itskshitija/analyzing-the-nyc-airbnb-market

The aim of this project is to utilize Python to understand the factors that influence Airbnb prices in New York City, or identifying patterns of all variables. Our analysis provides useful information for travelers and hosts in the city and some of the best insights for the Airbnb business.

data-science data-visualization dataanalysis dataanalysisusingpython

Last synced: 22 Jul 2025

https://github.com/carcesar/salariogovernadores2023

Visualização dos salários dos governadores em 2023

data-science data-visualization politics

Last synced: 24 Apr 2025

https://github.com/carcesar/mg2020

Visualização do mapa com os partidos dos prefeitos eleitos em 2020 - Minas Gerais

altair data-visualization infoviz minas-gerais politics python

Last synced: 24 Apr 2025

https://github.com/sanjana-bongale/cta_ridership_data_visualization_using_tableau

Tableau-based analysis of Chicago Transit Authority (CTA) ridership trends (2015-2024). It includes interactive dashboards, heatmaps, and comparative visualizations to explore bus and rail boarding data, COVID-19 impact, and long-term trends.

customer-analysis dashbaord data-visualization tableau

Last synced: 16 Feb 2026

https://github.com/archanakokate/ml_cardiovascular-disease-prediction-

EDA and Model building to predict the risk of a heart attack using a Logistic Regression and Random Forest Classifier

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/itskshitija/lego-set-explorer

As a part of the Maven Analytics Lego challenge, I developed an interactive Power BI dashboard exploring the evolution of LEGO sets from 1970 to 2022.

data-analysis data-science data-visualization dataanalysis dataset powerbi powerbi-desktop powerbi-report

Last synced: 12 Jun 2025

https://github.com/sanjiban08/coffee-sales-dashboard

Explore your coffee sales like never before with our Interactive Excel Dashboard—unlock insights, track trends, and enhance decision-making for a robust and caffeinated business strategy. ☕📈

data-cleaning data-visualization excel pivot-tables

Last synced: 26 Jan 2026

https://github.com/fbarffmann/sqlalchemy-challenge

Built a Flask API with SQLAlchemy to analyze and visualize Hawaii climate data. Automated data extraction and developed database queries for temperature and precipitation insights.

api climate-data data-analysis data-visualization flask orm python sql sqlalchemy sqlite

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/tableau-power-bi-dashboards

Dashboards created using Tableau/Power BI

dashboards data-visualization powerbi tableau

Last synced: 04 Feb 2026

https://github.com/petarran/gun-violence-usa

Data Science project comparing USA gun violence cases to its causes.

data-science data-visualization r

Last synced: 05 Sep 2025

https://github.com/darrenjolson/pba-analysis-app

Data analysis and visualization tool for professional bowling tournaments, predicting performance across different oil patterns and venues.

bowling data-analysis data-visualization flask pba predictive-analytics python reactjs sports-analytics

Last synced: 13 Apr 2026

https://github.com/amg-ai-labs/petrol_station_finder

A Python script to find nearby petrol stations and fuel prices using UK government data.

api data-visualization fuel geo python uk

Last synced: 13 Jun 2025

https://github.com/eduardorodriguesf/youtube-trending-scraper

Scraper program that searches youtube trending videos categories

data-visualization matplotlib pandas seaborn selenium

Last synced: 05 May 2026

https://github.com/analysisbyvivek/Road-Accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 29 Jan 2026

https://github.com/wilkerhop/linestream

A dynamic line visualization using HTML, JavaScript, and SVG. Each point has a vertical position based on its currentPosition, and all points are connected. New points can be added dynamically, updating the visual representation in real time. This project explores JavaScript, DOM manipulation, and SVG rendering.

data-visualization dynamic-graphics frontend html interactive-ui javascript proof-of-concept svg web-development

Last synced: 29 May 2026

https://github.com/amoghkori/working-with-apache-spark-mllib

Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.

amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark

Last synced: 13 Apr 2026

https://github.com/nature40/casestudies

Case studies for testing the functionality of database systems, sensors, etc

casestudies data-analysis data-visualization database

Last synced: 02 May 2026

https://github.com/ireneflorez/exploringweathertrends

Exploring Weather Trends using SQL, moving averages, and data visualization

data-visualization excel sql

Last synced: 10 Feb 2026

https://github.com/crazy-dot/hiring-process-analytics

Analyse the company's hiring process data and draw meaningful insights from it

data-analytics data-visualization hiring-process ms-excel-data-analytics statistical-analysis trainity

Last synced: 07 Jan 2026

https://github.com/zen204/renewable-energy-usage-v-electricity-access

Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.

d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development

Last synced: 08 Feb 2026

https://github.com/andrewobwocha/titanicsurvival

🚢 End-to-end Python pipeline for Titanic survival classification. Demonstrates EDA, preprocessing, feature engineering, and Logistic Regression evaluation using Scikit-learn.

classification data-preprocessing data-visualization exploratory-data-analysis feature-engineering machine-learning pandas python scikit-learn titanic

Last synced: 13 Jun 2025

https://github.com/ledsouza/ml-musicas

Desenvolvida uma pipeline para recomendação de músicas a partir de um modelo clusterização do sklearn

data-science data-visualization kmeans-clustering machine-learning matplotlib pandas plotly python sklearn spotify-api spotipy

Last synced: 13 Apr 2026

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/nandahkrishna/mpas

Movie Data Management and Analysis System developed using Java and Python

analysis data-visualization flask java java-application python

Last synced: 20 Apr 2026

https://github.com/shubham200137/expense-tracker-dashboard

The task is to create a Power BI dashboard from expense data (October–December) stored on SharePoint/OneDrive. It should include dropdowns for file and sheet selection with auto-refresh.

dashboard data-visualization powerautomate powerbi

Last synced: 04 Feb 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/femincan/d3-choropleth-map

My solution for the Visualize Data with a Choropleth Map project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 13 Apr 2026

https://github.com/danaelshrbiny10/gold-prices

The Egypt Gold Prices project is a data analysis and visualization initiative that focuses on tracking and understanding the daily gold prices in Egyptian pounds per gram.

data-visualization docker docker-compose matplotlib mongodb numpy pandas powerbi python3 webscraping

Last synced: 13 Apr 2026

https://github.com/bala-1409/tableau-visualization-viz.-project

This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity by calamities.

dashboard data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public visualization

Last synced: 04 Feb 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/farhashaad/farhashaad98

This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.

data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau

Last synced: 24 Apr 2026

https://github.com/brunomontezano/simple-thrive-user-growth-plot

📱 The repo contains a simple line plot for a presentation about the "Thrive: combata a depressão" app showing the user growth from April to October in 2022.

data-visualization data-viz datavisualization dataviz depression ggplot2 plots presentation-materials r-programming thrive-app user-growth

Last synced: 10 Jun 2026

https://github.com/syedzaheerabbas/aerofit-descriptive_analysis

Analyzed customer profiles for Aerofit treadmills to enhance product recommendations. The project includes visualizations and probability calculations to understand how customer demographics impact treadmill purchases.

data-visualization descriptive-statistics eda insights probability-analysis python

Last synced: 28 Apr 2026

https://github.com/musamairshad/matplotlib-learning

This repository contains material related to the Matplotlib Learning.

data-science data-visualization matplotlib plotting python

Last synced: 09 Oct 2025

https://github.com/saisurajmatta/airbnb-data-visualisation-project

Explored and visualized Seattle Airbnb data to gain insights into pricing, geographic trends, and optimal listing strategies for hosts.

data-analytics data-visualization excel tableau tableau-dashboards tableau-public tableu-workbook

Last synced: 05 Feb 2026

https://github.com/marianamartiyns/rfm-cluster-analysis

Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.

cluster-analysis data-analysis data-cleaning data-visualization pyhton

Last synced: 16 Mar 2025

https://github.com/mkaspulanwar/p6_bigdata_realtime_largescale_visualization

Praktikum Week 6 Big Data: Real-time analytics dan visualisasi data skala besar menggunakan PySpark Structured Streaming, Parquet Data Lake, dan Streamlit untuk monitoring mobilitas dan traffic smart city.

big-data data-visualization pyspark spark-streaming streamlit traffic-analytics

Last synced: 13 Apr 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights

This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.

clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality

Last synced: 14 Jun 2025

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/umutonder97/project-network-ids

Network-Based Intrusion Detection System - dev/deploy-ment of a Hybrid Intrusion Detection System (HIDS) that integrates Signature-based Network Intrusion Detection Systems (SNIDS)

artificial-neural-networks convolutional-neural-networks covid-19 covid-19-russia covid19-data data-visualization genetic-algorithm ids keras-tensorflow knn microservices-architecture network-behavioral-analysis python time-series-forecasting

Last synced: 05 Jul 2025

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/wazedkhan/medical-data-analysis

This project visualize and make calculations from medical examination data using matplotlib, seaborn, and pandas.

data-visualization jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 02 Apr 2025

https://github.com/shellynagar27/good-cabs-data-analysis-project

This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.

critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling

Last synced: 25 Jan 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/shellynagar27/candy-market-share-analysis

Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.

critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis

Last synced: 03 Feb 2026

https://github.com/parthivnaresh/facilyst

Facilyst is a library that makes using data science and machine learning tools easier.

data-science data-visualization deep-learning machine-learning mock-data neural-network python

Last synced: 18 Mar 2025

https://github.com/cyber-security-tech/top10-movies-web

Feature-rich full-stack Flask web app that lets users search, rate, and review movies via TMDb API, with smart genre filtering, interactive statistics (Chart.js), form validation (Flask-WTF), star-based ratings, and a polished UI/UX designed for real-world deployment.

api-integration bootstrap chartjs crud-app data-visualization flask flask-blueprints flask-wtf form-validation fullstack genre-filtering jinja movie-database python responsive-design sqlalchemy sqlite tmdb-api ui-ux web-app

Last synced: 08 Apr 2026

https://github.com/bretsw/eme6356-ss23-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2023)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/cartervr/taxdatabase-sql-tableau

End-to-end process for building an SQL Azure database, performing data analysis with SQL and Python, and visualizing data with Tableau.

azure data-science data-visualization database-architecture database-deployment database-management databse-design datanalysis erdiagram sql tableau

Last synced: 13 Mar 2026

https://github.com/cronware/predictive-maintenance

The Predictive Maintenance System is a C# WinForms application designed to monitor and analyze sensor data from industrial equipment in real time. It integrates machine learning (ML.NET) and MongoDB to detect anomalies, predict failures, and optimize maintenance schedules before equipment breakdown occurs.

csharp data-visualization dotnet machine-learning mlnet mongodb predictive-maintenance winforms

Last synced: 13 Apr 2026

https://github.com/rafaelmoura23/capella-info-ai

CapellaInfo is a Laravel-based application designed for automation, data, and AI projects. Its primary goal is to store and manage personal projects efficiently, providing a centralized platform for innovation and development.

artificial-intelligence automation data-science data-visualization laravel neural-network

Last synced: 28 Apr 2026

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/brazer27/iris-classification

A Python implementation of Naive Bayes algorithm for Iris flower classification. Features include cross-validation, data preprocessing, and prediction capabilities. Built from scratch without ML libraries, achieving ~95% accuracy on the classic Iris dataset.

cross-validation data-science data-visualization flower-classification iris-dataset machine-learning naive-bayes python

Last synced: 06 Sep 2025