An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1

Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.

data-analysis data-visualization deloitte json tableau tableau-public

Last synced: 11 Oct 2025

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/joonarafael/ids-exercises

Repository to store the exercise submissions for the Introduction to Data Science course (University of Helsinki).

course-work data-science data-visualization jupyter-notebook university-assignment

Last synced: 16 Jun 2026

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/titanic-eda-visualization

Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.

data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 31 May 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/bretsw/eme6356-ss23-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2023)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/dzakwanalifi/stadata-x

Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif

bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui

Last synced: 20 Jan 2026

https://github.com/shellynagar27/candy-market-share-analysis

Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.

critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis

Last synced: 03 Feb 2026

https://github.com/shellynagar27/good-cabs-data-analysis-project

This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.

critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling

Last synced: 25 Jan 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/saagpatel/sovereign

Browser-based geopolitical simulator — apply policy levers to 18 countries and watch cascading effects over 60 months

d3 data-visualization geopolitics monte-carlo nextjs simulation typescript web-worker

Last synced: 28 Jun 2026

https://github.com/aszenz/data-viz

visualize data from your browser

csv-converter data-analytics data-visualization

Last synced: 20 Jan 2026

https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce

This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.

data-analysis data-visualization ecommerce microsoft-sql-server powerbi python

Last synced: 18 May 2026

https://github.com/37743/ml-starterkit

This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.

data-preprocessing data-visualization exploratory-data-analysis machine-learning python

Last synced: 07 Apr 2025

https://github.com/umutonder97/project-network-ids

Network-Based Intrusion Detection System - dev/deploy-ment of a Hybrid Intrusion Detection System (HIDS) that integrates Signature-based Network Intrusion Detection Systems (SNIDS)

artificial-neural-networks convolutional-neural-networks covid-19 covid-19-russia covid19-data data-visualization genetic-algorithm ids keras-tensorflow knn microservices-architecture network-behavioral-analysis python time-series-forecasting

Last synced: 05 Jul 2025

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/marianamartiyns/rfm-cluster-analysis

Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.

cluster-analysis data-analysis data-cleaning data-visualization pyhton

Last synced: 16 Mar 2025

https://github.com/mishaa931/amazon-sales-dashboard-power-bi

This project features a dynamic Power BI dashboard built on dummy Amazon sales data. It visualizes key business metrics such as revenue trends, top-selling categories, discount impact, and geographic performance. The dashboard is designed to help stakeholders make data-driven decisions through clear, interactive visuals.

data-analysis data-quality data-visualization microsoftpowerbi

Last synced: 05 Feb 2026

https://github.com/saisurajmatta/airbnb-data-visualisation-project

Explored and visualized Seattle Airbnb data to gain insights into pricing, geographic trends, and optimal listing strategies for hosts.

data-analytics data-visualization excel tableau tableau-dashboards tableau-public tableu-workbook

Last synced: 05 Feb 2026

https://github.com/nazir20/scraping-tweets-using-python-and-preprocessing-tweets-for-sentiment-analysis

This is repo is about how to scrape tweets from Twitter using Python and also proprocessing tweets for sentiment analysis

data-cleaning data-visualization jupyter-notebook python twitter-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/musamairshad/matplotlib-learning

This repository contains material related to the Matplotlib Learning.

data-science data-visualization matplotlib plotting python

Last synced: 09 Oct 2025

https://github.com/syedzaheerabbas/aerofit-descriptive_analysis

Analyzed customer profiles for Aerofit treadmills to enhance product recommendations. The project includes visualizations and probability calculations to understand how customer demographics impact treadmill purchases.

data-visualization descriptive-statistics eda insights probability-analysis python

Last synced: 28 Apr 2026

https://github.com/farhashaad/farhashaad98

This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.

data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau

Last synced: 24 Apr 2026

https://github.com/nmelgar/lego_my_data

Data visualization project to sell LEGO bulks.

csv data-analysis data-visualization data-viz google-sheets tableau

Last synced: 08 Jan 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/derrmru/whats-in-the-news

Data Visualisation of News Content

data-visualization nlp react scraped-data

Last synced: 17 May 2026

https://github.com/parnika-singh/oncovision

An intelligent machine learning model for classifying breast cancer cells as benign or malignant using the UCI Breast Cancer Wisconsin dataset.

breast-cancer-prediction cancer-detection classification data-visualization decision-tree healthcare knn logistic-regression machine-learning medical-ai-project python3 sklearn svm-model xgboost

Last synced: 07 May 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/oenm176/hmeq-loan-analysis

Menggali wawasan dari dataset Home Equity (HMEQ). Proyek ini membangun model klasifikasi untuk mendeteksi kredit macet, yang menampilkan pra-pemrosesan data lengkap, normalisasi, dan visualisasi pohon menggunakan Python.

classification-model credit-risk-analysis data-mining data-science data-visualization decision-tree hmeq-dataset machine-learning python scikit-learn student-project

Last synced: 13 Apr 2026

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/mr-chang95/udacity_movie_project

Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.

data-analysis data-visualization jupyter-notebook movie python

Last synced: 13 Apr 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/badr-moufad/dashboard-agri-edge-frontend

Dashboard of Moroccan weather data adapted to the wheat calendar. This is part of my research internship.

clustering dashboard data-visualization morocco-regions plotlyjs reactjs redux tailwindcss weather

Last synced: 07 May 2026

https://github.com/miserman/splot

An R package to ease data visualization

data-visualization r

Last synced: 22 Jan 2026

https://github.com/teja-1403/forage-bcg-x-data-science

About This repository contains solutions to the 4 different tasks that must be performed during the Data Science virtual internship provided by BCG X via Forage.

business-understanding client-communication data-evaluation data-science data-visualization exploratory-data-analysis hypothesis-framing model-interpretation

Last synced: 27 Jan 2026

https://github.com/saisurajmatta/healthcare-data-analytics

Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.

data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery

Last synced: 22 Jan 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/chahelgupta/dep-videogames-dataset

The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.

data-analysis data-exploration data-extraction data-preparation data-preprocessing data-processing data-science data-visualization

Last synced: 15 Oct 2025

https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis

Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.

data-analysis data-visualization dax excel power-pivot power-query

Last synced: 22 Jan 2026

https://github.com/pngo1997/chicago-airbnb-cta

Interactive Chicago CTA train stations geospatial map.

data-visualization geospatial html python visualization

Last synced: 15 Oct 2025

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/syre/strava-stats

Strava Stats is a simple Python app for providing insights into your Strava riding metrics.

data-visualization metrics plotly-dash python strava tailwindcss

Last synced: 22 Jan 2026

https://github.com/sunnyrao07/education-wage-trends-usa-1973-2022

A Tableau-based data visualization project analyzing wage trends by education, gender, and race in the USA (1973–2022).

dashboard data-visualization tableau

Last synced: 05 Feb 2026

https://github.com/hamburgj/survivor-stats

Interactive visualization of Survivor US contestant statistics and season data, as well as connection path finding.

data-visualization graph interactive-visualizations react reactjs statistics survivor

Last synced: 16 Apr 2026

https://github.com/kunalpisolkar24/winequalityprediction

Predicting wine quality using machine learning with matplotlib, numpy, pandas, and seaborn for insightful data analysis. 🍇🤖📊

data-analysis data-science data-visualization machine-learning prediction-model

Last synced: 16 Oct 2025

https://github.com/grascya/heart-disease

The objective is to ascertain the probability of an individual being susceptible to a severe heart problem based on some features.

data-visualization explainable-machine-learning exploratory-data-analysis heart-disease svm-classifier

Last synced: 16 Oct 2025

https://github.com/hase3b/flask-dash-interactive-dashboard

An interactive data visualization dashboard created using Flask and Dash. This project includes comprehensive data preparation, exploratory data analysis (EDA), and dynamic visualizations with Seaborn and Plotly. Explore the multi-page Dash app with features like dropdowns and callbacks for updated plots.

callbacks dash dashboard data-analysis data-visualization dropdown eda flask interactive plotly seaborn web-app

Last synced: 19 May 2026

https://github.com/aishanipach/data-visualizer

Visualize data according to month and year build using reactjs.

data-visualization frontend react reactjs

Last synced: 14 Apr 2026

https://github.com/zen204/renewable-energy-usage-v-electricity-access

Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.

d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development

Last synced: 08 Feb 2026

https://github.com/crazy-dot/hiring-process-analytics

Analyse the company's hiring process data and draw meaningful insights from it

data-analytics data-visualization hiring-process ms-excel-data-analytics statistical-analysis trainity

Last synced: 07 Jan 2026

https://github.com/ireneflorez/exploringweathertrends

Exploring Weather Trends using SQL, moving averages, and data visualization

data-visualization excel sql

Last synced: 10 Feb 2026

https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees

Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.

classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn

Last synced: 17 Oct 2025

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/sgb31/covid-19-data-analysis

"In this project, I analyzed COVID-19 data to explore trends, case growth, and key patterns. I worked on cleaning the data, performing exploratory analysis, and visualizing infection rates, recoveries, and fatalities. The goal was to gain insights into how the pandemic evolved and its overall impact.

data-analysis data-visualization matplotlib pandas python seaborn

Last synced: 13 May 2026

https://github.com/kruthiktr/titanic-survival-prediction

Titanic Survival Prediction using Machine Learning predicts whether a passenger survived based on features like age, gender, and class. A Random Forest Classifier achieved 82.68% accuracy after data preprocessing. Explore this project to see how machine learning handles this classic problem.

classification classification-model data-visualization machine-learning predictive-modeling python titanic-dataset titanic-survival-prediction

Last synced: 10 Apr 2025

https://github.com/snototter/viren2d

Visualization Toolbox for Computer Vision

computer-vision-tools cpp data-visualization python

Last synced: 15 May 2026

https://github.com/asuquoaa/energy_consumption_dashboard_for_african_subregions_1980-2019

Interactive dashboard that allows users to explore the major energy sources across various subregions in Africa and analyze them

data-visualization interactive-visualizations

Last synced: 12 Jul 2025