An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/zulhaditya/netflix-analysis

Netflix data analysis using multiple python libraries.

data-visualization python

Last synced: 19 May 2026

https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends

Last synced: 30 May 2026

https://github.com/k8hertweck/tidytuesdaydataviz

data viz for TidyTuesday lunch meetup at the Hutch

data-visualization tidytuesday

Last synced: 30 May 2026

https://github.com/karishmagupta05/udemy-course-analysis

This project analyzes Udemy courses using Exploratory Data Analysis (EDA) techniques to uncover insights about course trends, pricing, subscriber counts, and popularity. By leveraging Python, Pandas, and data visualization libraries, we extract meaningful information from the dataset.

data-analysis data-visualization eda jupiter-notebook pandas python

Last synced: 13 Apr 2026

https://github.com/srinibas-masanta/zomato-customer-and-restaurant-analysis

This repository contains a comprehensive analysis of Zomato's platform, focusing on various aspects of customer behavior, restaurant performance, and market trends. The analysis leverages data-driven insights to answer key questions that can guide business strategies, enhance customer satisfaction, and optimize operational efficiency.

business-analytics data-analysis data-science data-visualization

Last synced: 02 Apr 2025

https://github.com/suresh-chelani/crop-data-visualization

This project implements data visualization tasks using TypeScript, Vite, Apache ECharts, and Mantine v7. The goal is to process agricultural data, handle missing values, and render a table and a bar chart based on the dataset.

apache-echarts data-visualization mantine-v7 typescript vite

Last synced: 01 Mar 2025

https://github.com/spriggancg/hishiryo

A package to generate a picture representation of any csv file.

csv data-visualization dataset heatmap package pipy python python3

Last synced: 14 Jan 2026

https://github.com/domingosdeeulariadumba/ablisk

A Python module for design, analysis and decision-making of A/B tests.

ab-testing data-visualization statistics

Last synced: 14 Jan 2026

https://github.com/beyzabasarir/brazilian-e-commerce-analysis

Brazilian E-Commerce Dataset By Olist PostgreSQL Analysis

data-analysis data-visualization sql

Last synced: 08 Jan 2026

https://github.com/izadoraluz/uber-twitter-feedback-analysis

Uma pesquisa exploratória ccom análise de feedback positivo sobre a empresa Uber no Twitter (X) usando visualização de dados, com o objetivo de criar um projeto prático usando PLN e um dashboard intuitivo

dashboard data-visualization pln

Last synced: 05 Feb 2026

https://github.com/sssshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 01 Mar 2025

https://github.com/thomas-basham/ps-creel

This web application fetches fishing report data from the Washington Department of Fish and Wildlife (WDFW) Creel Reports page and displays it on an interactive map.

creel creel-survey data-science data-visualization database fish fishing nextjs postgresql puget-sound-data pugetsound react sql website

Last synced: 13 Apr 2026

https://github.com/sakeenanavavi/larana_diamonds

A Diamond Price Predictor uses multiple machine learning algorithms to predict the price of a diamond based on a its attributes.

data-visualization knn machine-learning-algorithms random-forest regression support-vector-regression xgboost

Last synced: 10 Jan 2026

https://github.com/samruddhi3012/public-health-data-analysis

Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.

dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup

Last synced: 05 Feb 2026

https://github.com/aykutsp/world-infrastructure-data-hub

Interactive world map of fuel prices, electricity, EV charging costs and CO2 emissions — daily-refreshed open data pipeline.

choropleth climate-data co2-emissions data-visualization electricity-prices energy-data ev-charging fuel-prices github-actions leaflet open-data react sustainability typescript vite world-map

Last synced: 05 Apr 2026

https://github.com/joaopalmeiro/vscode-altair-snippets

A VS Code extension for scaffolding Altair charts for data visualization.

altair data-visualization python vscode vscode-extension vscode-snippets

Last synced: 09 May 2026

https://github.com/samwhaaa/da_portfolio

Showcasing some of my Data Analytics projects

data-analysis data-analytics data-visualization jupyter jupyter-notebook python

Last synced: 01 Mar 2025

https://github.com/nurulashraf/telco-customer-churn-prediction-model

This repository contains a Telco Customer Churn Prediction project using machine learning. It includes data preprocessing, exploratory data analysis, feature engineering, and model development to predict customer churn. Key tools used are Python, Pandas, NumPy, Matplotlib, Seaborn, and scikit-learn.

churn-prediction classification-model customer-churn data-visualization exploratory-data-analysis machine-learning predictive-analytics python scikit-learn

Last synced: 16 Mar 2025

https://github.com/aaronjmars/token-race

Data visualization tool for tokens, based on their FDV or MC.

coinmarketcap data-visualization dexscreener web3

Last synced: 14 Sep 2025

https://github.com/rohanag03/whatsapp-chat-analyzer

This project is a comprehensive tool for analyzing WhatsApp chat histories. Using Python and Streamlit, it visualizes chat data to provide insights into messaging patterns and media sharing habits.

chat-analysis data-visualization python streamlit

Last synced: 12 May 2026

https://github.com/auliannee/customer-analysis-with-tableau

This repository contains the data source and the tableau workbook.

data-analysis data-visualization tableau

Last synced: 12 Mar 2026

https://github.com/pratyusha-r/air-quality-index-visualization

This project visualizes the Air Quality Index (AQI) of various cities across India using Power BI. The aim is to provide insights into the air quality trends and help raise awareness about environmental issues affecting urban areas.

air-quality-data dashboard data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/xre22zax/airline-analysis

Travel agency and need to know the ins and outs of airline prices for your clients

data-analysis data-visualization python python3 visualization

Last synced: 13 Apr 2026

https://github.com/madhuresh2011/hr-analytics-using-power-bi

HR Analytics Dashboard, leveraging the power of Power BI to transform data into actionable insights.

analysis dashboard data-analytics data-visualization excel-dataset insights power-query powerbi

Last synced: 07 Jan 2026

https://github.com/ibttf/isolation-index

Independent research project tracking risk factors for social isolation in different US counties.

data-science data-visualization gis mapbox psychology react

Last synced: 13 Apr 2026

https://github.com/fbarffmann/mycitibike

Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.

citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping

Last synced: 07 Jul 2025

https://github.com/samruddhi3012/rfm-sales-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 12 Mar 2025

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/deliprofesor/arrhythmia-classification-and-anomaly-detection

This project classifies arrhythmias and detects anomalies using machine learning and deep learning. It includes preprocessing the "INCART 2-lead Arrhythmia Database," feature engineering, KMeans clustering, Random Forest, IsolationForest, and an LSTM model for classification.

classification clustering data-science data-visualization deep-learning machine-learning

Last synced: 10 Apr 2025

https://github.com/archanakokate/iris_flower_classification

Analyzing and modeling the Iris dataset with the aim of classifying the species of Iris flowers.

analytics data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/jansim/ridges

R package for downloading and visualizing topographical elevation data.

data-visualization geospatial r ridgeline

Last synced: 02 Mar 2025

https://github.com/sehgal-vishal/blinkit-sales-project

This dashboard is designed to provide in-depth insights into key performance indicators (KPIs) critical to the operations of Blinkit.

data-visualization dataanalysis powerbi sales

Last synced: 19 Mar 2026

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/01110011011101010110010001101111/tigergraph_cosmos_template

Template for TigerGraph and Cosmograph Projects with pyTigerGraph, Fast API, and Cosmos

cosmograph data-visualization tigergraph

Last synced: 26 Mar 2025

https://github.com/nero103/airbnb-destination

This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings

data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau

Last synced: 27 Mar 2026

https://github.com/seblehner/feldprakt

Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.

data-analysis data-visualization jupyter-notebook python

Last synced: 05 Oct 2025

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/trismald/eurosoccer1023

Data Analyst - European Soccer 2010 2023

data-analysis data-visualization jupyter-notebook pandas powerbi python

Last synced: 06 May 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/valinsogna/data_visualization_project

Analyzing scores from 17 major international skating events (Oct 2016-Dec 2017). This project delves into judge biases, athlete rankings based on difficult elements, and the significance of elements versus components in final rankings. Built using Python, it offers insights derived from publicly-released International Skating Union Protocols

data-visualization skating

Last synced: 07 Oct 2025

https://github.com/subhadipsinha722133/diamond-price-predction

This project applies 🤖Machine Learning techniques to analyze these features and build a predictive model that estimates the selling price of diamonds

data-visualization machine-learning pandas pkl-model python random-forest sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/omarsolieman/socialgiveawaydataanalysis

This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis

data-analysis data-science data-visualization instagram scraping threejs

Last synced: 14 May 2026

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/abinjohn8138-commits/churn-analysis

This project focuses on analyzing customer churn behavior within a telecommunication company using visual insights. The goal is to understand what factors lead to customer attrition and help the business take proactive steps to retain customers.

colab-notebook data-visualization excel insights jupyter-notebook pandas python

Last synced: 05 May 2026

https://github.com/jdede1/data-analysis-visualization-assignment-5

INFO 526 — Data Analysis and Visualization, Assignment 5 (Dashboard Reports — Iowa Liquor Sales). Part of the Master’s in MIS/ML program at the University of Arizona. Includes positive and negative dashboards showing key KPIs: top products/vendors driving sales vs bottom products/vendors hindering sales.

dashboard data-visualization matplotlib pandas seaborn

Last synced: 16 Apr 2026

https://github.com/gauravsy704/sct_ds_2

Performed data cleaning and exploratory data analysis (EDA) on the Titanic dataset from Kaggle. Investigated the relationships between variables and identified key patterns and trends in the data using Python, with a focus on survival rates, passenger demographics, and embarkation details.

data-science data-visualization jupyter-notebook pandas python seaborn

Last synced: 06 May 2026

https://github.com/lixx21/tableau_netflix_movies_tvshows_2021

Visualize netflix movies and tv shows in 2021

data-visualization dataset netflix tableau

Last synced: 19 Jan 2026

https://github.com/damisparks/machine-learning

Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention

data-science data-visualization deeplearning machine-learning machine-learning-algorithms machinelearning-python

Last synced: 09 Oct 2025

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/hirkojoba/fintrack

Full-stack financial tracking app with ML forecasting and AI insights. Built with Rails, PostgreSQL, Python/scikit-learn, and OpenAI API.

artificial-intelligence data-visualization fintech full-stack machine-learning openai postgresql python ruby-on-rails scikit-learn

Last synced: 14 Apr 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/frankelavsky/security-dash-challenge

I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.

client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization

Last synced: 14 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 19 Jan 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/mouradhamzaoui/End-To-End-MLOPS-Airline-Project

This project aims to predict the number of passengers, freight quantity, and mail quantity for American airlines operating between Canadian and U.S. airports using an MLOps approach. It involves automating the data pipeline, from data extraction and preparation to model training and evaluation, leveraging tools like DVC, MLflow, and Docker for vers

data-visualization docker dvc github-actions machine machine-learning-algorithms mlflow

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/retail-sales-analysis-python-powerbi

A complete retail sales analytics project using Python for data cleaning and EDA, and Power BI for dashboard visualization. Built as a capstone for the Business Analytics Bootcamp by CourseMea.

business-analytics capstone-project coursemea dashboard data-visualization eda exploratory-data-analysis powerbi python python3 retail-data

Last synced: 31 May 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/dzakwanalifi/stadata-x

Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif

bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui

Last synced: 20 Jan 2026

https://github.com/timjjting/task-lineage-generator

A simple Task Lineage Diagram Generator

d3 dag data-visualization golang graphviz-dot lineage task

Last synced: 21 Jan 2026

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce

This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.

data-analysis data-visualization ecommerce microsoft-sql-server powerbi python

Last synced: 18 May 2026

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/arunabhagit/data-driven-marketing-optimization-enhancing-engagement-conversions-and-customer-satisfaction

Analyzed ShopEasy’s marketing data using SQL, Python, and Power BI to identify low engagement and conversion issues, performed sentiment analysis, and delivered data-driven strategies for measurable performance improvement.

business-intelligence data-visualization dataanalytics marketingstrategy powerbi python sql

Last synced: 13 Oct 2025

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026