An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sanand0/datastories

Small data visualizations and stories, mostly vibe-coded

data-visualization

Last synced: 03 Apr 2026

https://github.com/albertofaraujo/pbi_dashboard_prouni

Analisar os dados referentes ao detalhamento quantitativo das bolsas PROUNI concedidas no ano de 2021.

data-visualization dax-studio power-query powerbi

Last synced: 03 Feb 2026

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/ianjure/martial-law-in-data

A data visualization of how martial law shaped the Philippine economy.

data-visualization

Last synced: 05 Jan 2026

https://github.com/ayaankhan98/covid-19-analysis

Covid-19 Analysis. This repository is a part of AMURoboHack 1.0, Here we tried to visulize the world data of Covid-19. Data Visulization gives an easy way to understand bunch of data. We tried plotting the data over a world map so that users can eaisly get the stats for a conuntry by just hovering the mouse pointer over the country in the world map, we also provided the zooming over the world map to bring a sense of attractiveness and user friendly interface.

covid-19 d3js data-visualization topojson

Last synced: 30 Mar 2025

https://github.com/shudhanshurp/adidas-us-data-analysis

This Power BI project analyzes Adidas sales data across different regions, retailers, and product categories in the U.S. The dashboards provide insights into sales performance, operational metrics, and future forecasts to support data-driven decision-making.

data-analysis data-transformation data-visualization forecasting powerbi python retail-analytics

Last synced: 24 Apr 2026

https://github.com/nmatthews2203-del/rent-affordability-explorer

Interactive housing analytics dashboard using Zillow rent data and Census income data to analyze affordability, rent trends, and geographic housing differences across U.S. counties.

altair data-analytics data-visualization housing-data interactive-dashboard pandas plotly python real-estate sql sqlite streamlit

Last synced: 03 May 2026

https://github.com/lucertgvby/phat

Graphical PowerShell application designed to help investigators, security analysts, and IT professionals examine email headers for signs of phishing or spoofing. The tool parses headers from .eml and .msg files, highlights important fields, and provides insights into SPF, DKIM, and DMARC results.

data-visualization dimensionality-reduction distributed-computing hashcracking led-matrix-displays mqtt off-chain-compute phala phat raspberry-pi-library single-cell srp-phat unsupervised-learning visualization

Last synced: 21 May 2026

https://github.com/shubham200137/expense-tracker-dashboard

The task is to create a Power BI dashboard from expense data (October–December) stored on SharePoint/OneDrive. It should include dropdowns for file and sheet selection with auto-refresh.

dashboard data-visualization powerautomate powerbi

Last synced: 04 Feb 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/oenm176/hmeq-loan-analysis

Menggali wawasan dari dataset Home Equity (HMEQ). Proyek ini membangun model klasifikasi untuk mendeteksi kredit macet, yang menampilkan pra-pemrosesan data lengkap, normalisasi, dan visualisasi pohon menggunakan Python.

classification-model credit-risk-analysis data-mining data-science data-visualization decision-tree hmeq-dataset machine-learning python scikit-learn student-project

Last synced: 13 Apr 2026

https://github.com/femincan/d3-choropleth-map

My solution for the Visualize Data with a Choropleth Map project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 13 Apr 2026

https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino

Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.

data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision

Last synced: 20 May 2026

https://github.com/ppatrzyk/heatmap

Display CSV as a heatmap in terminal

csv data data-visualization terminal

Last synced: 24 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/danaelshrbiny10/gold-prices

The Egypt Gold Prices project is a data analysis and visualization initiative that focuses on tracking and understanding the daily gold prices in Egyptian pounds per gram.

data-visualization docker docker-compose matplotlib mongodb numpy pandas powerbi python3 webscraping

Last synced: 13 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/eliasdehondt/learn-r

Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.

data-analysis data-visualization education machine-learning programming r statistics tutorials

Last synced: 03 Apr 2026

https://github.com/27ahmad/heart-disease-diagnostic-eda

This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.

data-analysis data-visualization pandas python

Last synced: 06 May 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/ecastillo081/marketing-finance-dashboard

An analytics project that combines Google Ads + Meta Ads data and translates it into finance metrics.

business-intelligence cac cohort-analysis data-pipeline data-visualization ltv marketing-analytics mode-analytics postgres roas sql strategic-finance supabase

Last synced: 17 Apr 2026

https://github.com/nmelgar/lego_my_data

Data visualization project to sell LEGO bulks.

csv data-analysis data-visualization data-viz google-sheets tableau

Last synced: 08 Jan 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/anhmiuhv/android_realtime_graph_view

An small android module to display a data stream in realtime

android data-stream data-visualization graph realtime

Last synced: 02 May 2026

https://github.com/brunomontezano/simple-thrive-user-growth-plot

📱 The repo contains a simple line plot for a presentation about the "Thrive: combata a depressão" app showing the user growth from April to October in 2022.

data-visualization data-viz datavisualization dataviz depression ggplot2 plots presentation-materials r-programming thrive-app user-growth

Last synced: 10 Jun 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/azaz9026/myntra_review_project

Myntra Scraper Project Project Overview: The Myntra Scraper Project is designed to extract product data from the Myntra website. This tool enables users to gather information such as product names, prices, descriptions, ratings, and images for analysis, comparison, or personal use.

data-science data-structures data-visualization filesystem github mogodb mogoose python3 strreamlit web-scraping

Last synced: 10 Apr 2026

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data

Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters

data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data

Last synced: 05 Jul 2025

https://github.com/melih0132/all-my-projects

This repository showcases projects from my computer science journey, covering technologies like web development and interactive applications.

csharp data-visualization database game-development html-css ia javascript kotlin-android python software-development swift unity web-development

Last synced: 05 Apr 2026

https://github.com/nazir20/scraping-tweets-using-python-and-preprocessing-tweets-for-sentiment-analysis

This is repo is about how to scrape tweets from Twitter using Python and also proprocessing tweets for sentiment analysis

data-cleaning data-visualization jupyter-notebook python twitter-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/sharoonjoseph321/liver_cirrhosis

This project aim to understanding the factors contributing to liver cirrhosis, analyzing its impact, and possibly predicting disease outcomes using machine learning. It might also explore survival analysis or risk stratification for liver cirrhosis patients.

analytics data-science data-visualization dataanalysis machine-learning machine-learning-algorithms predictive-analytics predictive-modeling python random-forest-classifier visualization

Last synced: 15 Mar 2025

https://github.com/virajbhutada/titanic-survival-prediction

ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.

ada-boost-classifier data-exploration data-science data-visualization decision-tree-classifier hyperparameter-tuning knn-classification logistic-regression machine-learning model-interpretation random-forest-classifier roc-curve titanic-classification

Last synced: 14 Jun 2026

https://github.com/mkaspulanwar/p6_bigdata_realtime_largescale_visualization

Praktikum Week 6 Big Data: Real-time analytics dan visualisasi data skala besar menggunakan PySpark Structured Streaming, Parquet Data Lake, dan Streamlit untuk monitoring mobilitas dan traffic smart city.

big-data data-visualization pyspark spark-streaming streamlit traffic-analytics

Last synced: 13 Apr 2026

https://github.com/snacks02/wobbling-statistics

Audio equipment statistics using Squiglink data

audio data data-visualization headphones iems speakers squiglink statistics

Last synced: 17 Apr 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/estebanrucan/reporte-comunas-tasa-defuncion-alta_2017

El fin de este reporte es indicar cuales son las mayores causas de defunción en las comunas de Chile en el año 2017, el material queda a libre disposición para que se puedan tomar medidas.

chile data-visualization ggplot2 plotly rmarkdown

Last synced: 04 Feb 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/emcramer/clockplot

Plotting utility for a "clockplot" that puts groups into a time-ordered heterogeneity visualization

biology data-analysis data-visualization heterogeneity pseudotemporal-ordering

Last synced: 10 Mar 2026

https://github.com/samanhur/data_visualization_pcc

First experiences in data visualization with python

data-analysis data-science data-visualization python3

Last synced: 23 Mar 2025

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/bala-1409/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization

Last synced: 04 Jan 2026

https://github.com/oelin/textgram

A simple text-based data visualisation library.

ascii-art data-visualization diagram python

Last synced: 23 May 2026

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/arslanr369/eda-journey

Exploratory data analysis (EDA) and visualization projects focusing on diverse datasets, including Bitcoin price trends and Indian restaurant reviews. Each notebook aims to provide insights and showcase data storytelling through visual exploration.

bitcoin data-science data-visualization eda

Last synced: 14 Mar 2025

https://github.com/ultra-bugs/pyside6-datatable-widget

A PySide6 DataTable widget with jQuery DataTable-like functionality

data-visualization desktop-app desktop-application gui pyside6 qt qt6 table

Last synced: 30 Jun 2025

https://github.com/aditishenoy35/netflix_analysis

An interactive data visualization project exploring Netflix content using Python and Jupyter Notebook.

data-visualization jupyter-notebook python

Last synced: 20 Apr 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/weybsonalves/prevendo-o-atrito-de-clientes

Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.

data-analysis data-science data-visualization machine-learning python

Last synced: 06 May 2026

https://github.com/dimits-ts/visualization-team-project

Team project visualizing various views for an established bike-sharing company. Includes a written report, presentation, R-code and Tableau files

data-visualization presentation-slides r-language tableau

Last synced: 06 Nov 2025

https://github.com/msikorski93/meteorite-landings

Basic data analysis focused mainly on visualizing geospatial data worldwide with cartopy.

cartopy data-visualization geopandas gis mapping meteorite-landing-sites shapefile

Last synced: 16 May 2026

https://github.com/akhdandann/squadevaluationdashboard-powerbi

A Power BI dashboard that visualizes squad evaluation metrics including happiness, contribution, commitment, delivery, and agile behavior across tribes at PT. XL Axiata Tbk. (with dummy data)

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/muichi-mon/fxplot

A simple JavaFX-based plotting library for quick and easy data-visualization.

data-visualization javafx plot series-data

Last synced: 16 May 2026

https://github.com/lotfiferaga/google-play-store-sentiment-analysis

Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.

data-analysis data-visualization googleplayservices python reviewsanalysis-nlp

Last synced: 26 Feb 2025

https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics

Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.

beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping

Last synced: 25 Feb 2025

https://github.com/mjanez/spain-cultural-pulse

Interactive web app to explore contemporary Spanish culture, values, politics & social norms with beautiful data visualizations (Next.js + Leaflet + Recharts + D3). Based on 2024 nationwide survey (3k respondents).

csic culture d3js data-visualization i18n nextjs norpol open-data politics social-norms sociology spain spain-culture spain-politics survey-data tailwindcss

Last synced: 13 Jan 2026

https://github.com/maettuu/project-beatblend

Repository for the Master's Project 2024 on Visualizing and Explaining Sequential Song Recommendations through Data Humanism

audio-features aws ci-cd content-based-recommendation data-visualization discogs-api docker fastapi full-stack jwt-token masters-project postgres python recommendation-system redis rest-api spotify-api visual-data vuejs websocket

Last synced: 11 Apr 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/usman619/data-science

Data Science basic practice code, assignments etc.

data-science data-visualization elt

Last synced: 06 Jun 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/guyabel/chord-afcon

Visualizing bilateral links between AFCON squads and players clubs

afcon africa chord-diagram data-visualization data-viz dataviz football rstats visualization

Last synced: 10 Jun 2026

https://github.com/brianyu28/old-sheets-flying

Data analysis and graphics tool for The Harvard Crimson's Data and Design Teams

data-visualization harvard-university journalism

Last synced: 15 May 2025

https://github.com/sanand0/booksviz

LLM-generated visual insights from the GoodReads 100K dataset

data-visualization llm

Last synced: 20 Jan 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/jaymax01/dvd-rental-data-analysis

Data analysis of a DVD rental database

data-visualization postgresql sql

Last synced: 22 Jul 2025

https://github.com/danielpeter/wiggles

iOS App for visualizing ground motion of nearby seismic stations

data-visualization fdsn seismics

Last synced: 12 Mar 2026

https://github.com/shivamkumar818/mbti-personality-prediction-from-text-data

MBTI Personality Prediction from Text Data This project leverages machine learning to predict Myers-Briggs Type Indicator (MBTI) personality types based on textual data, specifically from social media posts.

confusion-matrix correlation-matrix data-visualization dataset knn-classification linear-regression logistic-regression modeltraining navebayes numpy pandas python

Last synced: 13 Apr 2026

https://github.com/mahmoudwal27/manufacturing_downtime

This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.

data-analysis data-analysis-python data-visualization google-colab powerbi python sql

Last synced: 17 Apr 2026

https://github.com/swethajoseph/urological-cancer-referral-forecast

Analysing and forecasting urological cancer referral patterns for NHS Scotland, aiming to improve management and operational efficiency.

data-visualization datacleaning excel forcasting statistical-analysis tableau time-series-analysis

Last synced: 04 Jan 2026

https://github.com/an4pdm/data_analysis_escolar

Projeto de BD com dados fornecidos pelo "Portal de Dados Abertos" , feito com o intuito de praticar minhas habilidades em SQL.

analise-de-dados data-visualization database mysql project-repository sql study

Last synced: 10 Apr 2026

https://github.com/archanakokate/adidas_us_sales_analysis_powerbi

Analysis of Adidas Sales in US for year 2020-2021 using PowerBI

analysis data-visualization modelling powerbi

Last synced: 04 Jan 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/gowhale/daily-spend-analysis

Python script to analyse spending habits.

data-visualization pandas python

Last synced: 24 Apr 2026

https://github.com/aqueeqazam/matplotlib-for-data-science-analysis-and-statistics

With a nice mix between customization options and ease of use, Matplotlib is a robust Python library that can be used by both novice and seasoned data scientists and machine learning engineer to create a wide range of representations.

data-science data-visualization machine-learning matplotlib pyplot statistics

Last synced: 07 Oct 2025