An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/dsrodrigovieira/houserocketsales

Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python

data-analysis data-visualization heroku kaggle-dataset python

Last synced: 29 Apr 2026

https://github.com/pxaris/expenditure-analyzer

Application for analyzing expenditure data over time

data-analysis data-visualization docker python statistics

Last synced: 29 Apr 2026

https://github.com/manwithacap/by-the-metric-match

🎲🃏 A game data tracker for your board/card/video games!

data-analysis data-visualization games jupyter-notebook python utility

Last synced: 29 Apr 2026

https://github.com/datalopes1/ds_salaries2024_eda

Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) a partir do dataset Data Science Salaries 2024, que pode ser encontrado no Kaggle, com licensa Database: Open Database e enviado por Sazidul Islam.

data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/arif-miad/global-plastic-waste-analysis

Global plastic waste is a pressing environmental issue, with massive production, limited recycling, and high risks to ecosystems and human health

catboost-classifier data-science data-visualization geopandas machine-learning matplotlib python random-forest-classifier seaborn

Last synced: 29 Apr 2026

https://github.com/chrnthnkmutt/theartofstatistic_python

This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization

data data-science data-visualization machine-learning statistics

Last synced: 08 Jun 2026

https://github.com/trigeminal/hospital-respiratory-forensics

(DS) A comprehensive repository dedicated to the analysis of weekly hospital respiratory data and metrics reported to the Centers for Disease Control and Prevention’s (CDC) National Health Safety Network (NHSN) from August 2020 through October 2024.

data-visualization jupyter-notebooks python3

Last synced: 30 Apr 2026

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/alejo1630/ibm_capstone_project

This project aims to leverage predictive analytics to forecast the outcomes of rocket launches for Space Y, a new player in the commercial space industry.

data-collection data-science data-visualization data-wrangling exploratory-data-analysis machine-learning predictive-modeling python spacex

Last synced: 01 May 2026

https://github.com/com-480-data-visualization/project-2023-the-vizards

Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).

buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl

Last synced: 01 May 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/archie-cm/credit_risk_model_vix_id-x_partners

The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments

credit-risk data-analysis data-visualization machine-learning scorecard

Last synced: 01 May 2026

https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice

A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.

climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather

Last synced: 02 May 2026

https://github.com/md-emon-hasan/2-simple-bioinformatics-dna-ml-app

A simple bioinformatics application for DNA sequence analysis using machine learning techniques, implemented in Python.

bioinformatics data-science data-visualization deployment dna-sequences dna-sequencing supervised-machine-learning

Last synced: 09 Jun 2026

https://github.com/msikorski93/visualizing-lastest-usgs-earthquakes

This notebook contains an introduction to the use of Python and cartopy to visualize data concerning earthquakes. We will first read a file with earthquake locations (latitudes, and longitudes), magnitudes in Richter scale, and depths, and other descriptors and then overlay it on a worldwide map.

cartopy data-visualization folium map

Last synced: 09 Jun 2026

https://github.com/gerhynes/d3-data-dashboard

A D3 data dashboard showing CO2 emissions per country. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization jaavscript

Last synced: 02 May 2026

https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration

Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.

data-analysis data-visualization dataframe mysql pandas plotly python streamlit

Last synced: 02 May 2026

https://github.com/fybex/chatgpt-conversations-analysis

Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.

chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis

Last synced: 02 May 2026

https://github.com/kplanisphere/plotted-3d-environment

Plotted 3D Environment is a graphical project inspired by Minecraft, designed to demonstrate 3D object creation, animation, and interaction using OpenGL. It features first-person navigation, texture mapping, and collision detection within a dynamic 3D environment filled with obstacles and enemies - Final project for the Graphing course.

3d-graphics animation camera-movement collision-detection computer-graphics cpp data-visualization educational-project opengl texture-mapping

Last synced: 03 May 2026

https://github.com/zoliqua/venn-diagram-lab

🟢 🟤 Interactive Venn diagram viewer & editor — 44 SVG models (2–9 sets), pre-computed region paths, React + TypeScript + Vite

adelaide bioinformatics data-visualization edwards-venn euler hamilton interactive manawatu massey palmerton-north python-package react set-theory svg typescript upsetplot venn venn-diagram venndiagram victoria

Last synced: 05 May 2026

https://github.com/rohithay/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 03 May 2026

https://github.com/tushard48/product-cluster-analysis

This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr

data-visualization excel machine-learning powerbi streamlit unsupervised-learning

Last synced: 03 May 2026

https://github.com/zeynepcol/data-analysis-visualization

Data visualization and interactive analytics - Olympics Dataset

data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit

Last synced: 03 May 2026

https://github.com/rayyan9477/house-price-prediction-model

This project aims to predict house prices using a machine learning model. The project involves data cleaning, feature engineering, model selection, training, and evaluation. The dataset is uploaded by the user, and the model is trained to predict house prices based on various features.

data-science data-visualization gridsearchcv machine-learning machine-learning-algorithms notebook python random-forest

Last synced: 05 May 2026

https://github.com/nicholas-miklaucic/rho_plus

The Python data viz nitro canister you didn't know you needed

aesthetics bokeh colormap data-visualization matplotlib plotly python

Last synced: 05 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/vara-co/python-api-challenge

Weather and Perfect Vacationing Spots Worldwide, by using APIs

api apis data-analysis data-visualization hvplot jupyter-notebook matplotlib pandas vacation weather

Last synced: 05 May 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/shazeus/vizflow-cli

Data visualization pipeline tool for schema inspection, charts, dashboards, and export

charts cli dashboard data-visualization flask pandas plotly python

Last synced: 09 Jun 2026

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 06 May 2026

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 06 May 2026

https://github.com/basedrhys/global-stock-vis

Visualisation of financial stock data using CesiumJS

cesiumjs data-visualization nodejs python

Last synced: 06 May 2026

https://github.com/scarblase/portfolioprojects

A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊

csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql

Last synced: 06 May 2026

https://github.com/himanchalchandra/science-canvas

Repo containing projects I did during a four months bootcamp on Data Science and Machine Learning organized by Science Canvas India.

data-mining data-science data-visualization machine-learning-algorithms mysql nlp-machine-learning

Last synced: 06 May 2026

https://github.com/mxagar/statistics_with_python_coursera

My personal notes done while following the Coursera Specialization "Statistics with Python", from the University of Michingan, hosted by Dr. Brenda Gunderson.

data-modeling data-science data-visualization hypothesis-testing machine-learning pandas python statistics

Last synced: 06 May 2026

https://github.com/vietdoo/real-estate-marketplace

a webapp that can visualize homes for sale on a cluster map. Data is continuously fetched from MongoDB, build filtering functions and APIs to find homes.

api bigdata data-visualization flask maps mongodb reactjs webscraping

Last synced: 07 May 2026

https://github.com/ssreeramj/binod-detector

Scrapes comments of a youtube video and shows distribution of comments having 'binod' in it

data-visualization heroku python streamlit youtube-api

Last synced: 16 May 2026

https://github.com/amirhosseinhonardoust/ai-personal-study-tracker

An AI-driven productivity tracking app built with Python, Streamlit, SQLite, and Machine Learning. It logs and analyzes study sessions, predicts productivity using Random Forest models, and visualizes key insights to help learners improve focus, habits, and overall academic efficiency.

ai data-analytics data-visualization education learning-analytics machine-learning productivity python random-forest self-improvement sqlite streamlit student-success study-tracker time-management

Last synced: 07 May 2026

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 07 May 2026

https://github.com/rembertdesigns/smart-vinyl-catalog

AI-powered vinyl cataloging and music discovery platform leveraging BigQuery’s generative AI. Processes mixed-format data to deliver personalized recommendations, collection analytics, and intelligent search. Created for the Kaggle BigQuery AI Challenge to showcase real-world, scalable AI solutions for music lovers.

ai bigquery data-science data-visualization generative-ai hackathon kaggle kaggle-competition machine-learning music-analytics music-recommendation-algorithm python recommender-system vinyl

Last synced: 07 May 2026

https://github.com/willjw3/virus-spy

A simple, easy-to-follow Covid-19 tracker Jamstack app.

data-visualization jamstack javascript react

Last synced: 07 May 2026

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/vishvamporwal/pharmassist

A progressive web app made with Flask for Industrial pharmaceutical management and Analysis, to improve efficiency and make management easier.

data-visualization flask html-css-javascript python

Last synced: 07 May 2026

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 07 May 2026

https://github.com/1ayanabil1/iris-visualization

This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.

analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python

Last synced: 07 May 2026

https://github.com/pm25/youbike-station-finder

🚲 Display and visualize real-time information for Taipei YouBikes.

css3 d3js data-visualization html5 javascript visualization website youbike

Last synced: 08 May 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/hyoaru/philippine-poverty-area-estimates-choropleth

A web application providing a visual representation using a choropleth map of the estimated magnitude of poor families in the Philippines from the years 2006, 2009, 2012, and 2015.

data-visualization plotly python streamlit web-application

Last synced: 08 May 2026

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 08 May 2026

https://github.com/jethronap/jstat-gui

Web-based GUI application for data analysis

data-analysis data-visualization java jstat mongodb

Last synced: 08 May 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/md-emon-hasan/4-eda-football-ml-app

A ML application focused on exploratory data analysis and football analytics, featuring data visualization and insights using Python and relevant libraries.

data-science-projects data-visualization eda exploratory-data-analysis football-analytics sports-analytics webapp

Last synced: 08 May 2026

https://github.com/flazefy/customanalytic

created using next js, mysql, laravel api

apexcharts api data-visualization nextjs vercel

Last synced: 09 May 2026

https://github.com/avijit-jana/redbus-data-scraper-dashboard

A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.

automation dashboard data-analysis data-visualisation data-visualization datadrivendecisions filtering python3 redbus selenium selenium-python streamlit streamlit-application travel web-scraping webscrapping

Last synced: 09 May 2026

https://github.com/dogan-the-analyst/developer_survey_analysis

Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.

data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python

Last synced: 09 May 2026

https://github.com/chauxvive/uschildpoverty

An interactive choropleth map visualizing U.S. state-level child poverty data using D3.js. Compare child poverty rates over time with data from KIDS COUNT and the US Census Bureau.

choropleth-map d3 d3js data-visualization dataviz

Last synced: 09 May 2026

https://github.com/douglasvolcato/fiis-analysis-brasilian-market

Brazilian investment fund analysis focused in dividend yield and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/barrarrr/fly-in

A dynamic, terminal-based drone network simulation application.

42 42school a-star algorithms breadth-first-search data-visualization drone fly-in

Last synced: 10 Jun 2026

https://github.com/vrostbyte/budget-app

Web app to manage personal finances: track expenses, income, bills, and visualize budgets with charts.

bills-management budget css data-visualization expense- finance html income-tracker javascript json personal-finance web-app

Last synced: 10 May 2026

https://github.com/deva-246/business-insights-on-realtime-swiggy-data-using-python

Data analysis for business decision-making and insights of a real time segment of Swiggy data.

data-visualization jupyter pandas python seaborn

Last synced: 10 May 2026

https://github.com/kalelmartinho/7daysofcode

Esse projeto tem como objetivo experienciar o dia a dia de um cientista de dados. É um desafio com duração de uma semana, proposto pela Alura.

7daysofcode alura data-cleaning data-science data-visualization forecasting machine-learning

Last synced: 10 May 2026

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/mtrebi/d3_cars

Cars Dataset Visualization (PCP) using d3.js

d3js data-visualization javascript

Last synced: 12 May 2026

https://github.com/mpolinowski/victory-data-chart

React.js components for modular charting and data visualization

chart css-grid-layout data-visualization react styled-components victory

Last synced: 13 May 2026

https://github.com/s-sutharsan-20/python-matplotlib-1

This repository contains matplotlib python programs

data-visualization matplotlib python

Last synced: 13 May 2026

https://github.com/vjo/d3-punchcard

D3 Punchcard chart 📊●•●

chart d3js data-visualization library punchcard visualization

Last synced: 13 Jun 2026

https://github.com/topfunky/learning-r-stats

Scripts and data while learning to use the R statistics and charting software program

data-visualization r statistics

Last synced: 16 Jun 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/dorukalkan/3a-superstore-analysis

End-to-end data analysis, machine learning, and visualization project on a Turkish supermarket dataset

data-science data-visualization dbt machine-learning power-bi python sql

Last synced: 20 Jun 2026