An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/verinverdian/smart-factory

Smart Factory Dashboard – A web-based factory management dashboard to monitor employees, inventories, and productions with real-time data visualization.

admin-dashboard bootstrap dashboard data-visualization factory-management laravel manufacturing php production-management smart-factory

Last synced: 06 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python

Last synced: 07 May 2026

https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting

This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

data-analysis data-visualization mechine-learing plotly python streamlit

Last synced: 07 May 2026

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/tsear/reddit-discourse-project

Mapping emotional and conceptual discourse across Reddit philosophy communities.

data-visualization emotion-detection network-analysis nlp pandas reddit-api sentiment-analysis spacy text-mining tf-idf topic-modeling

Last synced: 08 May 2026

https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project

This project was completed to simulate real-world tasks that data professionals encounter every day on the job.

dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery

Last synced: 08 Mar 2026

https://github.com/msukmanowsky/datadrawer

A handy little utility for when you want some time series data to prototype and don't want to write code.

d3 data-visualization prototype prototyping vue vuejs

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/zxjahid/matplotlib

A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊

candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx

Last synced: 09 May 2026

https://github.com/nahiyanhkhan/data-processing-and-visualization

Loan Data Processing using Python's numpy and pandas libraries. For data visualization, matplotlib and seaborn are used.

data-analysis-python data-visualization matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/mituskillologies/ds-ref-cdac-aug24

Program of refresher course on Data Science conducted for CDAC officials at CDAC Headquarters, Pune in August 2024.

data-science data-visualization machine-learning mysql python-programming r-programming sql

Last synced: 10 May 2026

https://github.com/timjjting/escaping-flatland

A demo of optimization techniques for plotting large datasets in a 3D space using Three.js + Svelte integration

big-data data-visualization glsl-shaders lod octree svelte sveltekit three-js

Last synced: 11 May 2026

https://github.com/ericgio/r-d3

Low-level components and examples for rendering data with D3 + React

d3 data-visualization react

Last synced: 11 May 2026

https://github.com/hrosicka/czechpopulationestimation

This GitHub repository contains Python code for data analysis and population prediction in the Czech Republic up to the year 2050. The code is written in Python and utilizes the Pandas and Matplotlib libraries.

data-analysis data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe pandas-library pandas-python python python3

Last synced: 11 May 2026

https://github.com/dannykyungh/data-analytics-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

advanced-excel data-cleaning data-modeling data-visualization data-warehousing google-sheets looker-studio python r sql tableau

Last synced: 12 May 2026

https://github.com/magnus0969/gdp-analysis

An in-depth exploration of global GDP trends using Python and data science techniques. This project involves data preprocessing, exploratory data analysis (EDA), statistical insights, and interactive visualizations to understand economic patterns and correlations.

data-science data-visualization gdp-analysis plotly python3

Last synced: 12 May 2026

https://github.com/sricasea/fundraising-insights-mwpccc

Data storytelling meets impact strategy — a nonprofit fundraising analysis project combining SQL, Python, and Deepnote to uncover donor trends and guide smarter decisions.

data-analysis data-storytelling data-visualization deepnote fundraising nonprofit portfolio-project python sql

Last synced: 12 May 2026

https://github.com/travishorn/svplot

Reusable Svelte 5 action for rendering Observable Plot charts.

charts data-visualization observable-plot svelte

Last synced: 13 May 2026

https://github.com/manukot/sturdy-engine-python-

I've leant not only various Theoretical Concepts but also practical projects in my Masters Coursework

data-analysis data-visualization python3

Last synced: 13 May 2026

https://github.com/kinolag/traffic

A geospatial visualisation app showing road traffic information for all areas of Inner London. Built in TypeScript, combining React with D3.

d3 data-visualization geojson geospatial-visualization mapping react responsive-design svg topojson typescript

Last synced: 13 May 2026

https://github.com/chauxvive/fccbarchart

An interactive D3.js bar chart visualizing US GDP growth over time. Built as part of FreeCodeCamp’s Data Visualization certification.

d3 d3js data-visualization dataviz

Last synced: 13 May 2026

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 13 May 2026

https://github.com/estnafinema0/housing-price-analysis

Predicting housing prices with regression models and visual analytics. Includes preprocessing, custom pipelines, and visualized performance metrics.

custom-models data-preprocessing data-visualization eda exploratory-data-analysis housing-prices jupyter-notebook machine-learning pipelines regression-models seaborn sklearn

Last synced: 13 May 2026

https://github.com/magnus0969/heart-diease-eda

Exploratory Data Analysis (EDA) on heart disease data to uncover key risk factors and patterns. This project utilizes Python, Pandas, Seaborn, and Matplotlib to visualize trends, correlations, and insights that contribute to heart disease prediction and prevention.

data-insights data-science-projects data-visualization heart-disease-analysis python

Last synced: 14 May 2026

https://github.com/mathyouf/kaggle-notebook-code

Code and Images which I used in Kaggle Notebooks. Mostly for style and code clarity.

data-visualization kaggle

Last synced: 14 May 2026

https://github.com/yashsingh43/cdc-sleep-duration-health-analysis

Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.

beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python

Last synced: 11 Jun 2026

https://github.com/mohamedmetwalli5/breastcancerdiagnosis

Breast cancer diagnosis using machine learning via the XGBoost Algorithm after visualizing the data set & exploring it.

cancer data-visualization machine-learning

Last synced: 11 Jun 2026

https://github.com/mogalina/graph-rank-dynamics

Computational engine for analyzing how importance propagates through directed graphs.

data-visualization education graph-theory page-rank statistics

Last synced: 12 Jun 2026

https://github.com/danielvartan/estela

🧒🏽🍎 Analysis and Visualization of Food Consumption Data for Brazilian Children Aged 2 to 4, as Monitored by SISVAN in 2019

brazil child-health child-nutrition data-science data-visualization malnutrition nutritional-epidemiology sisvan sus

Last synced: 12 Jun 2026

https://github.com/shashwat9kumar/trends_in_a_country_on_twitter

Finding trending topics in each country on twitter and visualizing them in a WordCloud

data data-visualization trends tweepy twitter-api wordcloud

Last synced: 13 Jun 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization

Last synced: 05 Apr 2025

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/april-jk/stoke-your-code

Trading-terminal style Git history viewer that turns repository activity into candlestick charts and volume bars.

analytics candlestick-chart data-visualization developer-tools git-history git-visualization github lightweight-charts react typescript

Last synced: 15 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/claudiahw/excel-sales-dashboard

Data-driven Excel dashboard visualizing sales trends, top products, and profit breakdowns with dynamic filtering options.

dashboard data-visualization excel excel-dashboard pivot-tables

Last synced: 15 Jun 2026

https://github.com/bpazy/my_running_page

Make your own running home page

codoon data-visualization garmin gpx keep nike strava

Last synced: 17 Jun 2026

https://github.com/hanifheinrich/population-data-visualization

Implementasi Visualisai Data pada Data Kependudukan Nagari Tanjung Balik, Kabupaten Solok, Sumatera Barat Menggunakan Streamlit

data-visualization python streamlit-dashboard

Last synced: 16 Jun 2026

https://github.com/joonarafael/ids-exercises

Repository to store the exercise submissions for the Introduction to Data Science course (University of Helsinki).

course-work data-science data-visualization jupyter-notebook university-assignment

Last synced: 16 Jun 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/mattsebastianh/Making-a-Visual-Argument

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/mattsebastianh/Make-a-Line-Chart

Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization matplotlib pandas-dataframe python

Last synced: 18 Jun 2026

https://github.com/philippmeder/visdata

Useful python tools for data visualisation, e.g. 2D-profiles (also known as profile plots), comparison of measurements, or tables.

data-science data-visualization profile-plot python python-3 python3

Last synced: 18 Jun 2026

https://github.com/rb-thompson/machine-learning-basics

Implement a machine learning pipeline on the Iris flower dataset.

data-preprocessing data-visualization model-training python scikit-learn

Last synced: 18 Jun 2026

https://github.com/aman-codde/credit-card-analytics

A full-stack dashboard for credit card users to analyze spending, track rewards, and download statements securely.

analytics dashboard data-visualization express fullstack jwt-authentication mongodb nodejs react recharts tailwindcss

Last synced: 09 Apr 2026

https://github.com/ianjure/average-precipitation-map

A 3D data visualization of average precipitation using R.

data-visualization philippines r

Last synced: 16 Aug 2025

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/supsi-deass-cpps/multilingual_thematic_analysis

Modular R pipeline for multilingual survey analysis — translate, embed, cluster, and visualize open-ended responses using Google Cloud and tidyverse tools.

clustering data-visualization linguistics multilingual-analysis natural-language-processing qualitative-research r reproducible-research social-science survey-data text-mining thematic-analysis translation

Last synced: 04 Oct 2025

https://github.com/kplanisphere/analysis-of-political-texts

Analysis and Classification System of Political Texts Using Natural Language Processing - Final Project for the Information Retrieval Course

classification-model data-science data-visualization deep-learning information-retrieval machine-learning natural-language-processing neural-network nlp nlp-machine-learning

Last synced: 05 Oct 2025

https://github.com/allanotieno254/powerbi-sales-dashboard

This repository provides a step-by-step guide to building a Power BI sales dashboard. It includes sample data, DAX measures, and visual examples

data-analytics data-visualization dax-expression power-bi sales-analysis

Last synced: 03 Jan 2026

https://github.com/saravanansuriya/phonepe-pulse-data-visualization-and-exploration

Creating a dashboard by using streamlit application. In this app visualizing the data taken from Phonepe pulse Github repository.

data-visualization github-cloning mysql-database pandas-dataframe plotly-express python streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/palakjainanalyst/ecommerce-customer-spending-analysis

An end-to-end Ecommerce analytics project uncovering customer spending trends using Excel, Python, SQL, and Power BI. From raw data to interactive dashboards, this project delivers deep insights on spending patterns, high-value customer segments - showcasing a complete data-to-decisions workflow.

data-analysis data-visualization database ecommerce excel jupyter-notebook powerbi python spending sql

Last synced: 06 May 2026

https://github.com/alvitachen/breathe-retrospective

A dashboard that visualizes AQI and PM job openings across target cities.

aqi beginner charts data-visualization personal-project react vite

Last synced: 10 Apr 2026

https://github.com/raghul-m/stock-price

Simple Stock Price App Using Streamlit and Yfinance

data-science data-visualization streamlit-webapp yfinance-library

Last synced: 04 Oct 2025

https://github.com/kaoutarmi/analyse-des-ventes-pour-optimiser-la-performance

Analyse des données de ventes pour identifier des opportunités d'amélioration des performances commerciales. Utilisation de Pandas pour le traitement des données, et Matplotlib/Seaborn pour la visualisation des tendances et des résultats.

business-intelligence data-analysis data-visualization jupyter-notebook matplotlib pandas sales-optimization seaborn

Last synced: 20 Aug 2025

https://github.com/drod75/nyc-arrests-analysis

This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.

data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/mukeshlilawat1/netflix-data-visualization

Netflix Data Visualization – This project explores the Netflix dataset using Pandas for data manipulation and Matplotlib for creating meaningful visualizations. It highlights trends in movies and TV shows, distribution by release year, ratings, duration, and categories, making the data easy to understand through graphical insights.

data-visualization matplotlib pandas pip python

Last synced: 09 Apr 2026

https://github.com/itskshitija/tesla-stock-price-prediction

Welcome to the Tesla Stock Price Forecasting project, where we delve into time-series analysis to predict stock price trends for one of the world's most innovative companies—Tesla Inc.

data-visualization eda python time-series-analysis

Last synced: 12 Aug 2025

https://github.com/fernandogomesfg/sabores-aromas-analytics

Projecto Sabores & Aromas: um dashboard interativo desenvolvido no Power BI, focado em insights de vendas, desempenho por equipe e análise de rentabilidade para optimizar decisões estratégicas.

analise-de-dados data-science data-visualization dataanalytics powerbi storytelling-with-data vendas

Last synced: 13 Feb 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/rosanafss/r-ladies-bh-workshop-metricas

Como Plotar Métricas e Entregar Valor para Times Ágeis

data-analysis data-visualization r

Last synced: 25 Aug 2025

https://github.com/emilyjspencer/data-visualizations

📊 Creating data visualisations with Chart.js

chartjs data-visualization

Last synced: 27 Aug 2025

https://github.com/davepeck/unsheltered

Experimental/toy project to visualize public data related to unsheltered homelessness in Seattle.

data-visualization homelessness seattle

Last synced: 28 Aug 2025

https://github.com/rajkumargara/bike_rental_data_analysis

Chicago bike rental data analysis for business insights using R programming

data-analysis data-visualization data-wrangling large-dataset machine-learning-algorithms

Last synced: 11 Aug 2025

https://github.com/bayunova28/capital_bikeshare

This repository contains about data analysis case project from capital bikeshare

capital-bikeshare data-analytics data-science data-visualization sql-server

Last synced: 30 Aug 2025

https://github.com/yatharthkumarsaxena/cdac-noida-internship-network-traffic-analysis

Real-time network packet capture and analysis using Moloch (Arkime), Wireshark, and Elastic Stack to detect anomalies, visualize patterns, and enhance cybersecurity.

arkime dashboard data-visualization elasticsearch kibana logstash network-traffic-analysis pcap-files security-information-and-event-management threat-identification ubutnu wireshark

Last synced: 07 Sep 2025