An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/machinelearningzuu/data-engineering-projects

This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.

airflow bigquery data-engineering data-science data-visualization data-warehouse

Last synced: 30 Apr 2026

https://github.com/r-mahesh45/fraud-detection-and-sales-analysis-using-random-forest

This project uses Random Forest to classify fraud risk based on taxable income and analyze key factors driving high sales for a cloth manufacturing company.

classification data-visualization extract-transform-load python3 random-forest

Last synced: 30 Apr 2026

https://github.com/tuni56/ventas_streamlit

interactive sells and KPI's dashboard

dashboard data-visualization kpi python streamlit webapp

Last synced: 24 Apr 2026

https://github.com/dpb24/datakind-2025

📊 Data Analytics: Identifying Actionable Insights to Improve Financial Inclusion in Kenya

data-analytics data-visualization databricks datakind exploratory-data-analysis financial-data geopandas jupyter-notebook kenya matplotlib numpy python seaborn

Last synced: 24 Apr 2026

https://github.com/flazefy2/ds-mobilesdataset

https://www.kaggle.com/datasets/abdulmalik1518/mobiles-dataset-2025

csv data-visualization jupiter-notebook python

Last synced: 24 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/pejpero/neural_network_regression_and_classification

This repository contains neural network regression models built from scratch and using Keras for comparison. It visualizes training and testing performance, analyzing MSE, R², and decision boundaries. The project demonstrates learning techniques and optimization for regression tasks.

data-visualization keras machine-learning neural-network regression

Last synced: 09 Feb 2026

https://github.com/huacenxu/predict-loan-status

Using the Cross-Industry Standard Process of Data Mining (CRISP-DM), this project analyzes loan data from Prosper to identify key factors that predict loan status.

bootcamp-project data-science data-visualization data-w loan-prediction-analysis

Last synced: 25 Apr 2026

https://github.com/pyrypp/taxipoint_streamlit

The front-end for the taxi demand prediction service

data-visualization streamlit

Last synced: 24 Apr 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/mrdvince/co2-dashboard

A CO2 emissions dashboard visualization using d3.js. https://droid021.github.io/co2-dashboard/

d3 data-visualization

Last synced: 24 Apr 2026

https://github.com/ayushsiloiya619/data-science

Work area for data science project's.

analytics data-science data-visualization python

Last synced: 24 Apr 2026

https://github.com/pedrohdosanjos/economic-data-analysis

This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.

api data-analysis data-visualization jupyter-notebook python

Last synced: 24 Apr 2026

https://github.com/edgarhtt/uber_freight_data_analysis

Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task

data-analysis data-science data-visualization python

Last synced: 30 Apr 2026

https://github.com/bachtiarashidiqy/ecommercedashboard

An interactive e-commerce analytics dashboard built with Streamlit, providing visualizations for sales performance, product analysis, geographic insights, and delivery status. Includes date filtering, company branding, and comprehensive documentation.

analytics dashboard data-analysis data-visualization e-commerce matplotlib pandas python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/itskshitija/tesla-stock-price-prediction

Welcome to the Tesla Stock Price Forecasting project, where we delve into time-series analysis to predict stock price trends for one of the world's most innovative companies—Tesla Inc.

data-visualization eda python time-series-analysis

Last synced: 29 Jun 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/chetanmalviya513/assembly-election-analysis-data-insights-engagement

Scraped and analyzed real-time election data to build interactive dashboards showcasing seat trends, vote share distribution, and postal ballot stats. The analysis uncovered insights on voting patterns, winning margins, and candidate forfeitures. Visual storytelling and timely data updates helped the project gain strong engagement on social media.

assembly data-visualization dataanalytics descriptive-statistics election-analysis election-data msexcel news tableau-dashboards webscraping

Last synced: 25 Apr 2026

https://github.com/stefagnone/airbnb-data-analysis

Data analysis and visualization of Airbnb listings using text mining frameworks, Tableau dashboards, and MongoDB to uncover business insights for optimizing strategies.

airbnb-data-analysis business-insights data-visualization mongodb r-programming sentiment-analysis tableau-dashboards text-mining

Last synced: 25 Apr 2026

https://github.com/fbarffmann/belly-button-challenge

Built an interactive JavaScript dashboard to visualize bacterial biodiversity from belly button samples. Analyzed data from 153 participants and identified OTU 1167 as the most common bacteria.

biodiversity dashboard data-analysis data-visualization interactive-charts javascript json plotly

Last synced: 25 Apr 2026

https://github.com/ammahmoudi/statisticallearning

Homework Solutions for Statistical Learning Course as Computer Science B.Sc. Student at Department of Mathematical Sciences, Sharif University of Technology

data-visualization feature-selection logistic-regression random-forest

Last synced: 26 Feb 2026

https://github.com/tmoulik/bikeshare-python

Analysis of Bikeshare data from three major cities

data-analysis data-visualization python udacity-nanodegree

Last synced: 25 Apr 2026

https://github.com/fernandesotero/project-data-exploration

Student Performance Prediction with Data Science

data-visualization jupyter-notebook python

Last synced: 30 Apr 2026

https://github.com/tanyakuznetsova/world-happiness-report-2023-in-europe

Happiness Insight '23: Navigating global joy. Exploring trust's role in life satisfaction with my World Happiness Report analysis.

citizen-science data-storytelling data-visualization global-indicators life-satisfaction social-trust world-happiness-report

Last synced: 25 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/kruthiktr/crop-recommendation-system-using-machine-learning

A machine learning-based system recommending crops based on soil, climate, and environmental conditions to optimize agricultural yields.

ai-in-agriculture crop-recommendation data-visualization machine-learning prediction python python3 recommendation-system

Last synced: 25 Apr 2026

https://github.com/rickyarians/scrappy-do

Projek ini dikembangkan sebagai salah satu capstone project dari Algoritma Academy Data Analytics Specialization. Deliverables yang diharapkan dari projek ini adalah melakukan simple webscrapping untuk mendapatkan informasi.

beautifulsoup4 data-science data-visualization flask flask-application pandas webscraping

Last synced: 15 Apr 2026

https://github.com/arslan3x5/eda-journey

Exploratory data analysis (EDA) and visualization projects focusing on diverse datasets, including Bitcoin price trends and Indian restaurant reviews. Each notebook aims to provide insights and showcase data storytelling through visual exploration.

bitcoin data-science data-visualization eda

Last synced: 29 Jun 2026

https://github.com/gerhynes/d3-character-frequencies

A character frequency analyzer built using D3.js. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 25 Apr 2026

https://github.com/rijul007/smartwatch-data-analysis-using-python

Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python

Last synced: 30 Apr 2026

https://github.com/aniket965/crime-against-women-india

Some Data collected and visualisations of crime against women in India

crime-data data-visualization india women

Last synced: 06 Jun 2026

https://github.com/dikshadamahe/fossee-project

A hybrid web and desktop application for visualizing chemical equipment parameters, built with Django, React, and PyQt5.

chartjs chemical-engineering dashboard data-visualization django fossee fossee-2026 fossee-internship hybrid-application pyqt5 python react rest-api

Last synced: 25 Apr 2026

https://github.com/ddihora1604/iit_patna

A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.

data-analysis data-visualization datamodelling machine-learning-algorithms python

Last synced: 25 Apr 2026

https://github.com/fyonietz/infera

IDE For Data Science Or Data Analysis

cpp data-science data-visualization lightweight

Last synced: 25 Apr 2026

https://github.com/dulajkavinda/matplotlib-ml

📊Data visualisation with matplotlib library.

data-visualization jupyter-notebook matplotlib python seaborn

Last synced: 25 Apr 2026

https://github.com/novojitsaha/football-viz

Football Data Visualization using Statsbomb Open Data

data-visualization football-data frontend react typescript

Last synced: 25 Apr 2026

https://github.com/rayxiang03/indeed-job-scraping

Python toolkit for scraping Indeed job listings, preprocessing data, and generating visualizations for market analysis.

cloudscraper data-visualization indeed job-analysis nlp pandas python web-scraping

Last synced: 30 Apr 2026

https://github.com/waleedgeorgy/ml_sklearn

Implementation of various machine learning algorithms for regression and classification & feature engineering.

data-visualization jupyter-notebook machine-learning python

Last synced: 26 Apr 2026

https://github.com/dodji1/streamlit--bootcamp

Bootcamp de formation Streamlit - Initiation - Cas pratiques

data-science data-visualization python streamlit

Last synced: 26 Apr 2026

https://github.com/pedromarquetti/spotify-analyser

Data analyser for Spotify account usage

data-visualization python spotify

Last synced: 18 Mar 2026

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/odinleepro/airbnbnewyorkcityanalysis

AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.

airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis

Last synced: 27 Apr 2026

https://github.com/gerhynes/d3-histogram

A d3 histogram displaying UN data on worldwide births. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 27 Apr 2026

https://github.com/garcane/exodus_analysis

This project analyses cryptocurrency transaction data exported from the Exodus wallet. The goal is to explore and visualize the inflows and outflows of assets, the types of transactions, and other key metrics over time.

bitcoin btc crypto cryptocurrencies cryptocurrency data-analysis data-visualization eth ethereum pandas seaborn

Last synced: 27 Apr 2026

https://github.com/arda-guler/koerimei

KOERI Mapping Extension Interface. Maps latest earthquakes detected by Kandilli Observatory and Earthquake Research Institude.

data-visualisation data-visualization earthquake earthquake-visualization earthquakes geography map mapping

Last synced: 07 Jun 2026

https://github.com/caesaredia/food-app-user-behavior-analysis

Analyze user behavior and optimize app experience in a food-tech startup through funnel analysis and A/A/B testing. Includes data prep, visualization, and statistical testing in Python.

a-b-testing chi-square data-analysis data-visualization funnel-analysis python statistical-testing user-behavior

Last synced: 27 Apr 2026

https://github.com/benzerinsio/floralspecies-eda

📊 Análise Exploratória de Dados (EDA) - Flores Iris | Exploração de padrões e clustering com K-Means

analise-de-dados analise-exploratoria analise-exploratoria-de-dados botany clustering data-visualization eda exploratory-analysis exploratory-data-analysis python seaborn

Last synced: 27 Apr 2026

https://github.com/franckalbinet/teuvo

Self-Organising Map implemented as Literate Programming

data-visualization dimensionality-reduction neural-network

Last synced: 09 Feb 2026

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/bm777/kgraph

linear graph of kanda temperature and humidity data

data-visualization graph nextjs

Last synced: 28 Apr 2026

https://github.com/realvuk/r-for-data-science-by-vuk

My exercise from the book R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2nd Edition

data-science data-visualization r rstats

Last synced: 13 Jun 2026

https://github.com/shubham200137/spotify-listening-habits-analytics

Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver

Last synced: 18 Mar 2026

https://github.com/simranshaikh20/diwali-sales-analysis-for-business-insights

A data analyst project on diwali sales . In this state according state , gender, age we are able to know how much sale it done.

data-analysis data-visualization python

Last synced: 28 Apr 2026

https://github.com/ppatrzyk/foreign-tourists

Data visualization built with Svelte and d3.

d3 data-visualization poland svelte

Last synced: 28 Apr 2026

https://github.com/prajakta1321/credit-card-fraud-

credit card fraud detection using LR and data visualization in ML

data-visualization logistic-regression machine-learning outlier-detection python3

Last synced: 28 Apr 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/al-chris/whatsapp-dashboard-web

A client-side only web application for analyzing and visualizing WhatsApp chat exports. This version runs entirely in your browser without requiring any server or backend - your data never leaves your device!

data-visualization javascript whatsapp

Last synced: 28 Apr 2026

https://github.com/incalculable-driverslicence975/data-projects-portfolio

📊 Showcase data projects that highlight analytics, machine learning, and MLOps with reproducible code and clear business insights.

ai computer-vision dashboard data-science-projects data-visualization deep-learning etl excel finance hadoop hiveq keras machine-learning nlp pandas portfolio-project scikit-learn tableau-dashboards

Last synced: 28 Apr 2026

https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy

This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.

charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots

Last synced: 28 Apr 2026

https://github.com/buabaj/fortran-assignment

code repository for fortran and python climatology assignment.

big-data climatology data-analysis data-visualization fortran90 python

Last synced: 28 Apr 2026

https://github.com/priyanshubiswas-tech/e-commerce_data_analysis

Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.

data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python

Last synced: 28 Apr 2026

https://github.com/benzerinsio/winequality-eda

📊 Análise Exploratória de Dados (EDA) - Vinhos Tintos | Exploração de características físico-químicas e sua relação com qualidade

analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-visualization eda exploratory-analysis exploratory-data-analysis food-science python quality-control seaborn wine wine-quality

Last synced: 28 Apr 2026

https://github.com/falakrana/data-analysis-visualization

This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.

data-analysis data-visualization python tableau-public

Last synced: 01 May 2026

https://github.com/alexquilis1/news-sentiment-analyzer

A Flask web app that analyzes sentiment in news articles and generates word clouds to visualize emotional trends in current events

data-visualization flask natural-language-processing news-api nlp nltk python sentiment-analysis vader-sentiment wordcloud

Last synced: 19 May 2026

https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks

Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.

amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql

Last synced: 09 Feb 2026

https://github.com/dariush-hassani/pfd-charts

A lightweight, animated and customizable charting library for building Primary Flight Display (PFD) using modular D3.js.

d3js data-visualization drone gcs pfd

Last synced: 08 Jun 2026

https://github.com/gimhanul/parking

Python Data analysis and visualization

data-visualization folium python

Last synced: 28 Jun 2026

https://github.com/emircanakyuzz/veri_gorsellestirilmesi_ve_analizi-analysis_and_visualization_of_dataset

Bu çalışmada numpy, pandas, seaborn ve matplotlib gibi veri biliminde çokca bilinen modülleri kullanarak analiz ve görselleştirme işlemleri gerçekleştirdim.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/chrispsang/healthcare-dataanalysis

Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.

data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python

Last synced: 29 Apr 2026

https://github.com/misha-mayskiy/lootbox_analytics

Lootbox Analytics: Your personal dashboard for tracking and analyzing lootbox/gacha opening statistics from popular games. Currently supports Genshin Impact with detailed Pity/luck analysis. (Python, Flask, SQLAlchemy)

chartjs data-visualization flask gacha game-analytics genshin-impact pity-tracker python sqlalchemy statistics

Last synced: 29 Apr 2026

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 29 Apr 2026

https://github.com/anilyigitsel/istanbul-rental-apartments-analysis

This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.

data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing

Last synced: 29 Apr 2026

https://github.com/chauxvive/fccheatmap

A D3.js-driven heatmap visualizing monthly global land surface temperature variations over time. Built as part of FreeCodeCamp’s Data Visualization certification.

d3 d3js data-visualization dataviz

Last synced: 29 Jun 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 09 Feb 2026

https://github.com/sanam2405/ahs

This contains the analysis of result of AHS Madhyamik Examination 2022

data-analysis data-visualization jupyter-notebook python

Last synced: 18 Apr 2026

https://github.com/prangonghose/wikipedia-blocking-policies

This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.

data-analysis data-visualization matplotlib open-source pandas python3 seaborn

Last synced: 18 Apr 2026

https://github.com/sevilaymuni/project-no.3-seaborn-plots

Pandas and Seaborn Mediated Comprehensive Analysis on Differentiated Thyroid Cancer

data-analysis data-structures data-visualization mathplotlib pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/gattsu001/telecom-churn-predictor

Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.

churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm

Last synced: 18 Apr 2026

https://github.com/vishal8shah/au-jobs

Interactive treemap of 358 Australian occupations — explore AI exposure, pay, skill level & shortage status across 14.4M workers. Inspired by karpathy/jobs.

ai anzsco australia data-visualization gemini jobs open-source treemap

Last synced: 04 Jun 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026