An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/datalopes1/bank_marketing

Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez

data-analysis data-science data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/voidnire/redditviralmysteryposts

Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?

data-analysis data-visualization mysteries mystery nlms python-3 reddit

Last synced: 24 Apr 2026

https://github.com/avnigoyal25/ipl_eda

Exploratory Data Analysis on IPL datasets

data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/manisharora96/data-analysis-of-smartwatch

The project is structured with sample data, step-by-step Jupyter notebooks, and modular Python scripts for automated analysis

data-analysis data-visualization jupyter-notebook python smartwatch-analysis

Last synced: 24 Apr 2026

https://github.com/tuni56/ventas_streamlit

interactive sells and KPI's dashboard

dashboard data-visualization kpi python streamlit webapp

Last synced: 24 Apr 2026

https://github.com/dpb24/datakind-2025

📊 Data Analytics: Identifying Actionable Insights to Improve Financial Inclusion in Kenya

data-analytics data-visualization databricks datakind exploratory-data-analysis financial-data geopandas jupyter-notebook kenya matplotlib numpy python seaborn

Last synced: 24 Apr 2026

https://github.com/huacenxu/predict-loan-status

Using the Cross-Industry Standard Process of Data Mining (CRISP-DM), this project analyzes loan data from Prosper to identify key factors that predict loan status.

bootcamp-project data-science data-visualization data-w loan-prediction-analysis

Last synced: 25 Apr 2026

https://github.com/pyrypp/taxipoint_streamlit

The front-end for the taxi demand prediction service

data-visualization streamlit

Last synced: 24 Apr 2026

https://github.com/mrdvince/co2-dashboard

A CO2 emissions dashboard visualization using d3.js. https://droid021.github.io/co2-dashboard/

d3 data-visualization

Last synced: 24 Apr 2026

https://github.com/pedrohdosanjos/economic-data-analysis

This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.

api data-analysis data-visualization jupyter-notebook python

Last synced: 24 Apr 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/fbarffmann/belly-button-challenge

Built an interactive JavaScript dashboard to visualize bacterial biodiversity from belly button samples. Analyzed data from 153 participants and identified OTU 1167 as the most common bacteria.

biodiversity dashboard data-analysis data-visualization interactive-charts javascript json plotly

Last synced: 25 Apr 2026

https://github.com/tmoulik/bikeshare-python

Analysis of Bikeshare data from three major cities

data-analysis data-visualization python udacity-nanodegree

Last synced: 25 Apr 2026

https://github.com/tanyakuznetsova/world-happiness-report-2023-in-europe

Happiness Insight '23: Navigating global joy. Exploring trust's role in life satisfaction with my World Happiness Report analysis.

citizen-science data-storytelling data-visualization global-indicators life-satisfaction social-trust world-happiness-report

Last synced: 25 Apr 2026

https://github.com/kruthiktr/crop-recommendation-system-using-machine-learning

A machine learning-based system recommending crops based on soil, climate, and environmental conditions to optimize agricultural yields.

ai-in-agriculture crop-recommendation data-visualization machine-learning prediction python python3 recommendation-system

Last synced: 25 Apr 2026

https://github.com/ddihora1604/iit_patna

A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.

data-analysis data-visualization datamodelling machine-learning-algorithms python

Last synced: 25 Apr 2026

https://github.com/leandrocollares/ei-beneficiaries-in-canada

A responsive line chart showing regular Employment Insurance beneficiaries in Canada and its provinces and territories between 2019 and 2021

d3 data-visualization svelte

Last synced: 25 Apr 2026

https://github.com/dingaaling/webcam-mirror

Use a webcam as a mirror to view your NYC/FB data identities

data-identities data-visualization facial-detection facial-keypoints flask-application opencv

Last synced: 25 Apr 2026

https://github.com/dulajkavinda/matplotlib-ml

📊Data visualisation with matplotlib library.

data-visualization jupyter-notebook matplotlib python seaborn

Last synced: 25 Apr 2026

https://github.com/lasyakonduru/web-app-for-sentiment-analyzer-a-comprehensive-tool-for-analyzing-text-and-dataset-sentiments

A powerful and user-friendly tool for analyzing sentiments in text and datasets. This app leverages advanced sentiment analysis techniques to provide real-time insights, helping users classify text as Positive, Negative, or Neutral, and visualize sentiment trends for better decision-making.

data-visualization machine-learning natural-language-processing python sentiment-analysis streamlit vader-sentiment-analysis webapp

Last synced: 26 Apr 2026

https://github.com/syncfusionexamples/building-a-real-time-ecg-monitoring-dashboard-with-syncfusion-wpf-charts

Learn how to build a real-time ECG monitoring dashboard using Syncfusion WPF Charts. Explore the features of Syncfusion charts to create interactive and real-time data visualization for ECG monitoring.

chart-features data-visualization ecg-monitoring export-feature fast-line fast-line-chart health-tech interactive-charts medical-data-visualization real-time-dashboard real-time-updates syncfusion-charts syncfusion-controls wpf wpf-development

Last synced: 27 Apr 2026

https://github.com/tsbarr/citi-bikes-challenge

Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.

data data-visualization pandas-python python tableau

Last synced: 27 Apr 2026

https://github.com/zonggen/uiuc-cs416-a2

Data visualization assignment with D3.js

d3 data-visualization

Last synced: 27 Apr 2026

https://github.com/tostaylo/bicycle_death_data_visualization

Visualizing bicycle fatality data for the United States

beautifulsoup d3 data-visualization

Last synced: 27 Apr 2026

https://github.com/afinemax/climate_change_bot

@ClimateChangeBot is a BlueSky bot that posts daily Climate-Change plots

climate-change data-visualization global-warming mastodon-bot

Last synced: 27 Apr 2026

https://github.com/gabrieldiem/data_visualization_lifespan_wealth

Little python script that shows a data visualization of life span and wealth worldwide

data-visualization pandas plotly python script

Last synced: 27 Apr 2026

https://github.com/benzerinsio/floralspecies-eda

📊 Análise Exploratória de Dados (EDA) - Flores Iris | Exploração de padrões e clustering com K-Means

analise-de-dados analise-exploratoria analise-exploratoria-de-dados botany clustering data-visualization eda exploratory-analysis exploratory-data-analysis python seaborn

Last synced: 27 Apr 2026

https://github.com/sungj921028/data-analysis-for-aqi

A project that using python to analysis the AQI quality.

aqi data-science data-visualization jupyter-notebook

Last synced: 07 Jun 2026

https://github.com/bm777/kgraph

linear graph of kanda temperature and humidity data

data-visualization graph nextjs

Last synced: 28 Apr 2026

https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark

The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.

big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark

Last synced: 28 Apr 2026

https://github.com/jgohel9902/comprehensive-healthcare-analytics

An end-to-end healthcare analytics project integrating SQL, Python, and Power BI to analyze patient data, billing information, and doctor performance. This project showcases skills in data cleaning, advanced querying, visualization, and comprehensive insights generation to support data-driven decision-making in the healthcare industry.

data-visualization pandas powerbi python pythonfordatascience sql

Last synced: 28 Apr 2026

https://github.com/stefagnone/movies-dataset-analysis-project

Comprehensive analysis of the Movies dataset, exploring genre trends, comparisons, and qualitative insights using Python, Pandas, and visualizations. Designed to uncover actionable findings for stakeholders.

data-analysis data-visualization exploratory-data-analysis matplotlib movies-analysis pandas python seaborn storytelling-with-data

Last synced: 28 Apr 2026

https://github.com/bhaveshbhakta/parkinson-disease-prediction

Note* The hosted website link might take some time to load. Please be patient while the application initializes.

data-visualization flask health-prediction machine-learning parkinson-disease prediction web-development

Last synced: 28 Apr 2026

https://github.com/hadson0/chess-live-ratings-data

A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.

beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping

Last synced: 28 Apr 2026

https://github.com/jeus0522/4-eda-football-ml-app

A ML application focused on exploratory data analysis and football analytics, featuring data visualization and insights using Python and relevant libraries.

data-visualization eda exploratory-data-analysis football-analytics sports-analytics webapp

Last synced: 28 Apr 2026

https://github.com/leotrja/my-book-hands-on-machine-learning-with-scikit-learn-keras-and-tensorflow

📘 Explore the digital translation of "Practical Machine Learning" covering machine learning, deep learning, and neural networks in Persian.

computer-vision data-visualization deep-learning keras keras-tensorflow machi machine-learning neural-networks nlp num panda python reinforcement-learning sci tensorflow2

Last synced: 28 Apr 2026

https://github.com/malbiruk/salesflow-data-pipeline

End-to-end data engineering pipeline using Azure Blob, Data Factory, dbt, Snowflake, and Streamlit for interactive business analytics. (WIP)

azure-data-factory cloud-data-engineering data-visualization dbt etl snowflake streamlit

Last synced: 08 Jun 2026

https://github.com/szapp/candyanalysis

Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics

data-analysis data-visualization feature-selection interaction-terms

Last synced: 28 Apr 2026

https://github.com/robertovicario/uninsubria-datavisualization-project-work

Project Work for the Data Visualization module in the MSc in Computer Science program in Varese.

data-visualization dogecoin elonmusk python

Last synced: 28 Apr 2026

https://github.com/joshuadch/customer-churn-prediction

Predicting customer churn with Python (ETL, feature engineering, ML models, AUC/ROC) and business insights.

classification customer-churn data-science data-visualization feature-engineering machine-learning pandas python sklearn xgboost

Last synced: 28 Apr 2026

https://github.com/marcusrprojects/stock-return-analyzer

Analyze and visualize cumulative stock returns against a benchmark (e.g., S&P 500) across multiple time scopes using Python, yfinance, and Matplotlib.

cumulative-return data-visualization matplotlib pandas python stock-analysis yfinance

Last synced: 29 Apr 2026

https://github.com/chanmeng666/customer-insight

AI-powered customer review analysis platform — sentiment analysis, keyword extraction, topic modeling, and anomaly detection

chinese-nlp customer-feedback customer-insights data-visualization machine-learning nlp python review-analysis sentiment-analysis streamlit text-analysis text-mining topic-modeling

Last synced: 29 Apr 2026

https://github.com/misha-mayskiy/lootbox_analytics

Lootbox Analytics: Your personal dashboard for tracking and analyzing lootbox/gacha opening statistics from popular games. Currently supports Genshin Impact with detailed Pity/luck analysis. (Python, Flask, SQLAlchemy)

chartjs data-visualization flask gacha game-analytics genshin-impact pity-tracker python sqlalchemy statistics

Last synced: 29 Apr 2026

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 29 Apr 2026

https://github.com/frammenti/knowledge-sake

Documentation and code for the course project in Open Access and Digital Ethics, University of Bologna, a.y. 2024/2025.

data-visualization dcat-ap education eurostat observable-plot oecd open-data

Last synced: 29 Apr 2026

https://github.com/ronnienigash/exploring-visualization

Playing with D3, Python, and SQLite3 to create dynamic visualizations of interesting data

d3 data-visualization html python sqlite3 visualization

Last synced: 29 Apr 2026

https://github.com/andresberejnoi/streamlit-apps

A collection of sample web apps built with Streamlit.

data-science data-visualization streamlit

Last synced: 29 Apr 2026

https://github.com/fatihilhan42/starbucks_analysis_turkey_and_world_with_python

In this project, firstly the brands for coffee in the world and then these brands in Turkey were examined. The data from the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.

data-analysis data-cleaning data-science data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/sharoonjoseph11/indian-liver-diseases

Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models

data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/istinnew/eniac_ab_insight

Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.

ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code

Last synced: 29 Apr 2026

https://github.com/hazz-i/e-commerce-analysis

FP Dicoding Analisis data dengan python

data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/shariqayan/diwali_sales_analysis_python

The Diwali Sales Analysis project focuses on analyzing sales data during the Diwali festival to gain insights into customer behavior, improve customer experience, and optimize sales strategies.

data-visualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/sukitsubaki/image-color-scheme

Extract dominant colors from images and create beautiful color palettes with minimal dependencies. Supports various palette types: monochromatic, analogous, complementary, triadic, and tetradic.

color-extraction color-palette data-visualization design-tools image-analysis minimal python python-library

Last synced: 29 Apr 2026

https://github.com/saravanansuriya/airbnb-analysis

In This project aims to analyze Airbnb data, perform data cleaning and preparation, develop interactive geospatial visualizations, and create dynamic plots to gain insights into pricing variations, availability patterns, and location-based trends with help of Streamlit app.

data-visualization eda mongodb-atlas pandas python-script streamlit-webapp

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/machinelearningzuu/data-engineering-projects

This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.

airflow bigquery data-engineering data-science data-visualization data-warehouse

Last synced: 30 Apr 2026

https://github.com/r-mahesh45/fraud-detection-and-sales-analysis-using-random-forest

This project uses Random Forest to classify fraud risk based on taxable income and analyze key factors driving high sales for a cloth manufacturing company.

classification data-visualization extract-transform-load python3 random-forest

Last synced: 30 Apr 2026

https://github.com/devprnvk/realestateml

This Python program analyzes a dataset (HousePricePrediction.xlsx) containing information about house prices. It utilizes pandas for data manipulation, matplotlib for plotting, and seaborn for visualizing correlations and distributions.

data-science data-visualization datasets houses npm plotting prediction-model seaborn

Last synced: 30 Apr 2026

https://github.com/tashi-2004/global-ecommerce-retail-trends-analysis

The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

boxplot clv-analysis data-science data-visualization dataintegration deep-learning dimensionality-reduction ecommerce heatmap machine-learning normalization outlier-detection outlier-removal pca-analysis preprocessing python scatter-plot whatif-analysis

Last synced: 30 Apr 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/mxagar/eda_fe_summary

An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.

data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/fernandesotero/project-data-exploration

Student Performance Prediction with Data Science

data-visualization jupyter-notebook python

Last synced: 30 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 30 Apr 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/falakrana/data-analysis-visualization

This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.

data-analysis data-visualization python tableau-public

Last synced: 01 May 2026

https://github.com/kivanc57/explaratory_analysis

Exploratory and Descriptive Data Analysis on Indonesian data using R. This project involves reading data, feature analysis, correlation analysis, logistic regression, PCA, MDS, and clustering. Visualizations include boxplots, scatter plots, corrgrams, and dendrograms. Comprehensive report available in report.docx.

clustering data-science data-visualization descriptive-statistics explanatory-data-analysis mds pca plot r

Last synced: 08 Jun 2026

https://github.com/cdeweyx/bryce-harper-2016-analysis

Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.

data-analysis data-visualization python

Last synced: 01 May 2026

https://github.com/codeofrahul/python_amazon_sales_analysis

In this repository, I have saved my Python_Amazon_sales_analysis Notebook. To do this Amazon_sales_analysis, I have done end to end process. cleaned the dataset, Did EDA, ploted graph and reached to the conclusion.

amazon analysis data-visualization eda exploratory-data-analysis matplotlib pandas-library python seaborn

Last synced: 01 May 2026

https://github.com/ishmal793/bi-dummy-

An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.

data-analytics data-visualization eda pandas plotly python-dashboard streamlit

Last synced: 01 May 2026

https://github.com/codesaadumair/data-science-monorepo

Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.

data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/treyhamilton/baccarat-betting-simulator

Streamlit app simulating a baccarat betting strategy with full results visualization and CSV output.

baccarat betting data-visualization gambling machine-learning simulator streamlit

Last synced: 01 May 2026

https://github.com/gabrieldiem/iss_locator

Little python script that plots the ISS (International Space Station) location in a world map at a given time

data-visualization pandas plotly python script

Last synced: 01 May 2026

https://github.com/samia35-2973/world-university-ranking-2023-prediction

This repository is about creating models for predicting world university rankings 2023. The World University Rankings 2023 dataset include 1,799 universities across 104 countries and regions, making them the largest and most diverse university rankings to date. A clean dataset is generated through data preprocessing.

data-cleaning data-preprocessing data-visualization decision-trees machine-learning machine-learning-algorithms model-training prediction world-university-rankings world-university-rankings-2023

Last synced: 01 May 2026