An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/benzerinsio/floralspecies-eda

📊 Análise Exploratória de Dados (EDA) - Flores Iris | Exploração de padrões e clustering com K-Means

analise-de-dados analise-exploratoria analise-exploratoria-de-dados botany clustering data-visualization eda exploratory-analysis exploratory-data-analysis python seaborn

Last synced: 27 Apr 2026

https://github.com/natanast/tidytuesday_python

This repository contains my submissions for the TidyTuesday Python Challenge.

data-science data-visualization posit python quarto tidytuesday

Last synced: 07 Jun 2026

https://github.com/imshakil/machinelearning

Learning machine-learning algorithms, applications, completed projects, completed courses from different online course academy.

coursera data-analyst data-science-notebook data-visualization machine-learning-coursera machinelearning mathematics projects python udemy

Last synced: 28 Apr 2026

https://github.com/ppatrzyk/foreign-tourists

Data visualization built with Svelte and d3.

d3 data-visualization poland svelte

Last synced: 28 Apr 2026

https://github.com/prajakta1321/credit-card-fraud-

credit card fraud detection using LR and data visualization in ML

data-visualization logistic-regression machine-learning outlier-detection python3

Last synced: 28 Apr 2026

https://github.com/incalculable-driverslicence975/data-projects-portfolio

📊 Showcase data projects that highlight analytics, machine learning, and MLOps with reproducible code and clear business insights.

ai computer-vision dashboard data-science-projects data-visualization deep-learning etl excel finance hadoop hiveq keras machine-learning nlp pandas portfolio-project scikit-learn tableau-dashboards

Last synced: 28 Apr 2026

https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy

This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.

charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots

Last synced: 28 Apr 2026

https://github.com/sawaira-iqbal/used-cars-price-prediction-ml-project

🚗 The Used Car Price Prediction project uses advanced ML models like Random Forest 🌲, Decision Tree 🌳, XGBoost 🚀, and SVR 🔍 to predict used car prices, enhancing buying and selling decisions.

data-visualization decision-tree machine-learning price-prediction python random-forest-regressor support-vector-machine xgboost

Last synced: 28 Apr 2026

https://github.com/szapp/candyanalysis

Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics

data-analysis data-visualization feature-selection interaction-terms

Last synced: 28 Apr 2026

https://github.com/ezrahsieh/narrativevisualization

This project is an interactive narrative visualization designed to illustrate the impact of the COVID-19 pandemic on global life expectancy. The visualization is implemented using D3.js and follows the Martini glass narrative structure. This serves as the final project for CS416 at UIUC.

d3 data-visualization interactive-visualizations javascript narrative-visualization

Last synced: 28 Apr 2026

https://github.com/orvn/some-visualizations

Just some visualizations of concepts and data

d3js data-visualization math statistics

Last synced: 24 Jun 2026

https://github.com/dariush-hassani/pfd-charts

A lightweight, animated and customizable charting library for building Primary Flight Display (PFD) using modular D3.js.

d3js data-visualization drone gcs pfd

Last synced: 08 Jun 2026

https://github.com/marcusrprojects/stock-return-analyzer

Analyze and visualize cumulative stock returns against a benchmark (e.g., S&P 500) across multiple time scopes using Python, yfinance, and Matplotlib.

cumulative-return data-visualization matplotlib pandas python stock-analysis yfinance

Last synced: 29 Apr 2026

https://github.com/chrispsang/healthcare-dataanalysis

Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.

data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python

Last synced: 29 Apr 2026

https://github.com/andresberejnoi/streamlit-apps

A collection of sample web apps built with Streamlit.

data-science data-visualization streamlit

Last synced: 29 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/sharoonjoseph11/indian-liver-diseases

Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models

data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/hazz-i/e-commerce-analysis

FP Dicoding Analisis data dengan python

data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/lu-m-dev/biostatistics-eda

Exploratory data analysis and visualization system for biostatistical research

biostatistics data-analysis data-visualization eda

Last synced: 25 Jun 2026

https://github.com/edgarhtt/uber_freight_data_analysis

Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task

data-analysis data-science data-visualization python

Last synced: 30 Apr 2026

https://github.com/diogojorgebasso/dataanalysis_r_minesnancy

Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.

analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives

Last synced: 30 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 30 Apr 2026

https://github.com/samuelpillai/machine-learning-classification-regression-nlp

A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.

classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining

Last synced: 30 Apr 2026

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/01110011011101010110010001101111/tigercosmosbootstrapdash

Sample Repository of Visualisaing TigerGraph Data with Cosmos in a Bootstrap Dashboard

bootstrap cosmos data-visualization graph-visualization tigergraph

Last synced: 30 Apr 2026

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/realvuk/r-for-data-science-by-vuk

My exercise from the book R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2nd Edition

data-science data-visualization r rstats

Last synced: 13 Jun 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/fazatholomew/marlboroplan

In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.

data-analysis data-visualization energy jupyter-notebook massachusetts python

Last synced: 01 May 2026

https://github.com/codesaadumair/data-science-monorepo

Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.

data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python

Last synced: 01 May 2026

https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support

RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.

ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit

Last synced: 01 May 2026

https://github.com/gabrieldiem/iss_locator

Little python script that plots the ISS (International Space Station) location in a world map at a given time

data-visualization pandas plotly python script

Last synced: 01 May 2026

https://github.com/vedantshi/stock-price-prediction-for-maang-companies

This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.

data-analysis data-visualization lstm machine-learning python stock-price-prediction

Last synced: 01 May 2026

https://github.com/vaxdata22/redfin-analytics-etl-using-amazon-emr-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.

amazon-emr-cluster apache-airflow apache-spark aws-ec2 aws-s3 business-intelligence dags data-visualization etl-pipeline google-colab-notebook orchestration power-bi pyspark redfin snowflake snowpipe sqs-queue

Last synced: 02 May 2026

https://github.com/faithererer/haokanvideo_spider

好看视频爬取与数据分析

data-analysis data-visualization python spider

Last synced: 02 May 2026

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 02 May 2026

https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot

A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 02 May 2026

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/debjyotisaha/web-application-projects-streamlit-phase-2

This repository showcases interactive web applications built using the Streamlit framework.

dashboard data-visualization python streamlit

Last synced: 02 May 2026

https://github.com/femincan/d3-treemap-diagram

My solution for the Visualize Data with a Treemap Diagram project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 02 May 2026

https://github.com/inevolin/multivariate-data-analysis

Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.

analytics data-analysis data-science data-visualization javascript

Last synced: 09 Jun 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/ronaldkanyepi/southern-africa-health-indicators-analysis

Visualizing Health Trends in Southern Africa: Analyzing World Bank data (2013–2020) to uncover patterns in life expectancy, infant mortality, HIV prevalence, and maternal mortality.

botswana data-visualization dplyr ggplot2 ggtext health healthcare mozambique patchwork r south-africa tidyverse viridis zimbabwe

Last synced: 09 Jun 2026

https://github.com/lut-ful/credit-card-financial-report-dashboard

Visualizing and unlocking different KPI's of a Financials instution depending on credit card data using Power BI.

data-analytics data-visualization dax-query postgresql powe-bi python statics

Last synced: 03 May 2026

https://github.com/saagpatel/sovereign

Browser-based geopolitical simulator — apply policy levers to 18 countries and watch cascading effects over 60 months

d3 data-visualization geopolitics monte-carlo nextjs simulation typescript web-worker

Last synced: 28 Jun 2026

https://github.com/leosolar8/stock-price-prediction-ai-model

This project shows how to use a special type of AI called Long Short-Term Memory (LSTM) to predict stock prices. The project is split into two main parts: Training the AI Model and Making Predictions (Inference)

ai csv-dataset data-science data-visualization deep-learning finance financial-data forecasting keras lstm machine-learning python rnn stock-market stock-prediction tensorflow time-series time-series-forecasting

Last synced: 08 Apr 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/gerhynes/d3-birth-chart

An interactive UN birth rate bar chart built to practice manipulating SVGs with D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/prachipatel15/priceprediction

I've build a real estate price prediction project which predicts a house price in Banglore.

data-science data-visualization machine-learning pandas python

Last synced: 03 May 2026

https://github.com/imgios/dependensee

📊 Minimal relaxed functional dependencies set visualization webapp.

angular d3js data-visualization ionic rfd typescript visual-analytics webapp

Last synced: 03 May 2026

https://github.com/eikospartners/ngcatalyst

Repo for visuzalizations for Angular

angular d3 data-visualization

Last synced: 03 May 2026

https://github.com/raufjatoi/data-visualization

Data Visualization series

data-visualization

Last synced: 29 Jun 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/gerhynes/d3-pie-chart

A D3 pie chart showing UN birth data grouped by continent. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/nitrocosmstudios/temperature-sonification

Experimental sonification of temperature sensor data revealing structured, speech-like patterns coinciding with geomagnetic disturbances. Includes original dataset processing, synthetic signal attempts, and full write-up.

audio-analysis data-visualization electromagnetic-interference emf environmental-monitoring open-science raspberry-pi signal-processing sonification temperature temperature-monitoring temperature-sensor usb-devices weird-science

Last synced: 04 May 2026

https://github.com/ibrahimm7004/supermarket-sales-analysis

This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.

apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/jdanielgoh/abuso-escuelas

Codigo fuente del proyecto depredadores en las aulas. Construido con Nuxt.js, three.js y d3.js

d3js data-visualization ddj threejs

Last synced: 09 Jun 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/drod75/nyc-arrests-analysis

This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.

data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/s4rrar/israel-gaza-war

Human losses of the Israeli genocide in Gaza and the West Bank in numbers (October 7th War Victims)

data-engineering data-science data-visualization datascience jupyter jupyter-notebook python

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/itskshitija/tesla-stock-price-prediction

Welcome to the Tesla Stock Price Forecasting project, where we delve into time-series analysis to predict stock price trends for one of the world's most innovative companies—Tesla Inc.

data-visualization eda python time-series-analysis

Last synced: 29 Jun 2026

https://github.com/ledsouza/covid19

Projeto de análise de dados dos casos de Covid19

data-science data-visualization matplotlib pandas seaborn vitrinedev

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/femincan/d3-bar-chart

My solution for the Visualize Data with a Bar Chart project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 06 May 2026

https://github.com/priyapuranik/diwali-sales-analysis_with-data-visualization

Diwali sales data analysis with visualizations, highlighting customer demographics, product categories, and purchase trends.

data-visualization matplotlib-pyplot pandas python seaborn

Last synced: 06 May 2026

https://github.com/deaneeth/aqi-monitoring-system

Real-time air quality monitoring system with interactive maps, role-based access control, and automated AQI data simulation. Built with Laravel, MySQL, and Leaflet.js.

air-quality alerts-system data-visualization fullstack geospatial interactive-maps laravel leaflet mysql php phpunit simulation web-application

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026