An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 06 May 2026

https://github.com/harmonicode/filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 22 Mar 2025

https://github.com/Akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 Feb 2026

https://github.com/holy-angel-university/student-performance-analysis

This project analyzes student data to understand factors affecting final exam scores. Data includes study habits, extracurriculars, family background, school environment, and demographics. The goal is to identify key contributors to academic success.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 06 Apr 2025

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/vaxdata22/customer-churn-data-analytics-etl-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.

amazon-athena amazon-redshift apache-airflow aws-ec2 aws-glue aws-s3 business-intelligence customer-churn-analytics dags data-visualization etl-pipeline orchestration power-bi python3

Last synced: 18 Jun 2025

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/iankitnegi/ms-data-analyst-professional-certificate

Journey through the Microsoft Power BI Data Analyst Certificate with notes, projects, and exercises. 🚀

data-visualization microsoft powerbi

Last synced: 24 Jan 2026

https://github.com/archanakokate/bank_term_deposit_prediction

Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.

data-analysis data-visualization exploratory-data-analysis machine-learning

Last synced: 14 Sep 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/alra-code/data-analytics-com-power-bi

Desafio de projetos do Boocamp Data Analytics realizado pela Dio Me em 2024

analytics data-visualization desafios-resolvidos dio-bootcamp powerbi pt-br

Last synced: 25 Jan 2026

https://github.com/karanch10/fraudshield

FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.

data-analysis data-science data-visualization machine-learning python3

Last synced: 20 May 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 01 Nov 2025

https://github.com/kaustubh-indulkar/te-it-dsbda-assignmnets

This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.

big-data big-data-analytics data-analytics data-science data-visualization sppu-2019-pattern sppu-it-dept

Last synced: 29 Mar 2025

https://github.com/patricksferraz/aqw-madrid-data-analysis

Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.

air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data

Last synced: 30 Jan 2026

https://github.com/djsprenk/djsprenk.github.io

GitHub Pages site for DJ Sprenk

d3 d3-visualization data-visualization dj music python

Last synced: 20 May 2026

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/irishmorales/ph-poverty-statistics

An exploratory data analysis of Philippine poverty data. Data includes given 1991-2015 data, appended FIES 2018 & 2021 data, and 2024 & 2027 poverty estimates calculated using ARIMA.

data-visualization exploratory-data-analysis philippines poverty-alleviation

Last synced: 22 Mar 2025

https://github.com/firyanulrizky/ubud-souvenir-center-v1.0

Undergraduate Thesis Project (Mobile Apps Management Sales & E-Marketplace using Apriori Algorithm) Dedicated to Ubud Art Market

apriori-algorithm data-analytics data-mining data-visualization php web-application

Last synced: 27 Jun 2025

https://github.com/ranxi2001/predicting-mental-health-risk

数据分析案例-精神健康预测(数据来源kaggle)

data-analysis data-visualization eda

Last synced: 27 Jun 2025

https://github.com/samruddhi3012/rfm-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 27 Jun 2025

https://github.com/catalina2820/inteligencia-de-negocios

This repository contains materials and resources for the Business Intelligence course. It includes notes, workshops, and practical exercises that cover essential concepts and applications in data science, data visualization, machine learning, and big data.

bigdata data-cleaning data-science data-visualization web-scraping

Last synced: 04 Apr 2025

https://github.com/hannahgsimon/halmodeling2024

Developed code using the Hybrid Automata Library (HAL) to create a spatial agent-based model of radio-immune response to spatially fractionated radiotherapy. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/gappeah/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 25 Feb 2025

https://github.com/arosas17/mapping_earthquakes

Created a map to demonstrate the correlation between the tectonic plates and earthquakes. Circle were made on a map to indicate earthquakes, changing colors and size based on magnitude of the earthquake.

data-visualization javascript map

Last synced: 20 May 2026

https://github.com/vlad1343/data-visualisation

Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.

anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python

Last synced: 08 Apr 2026

https://github.com/faizantkhan/python_matplotlib

Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more

data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python

Last synced: 20 May 2026

https://github.com/kmohamedalie/excel-data-visualization

Visualizing and publishing chats with excel

data-visualization excel github html

Last synced: 23 Jun 2026

https://github.com/vzamboulingame/data-portfolio

This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.

data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql

Last synced: 20 May 2026

https://github.com/traccyyyyy/employeehrwebapp

Modern web application built with Lit, featuring Web Components, real-time data visualization, responsive UI, and RESTful API integration.

api-rest data-visualization developer-tools frontend interactive-dashboard javascript lit real-time state-management ui-ux webapp webcomponents

Last synced: 20 May 2026

https://github.com/fvdavid/d3-in-action

Angular 19 Data Visualization D3js

angular d3-visualization d3js data-visualization typescript

Last synced: 08 May 2026

https://github.com/zhouzhuofei/juliadl

learning Julia, write some notebooks, like machine learning and data science, visualization.

data-science data-visualization julia mxnet

Last synced: 21 Apr 2026

https://github.com/samruddhi3012/health-care-analytics

Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.

dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup

Last synced: 29 Mar 2025

https://github.com/maruf-hossen/kaggle-projects-and-learning

Comprehensive data science learning journey through Kaggle courses and exercises. Documenting progress in SQL, Python, ML, and data visualization with practical projects and business applications.

business-intelligence data-cleaning data-science data-visualization kaggle learning-journey machine-learning pandas python sql

Last synced: 05 May 2026

https://github.com/xmen3em/kaggle-competitions

This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.

data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit

Last synced: 09 Apr 2026

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 01 Jan 2026

https://github.com/arya920/stockpriceforecasting

The project seamlessly melds diverse technologies, including Numpy, Seaborn, Matplotlib, Keras, and more, to seamlessly integrate data manipulation, visualization, and machine learning.

data-visualization keras-tensorflow lstm-neural-networks modelling neural-network stock-market stock-price-prediction streamlit-webapp webapp

Last synced: 26 Mar 2025

https://github.com/heshamoomar/power-bi

visualizing real data from a survey that people took about people's jobs and work fields using Power BI

data-visualization microsoft-power-bi

Last synced: 04 Feb 2026

https://github.com/htanh2003/datamate

DataMate là một công cụ phân tích dữ liệu thông minh, kết hợp sức mạnh của mô hình ngôn ngữ lớn (LLM) và giao diện trực quan, giúp người dùng dễ dàng tải lên tệp CSV, khám phá dữ liệu, và nhận các phân tích thông minh

agent data-visualization deployment docker docker-compose langchain nginx streamlit

Last synced: 08 Apr 2026

https://github.com/gui-sitton/timeseries-taxi

To attract more drivers during peak hours, we need to predict the amount of cab requests for the next hour. Build a model for this prediction.

data-science data-visualization machine-learning ml python time-series time-series-analysis time-series-prediction

Last synced: 20 May 2026

https://github.com/hemant-kumar786/heart-disease-prediction

Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.

correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis

Last synced: 02 Nov 2025

https://github.com/mondial7/echarts-wc-import

Webcomponent to import Echarts library - current version echarts 3.8.5

data-visualization echarts3 polymer2 webcomponents

Last synced: 03 May 2026

https://github.com/errea/vet_clinic_database

For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.

data data-analysis data-structures data-visualization database

Last synced: 21 May 2026

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 27 Apr 2026

https://github.com/tanishpoddar/logitrack

LogiTrack is a Python & Streamlit-powered inventory management system for real-time warehouse optimization. It offers multi-warehouse planning, interactive maps, and supply chain analytics, supporting global coordinates, CSV/SQL data, and customizable parameters.

data-visualization database inventory-management logistics optimization python streamlit supply-chain supply-chain-analytics warehouse-optimization

Last synced: 02 Nov 2025

https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal

Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.

data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn

Last synced: 16 May 2026

https://github.com/matheusbcmelo/primeirorelatoriopowerbi

Primeiro relatório desenvolvido em PowerBI no curso DIO - Python Data Analytics

business-intelligence data-visualization powerbi report

Last synced: 25 Jan 2026

https://github.com/spacebakery/make-a-line-chart-for-research

Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization line-chart matplotlib python

Last synced: 22 Jul 2025

https://github.com/mvharsh/blinkit-sales-dashboard

An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.

blinkitdashboard data-analysis data-visualization powerbi

Last synced: 25 Jan 2026

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 25 Jan 2026

https://github.com/as16082023/global-electronics-retailer

Analyzed Maven Electronics' performance data to identify factors driving revenue decline since 2020.

advanced-excel data-analysis data-visualization

Last synced: 03 Feb 2026

https://github.com/anonymo2239/big-data-churn-analyzer

Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.

big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark

Last synced: 21 May 2026

https://github.com/ivangrana/data-visualization

Data visualization repository made with Chart.Js, D3,Plotly and Rstudio

d3js data-visualization

Last synced: 20 Jul 2025

https://github.com/gui-sitton/bank-loans

In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 21 May 2026

https://github.com/eins51/restaurantanalytics

Comprehensive business analytics project using Python and Tableau. Features include data visualization, interactive dashboards, and data-driven insights for restaurant performance and consumer behavior.

business-analytics data-visualization python-dashboard restaurant-analysis tableau

Last synced: 27 Jun 2025

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/tapas-gope/pizza-sales

This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.

business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis

Last synced: 02 Jan 2026

https://github.com/driversti/formula1

Formula 1 companion dashboard built on the public live-timing archive — starting with pre-race tyre inventory, more race-weekend insights to follow.

data-visualization f1 formula-1 github-pages python react tailwindcss typescript vite

Last synced: 21 May 2026

https://github.com/shivani8136/bellabeat-smart-device-data-analysis

This project analyzes smart device fitness data to uncover insights into user behavior, engagement, and wellness patterns. Conducted for Bellabeat, a high-tech company specializing in health-focused smart products for women, this analysis supports strategic decisions around product development and feature prioritization.

data-analysis data-visualization r-programming-language

Last synced: 08 Feb 2026

https://github.com/muneeb706/nei_pm2.5_data_analysis

Exploratory Data Analysis of PM2.5 Emission records from EPA National Emission Inventory

data-visualization exploratory-data-analysis r-programming

Last synced: 27 Jun 2025

https://github.com/touradbaba/multi-page_dash_application

This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.

dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere

Last synced: 27 Jul 2025

https://github.com/balajimohan18/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides

Last synced: 08 Mar 2026

https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn

Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation

data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis

Last synced: 08 Apr 2026

https://github.com/gabboraron/plague-inc

Plague Inc: An Epidemic Forecast Concept and Data Visualization Tool. Previously accessible at http://20.234.177.167/. You are welcome to host it on your own server.

big-data data-mining data-science data-visualization epidemic-simulations hackathon

Last synced: 06 Apr 2025

https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard

This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.

dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit

Last synced: 18 Apr 2026

https://github.com/gustavo-b-morales-s/footwear-market-data-pipeline

This project uses Python, Scrapy, SQLite3 and Streamlit to extract sports shoe data from Mercado Livre, perform transformations using Pandas, store the data in an SQLite database and create a data visualization interface emphasizing the main KPIs using Streamlit.

data-engineering data-visualization etl-pipeline webscraping

Last synced: 06 Apr 2025

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 18 Apr 2026

https://github.com/ancapitigoi/mushrooms-selection

In order to find which mushrooms are safe to eat, the decision tree data mining method is used.

classification-model data-cleaning data-exploration data-mining data-visualization decision-tree penalty-method r-programming

Last synced: 15 Jun 2026

https://github.com/simranshaikh20/credit-card-dashboard

A Data Visualization Project using Microsoft Power bi

data-analysis data-visualization powerbi

Last synced: 02 Jan 2026