Data visualization
Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.
- GitHub: https://github.com/topics/data-visualization
- Wikipedia: https://en.wikipedia.org/wiki/Data_visualization
- Created by: Charles Joseph Minard
- Aliases: dataviz,
- Last updated: 2026-06-28 00:07:45 UTC
- JSON Representation
https://github.com/pratanup/exploratory-data-analysis-eda-
Objective is to make this data ready for modeling by transforming the given data into clean data by doing EDA
data-analytics data-science data-visualization exploratory-data-analysis python
Last synced: 19 Apr 2026
https://github.com/vanheemstrasystems/flourish-headstart
Flourish - Headstart
data-visualization storytelling
Last synced: 06 Jan 2026
https://github.com/Lightning-Chart/lcjs-point-line-series-3d
JavaScript 3D charts real-time performance benchmark with LightningChart JS
3d benchmark chart charting charts data-visualization graphs javascript lcjs lightningchart-js line performance plot plotting points real-time rendering scatter streaming webgl
Last synced: 27 Dec 2025
https://github.com/josewebdev2000/space-mission-data-analysis
Exploring space mission data and creating graphs in base of it.
csv data-analysis data-science data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe python
Last synced: 30 Apr 2026
https://github.com/johannaschmidle/bookauthors
Explored a book sales database. Cleaned data using Excel and created an interactive dashboard to analyze author popularity, ratings, and sales trends. The project highlighted key insights such as sales performance and rating distributions [Excel]
author-sales book-sales books data-analysis data-visualization excel
Last synced: 04 Feb 2026
https://github.com/prekshivyas/datastreamingetl
Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming pipeline
apache-airflow apache-kafka apache-spark apache-zookeeper cassandra data-engineering data-ingestion data-pipeline data-processing data-visualization docker docker-compose
Last synced: 20 Jan 2026
https://github.com/analysisbyvivek/road-accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 19 Jun 2026
https://github.com/rohan3122k/social-media-sentiment-analysis-of-finance-defence-and-healthcare-in-the-usa
This project provides a comprehensive, data-driven analysis of three critical sectors - Finance, Defense, and Healthcare , under the administrations of Donald Trump and Joe Biden.
api aws data-visualization datamining financial-analysis healthcare-application nytimes-api python reddit-api sentiment-analysis wordcloud-visualization
Last synced: 11 May 2026
https://github.com/syed-m-nofel/global-covid-insights-pipeline
Analyze global COVID-19 trends using Python and Streamlit — includes time-series breakdowns, vaccination stats, and country-specific insights.
analytics-dashboard correlation-analysis covid-dashboard covid19 data-science-projects data-visualization global-health health-informatics interactive-dashboard matplotlib open-data pandas pandemic-analysis public-health python seaborn streamlit time-series-analysis
Last synced: 28 Apr 2026
https://github.com/jdfoster11/northwest_territories_collision_factors
Using Python & Tableau to perform a statistical and regression analysis on a NorthWest Territories Vehicle Collision Dataset
clustering-algorithm co-lab data-science data-visualization heatmap-visualization html python3 tableau
Last synced: 15 Mar 2025
https://github.com/mithoon278/us-visa-approval-prediction-mlops-project
This project presents a ML based solution using ML Algorithm to predict which visa applications will be approved and thus recommend a suitable profile for applicants whose visa have a high chance of approval.
aws classification data-visualization ec2-instance exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning random-forest-classifier s3-bucket
Last synced: 11 Apr 2026
https://github.com/shellynagar27/mobile-sales-analysis
Analyzed 2024 mobile sales data to uncover product trends, customer behavior, and regional insights using Power BI dashboards and structured data modeling.
cleaning-data data-analysis data-visualization dax eda figma modelling powerbi powerquery storytelling wireframe
Last synced: 16 May 2025
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/stopyransky/wdvp
World Government Data Visualisation Prize - submitted work
d3 d3js data-visualization dataviz react svg
Last synced: 18 May 2026
https://github.com/debjyotisaha/power-bi-projects-phase-1
Portfolio projects related to data visualisation in Power BI
data-analysis data-visualization dax-expression powerbi powerquery
Last synced: 18 Jan 2026
https://github.com/patricialjohnson/sql-database-project
SQL Music Database Store
business-analytics chinook-database data-analytics data-visualization database kaggle kaggle-dataset microsoft-excel microsoft-sql-server sql sqlite
Last synced: 12 Mar 2025
https://github.com/anurag-kumar-molankala/sales-performance-dashboard
A Power BI dashboard that analyzes sales trends, product performance, customer segmentation, and payment distribution. It uses DAX, time intelligence, and interactive visuals for data-driven insights. The model includes Sales, Product, and Customer tables for in-depth analysis.
dashboards data-analysis data-visualization dax dax-functions dax-measures dax-query etl-process powerbi powerbi-visuals powerquery sql-query sql-server
Last synced: 03 Apr 2025
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 23 Mar 2025
https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark
Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.
apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql
Last synced: 10 Mar 2026
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/josephbarbierdarnal/matoolkit
matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib
data-analysis data-visualization matplotlib
Last synced: 31 Mar 2025
https://github.com/jofaval/game-of-thrones
Data Analysis and Predictions of the Game of Thrones' character's survivance from 2016
classification data-analysis data-science data-visualization deep-learning game-of-thrones google-colab kaggle keras machine-learning matplotlib python scikit-learn seaborn tensorflow xgboost
Last synced: 11 Apr 2026
https://github.com/mituskillologies/ds-aug25
Programs of Data Science batch @ MITU Skillologies, August 2025
classification clustering data-analytics data-science data-visualization machine-learning mysql natural-language-processing powerbi python-programming regression supervised-learning unsupervised-learning
Last synced: 18 May 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/gitchaell/computer-scrapping
Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.
data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx
Last synced: 01 May 2026
https://github.com/rodrigojunqueiradev/2025-python-data-analysis-and-visualization-masterclass
2024 Python Data Analysis & Visualization Masterclass
data-analysis data-science data-structures data-visualization pandas python python-3 python3 seaborn
Last synced: 10 May 2026
https://github.com/prgermux/image-scrapper
"Image Scrapper" is a Python application that recursively scrapes images from directories and displays them on an interactive, zoomable, and scrollable canvas. Ideal for organizing and navigating large image datasets.
data-visualization desktop-application file-explorer graphics-view gui-tool image-organization image-processing image-scraper image-viewer interactive-visualization pyqt5 python recursive-directory zoom-and-pan
Last synced: 24 Mar 2025
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/sayamalt/house-price-prediction
Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.
data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing
Last synced: 09 Nov 2025
https://github.com/amirdora/covid19_lockdown_policies_germany
Python visualisation - Covid19 lockdown policy effects and new cases in germany. Using "Oxford policy tracker" and "Coronavirus Source Data - Our World in Data" data.
data-science data-visualization
Last synced: 13 Mar 2025
https://github.com/praths71018/kaggle_datasets
All projects that I did in kaggle
data-analytics data-science data-visualization kaggle kaggle-dataset machine-learning
Last synced: 01 May 2026
https://github.com/mattbixley/tidy_tuesday
A home for some #tidytuesday code, plots table and general upskilling.
data-science data-visualization ggplot r4ds tidytuesday tidyverse
Last synced: 15 Feb 2026
https://github.com/sayamalt/twitter-sentiment-analysis
Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.
data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing
Last synced: 09 Nov 2025
https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi
Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.
dashboards data-analysis data-visualization microsoft-power-bi power-bi
Last synced: 05 Jan 2026
https://github.com/datastalker/survival-cox
This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.
breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis
Last synced: 02 Apr 2025
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/teragrep/dpf_02
Teragrep Result Aggregation for Apache Spark
aggregation data-aggregation data-science data-summarization data-summary data-visualisation data-visualization teragrep
Last synced: 10 Jan 2026
https://github.com/aniruddha-biswas/jpmorgan-chase-excel-internship
JPMorgan Chase & Co.'s Excel Skills on Forage Virtual Internship
conditional-formatting data-analysis data-cleaning data-visualization excel excel-dashboard macos pivot-tables power-query shortcuts storytelling vba-excel
Last synced: 01 Apr 2025
https://github.com/jiyanshgarg/delhivery-logistics-data-analysis
This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.
business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment
Last synced: 12 Jun 2026
https://github.com/sayamalt/employee-attrition-prediction
Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.
binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation
Last synced: 08 Oct 2025
https://github.com/sayamalt/credit-card-approval-prediction
Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.
binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation
Last synced: 09 Nov 2025
https://github.com/living-with-machines/machines-interactive
This is the “machines interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.
data-visualization history-of-technology industrial-revolution machines museum museum-experience museum-installation
Last synced: 20 Jan 2026
https://github.com/githubsolver123/bus-tracker
Real-time bus tracking simulation built with R Shiny and Google Maps API. Visualizes bus movement along Broadway in NYC with 2-second position updates.
data-visualization geospatial gis google-maps-api r r-shiny real-time shiny simulation transportation web-application
Last synced: 01 Apr 2025
https://github.com/beastienerd/dataquest_guided_projects
Portfolio for Guided Projects
data-analysis data-visualization guided-project r
Last synced: 25 Jun 2025
https://github.com/victorolea/react-dashboard
Dashboard
dashboard data-visualization echarts netlify react vite
Last synced: 12 Apr 2026
https://github.com/shivasairam1706/mlops-project1
End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.
aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell
Last synced: 20 May 2026
https://github.com/sayamalt/taxi-trip-fare-prediction
Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.
cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling
Last synced: 09 Nov 2025
https://github.com/jbalooshie/pyber_analysis
Analysis of ride share data using Matplotlib and pandas, executed in Jupyter Notebook. Breakdowns are provided based on the city size, average fare, and number of rides taken.
data-analysis data-science data-visualization jupyter-notebook matplotlib pandas python
Last synced: 12 May 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/jfaccioli/citi-bike-tableau
A data analysis of Citi Bike users in Jersey City using Tableau
data-analysis data-visualization tableau tableau-public
Last synced: 26 Jan 2026
https://github.com/gunjanmimo/d3-visualization
D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework
d3js data data-science data-visualization reactjs
Last synced: 29 Apr 2026
https://github.com/alfiyafatima09/heuristic_algorithms
This project compares pathfinding algorithms (A*, Greedy Best-First, and Hill Climbing) by visualizing their paths and comparing performance metrics (nodes explored, memory, execution time) on a grid with obstacles.
Last synced: 20 Jan 2026
https://github.com/apelullo/cobalt_health_wellness_platform_ops
Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.
academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline
Last synced: 23 Mar 2025
https://github.com/gautam25raj/data-sync
A powerful platform designed to revolutionize the way teams collaborate and visualize data.
chat collaboration data-visualization express material-tailwind mongodb mongoose nextjs nodejs reactjs redux redux-toolkit tableau tableau-dashboard tailwindcss
Last synced: 11 Apr 2026
https://github.com/memudualimatou/data-visualization-of-wasabi-dataset-using-r
data-visualization r-shiny wasabi
Last synced: 15 Mar 2025
https://github.com/jabonsote/financial-anomaly-detection-with-deepseek-and-isolation-forest
🚀 Financial Anomaly Detection with DeepSeek and Isolation Forest – A powerful, locally-run tool for detecting financial anomalies using Isolation Forest and DeepSeek LLM. Features AI-powered insights, interactive time-series visualization, and automated PDF audit reports. 🔍📊
anomaly-detection chatbot data-visualization deepseek financial-analysis financial-data isolation-forest llm machienlearning ollama report-generator streamlit
Last synced: 12 Apr 2026
https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches
a data-driven exploration of international men's and women's football (soccer) match results using Python
data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization
Last synced: 05 May 2026
https://github.com/pat8901/diskanalyzer-cli
Processes a pdf file holding storage utilization data to automatically create graph visualizations revealing the true demographics hidden in large data.
data-visualization graphs-generation matplotlib
Last synced: 27 Dec 2025
https://github.com/ekenes/elections-timeline
Data visualization showing the results of the previous 5 U.S. presidential elections in a single map.
arcgis-js-api data-visualization elections gis mapping
Last synced: 24 Mar 2025
https://github.com/sanjana-bongale/walmart_retail_data_visualization_using_powerbi
Interactive Power BI dashboard analyzing Walmart sales data. Covers sales trends, customer insights, and branch performance using charts, KPIs, and filters for age, gender, year, and category. Includes a presentation for business storytelling and insights.
customer-analysis-for-retail dashboard data-storytelling data-visualization powerbi sales-analysis
Last synced: 04 Feb 2026
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 27 May 2026
https://github.com/franloza/contratosdemadrid
This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.
data-visualization duckdb evidence open-data
Last synced: 23 Jun 2026
https://github.com/codeofrahul/python_amazon_sales_analysis
In this repository, I have saved my Python_Amazon_sales_analysis Notebook. To do this Amazon_sales_analysis, I have done end to end process. cleaned the dataset, Did EDA, ploted graph and reached to the conclusion.
amazon analysis data-visualization eda exploratory-data-analysis matplotlib pandas-library python seaborn
Last synced: 01 May 2026
https://github.com/madhurimarawat/agile-sprint-and-iris-data-explorer
Streamlit app that combines agile sprint planning with data visualization of the Iris dataset. It helps analyze task distribution and explore key data insights interactively.
agile-codes agile-development agile-methodologies agile-metrics agile-planning codes complete-agile-explained data-visualization deployment documentation github-deployment iris-dataset output python readme software-engineering sprint sprint-planning streamlit-deployment streamlit-webapp
Last synced: 14 Aug 2025
https://github.com/kevinandersontech/ecommerce_dashboard_streamlit
A Streamlit dashboard that reads daily revenue metrics from the data pipeline. Provides date filters, summary KPIs, line charts, and a table to explore revenue over time across different statuses (e.g. paid, refunded, failed).
charts dashboard data-visualization duckdb filters metrics python streamlit
Last synced: 01 May 2026
https://github.com/archanakokate/bank_term_deposit_prediction
Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.
data-analysis data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Sep 2025
https://github.com/davidchocholaty/bithack_hackathon_2024
This repository contains my personal code tasks for the BIT_Hack hackathon, created in 2024.
data-mining data-science data-visualization exploratory-data-analysis hackaton hackaton-project machine-learning
Last synced: 06 May 2026
https://github.com/carmendev/covid-19-tracker
Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.
data-visualization firebase numeral reactjs
Last synced: 11 Apr 2026
https://github.com/karo23361/toy-store-kpi-power-bi
PowerBI Portfolio Project
csv data data-visualization powerbi
Last synced: 03 Feb 2026
https://github.com/mahmoudnamnam/superstore-analysis
This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.
clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn
Last synced: 12 Apr 2026
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 02 Jan 2026
https://github.com/abraham-ny/github-activity-visualizer
Visualize github activity with graph js
commit-visualization data-visualisation data-visualization free-website github-activity github-activity-graph graph-js graphjs html-css-javascript html5 html5-canvas web-development
Last synced: 05 Jan 2026
https://github.com/acdh-oeaw/visartist
Visual Artwork Analysis and Collection Tool
color-clustering color-space data-visualization visual-analysis
Last synced: 13 Jul 2025
https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis
The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.
data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium
Last synced: 10 Jun 2026
https://github.com/chokzb/covid19_vaccination_analysis
An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination
Last synced: 07 May 2026
https://github.com/pekiiipy/credit-card-fraud-detection
🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.
adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost
Last synced: 11 Apr 2026
https://github.com/yaser-123/energy-consumption-dashboard
A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.
dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi
Last synced: 05 Jan 2026
https://github.com/jleung51/visualizations
Javascript & D3.js visualizations of data.
d3js data-visualization javascript
Last synced: 27 Mar 2025
https://github.com/codesaadumair/data-science-monorepo
Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.
data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python
Last synced: 01 May 2026
https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle
Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9
classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml
Last synced: 11 Apr 2026
https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support
RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.
ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit
Last synced: 01 May 2026
https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard
Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.
business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis
Last synced: 05 Jan 2026
https://github.com/bartosz-ziolkowski/bartosz-ziolkowski.github.io
Blog with short data stories of various data sets
data-science data-visualization jupyter-notebook numpy pandas python
Last synced: 11 Apr 2026
https://github.com/priyanshu7639/data_visualization_dashboard
An Interactive data visualization tool that combines traditional plotting capabilities with modern AI assistance. It allows users to create and modify visualizations through natural language commands, making data exploration accessible to users of all skill levels.
business-analytics data-analysis data-engineering data-exploration data-science data-visualization datapreprocessing datascience interactive-visualizations matplotlib plotly plotting python research-tool streamlit
Last synced: 12 May 2026
https://github.com/akhdandann/itutilizationdashboard-powerbi
Interactive Power BI dashboard for monitoring IT utilization, application uptime, and infrastructure performance at PT PLN (2014-2018).
business-intelligence dashboard data-visualization power-bi reporting
Last synced: 26 Jan 2026
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/tashi-2004/apache-spark-geospatial-air-quality-analysis
This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.
aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis
Last synced: 25 Mar 2025
https://github.com/audreyadora/r_data_analytics
RStudio Data Analytics Learning Journal
data-science data-visualization r-studio
Last synced: 04 Feb 2026
https://github.com/aninditaws/investly
Investly: A personal finance platform for young investors, offering tailored portfolio recommendations by integrating user risk profiles, real-time market data, and optimization algorithms.
api-integration data-visualization goal-based-allocation react-frontend supabase-backend
Last synced: 01 Apr 2025
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/adithivs/prodigy_ds_01
data-science data-visualization python
Last synced: 17 May 2026
https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters
Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.
data-analysis data-science data-visualization machine-learning pandas
Last synced: 09 Apr 2025
https://github.com/mcommer/emtools
A toolbox for geophysical EM-simulation data- and model-file processing, analysis, plotting, and other gimmicks
data-visualization electromagnetics geophysics plotting-scripts shell-scripts
Last synced: 30 Jun 2025
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/asuquoaa/bar_chart_visualization_with_confidence_intervals_and_interactive_slider
This project visualizes probabilistic data using bar charts with 95% confidence intervals, allowing users to explore deviations from a Value of Interest (V of I) interactively.
data-visualization interactive-visualizations statistics
Last synced: 01 Sep 2025
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026