Data visualization
Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.
- GitHub: https://github.com/topics/data-visualization
- Wikipedia: https://en.wikipedia.org/wiki/Data_visualization
- Created by: Charles Joseph Minard
- Aliases: dataviz,
- Last updated: 2026-07-02 00:07:43 UTC
- JSON Representation
https://github.com/natanast/tidytuesday_python
This repository contains my submissions for the TidyTuesday Python Challenge.
data-science data-visualization posit python quarto tidytuesday
Last synced: 07 Jun 2026
https://github.com/imshakil/machinelearning
Learning machine-learning algorithms, applications, completed projects, completed courses from different online course academy.
coursera data-analyst data-science-notebook data-visualization machine-learning-coursera machinelearning mathematics projects python udemy
Last synced: 28 Apr 2026
https://github.com/simranshaikh20/diwali-sales-analysis-for-business-insights
A data analyst project on diwali sales . In this state according state , gender, age we are able to know how much sale it done.
data-analysis data-visualization python
Last synced: 28 Apr 2026
https://github.com/amastaneh/network-visualization-insights
Network Visualization Insights presents network data seamlessly, offering clear visuals through charts, graphs, and tables. Dive deep into key metrics across diverse locations.
d3js data-analytics data-processing data-science data-visualization data-visualizations dataviz react-chartjs-2 react-simple-maps
Last synced: 08 Jun 2026
https://github.com/bhaveshbhakta/parkinson-disease-prediction
Note* The hosted website link might take some time to load. Please be patient while the application initializes.
data-visualization flask health-prediction machine-learning parkinson-disease prediction web-development
Last synced: 28 Apr 2026
https://github.com/moha-cm/airbnb-data-analysis
Airbnb Data retrival from MongoDb and Analying the Data
dashboard-application data-preprocessing data-visualization eda mongodb nosql-database plotly python python-script python-scripting streamlit-application
Last synced: 08 Jun 2026
https://github.com/leotrja/my-book-hands-on-machine-learning-with-scikit-learn-keras-and-tensorflow
📘 Explore the digital translation of "Practical Machine Learning" covering machine learning, deep learning, and neural networks in Persian.
computer-vision data-visualization deep-learning keras keras-tensorflow machi machine-learning neural-networks nlp num panda python reinforcement-learning sci tensorflow2
Last synced: 28 Apr 2026
https://github.com/hayatiyrtgl/arima_linearregression_xgboost_time_series_analysis
This Python script conducts various data processing, visualization, and modeling tasks on a dataset.
arima arima-forecasting arima-model data-analysis data-visualization linear-models linear-regression machine-learning machine-learning-algorithms pandas python xgboost xgboost-regression
Last synced: 28 Apr 2026
https://github.com/abdul-wahab-318/black-friday-eda-prediction
EDA and model training on Black Friday dataset
data-analysis data-visualization eda machine-learning sklearn
Last synced: 28 Apr 2026
https://github.com/sawaira-iqbal/used-cars-price-prediction-ml-project
🚗 The Used Car Price Prediction project uses advanced ML models like Random Forest 🌲, Decision Tree 🌳, XGBoost 🚀, and SVR 🔍 to predict used car prices, enhancing buying and selling decisions.
data-visualization decision-tree machine-learning price-prediction python random-forest-regressor support-vector-machine xgboost
Last synced: 28 Apr 2026
https://github.com/malbiruk/salesflow-data-pipeline
End-to-end data engineering pipeline using Azure Blob, Data Factory, dbt, Snowflake, and Streamlit for interactive business analytics. (WIP)
azure-data-factory cloud-data-engineering data-visualization dbt etl snowflake streamlit
Last synced: 08 Jun 2026
https://github.com/ezrahsieh/narrativevisualization
This project is an interactive narrative visualization designed to illustrate the impact of the COVID-19 pandemic on global life expectancy. The visualization is implemented using D3.js and follows the Martini glass narrative structure. This serves as the final project for CS416 at UIUC.
d3 data-visualization interactive-visualizations javascript narrative-visualization
Last synced: 28 Apr 2026
https://github.com/codegeekr/test_datasciencestarter
test Data Science Starter
analytics data data-science data-visualization machine-learning python science starter-kit statistics test
Last synced: 28 Apr 2026
https://github.com/dariush-hassani/pfd-charts
A lightweight, animated and customizable charting library for building Primary Flight Display (PFD) using modular D3.js.
d3js data-visualization drone gcs pfd
Last synced: 08 Jun 2026
https://github.com/neyhere07/music_popularity_prediction
Music popularity prediction involves building machine learning models to estimate the popularity of tracks based on their audio features.
data-science data-visualization eda jupyter-notebook machine-learning python
Last synced: 29 Apr 2026
https://github.com/mauriciovazquezm/data_visualization_course_project
This project implements an interactive data visualization dashboard using R and Shiny. It leverages World Bank development indicators to explore key economic, social, and demographic metrics over time across countries and regions. The web app enables users to select specific indicators, filter by countries or years, and visualize trends through dyn
data-science data-visualization ggplot2 r-programming shiny web-app
Last synced: 29 Apr 2026
https://github.com/salvof88/raspberry-sensor-kit-demo
A lightweight Raspberry Pi sensor logger in Python for HC-SR04 (ultrasound) and DHT11 (temperature/humidity), exporting data to CSV or Google Sheets. Perfect for IoT experiments, smart home logging, or Raspberry Pi Zero DIY kits.
automation csv-logger data-logging data-visualization dht11 google-sheets gpio hc-sr04 hc-sr04-ultrasonic-sensor iot python python-sensors python3 raspberry-pi raspberry-pi-3 raspberry-pi-4 raspberry-pi-gpio raspberry-pi-zero sensor-data
Last synced: 29 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/flashdroid15/food_safety_security_dtp3
Modelling food safety and food security using Python
data-processing data-visualization matplotlib multiple-linear-regression numpy pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/vyasdeepti/food-classification-using-yolo11
Objective: To classify food images in different food categories using pre-trained YOLO11 model
classification-model data-visualization deep-learning deep-neural-networks image-classification image-processing jupyter-notebook python-3 ultralytics virtual-environment yolo11 yolo11s
Last synced: 29 Apr 2026
https://github.com/ronnienigash/exploring-visualization
Playing with D3, Python, and SQLite3 to create dynamic visualizations of interesting data
d3 data-visualization html python sqlite3 visualization
Last synced: 29 Apr 2026
https://github.com/fatihilhan42/starbucks_analysis_turkey_and_world_with_python
In this project, firstly the brands for coffee in the world and then these brands in Turkey were examined. The data from the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/mfakhriazhar/python-data-analyst-tutorial
A collection of My Python learning files for Data Analyst purposes. Covers fundamental to advanced topics such as data exploration, visualization, statistical analysis, and the use of popular libraries like Pandas, NumPy, Matplotlib, and Seaborn. Suitable for personal documentation or shared learning references.
data-analysis data-science data-visualization exploratory-data-analysis portfolio python
Last synced: 29 Apr 2026
https://github.com/jofaval/melbourne-temperature-timeseries
Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990
data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis
Last synced: 29 Apr 2026
https://github.com/istinnew/eniac_ab_insight
Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.
ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code
Last synced: 29 Apr 2026
https://github.com/hazz-i/e-commerce-analysis
FP Dicoding Analisis data dengan python
data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/sukitsubaki/image-color-scheme
Extract dominant colors from images and create beautiful color palettes with minimal dependencies. Supports various palette types: monochromatic, analogous, complementary, triadic, and tetradic.
color-extraction color-palette data-visualization design-tools image-analysis minimal python python-library
Last synced: 29 Apr 2026
https://github.com/nishumehta/supermart-grocery-sales-retails-analytics
Tableau Dashboard Link :
data-analysis data-cleaning data-visualization jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn
Last synced: 29 Apr 2026
https://github.com/amirrezaskh/nyc-taxi-dashboard
A comprehensive data analytics platform that processes NYC taxi trip data from Google BigQuery and visualizes insights through an interactive React dashboard. Features real-time heatmaps, temporal analysis, and geographic intelligence across 263 NYC taxi zones.
bigquery dashboard data-analytics data-science data-visualization geospatial leaflet material-ui nyc-taxi plotly react typescript
Last synced: 29 Apr 2026
https://github.com/christs8920/data-science-py
A collection of data science projects made in python.
data-science data-visualization machine-learning matplotlib nltk numpy pandas python sklearn svm-classifier visualization
Last synced: 29 Apr 2026
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 29 Apr 2026
https://github.com/muhammadusman-khan/e-commerce-store-eda
Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.
data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/gvatsal60/ds-on-kaggle
A collection of data science projects, experiments, and insights from Kaggle competitions and datasets
data data-science data-visualization numpy pandas python3
Last synced: 29 Apr 2026
https://github.com/andryadsm/asset-analyzer
📈 Project Asset Analyzer (Python)
commodities data-analysis data-visualization economics financial-markets investing matplotlib numpy pandas python seaborn stock-market strategy trading
Last synced: 29 Apr 2026
https://github.com/machinelearningzuu/data-engineering-projects
This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.
airflow bigquery data-engineering data-science data-visualization data-warehouse
Last synced: 30 Apr 2026
https://github.com/r-mahesh45/fraud-detection-and-sales-analysis-using-random-forest
This project uses Random Forest to classify fraud risk based on taxable income and analyze key factors driving high sales for a cloth manufacturing company.
classification data-visualization extract-transform-load python3 random-forest
Last synced: 30 Apr 2026
https://github.com/chrka/d3-chessboard-count
Plot per-square frequencies on a chessboard
Last synced: 30 Apr 2026
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/syed-bakhtawar-fahim/datavisualization
Data Visualization with Python
big-data-analytics data data-analysis data-analysis-python data-science data-visualization pandas pyspark
Last synced: 30 Apr 2026
https://github.com/devprnvk/realestateml
This Python program analyzes a dataset (HousePricePrediction.xlsx) containing information about house prices. It utilizes pandas for data manipulation, matplotlib for plotting, and seaborn for visualizing correlations and distributions.
data-science data-visualization datasets houses npm plotting prediction-model seaborn
Last synced: 30 Apr 2026
https://github.com/edgarhtt/uber_freight_data_analysis
Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task
data-analysis data-science data-visualization python
Last synced: 30 Apr 2026
https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python
This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.
csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/bachtiarashidiqy/ecommercedashboard
An interactive e-commerce analytics dashboard built with Streamlit, providing visualizations for sales performance, product analysis, geographic insights, and delivery status. Includes date filtering, company branding, and comprehensive documentation.
analytics dashboard data-analysis data-visualization e-commerce matplotlib pandas python seaborn streamlit
Last synced: 30 Apr 2026
https://github.com/mxagar/eda_fe_summary
An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.
data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas
Last synced: 30 Apr 2026
https://github.com/fernandesotero/project-data-exploration
Student Performance Prediction with Data Science
data-visualization jupyter-notebook python
Last synced: 30 Apr 2026
https://github.com/cagandemirmr/airbnb_available_houses
In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.
dashboard data-visualization dataprocessing python sql tableau
Last synced: 30 Apr 2026
https://github.com/praveendecode/analytics-for-hospitals-health-care-data
Analytics for Hospitals' Health-Care Data
covid-19 data-analysis data-visualization exploratory-data-analysis ibm-cognos-analytics ibm-watson medical-domain python
Last synced: 30 Apr 2026
https://github.com/rijul007/smartwatch-data-analysis-using-python
Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python
Last synced: 30 Apr 2026
https://github.com/priyam-hub/covid-19-data-analysis
Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook
analysis data data-visualization jupyter-notebook machine-learning python
Last synced: 08 Jun 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling
This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.
anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit
Last synced: 30 Apr 2026
https://github.com/rayxiang03/indeed-job-scraping
Python toolkit for scraping Indeed job listings, preprocessing data, and generating visualizations for market analysis.
cloudscraper data-visualization indeed job-analysis nlp pandas python web-scraping
Last synced: 30 Apr 2026
https://github.com/mitchellharrison/mitchellharrison.github.io
Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.
ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials
Last synced: 30 Apr 2026
https://github.com/gerhynes/d3-births-pie-chart
A D3 pie chart showing UN birth data grouped by month and quarter. Built for The Advanced Web Developer Bootcamp.
d3 data-visualization javascript
Last synced: 30 Apr 2026
https://github.com/danwild/estimate-error-bar
d3 d3js data-visualization error-bars
Last synced: 08 Jun 2026
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/realvuk/r-for-data-science-by-vuk
My exercise from the book R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2nd Edition
data-science data-visualization r rstats
Last synced: 13 Jun 2026
https://github.com/miguelmedinacastro/trabalho-dados-r
Trabalho final da disciplina Análise Exploratória de Dados
data data-science data-science-projects data-visualization database r rstudio
Last synced: 01 May 2026
https://github.com/gitchaell/computer-scrapping
Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.
data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx
Last synced: 01 May 2026
https://github.com/mayhixza/insurance-dataset-analysis
Medical cost insurance EDA project
data-science data-visualization eda linear-regression matplotlib scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/praths71018/kaggle_datasets
All projects that I did in kaggle
data-analytics data-science data-visualization kaggle kaggle-dataset machine-learning
Last synced: 01 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/kivanc57/explaratory_analysis
Exploratory and Descriptive Data Analysis on Indonesian data using R. This project involves reading data, feature analysis, correlation analysis, logistic regression, PCA, MDS, and clustering. Visualizations include boxplots, scatter plots, corrgrams, and dendrograms. Comprehensive report available in report.docx.
clustering data-science data-visualization descriptive-statistics explanatory-data-analysis mds pca plot r
Last synced: 08 Jun 2026
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/gerhynes/d3-movie-quotes
A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.
d3 data-visualization javascript
Last synced: 01 May 2026
https://github.com/caesaredia/la-cafe-market-analysis
A data-driven feasibility study exploring the potential of launching a robot-staffed café in Los Angeles, based on real F&B business data.
business-intelligence cafe data-analysis data-visualization food-industry franchise los-angeles market-research pandas python
Last synced: 01 May 2026
https://github.com/ishmal793/bi-dummy-
An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.
data-analytics data-visualization eda pandas plotly python-dashboard streamlit
Last synced: 01 May 2026
https://github.com/fbarffmann/project1
Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.
box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn
Last synced: 01 May 2026
https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support
RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.
ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit
Last synced: 01 May 2026
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 01 May 2026
https://github.com/jfaccioli/leaflet-earthquake
Geo mapping earthquakes with Leaflet / Javascript / GeoJSON
data-visualization geojson javascript json leaflet
Last synced: 01 May 2026
https://github.com/gabrieldiem/iss_locator
Little python script that plots the ISS (International Space Station) location in a world map at a given time
data-visualization pandas plotly python script
Last synced: 01 May 2026
https://github.com/samia35-2973/world-university-ranking-2023-prediction
This repository is about creating models for predicting world university rankings 2023. The World University Rankings 2023 dataset include 1,799 universities across 104 countries and regions, making them the largest and most diverse university rankings to date. A clean dataset is generated through data preprocessing.
data-cleaning data-preprocessing data-visualization decision-trees machine-learning machine-learning-algorithms model-training prediction world-university-rankings world-university-rankings-2023
Last synced: 01 May 2026
https://github.com/kushshriv/onlinejobpostings-infographic
The Python Data Cleaning Code and Input Dataset For My Telling Stories With Data Project
data-visualization pandas python
Last synced: 01 May 2026
https://github.com/celineboutinon/client-segmentation
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 5
aws client-segmentation cloud-architecture data-analysis data-science data-visualization database e-commerce marketing marketing-analytics marketplace-solution
Last synced: 01 May 2026
https://github.com/enshefalogram/ml-score-pred
This project is designed to make your life... predictable!
ai data-visualization eda machine-learning phyton python3 regression regression-models supervised-machine-learning
Last synced: 01 May 2026
https://github.com/martindambrosio/ba-tree-census-analysis
Analysis and visualization of Buenos Aires urban trees using Python and Tableau, including interactive maps to explore species distribution and characteristics.
data-visualization folium-maps pandas python tableau
Last synced: 01 May 2026
https://github.com/guptakushal03/whatsapp-chat-analyser
The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.
chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud
Last synced: 01 May 2026
https://github.com/sanikamal/machine-learning-atoz
Beginner-friendly machine learning tutorials and mini-projects.
collaborative-filtering data-analysis data-visualization decision-trees kmeans-clustering knn machine-learning machine-learning-algorithms recommender-system regression svm
Last synced: 08 Jun 2026
https://github.com/robwiederstein/covid-19-ky
Monitor US covid-19 cases w/ Johns Hopkins data
data data-visualization leaflet plotly r shell
Last synced: 02 May 2026
https://github.com/prady2309/iris_flower_classification
Random Forest Classification
data-science data-visualization machine-learning python
Last synced: 02 May 2026
https://github.com/vaxdata22/redfin-analytics-etl-using-amazon-emr-by-airflow-on-ec2
This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.
amazon-emr-cluster apache-airflow apache-spark aws-ec2 aws-s3 business-intelligence dags data-visualization etl-pipeline google-colab-notebook orchestration power-bi pyspark redfin snowflake snowpipe sqs-queue
Last synced: 02 May 2026
https://github.com/bhaveshbhakta/car-auction-prices-prediction-using-ml
Car Auction Price Prediction
auction car-price-prediction data-preprocessing data-visualization machine-learning random-forest
Last synced: 08 Jun 2026
https://github.com/quocduyenanhnguyen/airlines_web_scrapping
I scrapped airline data from a Wiki page with Python, did some data cleaning with Google Sheet and SQL, then visualized the data with Tableau.
airlines csv-files data-cleaning data-visualization mysql python3 tableau tableau-dashboards tableau-public webscraping
Last synced: 15 May 2026
https://github.com/quocduyenanhnguyen/twitter-despicable-me-4-hashtag-engagement-analysis
In this project, I explored Despicable Me 4 hashtag on Twitter to gather engagement metrics for data analysis over a one week period.
csv-files data-analytics data-cleaning data-visualization mysql python3 tableau tableau-dashboards tableau-public twitter twitter-hashtag
Last synced: 16 May 2026
https://github.com/faithererer/haokanvideo_spider
好看视频爬取与数据分析
data-analysis data-visualization python spider
Last synced: 02 May 2026
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 02 May 2026
https://github.com/teja-1403/ignosis-tech-ml-assignment
Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.
customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python
Last synced: 02 May 2026
https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot
A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.
d3 data-visualization javascript
Last synced: 02 May 2026
https://github.com/kimaruthagna/segmente
A journey through understanding customer segmentation using python with the general goal of encouraging data driven decision making
clustering crosstab customer-segmentation data-science data-visualization knn-classification lifetime-value pandas rfm-analysis seaborn
Last synced: 02 May 2026
https://github.com/ronitjariwala/prodigy_ds_04
Prodigy InfoTech Data Science Internship Task-4
data-analysis data-science data-visualization python
Last synced: 02 May 2026
https://github.com/hamzazafar10/movie-recommendation-system
Content based movie recommendation system using cosine similarity.
cosine-similarity data-analysis data-preprocessing data-science data-structures data-visualization jupyter-notebook machine-learning movie-recommendation python
Last synced: 02 May 2026
https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor
AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.
data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics
Last synced: 02 May 2026
https://github.com/rorrell/employmentdata
A Jupyter Notebook where I use group by to analyze the average unemployment rate by year
data-analysis data-visualization jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/rajapriya12345/data-visualization
data-visualization java python
Last synced: 02 May 2026
https://github.com/neuro-mechatronics-interfaces/matlab_analyses
Tools for analysis, statistics, and/or simulation in Matlab.
data-analysis data-visualization matlab matlab-codes matlab-functions matlab-gui matlab-scripts neuroscience weber-lab
Last synced: 09 Jun 2026
https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python
We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.
data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system
Last synced: 02 May 2026
https://github.com/holy-angel-university/global-cost-index-analysis
This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.
data-science data-visualization exploratory-data-analysis jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/debjyotisaha/web-application-projects-streamlit-phase-2
This repository showcases interactive web applications built using the Streamlit framework.
dashboard data-visualization python streamlit
Last synced: 02 May 2026