Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/greatwoman23/hotel_reservation_analysis
In this project, we delve into the intricate world of hotel reservations, utilizing a multifaceted analytical approach to uncover valuable insights. Through a combination of SQL queries and Tableau visualizations, we meticulously dissect a rich dataset comprising booking details, customer demographics, and reservation statuses.
data-analysis data-science data-visualization hotel hotel-reservation publications sql sql-query sqlite3 tableau
Last synced: 15 May 2026
https://github.com/codeslash21/wrangle-twitter-archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling neural-networkt twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/codeslash21/wrangle_twitter_archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling nanodegree-project neural-network twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/samwhaaa/da_portfolio
Showcasing some of my Data Analytics projects
data-analysis data-analytics data-visualization jupyter jupyter-notebook python
Last synced: 01 Mar 2025
https://github.com/wilfordaf/dataanalyst-test
Test task for Junior Data Analyst position
data-analysis pandas python trading-data
Last synced: 28 Feb 2025
https://github.com/dcs-training/introtostatistics
This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 26 Mar 2025
https://github.com/xre22zax/airline-analysis
Travel agency and need to know the ins and outs of airline prices for your clients
data-analysis data-visualization python python3 visualization
Last synced: 13 Apr 2026
https://github.com/satyam4229/prediction-of-different-diseases
Prediction of the different diseases with the help of different symptoms express the diseases in the real time. In the dataset, there are 132+ different symptoms on which the model is trained to give the best result of the disease.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/mattholy/haka
HaKa is an out-of-the-box tool system designed for data engineers and data analysts in medium-sized enterprises. It is easy to deploy and scale.
celery data-analysis data-engineering fastapi python uvicorn-gunicorn
Last synced: 19 May 2026
https://github.com/samruddhi3012/rfm-sales-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 12 Mar 2025
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/behavioral-insights-and-data-exploration
This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.
acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis
Last synced: 10 Apr 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 02 Apr 2025
https://github.com/nero103/airbnb-destination
This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings
data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau
Last synced: 27 Mar 2026
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/jianxi-erin/bigdata-machinelearning-lab
本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。
data-analysis data-visualization hadoop machine-learning python spark sql
Last synced: 03 May 2026
https://github.com/mahmoud2abdallah/improvado-marketing-homework
This Looker Studio dashboard provides a comprehensive analysis of marketing performance for August 2024, transforming raw data into actionable insights for data-driven decision making.
bigquery business-intelligence data-analysis looker-studio marketing
Last synced: 05 Oct 2025
https://github.com/vishal786-commits/target-businesscasestudy-sql
This project analyzes Target’s e-commerce transactions in Brazil between 2016 and 2018 using SQL. The goal was to explore customer behavior, order patterns, payments, delivery times, and freight costs to generate actionable business insights.
Last synced: 05 Oct 2025
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Jan 2026
https://github.com/chdre/data-analyzer
A small package to analyze and preprocess data.
Last synced: 06 Oct 2025
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/bhaveshbhakta/student-performance-prediction-using-ml
Student Performance Prediction
data-analysis data-visualization linear-regression machine-learning student-performance-analysis student-performance-prediction
Last synced: 08 Oct 2025
https://github.com/sarvesh2304/stellarator_simulation
A comprehensive Julia package for stellarator fusion reactor physics analysis featuring 3D magnetic field calculations, neoclassical transport modelling, quasi-isodynamic optimisation algorithms, and interactive 3D visualisations. Includes tokamak comparison framework and high-resolution plotting capabilities for fusion research.
3d-visualisation data-analysis field-line-tracing fusion-physics fusion-research interactive-3d julia magnetic-confinement magnetic-field-calculations magnetic-surfaces matplotlib neoclassical-transport numerical-methods optimisations physics-simulation plasma-physics plotly quasi-isodynamic stellarator stellarator-optimization
Last synced: 09 Oct 2025
https://github.com/ibromeat/market-orders-analysis
Data analysis of CRM market orders dataset
data-analysis jupyter-notebook machine-learning pandas python visualization
Last synced: 01 May 2026
https://github.com/amish5ingh/cricket-data-analytics-ipl
Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.
data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn
Last synced: 09 May 2026
https://github.com/l1ght14/tradersentiment_primetrade
Analyzes Bitcoin market sentiment's impact on Hyperliquid trader PnL & behavior. Uncovers patterns using Python (Pandas, Seaborn) to derive actionable trading insights. Junior Data Scientist assignment for PrimeTrade
bitcoin crypto-trading cryptocurrency data-analysis financial-data-analysis jupyter-notebook market-sentiment pandas python trader-behavior web3
Last synced: 20 Oct 2025
https://github.com/jhaayush2004/churncast
Fusion of deep Data Science, Machine Learning and MLOps...
aws data-analysis data-science data-visualization deep-neural-networks docker machine-learning mlops-workflow
Last synced: 09 Oct 2025
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/ninadpatil09/hospital_emergency_room_analysis
This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.
data-analysis tableau-public visualization
Last synced: 15 Feb 2026
https://github.com/atiqisrak/py
This repository houses the code and resources for the **100 Days of Python Challenge** – an intensive learning journey designed to propel you from beginner to a a confident Python programmer in just 100 days.
data-analysis data-science machine-learning python3
Last synced: 10 Oct 2025
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/anandu-jpg/coffee-shop-sales-analysis
This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.
business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton
Last synced: 18 May 2026
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 04 May 2026
https://github.com/frankelavsky/security-dash-challenge
I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.
client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization
Last synced: 14 Apr 2026
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 11 Oct 2025
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/pyrypp/koivunen-vastaanottoanalyysi
An analysis on warehouse goods receiving
business-intelligence data-analysis interactive-visualizations
Last synced: 11 Oct 2025
https://github.com/ahsankhizar5/titanic-eda-visualization
Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.
data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 31 May 2026
https://github.com/bakulwani/data-mart-weekly-sales
Cleaned and analyzed weekly sales data using SQL to build a business-focused data mart with KPIs, customer segmentation, and platform insights.
customer-segmentation data-analysis data-cleaning etl kpi-analysis mysql sales-analysis sql
Last synced: 21 Feb 2026
https://github.com/thinzarhninyu/dap
Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org
data-analysis data-analysis-python ipynb jupyter-notebook python
Last synced: 18 Feb 2026
https://github.com/abeltavares/postql
Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.
cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper
Last synced: 19 Jan 2026
https://github.com/NurFakhri/scraping-and-analysis-skincare
Scraping and data analysis of Indonesian skincare reviews.
beutifulsoup data-analysis data-scraping python requests review scraping-websites
Last synced: 12 Oct 2025
https://github.com/akash1070/project--uber-data-analysis
To Determine UBER data from the dataset using Python
data-analysis data-science python
Last synced: 09 May 2026
https://github.com/leosimoes/digitalinnovationone-analise-covid
Projeto prático "Criando modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil" da Digital Innovation One.
arima-models data-analysis data-science python time-series
Last synced: 09 May 2026
https://github.com/angelalim88/jakarta-air-quality-index-classification
This project classifies Jakarta's Air Quality Index (AQI) from 2010 to 2023 using machine learning models (Random Forest, MLP, SVM) based on pollutant concentrations.
data-analysis data-visua machine-learning scikit-learn tensorflow
Last synced: 13 Oct 2025
https://github.com/szymon-budziak/real_estate_house_prices_prediction
Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.
data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn
Last synced: 21 Jan 2026
https://github.com/sumit9000/submission-of-web-server-log-analysis-assessment
This project analyzes one year of real-world HTTP access logs from the University of Calgary’s computer science server. Using Python, pandas, and regular expressions, we clean and parse the data to extract meaningful insights and answer 10 analytical questions.
data-analysis data-cleaning eda jupyter-notebook log-parsing pandas python realworld-data regex web-log-analysis
Last synced: 14 Apr 2026
https://github.com/fnu-ankit/8-week-sql-challenge
My attempt on solving Case studies from #8WeeksSQLChallenge
8-week-sql-challenge 8-weeks-sql-challenge 8weeksqlchallenge case-study data data-analysis data-analysis-sql data-analytics database datawithdanny sql sqlserver
Last synced: 19 Apr 2026
https://github.com/asuquoaa/air_bnb_analysis_dashboard-tableau-
Interactive Tableau dashboards to analyze and visualize data, providing actionable insights for better decision-making
dashboard data-analysis interactive-visualization tableau
Last synced: 13 Mar 2026
https://github.com/ankitpoddar07/excel-project_back-office
📊 Coffee Sales Analytics – Back Office Excel Project
Last synced: 05 Feb 2026
https://github.com/supernyv/data_science_projects
Personal Data Science Projects
data-analysis data-science data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Oct 2025
https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis
Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.
data-analysis data-visualization dax excel power-pivot power-query
Last synced: 22 Jan 2026
https://github.com/a26nine/msc-dissertation-bitcoin-dashboard
An interactive data visualisation dashboard built using Tableau Desktop to research and analyse the relationship between the price volatility and adoptability of bitcoin.
data-analysis data-science data-visualization tableau tableau-desktop tableau-prep
Last synced: 17 Feb 2026
https://github.com/jaseel342/bike_sales_analysis_using_excel
data-analysis excel excel-dashboard
Last synced: 24 Feb 2026
https://github.com/zeynepcol/Data-Analysis-Visualization
Data Analysis
data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit
Last synced: 15 Oct 2025
https://github.com/hase3b/flask-dash-interactive-dashboard
An interactive data visualization dashboard created using Flask and Dash. This project includes comprehensive data preparation, exploratory data analysis (EDA), and dynamic visualizations with Seaborn and Plotly. Explore the multi-page Dash app with features like dropdowns and callbacks for updated plots.
callbacks dash dashboard data-analysis data-visualization dropdown eda flask interactive plotly seaborn web-app
Last synced: 19 May 2026
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/prakshi-23/itvedant-database-management-system-analysis-using-sql
Data Analysis using SQL
Last synced: 22 Jan 2026
https://github.com/prakshi-23/restaurant-ratings-analysis-dashboard-using-power-bi
A Power BI Report on Restaurant Ratings Analysis
data-analysis data-visualisation dax powerbi powerbi-dashboard
Last synced: 22 Jan 2026
https://github.com/farhad-here/predict_student_performance
Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).
classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-03
Data-Driven Decision Making: Selecting the Best Regression Model for E-commerce Sales
benchmark-framework data-analysis data-driven data-visualization e-commerce-project language-python lasso-regression linear-regression-models machine-learning python ridge-regression
Last synced: 20 Oct 2025
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/albertobarrago/sentinel
A contribute for the research of Corrado Malanga and Filippo Biondi
Last synced: 24 Oct 2025
https://github.com/sehgal-vishal/sql-nyc-collision-analysis
this analysis is based on the Collisions(Accidents) happend in New York City. I have used Sql Server For EDA(Exploratory Data Analysis
data-analysis database eda sql-server
Last synced: 06 Feb 2026
https://github.com/gjjvdburg/veld
Easy command line analytics
cli command-line-tool data-analysis data-science data-visualization statistics
Last synced: 26 Oct 2025
https://github.com/code-jl/nfl-kicker-predictor
A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.
beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping
Last synced: 01 Jun 2026
https://github.com/alunera-data/alunera-data
Hi, I’m Yvonne – building data solutions at the intersection of BI, SQL & Service Management
business-intelligence data-analysis data-engineering data-science github-profile portfolio rstats sql
Last synced: 28 Jan 2026
https://github.com/valentinoli/swiss-foodprint
Project in Applied Data Analysis, EPFL 2019
carbon-emissions data-analysis diet foodprint swiss switzerland
Last synced: 24 Jan 2026
https://github.com/annnieglez/fraud-detection-eda
Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns
data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn
Last synced: 28 Jan 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/dharininadkar/covid-data-dashboard
Data Analysis of Covid-19 Cases
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server
Last synced: 25 Feb 2026
https://github.com/abhi227070/medical-insurance-predictor
This project implements a machine learning regression model to predict medical insurance charges based on user-provided details such as smoking status, number of children, gender, and age. The user-friendly interface allows individuals to estimate their average insurance price before purchasing medical insurance.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression-models
Last synced: 04 May 2026
https://github.com/infinitode/duplipy
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting
Last synced: 15 Apr 2026
https://github.com/joannescode/regex_with_py
Learning by practicing with Regex (Python)
Last synced: 30 Jan 2026
https://github.com/touchesir/twitter_physicalactivity
Companion Data / Analysis for "Monitoring Physical Activity Levels using Social Media Data"
Last synced: 30 Jan 2026
https://github.com/nehar-2404/airbnb-nyc-eda-ml
This project analyzes Airbnb listings in New York City to uncover key insights about pricing, host activity, and neighborhood trends. It covers data cleaning, EDA, and basic machine learning to predict listing prices.
airbnb data-analysis eda machine-learning matplotlib pandas pyhton seaborn visualization
Last synced: 15 Apr 2026
https://github.com/aavishkarmahajan/sql
SQL code assignments and practice questions from SQL courses, SQL data analysis
Last synced: 07 Feb 2026
https://github.com/jujulis18/olympicsmedalsdashboard
Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.
dashboard data-analysis data-visualization eda olympic python streamlit
Last synced: 31 Jan 2026
https://github.com/traore-07/fedex-sales-analysis
Analysis of the FedEx Sales Transaction
data-analysis data-visualization sales-analysis tabeau
Last synced: 31 Jan 2026
https://github.com/steviecurran/gbt-scripts
IDL scripts for the reduction of Green Bank Telescope data
data-analysis data-compression data-visualization radio-astronomy spectroscopy
Last synced: 31 Jan 2026
https://github.com/alex-pierron/ekip-enedis-genai
Repository for the team "Ekip" during the H-GenAI Hackathon 2025 organized at SIA Partners, Paris, France
amazon-nova artificial-intelligence aws aws-lambda data-analysis database generative-ai mistral nlp
Last synced: 15 Apr 2026
https://github.com/emediongfrancis/unified-data-lake-implementation-gcp-kafka-airflow-snowflake
This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of Apache Airflow for ETL tasks, Google Cloud Storage as a data lake, Apache Kafka for data movement automation, Snowflake for data warehousing, and Google BigQuery for analysis.
airflow data-analysis data-warehousing etl etl-pipeline gcp-storage kafka snowflake value variety
Last synced: 07 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 09 Feb 2026