Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi
Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!
data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public
Last synced: 02 Aug 2025
https://github.com/evanwporter/sloth
Faster Pandas Dataframe
cython data-analysis dataframe pandas
Last synced: 14 Mar 2025
https://github.com/malthejorgensen/repx
Python regular expression file transformer
command-line-tool data-analysis text-processing
Last synced: 31 Jan 2026
https://github.com/steviecurran/gbt-scripts
IDL scripts for the reduction of Green Bank Telescope data
data-analysis data-compression data-visualization radio-astronomy spectroscopy
Last synced: 31 Jan 2026
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 20 Jun 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/emediongfrancis/unified-data-lake-implementation-gcp-kafka-airflow-snowflake
This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of Apache Airflow for ETL tasks, Google Cloud Storage as a data lake, Apache Kafka for data movement automation, Snowflake for data warehousing, and Google BigQuery for analysis.
airflow data-analysis data-warehousing etl etl-pipeline gcp-storage kafka snowflake value variety
Last synced: 07 Feb 2026
https://github.com/farzeen-2001/hr_analytics_dashboard_powerbi
HR data analytics using Power BI
data-analysis data-visualization datacleaning hr powerbi
Last synced: 25 Feb 2026
https://github.com/tr41z/machine-learning
machine learning models
ai artificial-intelligence data-analysis data-preprocessing google-colab jupyter-notebook machine-learning models python tensorflow
Last synced: 01 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/khanovico/python-stock-analyzer
This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.
amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn
Last synced: 02 Feb 2026
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 15 Apr 2026
https://github.com/mrgeislinger/bike-data-exploration
Data exploration of bike-related data
bicycle bike data-analysis data-science
Last synced: 08 Feb 2026
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 20 Jun 2026
https://github.com/shibbir24/customer-sales-analysis-dashboard-using-tableau
Customer Sales Analysis Dashboard Using Tableau
dashboard data-analysis data-visualization sales-analysis tableau
Last synced: 08 Feb 2026
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Feb 2026
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/jweinst1/xenon
A processing based language
data-analysis interpreter reactive-programming
Last synced: 15 Apr 2026
https://github.com/amoneva/cacc
An R Package to compute Conjunctive Analysis of Case Configurations (CACC), Situational Clustering Tests, and Main Effects
criminology data-analysis r social-science
Last synced: 15 May 2025
https://github.com/zimmi48/nixpkgs-issues
Analysis on nixpkgs issue lifetime.
data-analysis github-api nixpkgs
Last synced: 10 May 2026
https://github.com/an1mch1k-theone/project_1_hh_analyze
Проект: анализ резюме из HeadHunter
data-analysis data-analysis-project python
Last synced: 15 Apr 2026
https://github.com/sarincr/training-on-artificial-intelligence
Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training
artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn
Last synced: 10 Apr 2026
https://github.com/pranjalya/hand-washing-data-visualisation
A small project of Data Visualization, where we analyze the effect of hand washing after introduced by Dr. Semmelweis to the nurses and midwives after giving birth.
data-analysis data-visualization jupyter-notebook pandas python3
Last synced: 06 May 2026
https://github.com/shruti23-ui/blinkit-powerbi-dashboard
A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.
data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql
Last synced: 09 Feb 2026
https://github.com/jayqi/data-analysis-tools
Presentation on Data Analysis Tools
data-analysis presentation-slides
Last synced: 06 Jan 2026
https://github.com/evanmathew/northwind-traders
SQL-powered analysis of sales, employee performance, and customer behavior using PostgreSQL window functions. This project uncovers key business insights to optimize decision-making.
case-study data-analysis jupyter-notebook northwind-traders postgresql python-postgresql sql
Last synced: 20 Jun 2026
https://github.com/prince-pastakiya/human-resources-tableau-project
👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.
chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public
Last synced: 18 Apr 2026
https://github.com/shellynagar27/merchandise-sales-analysis
Merchandise Sales Analysis explores the sales trends of influencer Lee Chatmen’s merchandise using Power BI, and Power Query. The project uncovers key insights on revenue, product performance, location impact, shipping trends, and customer reviews.
critical-thinking data-analysis data-visualization figma powerbi powerquery problem-solving
Last synced: 07 Apr 2025
https://github.com/akashprak/socialnetworkads
Predicting customer purchase behavior from the Social Network Ads dataset.
data-analysis machine-learning mlflow pandas python scikit-learn seaborn xgboost
Last synced: 30 Mar 2025
https://github.com/rdrahul123/ecommerce-sales-dashboard
This project focuses on analyzing e-commerce sales data to uncover actionable insights and improve business decision-making. Using interactive dashboards and data analysis techniques, the project evaluates key performance metrics, customer behavior, sales trends, and payment modes across different categories and regions.
data-analysis data-science excel powerbi
Last synced: 22 Mar 2025
https://github.com/haonamnguyen/data-science-job-analysis
Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 16 Apr 2026
https://github.com/praveen-devknight/event-registration-analytics-dashboard
This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.
data-analysis data-visualization excel powerbi sql
Last synced: 11 Feb 2026
https://github.com/prakshi-23/tableau
Report using Tableau
dashboard data-analysis data-visualization report tableau
Last synced: 11 Feb 2026
https://github.com/iness000/online-retail-customer-segmentation
This project performs comprehensive customer segmentation analysis on an online retail dataset using machine learning clustering techniques and RFM (Recency, Frequency, Monetary) analysis. The goal is to identify distinct customer segments to drive better customer relationship management strategies and business insights.
customer-segmentation data-analysis k-means
Last synced: 31 Aug 2025
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/scailfin/benchmark-templates
Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 16 Jan 2026
https://github.com/thlindustries/mortalidade_neonatal_python_react
Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly
data-analysis data-visualization plotly python python3 react reactjs
Last synced: 16 Apr 2026
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Feb 2026
https://github.com/marknature/machine-learning-intern
Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset
data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn
Last synced: 10 Apr 2026
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 12 Feb 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/ankit21111/carpredict
This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!
data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn
Last synced: 16 Apr 2026
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 16 Apr 2026
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/kalyan4636/chocos-sales-analysis-report-and-dashboard.-
📊 Built using Power BI, this dashboard delivers actionable insights to boost strategic decision-making. Would you like me to include GitHub tags or a project description for the README as well?
bussiness-analyst data-analysis data-visualization dataanalyst microsoft-power-bi powerbi
Last synced: 26 Jan 2026
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 14 May 2025
https://github.com/mananabbasi/dashboard-power-bi
This repository showcases **Power BI projects** focused on data visualization and business intelligence. Each project transforms raw data into interactive dashboards and reports, providing actionable insights for decision-making. The repository includes Power BI files, datasets, and documentation for each project.
data-analysis data-science data-visualization powerbi
Last synced: 13 Feb 2026
https://github.com/celineboutinon/la-faim-dans-le-monde
OpenClassrooms Data Analyst 2022-2023 - Projet 4
data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot numpy pandas python seaborn
Last synced: 20 Jul 2025
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/drod75/burger_king_analysis
A simple analysis on a burger king dataset.
data-analysis data-visualization jupyter-notebook pandas python seaborn
Last synced: 09 May 2026
https://github.com/mo-elshamy/machine-learning-practice
This repository serves as a collection of my work and learning in machine learning while my internship in Cellual-Technologies, including algorithm explanations, data preprocessing workflows, and two projects.
data-analysis data-science dbscan decision-trees eda gradient-boosting gxboost hierarchical-clustering kmeans-clustering knn-classification linear-regression logistic-regression machine-learning model pca polynomial-regression preprocessing random-forest support-vector-machines training
Last synced: 14 Feb 2026
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/emanoelcampos/python-onemonth
This repository contains educational materials and projects developed during a Python course offered by OneMonth. It covers Python basics, intermediate concepts, web development with Flask, and data analysis with pandas. The course is structured into weeks, each focusing on a different aspect of Python programming and its applications.
data-analysis flask jupyter-notebook onemonth python python3
Last synced: 09 May 2026
https://github.com/aidan-zamfir/advt-analysis
Web scrapping project. Will eventually use character/episode data for NLP & networking/ data analysis .
data-analysis nlp python selen webscraping
Last synced: 23 Aug 2025
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 15 Feb 2026
https://github.com/celineboutinon/product-classification
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 6
api classification-models data-analysis data-science data-visualization e-commerce image-classification marketing marketing-analytics product-classification rgpd scraping-python text-classification
Last synced: 29 Jun 2026
https://github.com/guptaachin/airline-sentiment-analysis-from-twitter-feeds
Analyses of the airline service providers' sentiment from twitter feeds
classification data-analysis data-science jupyter-notebook machine-learning natural-language-processing pandas pca python sklearn-library tf-idf visualization
Last synced: 09 May 2026
https://github.com/swethajoseph/sales-eda-project
Performed an advanced Excel-based exploratory data analysis (EDA) of an E-Commerce sales dataset to create an interactive dashboard for uncovering key business insights.
advancedexcel data-analysis data-visualization datacleaning dataformatting exploratory-data-analysis msexcel pivot-tables
Last synced: 19 Mar 2026
https://github.com/prekshivyas/cis-595-big-data-analytics
Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.
data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping
Last synced: 16 Feb 2026
https://github.com/lotfiferaga/energeiahub
data-analysis data-visualization energy-consumption python streamlit
Last synced: 09 May 2026
https://github.com/arunesh-tiwari/sales-analysis
Tableau Data Analysis Project.
data-analysis data-visualization tableau
Last synced: 01 Mar 2026
https://github.com/leandrocollares/home-team-advantage-in-epl
Home team advantage in the English Premier League: an exploratory data analysis
data-analysis matplotlib pandas plotly
Last synced: 11 Jun 2026
https://github.com/rachit901109/simppl_task
Social Media Analytics Dashboard
dashboard-application data-analysis data-visualization network-graphs social-network-analysis
Last synced: 16 Apr 2026
https://github.com/mehrab-kalantari/olympics-data-analysis
A streamlit application to analyze the Olympics dataset from several views
data-analysis streamlit-dashboard streamlit-webapp
Last synced: 20 Apr 2026
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas
This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian
data-analysis numpy pandas python
Last synced: 10 Apr 2026
https://github.com/abeltavares/hotel_performance_analysis
A Power BI project that analyzes the performance of a hotel, including revenue, expenses, customer data, hospitality metrics and financial ratios.
business-intelligence data-analysis expenses financial-analysis hospitality-industry power-bi revenue
Last synced: 02 Mar 2026
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 14 May 2025
https://github.com/luminati-io/target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/anuppm9917/data-processing-and-csv-to-json-using-python-project
This project guides you through processing data from CSV to JSON format using Python. You'll learn to cleanse, validate, and transform data with pandas, numpy, csv, and json libraries, ensuring it's ready for POS system integration. This will help improve data integrity and streamline integration.
csv-files data data-analysis data-cleaning data-collection data-transformation data-validation python3 transformation
Last synced: 16 Apr 2026
https://github.com/arraypd/data-analysis-with-python-and-sql
data-analysis grafana matplotlib pandas polars postgresql pyspark python seaborn sql
Last synced: 09 Apr 2026
https://github.com/vadniks/akabigdata
Technologies and tools for big data analysis
applied-mathematics association-rule-learning classification clustering data-analysis data-visualization ensemble-learning machine-learning-algorithms python3 statistics
Last synced: 23 Sep 2025
https://github.com/marvinmarnold/oipm_stop_search
OIPM's analysis on Stop & Search (frisk) activity by the New Orleans Police Department.
data-analysis frisk new-orleans oipm police search stop
Last synced: 22 Jul 2025
https://github.com/kathisnehith/austin-crime-report-analysis
Data analysis and visualization of crime trends in Austin
crime-reporting data-analysis data-visual database reporting sql tableau
Last synced: 25 Feb 2026
https://github.com/beyzabasarir/northwind-traders-analysis
Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.
dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn
Last synced: 10 Apr 2026
https://github.com/carlosvinimsouza/full-tutorial-python
My tutorial Python completed
data-analysis data-science data-structures django django-framework fastapi fastapi-framework flask flask-web frameworks learn-to-code learning python python3 roadmap tutorial tutorial-code
Last synced: 10 Apr 2026
https://github.com/shrikantnaidu/greyatom-projects
GreyAtom Projects.
data-analysis data-science greyatom machine-learning portfolio
Last synced: 24 Jul 2025
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/soypete/example-go-dataframes-parser
example of https://godoc.org/github.com/kniren/gota/dataframe
data-analysis data-science datastructures golang-examples ml
Last synced: 12 Sep 2025
https://github.com/abhipatel35/gym-performance-analysis
Analyzing gym performance and user engagement in Arizona using Spark SQL, PySpark, and visualization techniques on the Yelp dataset.
apache-spark asu business-insights data-analysis data-processing-at-scale data-visualization dps gym-analysis rating-patterns sql trend-analysis user-insights yelp-dataset
Last synced: 16 Apr 2026
https://github.com/kosuri-indu/allaboutolympics
All About Olympics is an interactive dashboard presenting comprehensive data and insights on Olympic Games from 1896 to 2020.
data-analysis pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/akash-srm/user-engagement-analysis
Analyzed user engagement and feedback data to derive actionable insights for an online learning platform.
analytics-projects data-analysis data-cleaning eda jupyter-notebook pandas python seaborn student-engagement
Last synced: 16 Apr 2026
https://github.com/devanshsahu47/talentscape-glassdoor-analysis
TalentScape is an end-to-end Python project that cleans and analyzes a comprehensive Glassdoor Jobs dataset. It features robust data wrangling and 20 insightful visualizations to uncover trends in job titles, salary ranges, company ratings, and more—providing actionable recommendations to optimize recruitment and compensation strategies.
business-intelligence data-analysis data-vizualisation jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/johannaschmidle/netflix-subscription-analysis
Examined Netflix subscription data to understand market behaviour, predict future trends, and identify consumer preferences. [SQL, Tableau]
data-analysis data-cleaning data-trend data-visualization netflix
Last synced: 05 Mar 2026
https://github.com/e1washere/weather-spark-pipeline
Scalable pipeline using Apache Spark to process and analyze weather data.
apache-spark batch-processing big-data data-analysis data-engineering data-pipeline data-processing etl python spark-sql weather-data
Last synced: 17 Apr 2026
https://github.com/nischay002/us-honey-production-analysis
Analysis of US honey production (1995–2021) using Python & data visualization. Identifies trends in honey yield, pricing, and colony distribution across states.
data-analysis data-visualization exploratory-data-analysis honey-production matplotlib pandas python seaborn us-agriculture
Last synced: 26 Feb 2025
https://github.com/drill-n-bass/ovh-project
The goal of this task is to prepare statistical analysis of set of data from disks.
anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots
Last synced: 09 May 2026
https://github.com/sdley/cas_pratiques_a_rendre
Exercices pratiques de traitement de données avec python.
Last synced: 09 May 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026