Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/NurFakhri/scraping-and-analysis-skincare
Scraping and data analysis of Indonesian skincare reviews.
beutifulsoup data-analysis data-scraping python requests review scraping-websites
Last synced: 12 Oct 2025
https://github.com/bhaveshbhakta/flight-price-prediction-using-ml
Flight Price Prediction
data-analysis data-visualization flight-price-prediction machne-learning random-forest
Last synced: 12 Oct 2025
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/jeffbrennan/analysis-templates
Templates of commonly used graphics/functions/settings to help focus on the bigger picture
Last synced: 12 Oct 2025
https://github.com/tzerk/esr
R package 'ESR' for plotting and analysing ESR spectra in dating applications
data-analysis data-visualization electron-spin-resonance geochronology r
Last synced: 13 Mar 2026
https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce
This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.
data-analysis data-visualization ecommerce microsoft-sql-server powerbi python
Last synced: 18 May 2026
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/kingflow-23/ai-related-article-detector
Create a simple system that determines whether an article is related to AI or not using web scraping, text representation, and a classifier.
data-analysis data-engineering data-science logistic-regression pca-analysis scraping selenium umap
Last synced: 04 May 2026
https://github.com/zulhaditya/web-scraping-python
A repository that stores various source code and web scraping methods using Python.
data-analysis python3 webscraping
Last synced: 12 Oct 2025
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/apfirebolt/numpy-and-pandas-examples
Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python
data-analysis jupyter-notebook numpy pandas python
Last synced: 17 Apr 2026
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/sunsided/esc2024
Exploratory Data Analysis on the ESC 2024 results
csv data-analysis eurovision-song-contest scraping
Last synced: 18 Feb 2026
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 02 May 2026
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/stefagnone/-employee-salary-analysis-and-insights
Predictive analysis of employee salary determinants for an anonymized dataset, highlighting key factors influencing salary and providing insights for salary policy improvements.
business-intelligence data-analysis data-science employee-salary-analysis excel gender-pay-gap predictive-insights regression-modeling spss statistical-analysis
Last synced: 23 Feb 2026
https://github.com/jedrzej-wydra/data-analysis-associate
Associate Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/jedrzej-wydra/data-analysis-pro
Professional Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/korniichuk/pydatan-homework
Python Data Analysis course homework
course data-analysis data-analysis-python python python3
Last synced: 06 May 2026
https://github.com/gmalbert/supreme-court
Data Analysis of the US Supreme Court from 1790 to present
data-analysis data-science supreme-court
Last synced: 31 May 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/dhruwsunita/iphones-eda-analysis
EDA analysis on apple products.
data-analysis data-visualization eda matplotlib numpy pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/gmalbert/rugby
Rugby Data Analysis and Sports Betting
data-analysis rugby sports-betting
Last synced: 31 May 2026
https://github.com/rohithsaji97/toll_gate
This is a electronic toll collection system.
data-analysis digital-image-processing ocr-text-reader opencv python3 trained-models
Last synced: 29 Apr 2026
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026
https://github.com/okdp/platform-packages
Collection of OKDP platform packages
clickops data-analysis data-platform data-science fluxcd gitops helm kubernetes kubocd portal spark
Last synced: 26 Jun 2026
https://github.com/giseletoledo/case-study-wellness-smart
Project from the coursera course Google Data Analytics
data-analysis kaggle-dataset r
Last synced: 14 Oct 2025
https://github.com/samkazan/business-analysis-tableau
Business Analysis on Global/Superstore data using Tableau.
analysis data-analysis tableau visualization
Last synced: 08 Feb 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/anushkundu/london-housing-market-analysis
London Housing Market Analysis: An Insightful Power BI Dashboard"
data-analysis data-visualization powerbi transformation
Last synced: 27 Jan 2026
https://github.com/ayorick23/python-data-science-cheat-sheet
Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.
cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow
Last synced: 07 Apr 2026
https://github.com/fnu-ankit/8-week-sql-challenge
My attempt on solving Case studies from #8WeeksSQLChallenge
8-week-sql-challenge 8-weeks-sql-challenge 8weeksqlchallenge case-study data data-analysis data-analysis-sql data-analytics database datawithdanny sql sqlserver
Last synced: 19 Apr 2026
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 14 Oct 2025
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/karlyndiary/coffee-shop-sales-analysis
Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 07 May 2026
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 17 Nov 2025
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/saisurajmatta/healthcare-data-analytics
Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.
data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery
Last synced: 22 Jan 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/omnipotence-eth/manufacturing-quality-analytics
SQL + Python pipeline for semiconductor NCR analysis — supplier performance, defect Pareto, yield trends
analytics data-analysis etl manufacturing matplotlib pandas postgresql python quality sql
Last synced: 11 Apr 2026
https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis
Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.
data-analysis data-visualization dax excel power-pivot power-query
Last synced: 22 Jan 2026
https://github.com/a26nine/msc-dissertation-bitcoin-dashboard
An interactive data visualisation dashboard built using Tableau Desktop to research and analyse the relationship between the price volatility and adoptability of bitcoin.
data-analysis data-science data-visualization tableau tableau-desktop tableau-prep
Last synced: 17 Feb 2026
https://github.com/jaseel342/bike_sales_analysis_using_excel
data-analysis excel excel-dashboard
Last synced: 24 Feb 2026
https://github.com/zeynepcol/Data-Analysis-Visualization
Data Analysis
data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit
Last synced: 15 Oct 2025
https://github.com/sanjayankur31/20181206-neurofedora
Slides for my NeuroFedora seminar at the UH Biocomputaiton group's weekly seminar
computational-neuroscience data-analysis neurofedora neuroimaging neuroscience open-science
Last synced: 19 Feb 2026
https://github.com/badranalyst/exploratory-data-analysis-on-salaries-dataset
Performing EDA on a dataset related to salaries, exploring relationships between factors like job titles, industries, and locations. Insights are visualized with plots to identify trends and disparities in salary data.
data-analysis dataset eda exploratory-data-analysis pandas python
Last synced: 07 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/kunalpisolkar24/winequalityprediction
Predicting wine quality using machine learning with matplotlib, numpy, pandas, and seaborn for insightful data analysis. 🍇🤖📊
data-analysis data-science data-visualization machine-learning prediction-model
Last synced: 16 Oct 2025
https://github.com/ashithapallath/r-lab
This repository offers a collection of exercises, assignments, and projects designed for the R Programming course. It focuses on utilizing R for data analysis, statistical modeling, and visualization tasks.
data-analysis exploratory-data-analysis machine-learning r-language visualization
Last synced: 16 Oct 2025
https://github.com/sngr0x0/ranklytics-kr
OP.GG Scraping
data-analysis league-of-legends matplotlib opgg playwright-python scraping visualization
Last synced: 16 Oct 2025
https://github.com/aishwaryahastak/ipl_analysis
Analysis of IPL dataset using PySpark
Last synced: 16 Oct 2025
https://github.com/rita94105/ethereum-fraud-detection
This project focuses on detecting fraudulent transactions in the Ethereum network using both traditional machine learning models and deep learning techniques. By analyzing transaction attributes and interaction patterns, we aim to develop an effective fraud detection model.
data-analysis deep-learning ethereum fraud-detection machine-learning
Last synced: 01 May 2026
https://github.com/bhaveshbhakta/fish-weight-prediction-using-ml
Fish Weight Prediction
data-analysis data-visualization fish-weight-prediction gradient-boosting machine-learning
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/supertetelman/coursera-exdata-09
This repo contains several R scripts that were used to analyze, plot, and clean data from various datasets. These projects were part of the Coursera course, Exploratory Data Analysis. The end results of the analysis are included.
big-data course coursera data-analysis r
Last synced: 16 Oct 2025
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/javedali99/machine-learning-hw-solution-notebooks
Machine Learning Homework Solution Notebooks (UCF CAP5610)
data-analysis data-preprocessing data-science decision-trees machine-learning python random-forest recommender-system supervised-learning support-vector-machines titanic-kaggle unsupervised-learning
Last synced: 05 Jan 2026
https://github.com/javedali99/geospatial-and-earth-science-data
A comprehensive collection of global earth science and geospatial datasets 🌍
data-analysis dataset earth-observations earth-science earth-sciences earthscience geography geospatial geospatial-analysis geospatial-analytics geospatial-data open-datasets satellite-data
Last synced: 05 Jan 2026
https://github.com/gaaniruddha/mphil
This repository contains a copy of my final MPhil presentation and panel report.
data-analysis gpu-imager radio-astronomy
Last synced: 03 Mar 2026
https://github.com/mattdelaune/excel_sales_dashboard
Interactive Excel Dashboard for Coffee Sales Analysis: This project leverages Excel to analyze sales data, uncover seasonal trends, regional preferences, and customer behaviors, providing actionable insights for optimizing inventory and marketing strategies.
data-analysis excel pivot-tables sales-dashboard sales-data
Last synced: 27 Jan 2026
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/helosantosdesousa/analise-dados-titanic
Análise de dados com o dataset 'Titanic - Machine Learning from disaster'
analise-de-dados analise-exploratoria bootcamp bootcamp-project data-analysis data-girls data-science matplotlib numpy pandas python
Last synced: 07 May 2026
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 16 May 2026
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino
Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.
data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision
Last synced: 20 May 2026
https://github.com/rohansoni45/whatsapp-chat-analysis
This project involves analyzing WhatsApp chat data to extract valuable insights. Using Python and various libraries like Pandas and Matplotlib, the project processes and visualizes chat statistics such as message frequency, most active participants, and sentiment analysis.
chat-analysis data-analysis data-science matplotlib pandas python sentiment-analysis streamlit visualization web-app word-cloud
Last synced: 07 May 2026
https://github.com/prateek5525/online-shopping-analytics-project
The Online Shopping Analytics Project analyzed product trends, and regional sales using SQL and Tableau. Insights from the Sales and Location Dashboards highlighted key trends in demographics, product popularity, and regional performance. These findings empower businesses to optimize strategies, enhance marketing, and improve inventory management.
data-analysis excel kaggle-dataset sql tableau
Last synced: 20 Feb 2026
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 15 Jun 2026
https://github.com/prakshi-23/itvedant-database-management-system-analysis-using-sql
Data Analysis using SQL
Last synced: 22 Jan 2026
https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil
Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.
data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn
Last synced: 30 Mar 2025
https://github.com/antodata/hate_crimes_spain_2014_2017
Analysis of hate crimes in Spain between 2014 and 2017 using official data
chi-square chi-square-test data-analysis data-visualization datascience folium hatecrime json lgtbiq linear-regression maps matplotlib numpy pandas python python3 scipy selenium selenium-webdriver sklearn
Last synced: 14 Apr 2026
https://github.com/codeslash21/analyze-a-b-test-results
Analyze results of an A/B test run by an e-commerce website.
Last synced: 22 Jan 2026
https://github.com/27ahmad/heart-disease-diagnostic-eda
This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.
data-analysis data-visualization pandas python
Last synced: 06 May 2026
https://github.com/prakshi-23/restaurant-ratings-analysis-dashboard-using-power-bi
A Power BI Report on Restaurant Ratings Analysis
data-analysis data-visualisation dax powerbi powerbi-dashboard
Last synced: 22 Jan 2026
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 15 May 2025
https://github.com/eesunmoon/machine_learning
[Spring 2021] Machine Learning
data-analysis kaggle machine-learning ml python scikit-learn sklearn
Last synced: 14 Apr 2026
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/ianfelps/jornada_python
Projetos realizados durante a Jornada Python da Hashtag Treinamentos em maio de 2024.
artificial-intelligence automation data-analysis python
Last synced: 28 Apr 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application
The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.
data-analysis data-science data-visualization python python3 streamlit
Last synced: 18 Apr 2026
https://github.com/mtimma001/clinical-trial-data-tool-v2
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/ashwin331133/sql-pizza-outlet-sales-analysis
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
Last synced: 24 Feb 2026
https://github.com/badranalyst/student-tests-data-analysis-application
Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.
data-analysis data-visualization dataset matplotlib numpy pandas python sklearn
Last synced: 05 May 2026
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/docuvesta/youtube-api-fragrance-channel-analytics
Engagement metrics analysis of perfume Youtube channel using Youtube API 🎀
analysis beauty-products comments data-analysis data-analysis-python engagement-metrics insights jupyter-notebook likes-count marketing marketing-analytics perfume python views-count youtube youtube-api youtube-api-v3
Last synced: 03 May 2026
https://github.com/mudassir-a/vendor-performance-analysis
vendor performance data analysis project using sql, python and power bi
data-analysis powerbi python sql
Last synced: 18 May 2026
https://github.com/shreyaamenon/data-analysis-aiml-mini-projects
mini projects to help me grow skills in data analysis, artificial intelligence and machine learning.
ai data-analysis jupyter-notebook machine-learning python
Last synced: 11 Apr 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/amanyadav-07/customer-churn-prediction
Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.
accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026