Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/shafaq-aslam/data-analytics-dairy
A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.
data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau
Last synced: 20 Jan 2026
https://github.com/jimohola/azure-flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 09 Mar 2025
https://github.com/virajbhutada/credit-card-transaction-analysis-sql
This project provides a structured database schema and SQL scripts to analyze credit card data. It includes tools for managing and analyzing transaction data, helping to identify spending patterns and trends. The project features visual schema diagrams and supporting documentation for easy understanding.
creditcard customer data-analysis data-cleaning data-modeling database database-management insights performance-optimization postgresql query-language schema-design schema-diagram scripts sql transactions trends
Last synced: 15 May 2026
https://github.com/aicorsair/python-case-study-365-data-science-subscription-purchase-prediction
This repository contains a comprehensive case study on predicting 365 Data Science customer subscriptions using real-world student engagement data.
data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization decision-tree feature-engineering feature-selection hyperparameter-optimization hyperparameter-tuning k-nearest-neighbors logistic-regression machine-learning purchase-prediction python random-forest scikit-learn statsmodels svc
Last synced: 08 May 2026
https://github.com/petrosdemetrakopoulos/caraccidentsvrilissia
A data project for car accidents in my neighbourhood
article data-analysis data-science data-visualization machine-learning predictive-modeling
Last synced: 23 Mar 2025
https://github.com/praveingk/lipidanalysis
data-analysis data-visualisation
Last synced: 17 Mar 2025
https://github.com/trim0500/fe-stats-classifier
An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.
creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton
Last synced: 11 Apr 2026
https://github.com/firdevstorlak/maritime-signature-lab
Prototyp einer maritimen Signaturdatenbank (Akustik, Magnetik, RCS, IR) mit Python, SQLite und einfacher Computer-Vision.
acoustic-signatures cli-tool computer-vision data-analysis demo-project engineering-prototype infrared-imaging maritime opencv python radar rcs relational-database scientific-computing signal-processing sqlite synthetic-data
Last synced: 07 May 2026
https://github.com/bagusperdanay7/fcc-da-mean-variance-standard-deviation-calculator
One of Data Analysis with Python (freecodecamp) task, created a Mean Variance Standard Deviation Calculator.
data-analysis freecodecamp-project numpy python
Last synced: 06 May 2026
https://github.com/deeksha-dhawan/pizza-outlet-analysis-using-sql
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
coding-challenge data-analysis data-science microsoft my portfolio-project programming project projects sql sql-analysis sql-project sqlproject sqlserver
Last synced: 23 Mar 2025
https://github.com/jesuserro/ab-testing-ui-redesign-vanguard
A/B testing analysis to evaluate the impact of a user interface redesign at Vanguard.
a-b-testing data-analysis eda exploratory-data-analysis testing ui-design ux-design
Last synced: 08 Jul 2025
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 10 Mar 2026
https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python
In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.
data-analysis data-cleaning data-visualization jupyter-notebook
Last synced: 23 Mar 2025
https://github.com/saifalibaig/-online-retail-exploratory-data-analysis-with-python
Exploratory Data Analysis of an online retail store
data-analysis data-visualization matplotlib numpy pandas python3
Last synced: 11 Apr 2026
https://github.com/jedrzej-wydra/improving-accuracy
Improving accuracy of age estimates for insect evidence—calibration of physiological age at emergence (k) using insect size but without “k versus size” model
Last synced: 02 Sep 2025
https://github.com/manisharora96/instagram-reach-analysis
This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively
data-analysis data-visualization instagram-reach python-tools
Last synced: 23 Mar 2025
https://github.com/nikhil-donthusaram/heartdiseaseprediction
Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.
data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode
Last synced: 11 Apr 2026
https://github.com/steviecurran/dashboards
Compilation of Links to the dashboards in the other repositories
dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau
Last synced: 21 Feb 2026
https://github.com/ejw-data/pandas-school
Analysis of school data with Pandas
Last synced: 08 May 2026
https://github.com/mohammad-malik/covid-visualizations-d3
This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).
covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization
Last synced: 28 May 2026
https://github.com/al-ghaly/iti-project
ITI Final/Graduation Project.
data-analysis data-cleaning data-visualization data-warehousing machine-learning power-bi python-data-analysis sql statistical-analysis
Last synced: 15 Mar 2025
https://github.com/deller23/hotel_booking_data_cleaning
Efficiently transforming raw hotel booking data into actionable insights! This project leverages Python and Pandas for advanced data cleaning—handling missing values, detecting outliers, and optimizing features—ensuring a high-quality dataset ready for analysis and modeling.
data-analysis data-cleaning data-preprocessing data-visualization data-wrangling pandas python
Last synced: 31 Mar 2025
https://github.com/abdullahashfaqvirk/powerbi-dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 10 Mar 2026
https://github.com/kailenroa/dashboad-excel-huisprijzen
This project focuses on developing a dashboard powered by Funda to visualize house pricing in the Netherlands. The dashboard simplifies the home-buying process by allowing users to compare prices, energy labels, number of rooms, and square meters across different provinces, all in one interactive platform..
dashboard data-analysis excel house-prices
Last synced: 05 Jan 2026
https://github.com/abelarduu/power_bi_analyst
Projeto Power BI para relatório de dados financeiros, com navegação intuitiva e recursos interativos. Oferece uma experiência completa ao usuário, combinando apresentação sofisticada e funcionalidade eficaz para análise de dados.
dashboard data-analysis data-analytics modelagem-de-dados powerbi tratamento-de-dados
Last synced: 08 Sep 2025
https://github.com/devexpress-examples/wpf-pivot-grid-define-custom-cell-template-to-performing-data-editing
This example shows how to edit a cell with the cell editing template in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 02 May 2026
https://github.com/adilshamim8/eda-on-health-and-sleep-data
Exploratory Data Analysis (EDA) on health and sleep data, uncovering patterns and insights using Python and visualization tools.
data-analysis data-visualization eda health healthcare sleep sleep-analysis
Last synced: 15 Mar 2025
https://github.com/shahriarha/sql
Structured query language
data-analysis mysql mysql-database sql
Last synced: 02 Sep 2025
https://github.com/alphatwirl/qtwirl
qtwirl (quick-twirl), one-function interface to AlphaTwirl
alphatwirl data-analysis data-frame pandas r root-cern
Last synced: 11 Apr 2026
https://github.com/reddyprasade/r-program
R is a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.
data-analysis data-science r-programming
Last synced: 11 Apr 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/ginalamp/covid_dashboard_twitternews
Corona Dashboard & report based on Twitter media outlet news.
dashboard data-analysis data-visualization twitter
Last synced: 28 Jan 2026
https://github.com/mradovic38/cnn-dominant-color-recognition
Detecting a dominant color of Tulip images using various convolutional neural networks.
artificial-intelligence cnn computer-vision data-analysis deep-learning dominant-color kmeans-clustering machine-learning neural-network pytorch regression resnet resnet-18 squeeze-and-excitation
Last synced: 01 Jul 2026
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/lit26/novel-corona-virus-2019
Data Analysis for Novel Corona Virus 2019
analysis coronavirus-case data-analysis sir-model
Last synced: 10 Jun 2025
https://github.com/lucaso21/euro-2021-player-stats-analysis
A short project analyzing stats for players at the Euro 2021 tournament.
data-analysis data-science r rvest tidyverse
Last synced: 16 Mar 2025
https://github.com/lfariello/atmospheric_reentry
Matlab code for the determination of the reentry trajectory, deceleration profiles, and heat flux of the ARD capsule during orbital reentry into Earth's atmosphere.
data-analysis heat-flux-prediction heat-transfer hypersonic hypersonic-capsule matlab-programming trajectory-prediction
Last synced: 23 Mar 2025
https://github.com/b-varun-reddy/fairwai-bias-detection
Submission for the FairwAI Hospitality Intern Challenge. This project analyzes bias signals in Yelp hospitality reviews using open-source data, Python, and fairness-focused keyword detection.
bias-detection data-analysis ethical-ai fairness hospitality machine-learning natural-language-processing python social-impact yelp-dataset
Last synced: 19 Apr 2025
https://github.com/atharvapathak/rsvp_movies_case_study
SQL queries performed on IMDb database to provide recommendations to RSVP Movies based on insights.
data-analysis data-cleaning data-science imdb-dataset rsvp-movies sql
Last synced: 28 Jan 2026
https://github.com/gabrieladados/analise-ecommerce
Análise SQL para E-commerce: Estratégias de Crescimento para Impulsionar Vendas
bigquery data-analysis ecommerce sql
Last synced: 31 Mar 2025
https://github.com/cyberoctane29/epa-carbon-monoxide-aqi-analysis
This project continues my EPA Air Quality AQI Analysis, focusing on carbon monoxide levels in EPA data. Using Python, I applied statistics, probability analysis, outlier detection, sampling, and hypothesis testing to assess pollution and health impacts. Leveraging Pandas, NumPy, SciPy, and Matplotlib, it supports environmental policy decisions.
data-analysis eda hypothesis-testing probability-distribution sampling sampling-distribution statistical-analysis
Last synced: 24 Mar 2025
https://github.com/cescedes/this-is-jeopardy
Writing several functions that investigate a dataset of Jeopardy! questions and answers.
codecademy data-analysis python
Last synced: 11 Apr 2026
https://github.com/curtisalexander/cramisc
Personal R functions for data analysis
Last synced: 12 Mar 2025
https://github.com/shiva16/da
Data Analytics - Study materials
analytics data-analysis data-science data-structures
Last synced: 07 Feb 2026
https://github.com/mansogf/datascience_introduction
Data Science Introductions Practices
data-analysis data-science data-visualization graph
Last synced: 04 Apr 2025
https://github.com/anuragmudgal96/data-warehouse-project
Designing and implementing a modern data warehouse on SQL Server, covering ETL pipelines, dimensional modeling, and analytical reporting.
data-analysis data-engineering data-warehouse datawarehousing etl etl-job etl-pipeline sql sql-server
Last synced: 09 Oct 2025
https://github.com/anilyigitsel/tourist-attraction-data-analysis
This project analyzes tourism trends from 2017 to 2021, focusing on visitor numbers, ratings, and attraction popularity during these years.
data-analysis data-visualization excel sql tourism
Last synced: 26 Jan 2026
https://github.com/junpenglao/spafv
SPAFV - Surface Profile Analysis for Free Viewing eye movement experiment in 2AFC task
data-analysis statistics temporal-logic
Last synced: 31 Mar 2025
https://github.com/zachbateman/easy_plot
Easy Statistical Visualization in Python
data-analysis data-visualization graphics matplotlib python seaborn
Last synced: 18 Jan 2026
https://github.com/nahiyanhkhan/sales-insight-dashboard_powerbi
Build a dashboard to display the sales insights of a company's sales data over the 4 years period. It includes displaying revenue, sales quantity in different regions over the years.
dashboard data-analysis data-analytics data-visualization powerbi salesdashboard
Last synced: 08 Jan 2026
https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql
Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.
analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language
Last synced: 16 Oct 2025
https://github.com/vinitgurjar/r_lang_exp
This is a collection of my collage Data Analytics lab work and assignment, the files here contains program of R language
data-analysis data-visualization r
Last synced: 02 Jul 2025
https://github.com/darksoulnelson/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 05 Jul 2025
https://github.com/divyanshugit/indian-judiciary-analysis
Analysis of Indian district court data across states.
Last synced: 02 Jul 2025
https://github.com/privatepeople/dataanalysis
데이터 분석 공부를 위한 저장소
data-analysis data-science data-visualization matplotlib numpy pandas python scikit-learn scipy
Last synced: 26 Jan 2026
https://github.com/bhiogade/hpi-analysis
House price index (HPI) Analysis
data-analysis data-cleaning data-visualization gathering-data numpy pandas-python
Last synced: 30 Apr 2026
https://github.com/odessaz/portfolio-projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 Apr 2026
https://github.com/adrianlardies/from-data-to-insight
This project creates and manages a MySQL database to analyze the performance of Bitcoin, Gold, and the S&P 500 in response to economic factors. It integrates historical data, executes advanced SQL queries, and visualizes key insights, showcasing the power of SQL and Python in financial analysis.
data-analysis data-science matplotlib pandas python seaborn sql
Last synced: 12 Apr 2026
https://github.com/lucashomuniz/Project-05
Statistical Analysis of Hospitalization Costs: Leveraging SQL and R for Insights
anova-analysis anova-test data-analysis finance-analysis-data language-r linear-regression sql stastistical-model statistical-analysis
Last synced: 20 Oct 2025
https://github.com/josephbarbierdarnal/matoolkit
matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib
data-analysis data-visualization matplotlib
Last synced: 31 Mar 2025
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/krzysikd/apartment-prices-in-poland-analysis-and-visualization
Data Analyst portfolio project that involves cleaning, transforming, and visualizing data to create an insightful dashboard. The project uses SSIS for ETL processes, SSMS for database management and queries, and Power BI for data visualization, focusing on the analysis of rental and sales apartment prices in Poland.
data-analysis data-cleaning data-visualizations powerbi sql sqlserver ssis
Last synced: 04 Feb 2026
https://github.com/3rd-son/movie-streaming-service-analysis
Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn
Last synced: 18 Apr 2026
https://github.com/prgermux/defect-finder
Defect Finder is an interactive Python-based GUI application for detecting and analyzing mechanical and non-mechanical defects in data. It provides defect visualization, periodicity analysis, and statistical insights, making it ideal for research and quality control workflows.
data-analysis defect-detection gui pyqt5 python quality-control statistics visualization
Last synced: 24 Mar 2025
https://github.com/kathkoeh/pimaindian-kk
Logistic regression analysis of diabetes risk using the Pima Indians dataset. Includes prevalence analysis, modeling, ROC/AUC evaluation, and patient testing in Python.
data-analysis diabetes epidemiology logistic-regression machine-learning public-health python
Last synced: 28 Apr 2026
https://github.com/hilalguleryuz/excel_supermarket_data_analysis_project
Supermarket Data Analysis with Excel
dashboard data-analysis data-visualization excel microsoft-excel supermarket supermarket-data-analysis supermarket-dataset
Last synced: 06 Jan 2026
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 02 Apr 2025
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql sql-query
Last synced: 04 Mar 2026
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 03 Sep 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/torchstack-ai/cancer-biomarker-discovery
scRNASeq drug discovery and biomarker project
bioinformatics cancer-research data-analysis data-visualization r scrna-seq-analysis startup
Last synced: 01 Apr 2025
https://github.com/danielafishwickinacap/coderhouse_da
Data analyst Final Project files
Last synced: 18 Jan 2026
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 05 Mar 2026
https://github.com/bryanfks-dev/klempoken-analysis
Analysis and forcasting model for Klempoken MSMEs
big-data-analytics data-analysis data-forecast data-visualization
Last synced: 01 Apr 2025
https://github.com/shellynagar27/transportation-and-logistics-challenge
Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.
cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python
Last synced: 16 May 2026
https://github.com/mostafa-ghorab/global-happiness-analysis
An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.
business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/yash-3-bit/online-sales-analysis
Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization
data-analysis data-visualization pandas-library
Last synced: 27 Mar 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/ernanej/data-science-dca0131
Files, developed throughout the 2024.1 semester of the Data Science discipline taught at the Federal University of Rio Grande do Norte by the Department of Computer Engineering and Automation (DCA). 📚
big-data data-analysis data-science ia
Last synced: 30 Mar 2025
https://github.com/dulajkavinda/pandas-exploring-data-ml
🐼 Exploring data with pandas library.
data-analysis machine-learning pandas python
Last synced: 09 May 2026
https://github.com/theveryhim/frequent-item-sets-and-lsh
A practice on finding frequent item sets and similar items in pysaprk framework
big-data data-analysis frequent-itemset-mining locality-sensitive-hashing pyspark text-processing
Last synced: 03 Jul 2025
https://github.com/sarveshdhond/top_25_cad_stocks
In this project I have used Python Jupyter lab and Pandas to import data set from Yahoo stocks website. I have imported the top 25 most active Canadian stocks on 12th July 2024. This project shows skills such as Python, Web Scrapping and Pandas.
data-analysis pandas-dataframe python webscraping
Last synced: 01 Apr 2025
https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters
Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.
data-analysis data-science data-visualization machine-learning pandas
Last synced: 09 Apr 2025
https://github.com/grooviter/tablesaw
Java dataframe and visualization library
data-analysis dataframe java visualization
Last synced: 28 Mar 2025
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/nilayhangarge/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python
Last synced: 12 Apr 2026
https://github.com/giatraskon/clustering_algorithms_analytical_and_computational
Analytical and computational exploration of clustering algorithms, focusing on k-means and k-medians, with MATLAB implementations and synthetic dataset analyses.
clustering computational-mathematics data-analysis data-science data-visualization k-means k-means-clustering k-median-clustering k-medians k-medoids k-medoids-clustering machine-learning matlab noise-robustness numerical-methods outlier-detection possibilistic-clustering-algorithms statistical-analysis synthetic-data unsupervised-learning
Last synced: 21 Mar 2025
https://github.com/krypten/nycsubwayturnstileweatheranalysis
Analyzing the NYC Subway Dataset
data-analysis machine-learning machinelearning python
Last synced: 01 Sep 2025
https://github.com/saroshfarhan/dublin_pedestrian_data_analysis
Pedestrian's footfall data analysis for the city of Dublin
data-analysis data-visualization r-programming
Last synced: 07 Jan 2026
https://github.com/bhaskarbharati/ibm-datascience-hands-on-lab
This is the basic hands-on exercise using Jupyter Notebook. This lab is done in the process of learning course Tools For Data Science | IBM
data-analysis data-science data-visualization datawrangling eda machine-learning
Last synced: 23 Apr 2025
https://github.com/rachit1084/sql-practice-ankit-bansal
Personal SQL problem-solving practice based on Ankit Bansal's YouTube series, with logic-driven solutions for analyst prep.
analytics data-analysis data-analyst interview-preparation logical-reasoning postgresql sql sql-practice
Last synced: 04 Jul 2025
https://github.com/lukeskywalkerii/website
data-analysis data-visualization powerbi python r sql statistics
Last synced: 12 Apr 2026
https://github.com/francois-lenne/eletric_vehicle_usa
the project is purely educational the main goal is to use fabric
data-analysis data-engineering delta-lake fabric jupyter-notebook pyspark python spark
Last synced: 12 Apr 2026
https://github.com/rosanafss/data-visualization-nanodegree
Data Visualization Nanodegree
dashboard data-analysis data-preparation data-visualization design interactive sketch storytelling tableau wireframe
Last synced: 26 Jan 2026
https://github.com/soajala/shopify-sales-analysis-powerbi
End-to-end Power BI dashboard project analyzing Shopify sales data with real-time metrics, DAX, and business insights.
business-intelligence data-analysis data-visualization dax interactive-dashboard powerbi sales-analysis shopify
Last synced: 05 Sep 2025
https://github.com/ymorsi7/caliwageanalysis
California employment and wage analysis on data from the past decade.
data-analysis data-science ipynb jupyter-notebook
Last synced: 21 Jan 2026
https://github.com/smoeding/jmeterplugin-datasketches
A JMeter listener using DataSketches to estimate response time quantiles and histograms
data-analysis jmeter jmeter-listeners jmeter-plugin
Last synced: 06 Mar 2025