Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/fisseha-estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 12 May 2026
https://github.com/farhad-here/id_validator
Iranian National ID Validator. This was one of my data analysis project for the course i had.
data-analysis identity idverification object-oriented-programming oop oops-in-python python streamlit
Last synced: 30 Apr 2026
https://github.com/roland045/smart_fluid_sedimentation_tester
Control program for custom developed smart fluid sedimentation tester system
arduino data-analysis instrumentation measurement sensor
Last synced: 13 May 2026
https://github.com/mfakhriazhar/nlp-movie-recommender-system
This project is a content-based movie recommender system built using Natural Language Processing (NLP) techniques. By extracting and combining important text features from movie metadata, this system suggests movies that are similar to a user's selected title.
data-analysis data-science deep-learning machine-learning natural-language-processing python recommender-system
Last synced: 30 Apr 2026
https://github.com/yankh764/revenue-data-analysis
A take home assignment of improving a revenue data pipeline
data-analysis docker python sql take-home-assignment
Last synced: 30 Apr 2026
https://github.com/mitchellharrison/mitchellharrison.github.io
Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.
ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials
Last synced: 30 Apr 2026
https://github.com/manukot/sturdy-engine-python-
I've leant not only various Theoretical Concepts but also practical projects in my Masters Coursework
data-analysis data-visualization python3
Last synced: 13 May 2026
https://github.com/ahmedtaher10/covid-19-cases
The data we are using contains the data on covid-19 cases and their impact on GDP from December 31, 2019, to October 10, 2020.
data-analysis python visualization
Last synced: 30 Apr 2026
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/busra-deveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 30 Apr 2026
https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study
This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.
data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn
Last synced: 01 May 2026
https://github.com/ladaegorova18/data_analysis
Learning the basics of data analysis in Python
analytics data-analysis data-visualization steam-games
Last synced: 24 Jun 2026
https://github.com/rybakov-ks/particleanalyzer
A Computer Vision-based tool for automatic segmentation and size analysis of particles in Scanning Electron Microscope (SEM) images.
computer-vision data-analysis deep-learning detectron2 electron-microscopy image-segmentation materials-characterization microscopy-images nanotechnology object-detection particle-analysis scanning-electron-microscopy scientific-research sem sem-image-analysis yolo
Last synced: 13 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/devag2004/electricity-analysis-using-spark
electricity analysis project made using spark
data-analysis spark spark-mllib
Last synced: 01 May 2026
https://github.com/shruti-h/netflix-eda
Exploratory Data Analysis on Netflix Movies & TV Shows dataset using Python, Pandas, Matplotlib, and Seaborn
data-analysis data-science eda matplotlib netflix pandas-library python seaborn
Last synced: 01 May 2026
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/virajbhutada/music-store-data-analysis-sql
Hands-on SQL data analysis project for music store. Enhance proficiency with database queries. Ideal for practitioners seeking real-world analytics experience. Gain insights into customer behavior, revenue trends, and genre preferences, empowering strategic decision-making in the music industry. Explore the project for a rich learning experience.
data-analysis data-insights data-science database genre-prediction music-industry music-store postgresql postgresql-database query-optimization revenue-trends sql sql-queries
Last synced: 01 May 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/nlink-jp/shell-agent-v2
macOS local-first chat & agent tool with interactive data analysis (Wails v2 + React)
data-analysis duckdb golang llm macos react wails
Last synced: 13 May 2026
https://github.com/ariyaarka/result-analysis
A simple analysis of result based on different factors shown in figures
data-analysis jupyter-notebook matplotlib numpy-library pandas-dataframe python seaborn
Last synced: 01 May 2026
https://github.com/bpkaur/a-network-analysis-of-game-of-thrones
A Network analysis of Game of Thrones: To analyze the co-occurrence network of the characters in the Game of Thrones books
data-analysis data-science machine-learning networkx python3
Last synced: 01 May 2026
https://github.com/myounesdev/authorgraphanalyzer
a web-based visualization tool for analyzing and exploring author collaboration networks
algorithms binary-tree bts d3js data-analysis dijkstra-algorithm django exception-handling pandas python scss
Last synced: 08 Jun 2026
https://github.com/caesaredia/la-cafe-market-analysis
A data-driven feasibility study exploring the potential of launching a robot-staffed café in Los Angeles, based on real F&B business data.
business-intelligence cafe data-analysis data-visualization food-industry franchise los-angeles market-research pandas python
Last synced: 01 May 2026
https://github.com/vbhvsingh0/coulombic_dyn_formaltetra
The Python code simulates a formaldehyde tetra-cation molecule using Coulombic forces
data-analysis physics-simulation python shell-scripting
Last synced: 24 Jun 2026
https://github.com/pablo1785/receipt-rs
Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API
api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx
Last synced: 01 May 2026
https://github.com/ibrahimhabibeg/national-university-of-singapore-sms-analysis
Analysis of SMS messages collected by the National University of Singapore
analytics data-analysis data-science nlp python
Last synced: 13 May 2026
https://github.com/ireneli393/music-recommendation-system-with-listenbrainz-dataset
Recommendation System
alternate-least-squares baseline-model data-analysis data-sceince lightfm-library python recommendation-system
Last synced: 14 May 2026
https://github.com/dhruwsunita/customer-churn-analysis
Customer Churn Analysis using panda library
data-analysis data-cleaning data-manipulation data-science pandas python3
Last synced: 01 May 2026
https://github.com/codesaadumair/data-science-monorepo
Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.
data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python
Last synced: 01 May 2026
https://github.com/kavicastelo/soil-fertilizer-analysis-colab
This repository includes a data analysis and model training practical Jupyter notebooks using a soil fertilizer dataset. (use 4th edition)
data-analysis jupyter-notebook python
Last synced: 01 May 2026
https://github.com/deliprofesor/joblocationmapper
JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.
clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python
Last synced: 14 May 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 26 May 2026
https://github.com/celineboutinon/client-segmentation
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 5
aws client-segmentation cloud-architecture data-analysis data-science data-visualization database e-commerce marketing marketing-analytics marketplace-solution
Last synced: 01 May 2026
https://github.com/vedantshi/stock-price-prediction-for-maang-companies
This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.
data-analysis data-visualization lstm machine-learning python stock-price-prediction
Last synced: 01 May 2026
https://github.com/guptakushal03/whatsapp-chat-analyser
The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.
chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud
Last synced: 01 May 2026
https://github.com/ujjwalll/get-that-flair
It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/
data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling
Last synced: 01 May 2026
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 03 May 2026
https://github.com/bheemisme/icc-t20-world-cup-dashboard
2024 icc t20 world cup dashboard
dashboard data-analysis data-analytics data-science data-visualization matplotlib pandas seaborn
Last synced: 02 May 2026
https://github.com/more-joao/color-distance-luminance
Data analysis project that aims to establish a relation between the Canberra distance between white and any given color in the RGB colorspace and its luminance.
canberra-distance data-analysis luminance python r rgb
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/faithererer/haokanvideo_spider
好看视频爬取与数据分析
data-analysis data-visualization python spider
Last synced: 02 May 2026
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 02 May 2026
https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.
analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 02 May 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026
https://github.com/suma-aljudaia/my-portfolio
Suma Aljudaia | Portfolio – AI & Data Analysis Enthusiast
ai css data-analysis html machine-learning portfolio
Last synced: 02 May 2026
https://github.com/jimohola/breast-cancer-detection
Breast Cancer Detection-Machine learning
data-analysis data-visualization exploratory-data-analysis machine-learning python3
Last synced: 02 May 2026
https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor
AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.
data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics
Last synced: 02 May 2026
https://github.com/rorrell/employmentdata
A Jupyter Notebook where I use group by to analyze the average unemployment rate by year
data-analysis data-visualization jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/chuxinh/our-data-manual
All in one place for our data science learning journey by Chuxin and Melody
data-analysis data-science machine-learning python
Last synced: 09 Jun 2026
https://github.com/lu-m-dev/biostatistics-eda
Exploratory data analysis and visualization system for biostatistical research
biostatistics data-analysis data-visualization eda
Last synced: 25 Jun 2026
https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 14 May 2026
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 02 May 2026
https://github.com/bhaveshbhakta/gold-price-prediction-using-ml
Gold Price Prediction
data-analysis data-visualization gold-price-prediction machine-learning python
Last synced: 02 May 2026
https://github.com/se7en69/rna-seq-data-processing-and-analysis-pipeline
This pipeline automates essential steps for RNA-Seq data analysis, including quality control, read trimming, alignment to a reference genome, and coverage quantification. It leverages tools like FastQC, fastp, STAR, and bedtools to ensure high-quality results, with MultiQC reports providing an overview at each stage.
bioinformaitcs-scripting bioinformatics bioinformatics-pipeline data-analysis linux scripts shell
Last synced: 02 May 2026
https://github.com/yashsingh43/cdc-sleep-duration-health-analysis
Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.
beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python
Last synced: 11 Jun 2026
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/faiyaz-zaman/used-car-market-trends-on-bikroy.com
Used Car Market Trends on Bikroy.com
data-analysis python scraping-websites selenium tableau
Last synced: 02 May 2026
https://github.com/maddieemihle/pandas-challenge
Python analysis to create and manipulate school and standardized test data. Scores are calculated, grouped, aggregated, summarized, and organized using pandas.
Last synced: 09 Jun 2026
https://github.com/badranalyst/movie-correlation-analysis-in-python
This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.
data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/helenaden/data-science-fundamentals
This project delves into fundamental data science concepts using Python libraries like NumPy and Pandas
data-analysis datascience datasets datavisualization datawrangling heatmap numpy pandas patterns python
Last synced: 03 May 2026
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/monteirooscar98/tarifas-publicas-sp-dieese
Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.
data-analysis data-visualization python webscraping
Last synced: 03 May 2026
https://github.com/aicorsair/python-case-study-imdb-movie-reviews-sentiment-analysis-with-nlp
This repository contains a comprehensive case study on sentiment analysis using the IMDb dataset of movie reviews.
ada-boost artificial-intelligence classification data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-extraction hyperparameter-tuning logistic-regression machine-learning naive-bayes natural-language-processing nltk python random-forest shap
Last synced: 03 May 2026
https://github.com/skuschel/postexperiment
postprocessor for experimental (event based) data.
data-analysis eventstore hacktoberfest postprocessing
Last synced: 12 Jun 2026
https://github.com/chaedoll/analysis-python-foreignerinfra
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
data-analysis python team-project
Last synced: 03 May 2026
https://github.com/sambit-mondal/stockx
StockX is a full-stack application designed to help store owners efficiently manage their inventory, track purchases, and analyze stock levels. The system integrates MongoDB, Express, React, and Flask (Python) to provide a seamless experience.
artificial-intelligence data-analysis inventory-management-system machine-learning mern-stack
Last synced: 12 Jun 2026
https://github.com/neelimabonangi/defect-detection-hot-rolling
Defect Detection in Hot Rolling Using Machine Learning
classification data-analysis data-science defect-detection jupyter-notebook machine-learning manufacturing numpy pandas predictive-analytics python random-forest scikit-learn
Last synced: 12 Jun 2026
https://github.com/baggiponte/pyconpt-polars
@pola-rs talk @pyconpt
apache-arrow data-analysis data-science etl polars python
Last synced: 03 May 2026
https://github.com/saksham-jain177/cryptodataanalysis
A Python powered project that fetches live cryptocurrency data from the CoinMarketCap API, analyzes it, and updates a live Excel sheet every 5 minutes.
api-integration coinmarketcap cryptocurrency data-analysis excel live-data python
Last synced: 12 Jun 2026
https://github.com/mohnish88/e-commerce-data-analysis
I analyzed sales data to identify trends and patterns, which significantly enhanced decision-making processes. Additionally, I created interactive visualizations to present these insights clearly and effectively, facilitating better understanding and communication of the data's implications.
data-analysis data-cleaning jupyter-notebook pandas plotly python python-library sales sales-analysis visulaization
Last synced: 03 May 2026
https://github.com/devlucho/modelos-predictivos
Modelos predictivos utilizando los algoritmos de Regresión Lineal, Regresión Logística y Árboles de Decisión.
data-analysis jupyter-notebook python3
Last synced: 03 May 2026
https://github.com/imosudi/unsupervised-ml-kmeans-analysis
K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.
clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning
Last synced: 25 Jun 2026
https://github.com/codeslash21/tmdb_data_analysis
We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.
data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie
Last synced: 03 May 2026
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 03 May 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/rock12231/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 15 Mar 2025
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/rodrigojunqueiradev/data-exploration-and-cleaning
Credit Analysis Data: Foundations for Cleaning and Exploration
data-analysis data-engineering data-science data-visualization datascience matplotlib matplotlib-pyplot numpy pandas python python-3 python3
Last synced: 13 Apr 2026
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/ljadhav25/swiggy-restaurant-analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots
Last synced: 03 May 2026
https://github.com/matteospanio/speed-analysis
A project to analyze the internet speed
Last synced: 03 May 2026
https://github.com/syarwinaaa09/analyzing-crime-in-los-angeles
Exploratory data analysis of Los Angeles crime data with insights on temporal patterns, locations, and age demographics.
crime-data data-analysis eda los-angeles pandas public-safety python visualization
Last synced: 03 May 2026
https://github.com/nathadriele/world-marathon-run-majors-analytics-challenge
This project presents a complete data engineering, analytics, machine learning, and Streamlit dashboard pipeline focused on the Abbott World Marathon Majors: Tokyo, Boston, London, Berlin, Chicago, and New York City. Covering the 2018 to 2025 seasons, it analyzes more than 628,000 runner records and 86 verified winner entries.
challenge data-analysis data-pipeline gradient-boosting lasso-regression linear-regression machine-learning models predictive-modeling python random-forest ridge-regression run-analytics world-marathon
Last synced: 09 Jun 2026
https://github.com/bpkaur/whats-in-a-name
Exploring dataset of first names of babies born in the US in order to uncover interesting stories
data-analysis datacamp numpy pandas python3
Last synced: 04 May 2026
https://github.com/marialuizaleitao/walmartsalesanalysis
This project explored data collection and preprocessing, advanced application of SQL queries, and feature engineering. Key calculations, such as COGS (Cost of Goods Sold) and VAT (Value Added Tax), were performed to assess the profitability and financial efficiency of the branches.
business-analytics data-analysis mysql-database sql
Last synced: 13 Jun 2026
https://github.com/zkan/python-for-data-scientists
Python for Data Scientists
data-analysis data-science data-scientists machine-learning pandas python
Last synced: 13 Apr 2026
https://github.com/lexiortiz/advanced-data-analytics
Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.
data data-analysis data-engineering google python-3 sql
Last synced: 29 May 2026
https://github.com/ironlegion88/media_bias
An end-to-end NLP pipeline to analyze ideological bias in online news media during elections. Uses sentiment analysis, topic modeling (LDA/NMF), and NER to quantify media framing.
data-analysis machine-learning media-bias nlp nltk political-science python scikit-learn sentiment-analysis spacy topic-modeling
Last synced: 13 Apr 2026
https://github.com/nurulashraf/polynomial-regression-manufacturing
A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.
data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn
Last synced: 16 Apr 2026
https://github.com/r13i/cheapest-phone-call
Small challenge to find the best phone operator to use based on call price
big-data big-data-analytics cheapest data-analysis data-cruncher pandas phone-number pricelist
Last synced: 04 May 2026
https://github.com/xiaohan2012/myunisport
Visualize your Unisport annual training records
data-analysis data-visualization pandas pygal sports-stats tikzposter
Last synced: 04 May 2026
https://github.com/ashleydavis/brisjs-data-analysis-talk
Code for my talk to BrisJS on data analysis in JavaScript
charting data-analysis data-visualization data-viz javascript node node-js nodejs visualization
Last synced: 25 Mar 2025
https://github.com/borjamome/accidentes_madrid
Análisis de Accidentes en Madrid en SQL (2023)
accidentes-coche data-analysis madrid sql
Last synced: 17 Jan 2026
https://github.com/parthds02/e-commerce-data-analysis-with-python
This project focuses on analyzing an e-commerce dataset using Python. The goal is to derive meaningful insights through exploratory data analysis (EDA) and uncover trends and patterns that can drive business decisions.
data-analysis ecommerce exploratory-data-analysis jupyter-notebook pytho sales-analysis visualization
Last synced: 13 Jun 2025
https://github.com/nafiealhilaly/first-dash-app
A simple dash plotly app to explore and analyze imagined students assessment dataset
data-analysis data-analytics data-visualization eda plotly-dash python
Last synced: 02 Apr 2025