Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-02-07 00:07:52 UTC
- JSON Representation
https://github.com/aalkiyumi/senior-design-project
Web scraper for collecting product and review data from e-commerce sites using Scraping Bee, AWS, Selenium, and Pandas. Focuses on cost-effective solutions, user-friendly interfaces, and efficient data extraction and analysis.
aws cs5001 data-analysis data-extraction data-processing data-storage e-commerce-analytics e-commerce-data pandas product-reviews review-sentiment-analysis scraping-bee selenium senior-design-project uc uc2026 university-of-cincinnati web-crawlers web-scraping
Last synced: 13 Apr 2025
https://github.com/andreaschandra/who-suicides-statistics
Exploratory Data Analysis for Suicides using Python
data-analysis data-science eda python
Last synced: 06 Apr 2025
https://github.com/antononcube/wl-outlieridentifiers-paclet
Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.
data-analysis hampel outlier-detection outliers
Last synced: 16 Jan 2026
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 03 Jan 2026
https://github.com/jesussantana/data-science-with-python-it-academy
Learn how to extract value from data by ingesting, transforming, storing, analyzing, and visualizing data
classification-model clustering-methods dash data-analysis data-mining data-science database machine-learning matplotlib mongodb numpy pandas plotly python3 regression-models seaborn sklearn sql web-scraping
Last synced: 04 Aug 2025
https://github.com/saranshbansal/spam-detection-analytics-tool
This is a nice tool to read chunks of sms data from a csv and understand how different algorithms (pre-implemented) perform in identifying spam messages.
analytics data-analysis data-science data-visualization mysql spring-boot
Last synced: 24 Feb 2025
https://github.com/tameronline/tameronline
Showcasing Projects on Data Analysis, Programming, and AI — Developed Using Python and Modern Frameworks
data-analysis deep-learning flask machine-learning numpy pandas python3 sql web-development
Last synced: 11 Jun 2025
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 03 Apr 2025
https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring
A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.
data-analysis patient-monitoring
Last synced: 30 Jan 2026
https://github.com/idaraabasiudoh/vehicle-co2emission_model
Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.
data-analysis jupyter-notebook machine-learning python3 scikit-learn
Last synced: 04 Mar 2025
https://github.com/pangeo-data/foss4g-2022
Pangeo tutorial at FOSS4G 2022
data-analysis hvplot pangeo time-series xarray
Last synced: 12 Apr 2025
https://github.com/gallillio/data_science-data_visualizer_tool
## About Supervised ML Helper is a Python application that streamlines exploratory data analysis (EDA) and preprocessing for supervised machine learning. Featuring a user-friendly Tkinter interface, it enables users to load CSV files, visualize data, and perform essential transformations, making data preparation accessible for all skill levels.
data-analysis data-science data-visualization matplotlib numpy pandas seaborn sklearn
Last synced: 11 Jul 2025
https://github.com/bala-ceg/digital-payment-index
This project aims to develop an index for the digital transactions of India
collaborate data-analysis fintech hacktoberfest machine-learning statistics
Last synced: 20 Jun 2025
https://github.com/kaguya163/ankara_coffee_sales_analysis
"Coffee shop sales analysis in Ankara. SQL, Tableau, Python, Data Analytics"
data-analysis mysql python sql tableau
Last synced: 06 Jul 2025
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 12 Sep 2025
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 02 Mar 2025
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 21 Jul 2025
https://github.com/cosmoduende/r-marvel-vs-dc
DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R
comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros
Last synced: 11 Apr 2025
https://github.com/dain55788/ibm-data-engineer-lecture-note
Lecture Notes and Practice Materials of IBM Data Engineering Course
data-analysis database dataengineering datawarehouse ibm
Last synced: 03 Apr 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 24 Feb 2025
https://github.com/quantumudit/uk-student-accommodation-analysis
This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 01 Nov 2025
https://github.com/patex1987/temperature-calibration
Notebook for sensor calibration evaluation
calibration data-analysis jupyter-notebook sensor
Last synced: 20 Jun 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 18 Mar 2025
https://github.com/chaitanyac22/customer-segmentation-using-rfm-analysis-for-online-retail
This project uses RFM (Recency, Frequency, and Monetary) segmentation to analyze customer behavior and provide insights for targeted marketing campaigns. By classifying customers based on their purchasing patterns, strategies can be tailored to improve customer retention, drive growth, and maximize the lifetime value of each customer.
customer-segmentation data-analysis data-science data-visualization exploratory-data-analysis marketing numpy pandas python python3 rfm rfm-analysis rfm-segmentation
Last synced: 11 Jun 2025
https://github.com/ishanoshada/lottery-predict
Predict lottery numbers with this Flask-powered web app! Upload Excel data, get real-time analysis, and see animated predictions. Try it now! 🎰
data-analysis flask lottery lottery-prediction machine-learning prediction prediction-model predictor python
Last synced: 08 May 2025
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/kevinyang372/san-francisco-crime-data-analysis
An ARIMA prediction model for forecasting potential crimes based on users' time and location
data-analysis machine-learning
Last synced: 29 Oct 2025
https://github.com/leandronasx/agro-data
Projeto final da formação de analista de dados e dashboard da SoulCode Academy.
bigquery data-analysis gcp looker pandas powerbi python
Last synced: 18 Jul 2025
https://github.com/shivamswarnkar/tesla-stock-prediction
Making prediction of close prices of Tesla Stocks using different regression methods.
data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction
Last synced: 09 Sep 2025
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/uts58/international-student-job-insights-usa
Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.
career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa
Last synced: 13 May 2025
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 01 Nov 2025
https://github.com/elysian01/ml-eda-and-modelling-using-streamlit
Beautiful Web interface made using Streamlit for quick Exploratory Data Analysis and building classification models which are implemented from scratch.
data-analysis data-visualization eda exploratory-data-analysis knn-classification logistic-regression matplotlib ml-model-on-web ml-models naive-bayes-classifier pandas seaborn streamlit streamlit-webapp
Last synced: 12 Apr 2025
https://github.com/beckversync/probability-and-statis_computer-parts-cpus-and-gpus-ics_
Probability and statistical analysis techniques are employed to explore data related to computer components, such as CPUs, GPUs, and Integrated Circuits (ICs). The objective is to uncover trends, identify patterns, and extract meaningful insights from real-world hardware data.
Last synced: 23 Jun 2025
https://github.com/juicedata/juicefs-deeplearning-tutorials
Deep Learning and Data Analytics Techniques with the help of JuiceFS.
data-analysis deep-learning filesystem juicefs machine-learning
Last synced: 07 Jul 2025
https://github.com/tushar2704/everyday-sql
Welcome to Everyday SQL Sheets – your go-to resource for everyday SQL cheat sheets, pro tips, interview questions, and more. Whether you're a beginner looking to learn SQL or an experienced developer seeking quick reference materials, this application has got you covered.
artificial-intelligence cheatsheet data-analysis data-science database mysql postgresql query-language sql sqlalchemy streamlit streamlit-tushar2704 tushar2704
Last synced: 30 Dec 2025
https://github.com/praveendecode/product_sentiment_analysis
This project employs NLTK, Prowebscraper, and Python for sentiment analysis on online product reviews. Through web scraping, EDA, and NLP, it evaluates user satisfaction by comparing actual ratings and sentiment scores
data-analysis data-visualization natural-language-processing nltk-python product-analysis python sentiment-analysis
Last synced: 04 Apr 2025
https://github.com/ikanurfitriani/project-data-analysis-python
This repository contains the results of data analysis learning using the Python.
data-analysis data-analysis-project data-analysis-python python
Last synced: 21 Mar 2025
https://github.com/rafat-decodis/robust-asr-for-low-resource-languages
Exploring Benchmark Gaps and Real-World Speech Generalization for Language in Low Resource
artificial-intelligence automatic-speech-recognition data-analysis dataprocessing whisper
Last synced: 23 Jun 2025
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 05 Apr 2025
https://github.com/prithivsakthiur/data-board
Data Boards - Visualization of various plots ( Analysis )
data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces
Last synced: 28 Oct 2025
https://github.com/alexandregazagnes/unilasalle-public-resources
UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python
data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization
Last synced: 06 Apr 2025
https://github.com/seyedhosseinzadeh/ws_tm
Weather web scraping and Time series model to predict temperature, humidity and barometer
data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping
Last synced: 28 Feb 2025
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 30 Jul 2025
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 14 Oct 2025
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 27 Jan 2026
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 30 Jul 2025
https://github.com/avinesh-masih/data-analytics-assignment
Complete PW Skills Data Analytics Assignments: This repository contains all PW Skills Data Analytics assignments, covering topics like Python, SQL, Statistics, Data Visualization, and more. It includes well-structured solutions with notebooks and queries, ideal for learners seeking clarity and hands-on practice.
ai api data-analysis data-science data-visualization eda flask jupyter-notebook machine-learning matplotlib numpy pandas pw pw-assignment pw-skills-assignment pwskills python seaborn sql statistics
Last synced: 13 Jun 2025
https://github.com/afondiel/ibm-data-science-professional-certificate-coursera
IBM Data Science Professional Certificate Coursera Notes
ai classification clustering coursera data-analysis data-engineering data-mining data-science data-science-challenges data-science-projects data-scientist data-visualization ibm ibm-certificate ibm-professional-certificate linear-algebra machine-learning python regression statistics
Last synced: 13 Oct 2025
https://github.com/winter000boy/dsa-practice
This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.
data-analysis data-science leetcode leetcode-python pandas-python python3
Last synced: 06 Feb 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 13 Oct 2025
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning oneness-consciousness predictive-modeling python3 scikit-learn
Last synced: 12 Apr 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/codebypinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 12 Oct 2025
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 15 Aug 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/worst001/note_machine_learning
整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。
ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook
Last synced: 16 Oct 2025
https://github.com/alejandrodumas/traintestdiff
Explore the distribution of your train/validation/test datasets
data-analysis matplotlib pandas seaborn
Last synced: 13 Jun 2025
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/camille-maslin/securecard-ai
🛡️ SecureCard-AI: A high-performance credit card fraud detection system implemented in a Jupyter Notebook, achieving 99.97% accuracy.
classification credit-card-fraud-detection data-analysis data-science fraud-detection jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 28 Feb 2025
https://github.com/draym/covid19tracker
Coronavirus COVID-19 dashboard to track global cases
covid-19 covid19-tracker dashboard data-analysis
Last synced: 07 Jan 2026
https://github.com/prajwalchapke055/cognizant-artificial-intelligence-job-simulation-forage
Advise one of Cognizant’s clients on a supply chain issue by applying knowledge of machine learning models.
artificial-intelligence cognizant communication data-analysis data-modeling data-visualization development evaluation forage job-simulation machine-learning machine-learning-algorithms machine-learning-engineering model-interpretation presentation problem-statement python quality-assurance skills virtual-internship
Last synced: 12 Oct 2025
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/jgekko99/portfolio-optimization-and-backtesting-using-python-a-pragmatic-approach
Modern Portfolio Theory (MPT) and Monte Carlo simulations to optimize and backtest a portfolio of various financial assets
asset-management data-analysis data-cleaning jupyter-notebook modern-portfolio-theory monte-carlo-simulation multiprocessing multithreading numba numba-jit-compiler perfomance-python python
Last synced: 12 Oct 2025
https://github.com/akshat0427/spotify_history
code to find out some insights in spotify streaming data (work in progress)
data-analysis data-visualization
Last synced: 04 Feb 2026
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 05 Feb 2026
https://github.com/knyghtmare/msba_projects_public
A repo containing links to my projects done
data-analysis data-mining data-science data-science-portfolio data-science-projects data-visualization datascience tahsinjahinkhalid
Last synced: 06 Feb 2026
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 29 Jan 2026
https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena
This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.
artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning
Last synced: 28 Feb 2025
https://github.com/txn2/mcp-data-platform
A semantic data platform MCP server that composes multiple data tools with bidirectional cross-injection - tool responses automatically include critical context from other services.
data-analysis data-lake data-warehouse golang golang-library mcp mcp-server
Last synced: 05 Feb 2026
https://github.com/mirokeimioniemi/optimizing-insulin-injection-timing
Data processing and analysis for "Determining the optimal timing for insulin injection to minimize glucose level variability after a meal in ideal conditions" - a research project for the IB Standard Level Mathematics Analysis and Approaches course inspired by my type 1 diabetes.
cgm data-analysis data-science dexcom dexcom-g6 diabetes exploration ib insulin insulin-timing international-baccalaureate mathematics optimization python type-1-diabetes
Last synced: 18 Oct 2025
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 11 Oct 2025
https://github.com/roland045/road_quality_measurement_analysis
Novel road quality measurement system for cost effective pavement monitoring, ML-based
azure data-analysis data-engineering data-science machine-learning mlops model-deployment python sql unsupervised-learning
Last synced: 24 Jan 2026
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 14 Apr 2025
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 27 Jan 2026
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 10 Oct 2025
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 11 Mar 2025
https://github.com/johnsesana/eda-liquor-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization sql tableau-dashboards
Last synced: 11 Mar 2025
https://github.com/andr3w03/bike-sharing-dashboard
Bike Sharing Data Analysis Streamlit Dashboard
dashboard data-analysis data-visualization python streamlit
Last synced: 20 Oct 2025
https://github.com/danielvartan/linear-models
🐧🧊 General Linear Models Using Penguins
data-analysis data-visualization general-linear-models multiple-regression palmer-archipelago penguin-statistics penguins r-programming statistical-modeling
Last synced: 31 Jan 2026
https://github.com/fenghaojiang/ethereum-etl
ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain
Last synced: 14 Jan 2026
https://github.com/markmelnic/carsen-desktop
A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de
automation dashboard data-analysis interface scraping scraping-websites tkinter-python
Last synced: 08 Oct 2025
https://github.com/juliasouz/dashboard-vendas
Dashboard interativo de vendas do Xbox Game Pass, criado no Excel para análise e visualização de dados de assinaturas.
business-intelligence dashboard data-analysis excel sales-data visualizacao-de-dados xbox-game-pass
Last synced: 31 Jan 2026
https://github.com/markmelnic/scalg
List scoring algorithm. Analyse data using a range based procentual proximity algorithm.
algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm
Last synced: 08 Oct 2025
https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql
This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.
data-analysis python retail sql sql-server sqlalchemy
Last synced: 07 Feb 2026
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/souvik09-tech/walmart_sales_dataanalysis
This end-to-end data analysis project leverages Python for processing and SQL for advanced querying to extract key business insights from Walmart sales data. It's designed for data analysts to enhance skills in data manipulation, querying, and pipeline creation.
data-analysis end-to-end etl-pipeline jupyter-notebook mysql mysql-database pandas python
Last synced: 08 Oct 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/ultralytics/sandd
data-analysis data-science neutrino particle-physics
Last synced: 25 Apr 2025
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/emso-c/stream-analyser
A tool that analyses YouTube live streams.
cli data-analysis guessing highlights python youtube-video
Last synced: 18 Jan 2026
https://github.com/ahmednasef3/store-sales-full-eda
Simple EDA for Store Sales.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas plotly seaborn store
Last synced: 08 Oct 2025
https://github.com/nelson-gon/nelson-gon.github.io
Biologically Plausible Programming
bioinformatics blog blogdown computational-biology data-analysis data-exploration ghost ghostwriter-theme github github-pages hugo-site hugo-theme programming python3 r side-project
Last synced: 07 Feb 2026
https://github.com/jpquast/icp-ms-data-explorer
A shiny app for the exploration of ICP-MS data.
data-analysis icp-ms r shiny shiny-apps
Last synced: 17 Jan 2026