Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/firdevstorlak/maritime-signature-lab
Prototyp einer maritimen Signaturdatenbank (Akustik, Magnetik, RCS, IR) mit Python, SQLite und einfacher Computer-Vision.
acoustic-signatures cli-tool computer-vision data-analysis demo-project engineering-prototype infrared-imaging maritime opencv python radar rcs relational-database scientific-computing signal-processing sqlite synthetic-data
Last synced: 07 May 2026
https://github.com/parthshah02/customer_churn_dashboard
This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python
data-analysis matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/fbarffmann/mycitibike
Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.
citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping
Last synced: 07 Jul 2025
https://github.com/samruddhi3012/rfm-sales-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 12 Mar 2025
https://github.com/iliyasalve/tiktok_claim_classification_model
Develop a predictive model for classifying videos with claims to reduce the backlog of user reports and optimize the content moderation process.
data-analysis machine-learning python regression-models tiktok
Last synced: 21 May 2026
https://github.com/trim0500/fe-stats-classifier
An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.
creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton
Last synced: 11 Apr 2026
https://github.com/george-gca/ai_papers_analysis
Do some analysis based on main AI conferences
conferences data-analysis fasttext fasttext-embeddings fasttext-python python scikit-learn top2vec
Last synced: 29 Apr 2026
https://github.com/vriv06/btk-trials-data-analysis
Data analysis of Bioteksa plant nutrition trials for measure nutrient efficacy, resistance against biotic and abiotic factors, etc.
agriculture-research confluence crops data-analysis quarto r
Last synced: 23 Mar 2025
https://github.com/subratamondal1/heart-attack-prediction
Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.
data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/aalekhpatel07/statcan
StatCAN dataset fetcher and cleaner.
census data-analysis data-science statcan
Last synced: 02 Apr 2025
https://github.com/kzon94/torn-market-analyzer
Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.
data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api
Last synced: 11 Apr 2026
https://github.com/aicorsair/python-case-study-365-data-science-subscription-purchase-prediction
This repository contains a comprehensive case study on predicting 365 Data Science customer subscriptions using real-world student engagement data.
data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization decision-tree feature-engineering feature-selection hyperparameter-optimization hyperparameter-tuning k-nearest-neighbors logistic-regression machine-learning purchase-prediction python random-forest scikit-learn statsmodels svc
Last synced: 08 May 2026
https://github.com/virajbhutada/credit-card-transaction-analysis-sql
This project provides a structured database schema and SQL scripts to analyze credit card data. It includes tools for managing and analyzing transaction data, helping to identify spending patterns and trends. The project features visual schema diagrams and supporting documentation for easy understanding.
creditcard customer data-analysis data-cleaning data-modeling database database-management insights performance-optimization postgresql query-language schema-design schema-diagram scripts sql transactions trends
Last synced: 15 May 2026
https://github.com/deliprofesor/k-means-clustering-for-retail-data-analysis
This project uses K-Means clustering to segment wholesale customers based on their spending habits. The data is preprocessed, scaled, and clustered into four groups. The Elbow and Silhouette methods determine the optimal number of clusters, and results are visualized using boxplots and scatter plots to uncover spending patterns.
clustering-visualisation data-analysis elbow-method k-means k-means-clustering r silhouette-score
Last synced: 10 Apr 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/rafgpereira/obmep-analise
Código que analisa a retrospectiva das premiações da Obmep em determinada localidade e escola
data-analysis excel pandas python
Last synced: 29 Apr 2026
https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python
In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 03 May 2026
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/vdoninav/real_estate_analysis
real estate analysis
data data-analysis data-analysis-python data-science pandas pandas-dataframe pandas-python plotly plotly-express scipy seaborn streamlit streamlit-application streamlit-dashboard streamlit-webapp
Last synced: 12 Apr 2026
https://github.com/hfzdzakii/dicoding-solvinghrproblem
This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization
Last synced: 16 May 2025
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/mylena13s/electric-vehicle-data-analysis
data-analysis data-science programming pyspark-notebook
Last synced: 15 Mar 2025
https://github.com/jimohola/movielens_data_analysis
Movielens Data Analysis
data-analysis data-visualization exploratory-data-analysis pyhton3
Last synced: 11 Jun 2025
https://github.com/agrdatasci/climmob-analysis
Workflow for data analysis applied on ClimMob.net
citizen-science data-analysis workflow
Last synced: 24 Jun 2025
https://github.com/aya-jafar/python
Practice files & exercises during the journey of Python leaning 🐍
Last synced: 16 May 2025
https://github.com/nero103/airbnb-destination
This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings
data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau
Last synced: 27 Mar 2026
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/erickchacon/day2day
Functions that can be useful in the day-to-day data analysis. It comprehends functions to find paths for projects, make summaries of databases inside folder and so on.
data-analysis exploratory-data-analysis simulation spatial-analysis
Last synced: 02 Sep 2025
https://github.com/codesaadumair/pandas_exercises_personal
Personalized enhancements to pandas exercises with comprehensive solutions and practical insights for mastering data analysis in Python.
data-analysis data-science pandas python
Last synced: 09 May 2026
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/paphada1103/data-analysis-with-python
📊 Analyze data efficiently using Python’s top libraries. Learn to explore, clean, and visualize data for meaningful insights in your projects.
carpentries data-analysis data-carpentry data-visualisation dataframe-api dataset english hacktoberfest ibm jovian lsl machine-learning matplotlib programming python realtime social-sciences spark
Last synced: 09 May 2026
https://github.com/jianxi-erin/bigdata-machinelearning-lab
本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。
data-analysis data-visualization hadoop machine-learning python spark sql
Last synced: 03 May 2026
https://github.com/mahmoud2abdallah/improvado-marketing-homework
This Looker Studio dashboard provides a comprehensive analysis of marketing performance for August 2024, transforming raw data into actionable insights for data-driven decision making.
bigquery business-intelligence data-analysis looker-studio marketing
Last synced: 05 Oct 2025
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/joyceannie/sql-data-with-danny-case-studies
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql
Last synced: 05 Oct 2025
https://github.com/vishal786-commits/target-businesscasestudy-sql
This project analyzes Target’s e-commerce transactions in Brazil between 2016 and 2018 using SQL. The goal was to explore customer behavior, order patterns, payments, delivery times, and freight costs to generate actionable business insights.
Last synced: 05 Oct 2025
https://github.com/egbe34/sql-portfolio
SQL portfolio showcasing business-focused queries for KPIs, retention, churn, RFM, and Pareto analysis. Built with sample commerce data for analytics and BI use cases.
bigquery business-intelligence churn-analysis cohort-analysis data-analysis kpi postgresql rfmsegmentation sql windowfunction
Last synced: 19 May 2026
https://github.com/aicorsair/python-case-study-imdb-movie-reviews-sentiment-analysis-with-nlp
This repository contains a comprehensive case study on sentiment analysis using the IMDb dataset of movie reviews.
ada-boost artificial-intelligence classification data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-extraction hyperparameter-tuning logistic-regression machine-learning naive-bayes natural-language-processing nltk python random-forest shap
Last synced: 03 May 2026
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/trismald/eurosoccer1023
Data Analyst - European Soccer 2010 2023
data-analysis data-visualization jupyter-notebook pandas powerbi python
Last synced: 06 May 2026
https://github.com/jimartskenya/ai-code-context
🤖 Automate code documentation with AI to enhance understanding and streamline your workflow, saving time on unfamiliar codebases and projects.
ai claude-code codebase-analysis context-management data-analysis dependency-analysis gemini intellij-plugin jupyterlab-extension llm-integration machine-learning mcp-server open-source pandas prompt-engineering streamlit-component token-reduction vibe-coding
Last synced: 08 May 2026
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/anderson-andre-p/exploratory-data-analysis.roller-coaster
This repository contains an exploratory data analysis (EDA) project focused on roller coasters. The project involved organizing, cleaning, and visualizing the data to gain insights into roller coasters' characteristics and performance.
data-analysis eda exploratory-data-analysis exploratory-data-visualizations notebook
Last synced: 15 Mar 2025
https://github.com/swapnil-jain/tailored-tomes
Web application which shows Top 50 books of all time & recommends similar books if a book name is provided.
book bookrecommendsystem books bootstrap3 cosine-similarity data-analysis html machine-learning python
Last synced: 20 Jan 2026
https://github.com/anderson-andre-p/wine-data-analysis
This repository contains a data analysis project that focuses on a series of wine data. The project was completed using Python libraries such as NumPy, Pandas, Seaborn, and Matplotlib. The goal of this project was to gain insights into the characteristics of the wines and to practice data analysis skills.
data-analysis data-science data-science-portfolio pandas-dataframe wine-dataset
Last synced: 15 Mar 2025
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Jan 2026
https://github.com/dcs-training/pca-2023
PCA workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/subhamghimire/dataanavis
Learning Data analysis and visualization
data-analysis data-science data-visualization dataset
Last synced: 06 Oct 2025
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/chdre/data-analyzer
A small package to analyze and preprocess data.
Last synced: 06 Oct 2025
https://github.com/data-edd/mastering_sql
This is a repo documenting me mastering sql
data-analysis mysql mysql-database sql
Last synced: 06 Oct 2025
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/fatihilhan42/starbucks_analysis_turkey_and_world_with_python
In this project, firstly the brands for coffee in the world and then these brands in Turkey were examined. The data from the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/pedramjlo/uae_cars_analysis
Analysis of the UAE second-hand car data
data-analysis jupyter-notebook pandas python sql sqlite3
Last synced: 11 May 2026
https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office
My Project to learn the Basics of Analysis & Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python
Last synced: 11 Apr 2026
https://github.com/ilaxi/lomicontadores
data management tool in reference to number of actions per day in a year
data-analysis gdscript godot godot4 python
Last synced: 19 Apr 2026
https://github.com/myles/notebooks
Some of my random Jupyter Notebooks.
data-analysis data-science jupyter-notebooks
Last synced: 18 Jan 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/surbhi242singh/pizza_sales_project
Used SQL to analyze pizza sales data
data-analysis mysql pizza-sales sql
Last synced: 07 Oct 2025
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/shellynagar27/mobile-sales-analysis
Analyzed 2024 mobile sales data to uncover product trends, customer behavior, and regional insights using Power BI dashboards and structured data modeling.
cleaning-data data-analysis data-visualization dax eda figma modelling powerbi powerquery storytelling wireframe
Last synced: 16 May 2025
https://github.com/marcus-v-freitas/media_movel_covid19
Estudo de série temporal com média móvel de casos e óbitos de Covid19 no munícipio de São Paulo
covid-19 data-analysis data-science government-data jupyter-notebook moving-average plotly python sao-paulo temporal-series
Last synced: 17 Jan 2026
https://github.com/michaelcurrin/yahoo-finance-reports
Use the Yahoo Finance API to get info on shares of interest and report on them
data-analysis data-science python reporting shares stock-market yahoo-finance yahoo-finance-api
Last synced: 07 Oct 2025
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/roydevashish/algo8.ai-data-manipulation-assignment
This assignment performs transaction-level sales data analysis and generates reports using Pandas / SQL / Spark inside a containerized environment. The dataset contains sales transaction records and is used to analyze SKUs, customers, and sales representative performance.
data-analysis duckdb python3 sql uv
Last synced: 15 May 2026
https://github.com/nimomach/skateboarding-in-olympics
Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.
data-analysis data-visualization olympics paris skateboarding tokyo
Last synced: 10 Mar 2026
https://github.com/gabboraron/biostatisztika_es_alkalmazasai
"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"
biostatistics data-analysis data-visualization r statistics statistics-course
Last synced: 24 Oct 2025
https://github.com/shourya1997/boston_housing
In this project, you will apply basic machine learning concepts on data collected for housing prices in the Boston, Massachusetts area to predict the selling price of a new home.
boston-housing-dataset data-analysis jupyter-notebook machine-learning python unsupervised-machine-learning
Last synced: 18 May 2026
https://github.com/mr-dhan/eda-sales-customer-transactions
Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.
dashboard data data-analysis data-analysis-python data-science data-visualization eda python
Last synced: 29 Apr 2026
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/omarsolieman/socialgiveawaydataanalysis
This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis
data-analysis data-science data-visualization instagram scraping threejs
Last synced: 14 May 2026
https://github.com/maccccd/sql-proficiency-journey
A technical journey of my SQL understanding.
data-analysis sql systems-analysis-and-design uml-class-diagram
Last synced: 15 Feb 2026
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 16 May 2026
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 02 Sep 2025
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/iwasakiyuuki/data-analysis-platform-etl
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 01 Jul 2025
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/roland045/bike-share-dataset-analysis
User behaviour analysis on a public bike-share dataset
data-analysis data-visualization python time-series-analysis user-behavior-analytics
Last synced: 29 Apr 2026
https://github.com/ndomah1/learning-probability-and-statistics
This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.
correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics
Last synced: 18 Jan 2026
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 23 Mar 2025
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/tzerk/esr
R package 'ESR' for plotting and analysing ESR spectra in dating applications
data-analysis data-visualization electron-spin-resonance geochronology r
Last synced: 13 Mar 2026
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/akash1070/project--uber-data-analysis
To Determine UBER data from the dataset using Python
data-analysis data-science python
Last synced: 09 May 2026
https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce
This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.
data-analysis data-visualization ecommerce microsoft-sql-server powerbi python
Last synced: 18 May 2026
https://github.com/leosimoes/digitalinnovationone-analise-covid
Projeto prático "Criando modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil" da Digital Innovation One.
arima-models data-analysis data-science python time-series
Last synced: 09 May 2026
https://github.com/ntaraujo/cleo
Contact data processor for Cléo
contacts-manager data-analysis data-visualization whatsapp whatsapp-web
Last synced: 15 May 2026
https://github.com/tolumie/exploratory-data-analytics-projects
Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.
data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/zulhaditya/web-scraping-python
A repository that stores various source code and web scraping methods using Python.
data-analysis python3 webscraping
Last synced: 12 Oct 2025