Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/hosseinkarimi128/zed-one
An AI-powered assistant that analyzes CSV data using natural language queries to generate pandas code and visualizations.
ai-data-analysis automated-pandas automated-pandas-queries csv data-analysis fastapi langchain machine-learning matplotlib nlp openai pandas restful-api summarization visualization-tools
Last synced: 07 Apr 2026
https://github.com/vishnu-vamshii/layoffs-data-analysis-in-sql
This project focuses on the cleaning and exploratory analysis of a dataset containing layoff information. It includes data deduplication, standardization of columns, handling null and blank values, and analyzing layoffs by company, industry, country, and date. Various SQL queries are used to explore trends and patterns in layoffs over time.
Last synced: 15 Jul 2025
https://github.com/viper373/lol-dataanalytics
腾讯游戏-英雄联盟赛事20/21/22年数据综合分析预测
crawler-python data-analysis jupyter-notebook lol python spider
Last synced: 15 Jul 2025
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/shrutiijoshi/airbnb-listing-reviews
Airbnb is an online marketplace that connects people who want to rent out their homes with travelers seeking accommodations.
data-analysis matplotlib-pyplot pandas-python python seaborn
Last synced: 17 May 2026
https://github.com/gagan8605/zepto_sql_analysis
This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
cleaning-data data-analysis database-management postgresql sql
Last synced: 16 Jul 2025
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 27 Jul 2025
https://github.com/priyadarshinijain/air-quality-data-analysis-and-visualization
# 🌍 Air Quality Data Analysis and Visualization
data-analysis jupyter-notebook python visualization
Last synced: 06 Feb 2026
https://github.com/amr-yasser226/interactive-sales-analytics-dashboard
An interactive web-based dashboard for visualizing multinational electronics sales data. This project for the DSAI 203 course integrates a Python/Flask backend with an amCharts frontend to provide dynamic insights into product revenues, sales distribution, and employee statistics across different countries.
am5charts amcharts business-intelligence css dashboard data-analysis data-analytics data-visualization flask html javascript python sqlalchemy sqlite web-application
Last synced: 13 Apr 2026
https://github.com/vedantshi/tableau-bike-data-dashboard
London Bike Rides Analysis explores bike usage patterns using data visualization and machine learning. It identifies trends through a dynamic moving average, analyzes weather impact with heatmaps, and provides actionable insights via an interactive Tableau dashboard. Tools: Python, Tableau.
data-analysis data-visualization python tableau weather-data
Last synced: 16 May 2026
https://github.com/maxbiostat/diehl_ebola_cell_2016
supplementary code and data to Diehl et al, 2016 (Cell)
data-analysis data-visualization disease-spread ebola mutation
Last synced: 11 Jul 2025
https://github.com/purushothamadluru/atlantic-gdp-job-demand-analysis
data-analysis data-visualization powerbi
Last synced: 17 Feb 2026
https://github.com/olympus-terminal/data-processing
Data analysis and processing tools
automation data-analysis data-processing data-science etl machine-learning pdf-extraction python r research statistics web-scraping
Last synced: 16 May 2026
https://github.com/maheera421/pandas
Implementation of essential Pandas functions.
data-analysis data-manipulation pandas-dataframes pandas-datareader pandas-python
Last synced: 17 Jul 2025
https://github.com/namratha2301/starbucks_global_presence
Exploring the global presence of Starbucks.
business-analysis data-analysis data-science data-visualization matplotlib pandas pycountry
Last synced: 19 May 2026
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/venkat-023/thyroid-cancer-prediction
This project aims to develop a machine learning pipeline to predict thyroid cancer based on patient data. The dataset was sourced from multiple public repositories, cleaned, and merged to create a comprehensive dataset for modeling. Various classification algorithms were implemented, including Random Forest, Logistic Regression, K-Nearest Neighbors
data-analysis data-cleaning deep-learning ensembling-methods hyperparameter-tuning machine-learning-algorithms nueral-networks
Last synced: 17 May 2026
https://github.com/ofir-frd/predict-success-of-a-restaurant
Apply machine learning on a restaurante database. Study and analyse the data for prediction of a successful restaurant.
data-analysis data-science machine-learning visualization
Last synced: 11 Jun 2026
https://github.com/adnanrahin/nlp-with-disaster-tweets
Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.
data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets
Last synced: 21 Jun 2025
https://github.com/karlyndiary/bellabeat-eda
Bellabeat Case Study - Google Data Analytics Capstone using Python.
bellabeat bellabeat-case-study bellabeat-eda bellebeat-data-analysis case-study case-study-analysis data-analysis data-visualization eda python reports
Last synced: 17 May 2026
https://github.com/nikbarb810/covid_growth_rate_390.51
Exploring Covid Growth Rate of European Population using genetic data analysis
bioinformatics data-analysis r rcpp
Last synced: 07 Apr 2026
https://github.com/pkjjoshi/restaurants-analysis
Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.
beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn
Last synced: 21 Jun 2025
https://github.com/ahmeddhus/exploring-football-data-analysis
Learning and exploring data analysis through real-world datasets using Python and StatsBomb APIs and mlpsoccer library
data-analysis jupyter-notebook mplsoccer python statsbomb
Last synced: 17 May 2026
https://github.com/cyblx/clustering
This project explores clustering techniques and supervised learning applied to World Cup team performance analysis. The methodologies include K-Means, DBSCAN, K-Nearest Neighbors, Gaussian Mixture Models (GMM), and Agglomerative Clustering.
clustering data-analysis dbscan gmm kmeans supervised-learning unsupervised-learning world-cup
Last synced: 18 Jul 2025
https://github.com/theveryhim/basic-data-analysis
Working with basic Python tools frequently used in data science
data-analysis data-processing visualization
Last synced: 18 Jul 2025
https://github.com/theveryhim/web-scraping-and-statistical-tests
Crawling web for data and perform statistical tests to verify judgments
data-analysis hypothesis-testing web-scraping
Last synced: 18 Jul 2025
https://github.com/theveryhim/massive-text-processing
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 18 Jul 2025
https://github.com/stynw7/asa-international-data-quest-2025
ASA: International Data Quest 2025 🔥
data-analysis data-mining data-visualization jupyter-notebook python
Last synced: 17 May 2026
https://github.com/abhinavhariyal/diwali-sales-analysis
This project is based on data visualization and analysis using python and jupyter notebook on the data for diwali sales.
data-analysis data-visualization jupyter python
Last synced: 19 May 2026
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/mothraa/etl-marketanalysis-webscraping
OC project 2
data-analysis etl python web-scraping
Last synced: 15 Aug 2025
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 18 Jul 2025
https://github.com/atharvkadammm/suicide-prediction-system
A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.
csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-
Last synced: 01 Jul 2025
https://github.com/teditae/data-analysis-with-pandas
Mini data science projects focused on Pandas-powered analysis.
data-analysis data-manipulation pandas python
Last synced: 30 Apr 2026
https://github.com/atharvkadammm/calmlytic
An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.
anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost
Last synced: 21 Jun 2025
https://github.com/rezowanrahat/netflix_analysis
Data analysis of Netflix content using Python, Pandas, and Seaborn
data-analysis data-visualization netflix pandas python
Last synced: 07 May 2026
https://github.com/bhaveshbhakta/mobile-price-prediction-using-xgboost
Mobile Price Prediction
data-analysis data-visualization machine-learning mobile-price-prediction xgboost
Last synced: 19 Jul 2025
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/kushagrakumar04/visual-age-distribution
A Bar chart or histogram to visually depict the distribution of a categorical or continuous variable, such as the age distribution or gender composition within a population. This graphical representation provides a clear and insightful overview of the data's patterns and trends.
data-analysis data-science google-colab
Last synced: 21 Jun 2025
https://github.com/kwonnayeon/urban-parks-childrens-happiness
Grad thesis on urban parks’ impact on children’s happiness – data, results, and code
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 17 Feb 2026
https://github.com/alexjackson1/commons-indicative-votes
A cluster analysis of the House of Commons' Indicative Brexit Voting Process on 27 Match 2019
Last synced: 19 Jul 2025
https://github.com/jpcadena/malware-analysis
Analysis of malware signatures and their associated Common Vulnerabilities and Exposures (CVEs)
black common-vulnerabilities-and-exposures cve-search data-analysis data-engineering data-reporting data-visualization isort malware-analysis matplotlib mypy numpy pandas plotly poetry pre-commit pydantic python ruff seaborn
Last synced: 03 Mar 2026
https://github.com/alpkanoz/ibm_data_science_professional_certificate
The repository contains projects and training materials carried out throughout the IBM data science professional course.
classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn
Last synced: 07 Mar 2026
https://github.com/mh0386/motorcycle_data_analysis
Data analysis applied to motorcycle dataset.
Last synced: 19 Jul 2025
https://github.com/jgohel9902/toronto-airbnb-snowflake
This project analyzes Airbnb listings in Toronto using **Snowflake’s cloud data platform**. It follows a **Bronze → Silver → Gold** medallion architecture and leverages **Snowflake Cortex** to generate **AI-driven executive insights**.
data-analysis python snowflake sql
Last synced: 07 Mar 2026
https://github.com/marlysson/craw
A system to show the data collected from various sources using chartjs - ⚡️
chartsjs data-analysis data-science web-scraping
Last synced: 21 Jun 2025
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/malexandersalazar/tools-python-mssql-statistics-descriptor
A lightweight tool based on sweetviz that generates high-density visualizations to kickstart Exploratory Data Analysis within Microsoft Azure SQL Database using ODBC with just one line of code
azure-sql-database data-analysis data-visualization eda python
Last synced: 16 May 2026
https://github.com/dmytrori/himalayan_expeditions
Himalayan expedition stats, 1905–2020
alpinism data-analysis data-visualization pandas-python
Last synced: 21 Jun 2025
https://github.com/preciousclement/maternal-experiences-in-nigeria
This repository contains a Python-based project that generates realistic synthetic data simulating the maternal health journey of 5,000 women in Nigeria.
data-analysis data-generation maternal-health nigeria public-health python
Last synced: 08 May 2025
https://github.com/jm199504/data-analysis-practice
数据分析练习(Titanic / BankCustomers)
Last synced: 02 May 2026
https://github.com/finnishcancerregistry/directadjusting
Compute estimates of weighted averages with confidence intervals.
biostatistics confidence-intervals data-analysis direct-adjusting direct-adjustment epidemiology health-statistics r r-package statistical-adjusting statistical-adjustment weighted-analysis weighted-average weighted-averages
Last synced: 26 Mar 2025
https://github.com/shubhamgoyal575/tableau-visualization-dashboard
This repository features interactive Tableau dashboards for sales performance and healthcare analysis. It includes insights on revenue trends, regional sales, patient demographics, and hospital occupancy for data-driven decision-making. 🚀
dashborad data-analysis data-cleaning-and-preprocessing healthcare-analysis healthcare-dashboard sales-dashboard sales-data-analysis-project tableau tableau-dashboards tableau-public visualization visualization-tools
Last synced: 20 Feb 2026
https://github.com/patricialjohnson/data-visualization-tableau-project
Tableau Visualization Project
business-analytics business-intelligence data-analysis data-visualization digital-marketing digital-marketing-agency kpi microsoft-excel program-management project-management python search-engine-optimization seo sql tableau
Last synced: 21 Jun 2025
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/jabercrombia/invoice-tracker
Created an invoice tracker with sample data using Nextjs and data visualizations.
data-analysis nextjs postgres shadcn vercel
Last synced: 07 Apr 2026
https://github.com/josafary-ds/curso_dnc
Repositório para armazenamento dos arquivos de estudo e projetos DNC - Cientista de Dados
data-analysis data-science data-visualization machine-learning powerbi python
Last synced: 13 Mar 2025
https://github.com/dsite42/simple_data_visualizer
This is a simple tool to visualize data for a quick Exploratory Data Analysis (EDA). You can create various plot types as seaborn or plotly plot via a GUI in multiple windows (RelPlot, PairPlot, JointPlot, DisPlot, CatPlot, LmPlot, 3DPlot).
data-analysis data-science data-visualisation data-visualization eda exploratory-data-analysis plotly seaborn
Last synced: 12 May 2026
https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 12 Mar 2025
https://github.com/m-biriulova/automated-sales-report
Automated sales & returns report using Python, Excel, and PDF export
automation business-intelligence data-analysis data-visualization excel financial-analysis freelance portfolio-project python report-generation sales-report
Last synced: 20 Jun 2025
https://github.com/blankscreen-exe/tsf_datascience
Repo for all TSF internship tasks
data-analysis data-mining data-mining-algorithms python
Last synced: 17 May 2026
https://github.com/madrury/hot-sauce
Simuation of a Hot Sauce Spicyness Dataset
data-analysis data-science data-visualization dataset machine-learning
Last synced: 16 May 2026
https://github.com/iamber12/stack-overflow-analysis-using-stack-exchange-api
This Python-based project utilizes the Stack Exchange API to analyze StackOverflow data, focusing on the 'R' and 'Dot Net' programming tags.
data-analysis data-visualization python stack-exchange-api
Last synced: 20 Jul 2025
https://github.com/whisplnspace/insightgenie
InsightGenie is an AI-powered data analyst that lets you upload files, ask questions, and get insights with visualizations
data-analysis data-science data-visualization deployment gemini-api huggingface nlp
Last synced: 19 Jun 2025
https://github.com/leoz0214/foodhygieneanalysis
Data analysis regarding Food Hygiene Ratings in England, Wales and Northern Ireland.
data-analysis food-hygiene-ratings pandas python
Last synced: 17 May 2026
https://github.com/dhruvil-26/sql-projects
This repository contains SQL projects focusing on data analysis and insights. Currently, it includes: 1. RSVP Movies Analysis - SQL queries to analyze movie trends, ratings, and genres. 2. Pizza Sales Analysis - SQL queries to explore sales patterns, customer behavior, and profitability in a pizza business.
analysis data-analysis database mysql pizza-sales-analysis rdbms rsvp sql
Last synced: 17 May 2026
https://github.com/fatihilhan42/wnba-draft-player-dataanalysis-1997-2022-with-python
In this project, the statistics of the players in the WNBA drafts from 1997 to 2022 were examined. The data in the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-analysis-python data-visualization jupyter-notebook python
Last synced: 17 May 2026
https://github.com/lucasfloresc/final_project
This is the final project of the Ironhack Bootcamp. In this project I applied all methods and tecniques learned in the Bootcamp, such as Web Scrapping and API extraction, Data cleaning and processing with Python, Python logic, the implementation of machine learning and Data Visualization. All displayed in Streamlit for more user friendly interface
data-analysis data-visualization machine-learning python streamlit webscraping
Last synced: 08 May 2026
https://github.com/mkk-1817/cvip-ds-exploratory_data_analysis-terrorism
This repository deals with exploring global terrorism trends analyzing the Global Terrorism Database to uncover temporal patterns, identify top terrorist groups, examine attack types, and gain insights into geographical and success/failure dynamics.
coderscave data-analysis data-science data-visualization eda exploratory-data-analysis python terrorism-analysis
Last synced: 19 Jun 2025
https://github.com/celineboutinon/lafleche-et-associes
OpenClassrooms Data Analyst 2022-2023 - Projet 7 using KNIME Analytics Platform
data-analysis data-analytics data-visualisation knime-analytics-platform no-code rgpd
Last synced: 08 Feb 2026
https://github.com/aelmah/ibm-applied-ds
Find here : A collection of projects I've done throught Applied DS Specialization !
applied-data-science-capstone beautifulsoup data-analysis data-visualization machine-learning python-for-ai-and-data-science web-scraping
Last synced: 11 Sep 2025
https://github.com/marcosvbras/udacity-nd109-project-titanic
Data Analysis project to Udacity Nanodegree's course: Artificial Intelligence Programming with Python.
data-analysis data-analyst-nanodegree data-science jupyter-notebook machine-learning python udacity
Last synced: 19 May 2026
https://github.com/lorinczakos/sql-projects
This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course
bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode
Last synced: 16 May 2026
https://github.com/deliprofesor/2024-salary-analysis-for-machine-learning-engineers
This project analyzes a salary dataset to explore factors like experience, company size, remote work ratio, and country. It includes data cleaning, group analysis, visualizations, and machine learning models (linear regression and Random Forest) to predict salaries and identify key features.
data-analysis data-cleaning data-visualization ggplot2 linear-regression machine-learning plotly r-programming random-forest salary-prediction salary-trends
Last synced: 07 Mar 2026
https://github.com/gianninijs/dashboard_cury_company
Dashboard
data-analysis data-visualization python statistics streamlit
Last synced: 05 Apr 2025
https://github.com/wesleych3n/my-work-log
A self project to record and analyze work's check in/out time on google sheet with telegram bot.
data-analysis telegram-bot worklog
Last synced: 20 Jul 2025
https://github.com/nferno55/mock-data-governance
Working with messy data and using data quality practices to clean it up and practice SQL/Python automation. YAML will be used for Metadata validation soon.
data-analysis database-management metadata python sql sqlite3 yaml
Last synced: 16 May 2026
https://github.com/sangampaudel530/bhutan-rainfall-explorer
Interactive dashboard to explore, analyze, and forecast rainfall trends in Bhutan (2021–2025) using Streamlit, Plotly, and Prophet.
bhutan climate-change data-analysis prophet-facebook rainfall-prediction streamlit visualization
Last synced: 17 May 2026
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/tabibyte/aoty-highest-rated-albums-data-analysis
Data Analysis of AOTY Highest Rated Albums
albums aoty data-analysis music
Last synced: 10 Sep 2025
https://github.com/arv-anshul/pw-experience-portal
Data Analysis on PW Skills and Ineuron.ai experience/internship portal.
data-analysis experience ineuron-ai internship physics-wallah portal pw-skills python3
Last synced: 16 Apr 2026
https://github.com/ggarciajavier/udacity-dalf-project1-investigate-dataset
Work performed for the 1st project of Udacity Data Analyst Nanodegree: exploratory data analysis of a football dataset.
data-analysis football-analytics python python36 udacity-data-analyst-nanodegree
Last synced: 15 May 2026
https://github.com/athari22/multivariable_regression_and_valuation_model_
Multivariable regression model using Python to analyze and predict Boston housing prices based on various socioeconomic and environmental features.
data-analysis data-analysis-python housing-prices housing-prices-competition machine-learning pandas pandas-python plotly python regression-models seaborn seaborn-python sklearn
Last synced: 17 Jun 2025
https://github.com/nemat-al/aviation-accidents
Aviation Accidents Analysis
aviation data-analysis data-science data-visualization plotly python
Last synced: 17 May 2026
https://github.com/nafisrayan/crypto-trading-platform
This React Crypto Exchange Template is designed to provide a solid foundation for building a comprehensive cryptocurrency exchange platform. With its sleek and modern design, this template is perfect for anyone looking to create a user-friendly and intuitive trading experience.
crypto dashboard data-analysis data-visualization react template
Last synced: 16 May 2026
https://github.com/j-faria/bicerin
Working on the RV challenge in Torino
data-analysis gp radial-velocity rv-challenge
Last synced: 07 Apr 2026
https://github.com/surajsanap/employee-resigning-analysis-powerbi-dashboard-data-analytics
Effortlessly analyze employee resignations with our concise Power BI dashboard. Download the XML file, open the dashboard, and gain quick insights into resignation trends and reasons for departure. Streamlined and effective
dashboard data-analysis data-analytics powerbi python xml-dataset
Last synced: 08 May 2025
https://github.com/danitilahun/exploratory-data-analysis-projects
This repository contains a collection of my personal Exploratory Data Analysis (EDA) projects. Each project involves exploring various datasets to gain insights, uncover patterns, and visualize trends.
data-analysis data-science data-visualization exploratory-data-analysis python
Last synced: 16 May 2026
https://github.com/davydantoniuk/stackoverflow-graph-analyse-r
data-analysis graph r stackoverflow
Last synced: 13 Mar 2025
https://github.com/jacktheprogrammer/hypothesis-testing-using-data-analytics
Hypothesis testing using data analytics for yellow trip car ride provider service to increase their revenue
data-analysis data-analytics data-analytics-project data-insights data-plotting data-visualization descriptive-analysis hypothesis-testing prescriptive-analysis statistical-analysis statistical-methods
Last synced: 17 Jun 2025
https://github.com/mostafa-bashir/investigating_weather_data
data-analysis ipython jupyter-notebook nump pandas python
Last synced: 07 Apr 2026
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 07 Feb 2026
https://github.com/djccnt15/mathematics
data-analysis data-science linear-algebra python statistics
Last synced: 24 Jun 2025
https://github.com/kakri787/alcoholism-and-grade-analysis
A mini project for university data science module where we analyzed on the relationship between alcohol consumption in students and their academic performance, making use of exploratory data analysis and machine learning techniques to see if we can predict student's grades.
data-analysis data-science data-vizualisation lasso-regression machine-learning neural-network
Last synced: 12 Apr 2025
https://github.com/anandanraju/sql_data_analysis_projects
About This Two projects involves analyzing Pizza Data & Walmart Sales data using SQL to identify insights and trends. The aim is to do data-driven approaches to understand sales performance, identify key factors influencing sales, and provide actionable recommendations for business improvement.
csv data-analysis data-management mysql pizza sql sql-schema walmart
Last synced: 24 Jun 2025
https://github.com/prathmesh2507/ctc-hackthon
A data-driven system designed to reduce overcrowding and optimize urban public transport using real-world geospatial data and intelligent simulation.
dashboard data-analysis data-visualization python streamlit
Last synced: 16 May 2026
https://github.com/monish-nallagondalla/cement_strength_prediction
The Cement Strength Prediction project uses machine learning to predict the compressive strength of cement based on its components, such as Cement, Fly Ash, Water, Superplasticizer, Coarse Aggregate, Fine Aggregate, and Age. The goal is to forecast compressive strength (MPa) for optimized cement production and quality control.
cement-strength-prediction construction-industry data-analysis data-preprocessing data-science data-visualization feature-engineering machine-learning predictive-modeling python regression-analysis scikit-learn
Last synced: 11 May 2026
https://github.com/nitins17/tableauvisualizations
Visualizations I created while learning to work with Tableau
data-analysis data-science data-visualization tableau visualization
Last synced: 01 Mar 2026
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 02 Feb 2026
https://github.com/brownred/python-and-sql
Python and SQL (postgreSQL & mySQL) for data analysis.
data-analysis databases python3 sql
Last synced: 11 May 2026