Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/imrandil/sql_practice_with_analysis
SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way
data-analysis docker markdown postgres sql
Last synced: 10 May 2026
https://github.com/vikktor93/datascience-spotify
Analysis of Spotify dataset containing the top songs currently trending for over 70 countries.
data-analysis data-science data-scientist jupyter-notebook kaggle matplotlib pandas seaborn
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/vatshayan/research-papers-
Research papers as blue prints, drafts, pre prints and published papers
artificial-intelligence big-data blockchain blockchain-technology data-analysis data-mining data-science data-visualization deep-learning deep-neural-networks image-classification image-manipulation image-processing machine-learning machine-learning-algorithms machine-learning-library python supervised-learning unsupervised-machine-learning webdevelopment
Last synced: 10 Jun 2026
https://github.com/codingvangogh/data-science
Data Science, Machine Learning, Data Exploration, Big Data etc
data-analysis datascience decision-tree-classifier decision-tree-regression heatmap jupyter-notebook machinelearning python python3 ridge-regression seaborn sklearn svm-classifier
Last synced: 11 May 2026
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 11 May 2026
https://github.com/ceia-prefeitura/urban-lit-tracker-etl
UrbanLitTracker coleta artigos acadêmicos sobre mudanças urbanas via OpenAlex API, processa e armazena em MongoDB. Oferece dashboard interativo com Dash, exibindo dados como trabalhos mais relevantes, autores e palavras-chave frequentes, facilitando a análise e visualização da literatura urbana.
academic-research bibliometrics data-analysis data-pipeline data-visualization etl openalex-api urban-studies
Last synced: 11 May 2026
https://github.com/OdessaZ/Portfolio-Projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 May 2026
https://github.com/ggarciajavier/udacity-dalf-project2-wrangle-openstreetmap-data
Work performed for the 2nd project of Udacity Data Analyst Nanodegree: OpenStreetMap data wrangling and analysis.
data-analysis openstreetmap python sql
Last synced: 12 May 2026
https://github.com/bhaveshbhakta/calories-burnt-prediction-using-xgbregressor
Calories Burnt Prediction
calories-burned data-analysis data-visualization machine-learning python xgboost
Last synced: 12 May 2026
https://github.com/parthds02/-daily-calorie-count-meal-plan-generator-
Welcome to the Daily Calorie Count Meal Plan Generator project! This Streamlit web application is designed to create personalized meal plans based on user inputs such as age, weight, gender, and calorie goals. It also allows users to download their customized meal plans as PDFs.
calories-tracker data-analysis data-science pdf-generation streamlit vscode
Last synced: 13 May 2026
https://github.com/devanshsahu47/prime-content-analytics
Prime Data Explorer analyzes Amazon Prime's content and credits data to uncover trends in release years, genres, and ratings. It cleans, merges, and visualizes the data to provide actionable insights for optimizing content strategy and boosting audience engagement.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python3
Last synced: 13 May 2026
https://github.com/rybakov-ks/particleanalyzer
A Computer Vision-based tool for automatic segmentation and size analysis of particles in Scanning Electron Microscope (SEM) images.
computer-vision data-analysis deep-learning detectron2 electron-microscopy image-segmentation materials-characterization microscopy-images nanotechnology object-detection particle-analysis scanning-electron-microscopy scientific-research sem sem-image-analysis yolo
Last synced: 13 May 2026
https://github.com/alexgenovese/react-charts-covid-19-data
Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts
data-analysis data-science data-visualization react reactjs
Last synced: 13 May 2026
https://github.com/ireneli393/music-recommendation-system-with-listenbrainz-dataset
Recommendation System
alternate-least-squares baseline-model data-analysis data-sceince lightfm-library python recommendation-system
Last synced: 14 May 2026
https://github.com/yeonjaee/data-analytics
converts raw data into actionable insights
Last synced: 11 Jun 2026
https://github.com/yashsingh43/lung-cancer-biomarker-analysis
Gene expression analysis to identify biomarkers for early lung cancer detection (SCLC & NSCLC)
bioinformatics biomarkers cancer cytoscape data-analysis gene-expression gsea nsclc r sclc
Last synced: 11 Jun 2026
https://github.com/prgermux/data-plotter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing data from various sources. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data. The application supports multiple file formats, allows users to select any columns for the X and Y axes, and provides dynamic plots.
automation data-analysis plott python
Last synced: 12 Jun 2026
https://github.com/neelimabonangi/defect-detection-hot-rolling
Defect Detection in Hot Rolling Using Machine Learning
classification data-analysis data-science defect-detection jupyter-notebook machine-learning manufacturing numpy pandas predictive-analytics python random-forest scikit-learn
Last synced: 12 Jun 2026
https://github.com/nob101/lotto-analyzer
Ein Node.js & SQLite basiertes Tool zur Analyse und Auswertung der Euromillionenziehung und Joker. A Node.js and SQLite web application to analyze, track, and evaluate lottery (Euromillionen) and Joker results.
backend css data-analysis express html5 javascript nodejs sqlite statistical-analysis
Last synced: 14 Jun 2026
https://github.com/dipeshgoyal013/crypto-currency-dashboard
This project analyzes historical cryptocurrency data and builds an interactive Power BI dashboard. It includes time-series forecasting of Bitcoin and Ethereum using ARIMA and Power BI’s forecasting model.
data-analysis excel power-bi python
Last synced: 15 Jun 2026
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 15 Jun 2026
https://github.com/juanse0330/registro-pacientes-terapia-python
Proyecto en Python para automatizar el registro y análisis de pacientes en terapia ocupacional domiciliaria. Herramienta orientada al sector salud.
automatizacion data-analysis python salud terapia-ocupacional
Last synced: 17 Jun 2026
https://github.com/kheriberto/bedu_dc
Ejercicios del curso de "python desde 0" de la plataforma BEDU
Last synced: 18 Jun 2026
https://github.com/ddsuhaimi/turkiye-student-evaluation-eda
A little bit of exploration of well-known Turkiye Student Evaluation dataset
data-analysis data-science data-visualization-project exploratory-data-analysis exploratory-data-visualizations
Last synced: 18 Jun 2026
https://github.com/httpsnooow/graphs-analysis-neo4j
Challenges from the "Neo4J - Data Analysis with Graphs" course by Digital Innovation One (DIO).
challenge data-analysis data-engineering data-science graph neo4j neo4j-database neo4j-graph
Last synced: 18 Jun 2026
https://github.com/mahapeth/invest-track
Реализация инструмента для мониторинга активности пользователей ИС "Инвест" для ВКР по направлению 01.03.02 Прикладная математика и информатика
analitycs app data-analysis data-visualization jupyter-notebook python sites
Last synced: 20 Jun 2026
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 20 Jun 2026
https://github.com/aonurakman/data-analysis-and-ml-algorithms
An exploration of data analysis techniques and standard ML algorithms on QSAR oral toxicity dataset. - 2021 - Yıldız Technical University
classification clustering data-analysis data-mining isolation-forest python regression
Last synced: 20 Jun 2026
https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis
A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.
data-analysis data-visualization exploratory-data-analysis powerbi python3
Last synced: 20 Jun 2026
https://github.com/katiebuntic/research_methods
Data Science Research Methods
analysis data-analysis data-science python research-project
Last synced: 23 Jun 2026
https://github.com/dcs-training/datavisualisationwithr2021
Data Visualisation with R Course (delivered by the Centre in October/November 2021). This workshop is focusing on good practice of creating graphs with R and R Studio. Go to the readme file
data-analysis data-visualisation data-wrangling r
Last synced: 23 Jun 2026
https://github.com/vbhvsingh0/coulombic_dyn_formaltetra
The Python code simulates a formaldehyde tetra-cation molecule using Coulombic forces
data-analysis physics-simulation python shell-scripting
Last synced: 24 Jun 2026
https://github.com/okdp/platform-packages
Collection of OKDP platform packages
clickops data-analysis data-platform data-science fluxcd gitops helm kubernetes kubocd portal spark
Last synced: 26 Jun 2026
https://github.com/shinie19/sql-data-warehouse-project
Build a modern Data Warehouse from scratch with SQL Server, including ETL processes, data modeling and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-modeling data-normalization data-science data-standardization data-warehouse etl-pipeline medallion-architecture sql-server
Last synced: 29 Jun 2026
https://github.com/drisskhattabi6/meteo-data-mining
This repo contains using Data Mining Techniques to analyze meteorological (meteo) data. The objective is to extract meaningful insights and patterns from the data that can aid in understanding weather phenomena and predicting future weather conditions.
cart data-analysis data-mining data-visualization decision-making decision-tree extract-data extract-insights insights-analytics insights-data k-means knn machine-learning svm
Last synced: 21 Mar 2025
https://github.com/celineboutinon/product-classification
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 6
api classification-models data-analysis data-science data-visualization e-commerce image-classification marketing marketing-analytics product-classification rgpd scraping-python text-classification
Last synced: 29 Jun 2026
https://github.com/maprihoda/learning-spark
apache-spark data-analysis data-science data-wrangling machine-learning pyspark python
Last synced: 19 May 2026
https://github.com/shrunga92/5g_qos_data_transformation_python
Resource Allocation in 5G Network Service
Last synced: 19 May 2026
https://github.com/vubacktracking/freecodecamp-data-analysis-with-python
5 Projects in Data Analysis With Python Course on Freecodecamp
data-analysis freecodecamp freecodecamp-project python
Last synced: 19 May 2026
https://github.com/prady2309/football-players-analysis
The dataset used is available on Kaggle
data-analysis data-science data-visualization fifa football-analytics machine-learning python3
Last synced: 19 May 2026
https://github.com/phanchenh/associationrule_adventurework_sqlproject
Project Title: Analyzing AdventureWorks Dataset with Association Rule Mining in SQL (2011-2014)
adventureworks association-rule-mining association-rules business-analytics business-intelligence data-analysis microsoft-learn mssql mssqlserver
Last synced: 08 Apr 2026
https://github.com/touppercase78/salary-prediction-collection
Salary predictions with ML models and analyses on datasets from several other GitHub repos
data-analysis data-visualization datasets machine-learning python3 regression-models
Last synced: 02 May 2026
https://github.com/eco786786/salaries
This analysis explores the factors influencing salaries for data professionals from 2020 to 2024, including job titles, experience levels, remote work ratios, employment types, company locations and sizes. Using data from Kaggle, the project uncovers trends and insights to guide both companies and professionals in the tech industry.
data-analysis git postgresql powerbi
Last synced: 19 May 2026
https://github.com/jabulente/tanzania-geographical-zones
This project provides a geospatial visualization of Tanzania's geographical zones and regions. It uses geospatial data to map each zone, display regions, and annotate them for easy identification. The visualizations include simulated data to demonstrate thematic mapping techniques.
ai data-analysis data-science data-visualization geopandas geospatial location matplotlib ml python tanzania tanzania-geographic tanzania-locations
Last synced: 19 May 2026
https://github.com/srvcl/lung-cancer-survival-analysis
Data Cleaning of a dataset and Survival Analysis in R Language
data-analysis data-science data-visualization r survival-analysis
Last synced: 11 May 2026
https://github.com/logan722/employee-management-system
An Employee Management System
data-analysis problem-solving pycharm-ide python-library
Last synced: 06 Apr 2025
https://github.com/carvalhoandre/coletor-tweets
Criado para coletar e armazenar tweets utilizando a API do Twitter. Inicialmente inspirado no caso de uso do livro Um Voluntário na Campanha de Obama, este projeto tem como objetivo demonstrar a importância do monitoramento no X. O coletor permite buscar tweets sobre qualquer termo desejado
data-analysis mongodb python twiter-analysis twitter
Last synced: 19 May 2026
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/amishidesai04/interactive-data-visualisation-tool
A Java-based application leveraging JavaFX to create dynamic and interactive charts, including pie charts, bar charts, and line graphs. Ideal for visualizing various datasets, this tool offers customizable features and a user-friendly interface. Easily input and manage data, customize chart styles, and observe trends and patterns effectively.
charts data-analysis data-visualisation data-visualization-project gui java javafx visualization-tools
Last synced: 17 Apr 2026
https://github.com/timkong21/siemens-mobility-operations-industrial-engineer-simulation
Operations Industrial Engineer job simulation with Siemens Mobility. Includes time study analysis to identify assembly bottlenecks (Task 1) and a proposed layout redesign to improve efficiency without automation (Task 2).
data-analysis forage industrial-engineering job-simulation manufacturing process-improvement production-engineering python siemens time-analysis
Last synced: 19 May 2026
https://github.com/shubhamgoyal575/credit-card-fraud-detection
📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.
adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost
Last synced: 08 Feb 2026
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/hawmex/aut_data_and_information_analysis_project
This repository contains the files of my project for the "Data & Information Analysis" course at AUT (Tehran Polytechnic).
data-analysis data-science k-means outlier-detection python
Last synced: 19 May 2026
https://github.com/samir-atra/share-lm_dataset_analysis
Analysis, studies and optimizations on the ShareLM extension dataset
data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas
Last synced: 19 May 2026
https://github.com/tusharpandey003/data-science
Data science include Data Analysis, Machine learning , EDA,PCA and Data Structure and Algorithms
algorithms algorithms-and-data-structures data-analysis data-analytics data-cleaning data-science data-structures data-visualization dsa kmeans-clustering machine-learning outlier-detection pca pca-analysis
Last synced: 13 Mar 2025
https://github.com/twistedfrost/best-of-ml-python
Explore the best machine learning libraries in Python. Stay updated with weekly rankings and contributions. Join the community! 🐙🌟
airport airport-simulation awesome breast-cancer-prediction data-analysis data-science data-visualization decision-tree-classifier deep-learning gpt jax nlp random-forest-classifier scikit-learn svm-classifier transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 26 Jun 2025
https://github.com/parthkumarmpatel/sql-exploratory-data-analysis
SQL EDA scripts for sales data warehouse — metrics, insights, and rankings from my data warehouse project.
data-analysis exploratory-data-analysis sql-server
Last synced: 26 Jun 2025
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/roma-glushko/magechurn
churn-analytics data-analysis data-science
Last synced: 06 Apr 2025
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 May 2026
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 20 May 2026
https://github.com/tabibyte/azerbaijani-rapper-lyrics-data-analysis
Lyrics Data Analysis of Azerbaijani Rappers
azerbaijan data-analysis rappers
Last synced: 22 Jul 2025
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/steviecurran/prediction-plot
Code to performs machine learning (k-nearest neighbours regression) and plot the predicted versus measured values
astrophysics c data-analysis high-redshift machine-learning pgplot python statistics tensorflow visualization
Last synced: 20 May 2026
https://github.com/nikitalpopov/news
v semester project
data-analysis data-science python scikit-learn
Last synced: 20 May 2026
https://github.com/silasberger/charts-analysis
Data set collection, preprocessing and analysis of singles- and album charts
charts data-analysis data-mining data-science dataset music
Last synced: 14 Sep 2025
https://github.com/ranxi2001/predicting-mental-health-risk
数据分析案例-精神健康预测(数据来源kaggle)
data-analysis data-visualization eda
Last synced: 27 Jun 2025
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 20 May 2026
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/edanur-y/abalone-age-prediction-with-regression-models
Comparing the performances of simple linear, multiple linear, multi-layer perceptron and k-nearest neighbors regressions on abalone data to predict the age.
data-analysis hyperparameter-tuning missing-values-analysis outlier-analysis python recursive-feature-elimination
Last synced: 20 May 2026
https://github.com/hemant-kumar786/heart-disease-prediction
Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.
correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis
Last synced: 02 Nov 2025
https://github.com/waynejz/heart-disease-analysis
COMP9321 19T1 Assignment 3
data-analysis machine-learning web-application
Last synced: 04 Apr 2025
https://github.com/habiburrahman-mu/data-wrangling
Data Wrangling is the process of converting data from the initial format to a format that may be better for analysis.
data-analysis data-mining data-science
Last synced: 21 May 2026
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/mvharsh/blinkit-sales-dashboard
An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.
blinkitdashboard data-analysis data-visualization powerbi
Last synced: 25 Jan 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 18 Apr 2026
https://github.com/mmzong/gee_lifestyleeffectsonhypertension
Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.
aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots
Last synced: 29 Jul 2025
https://github.com/ginga1402/car_price_prediction
Predict the price of a car using MS Excel.
college-project data-analysis excel linear-regression
Last synced: 30 Mar 2025
https://github.com/jabulente/t-test-python-implementation
A Python-based implementation of one-sample, two-sample, and paired t-tests for statistical analysis and hypothesis testing.
automation data-analysis data-science eda exploratory-data-analysis hypothesis-testing independent-ttest one-sample-t-test python reporting statistics ttest two-sample-t-test
Last synced: 27 Jun 2025
https://github.com/bhiogade/tlc-trip-analysis
NYC Taxi and Limousine Commission (TLC) Trip Analysis
data-analysis data-cleaning data-collection data-visualization pandas-python tableau tableau-desktop
Last synced: 30 Mar 2025
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 08 Apr 2026
https://github.com/kunalkumar2001/sales-project-using-excel-and-sql
Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.
data-analysis data-analytics excel mssql sql
Last synced: 03 Nov 2025
https://github.com/ishmal793/basic-python-
Beginner-friendly Python code examples and exercises – a strong foundation for aspiring data analysts.
data-analysis data-analytics learning-python-code problem-solving python-basics python-for-beginners
Last synced: 23 Jul 2025
https://github.com/nishumehta/british-airways-reviews-analysis
This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.
dashboard data-analysis data-visualization tableau tableau-public
Last synced: 12 Jan 2026
https://github.com/lucashomuniz/project-10
Optimizing Sales Forecast Accuracy: Exploratory Analysis and Insights
data-analysis data-munging data-visualization dax-languague exploratory-data-analysis language-r power-bi sales-forecast statistics-modules
Last synced: 30 Mar 2025
https://github.com/achronus/data-exploration
A repository dedicated to interesting data exploration projects I've completed
data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 02 Jan 2026
https://github.com/maazie-khan/austin-housing-insights-powerbi
Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.
dashboard data-analysis data-science data-visualization database powerbi
Last synced: 02 Jan 2026
https://github.com/admacpherson/admacpherson.github.io
This repository hosts my personal website & portfolio. You can find my work experience, endorsements, contact information, and more on it at andrewmacpherson.dev
data-analysis personal-site portfolio website
Last synced: 15 Sep 2025
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/nymarya/analise-correlacao-sifilis
Código da análise de correlação entre notificações de casos de sífilis e disponibilidade de testes e medicamentos
data-analysis healthcare pandas
Last synced: 03 Jan 2026