Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-
This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.
arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression
Last synced: 19 Jun 2026
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 20 Jun 2026
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 20 Jun 2026
https://github.com/evanmathew/northwind-traders
SQL-powered analysis of sales, employee performance, and customer behavior using PostgreSQL window functions. This project uncovers key business insights to optimize decision-making.
case-study data-analysis jupyter-notebook northwind-traders postgresql python-postgresql sql
Last synced: 20 Jun 2026
https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis
A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.
data-analysis data-visualization exploratory-data-analysis powerbi python3
Last synced: 20 Jun 2026
https://github.com/neeraj08823/bellabeat_case-study
HOW CAN A WELLNESS COMPANY PLAY IT SMART?
data-analysis data-cleaning data-visualization r rmarkdown rstudio tableau tableau-public
Last synced: 25 Jun 2025
https://github.com/vbhvsingh0/coulombic_dyn_formaltetra
The Python code simulates a formaldehyde tetra-cation molecule using Coulombic forces
data-analysis physics-simulation python shell-scripting
Last synced: 24 Jun 2026
https://github.com/mituskillologies/aiml-dypiemr-sep24
Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.
artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3
Last synced: 05 Apr 2025
https://github.com/jonathancaleb/adap
📊🌱 Agricultural Data Analysis Platform 🌍🚜 A personal initiative to analyze coffee growth trends in Uganda using Python, data science, and machine learning. This project supports sustainable farming with predictive models and interactive visualizations. 🍃📈
data-analysis data-science python
Last synced: 18 May 2026
https://github.com/data-edd/e-commercestore_analysis
This project analyzes e-commerce data to provide insights into sales performance, profitability, and customer behavior using Power BI.
data-analysis powerbi powerbidashboard
Last synced: 02 Feb 2026
https://github.com/wikidata/purdue-data-mine-2024
Program materials for WMDE's 2024 Purdue Data Mine project
analytics data-analysis data-quality data-science etl open-data python wikidata wikimedia
Last synced: 12 May 2025
https://github.com/pyramidheadshark/ai-mirea-sem1p
Completed set of all MIREA AI an DA practices (1 sem.)
beginner-friendly data-analysis data-science jupyter mirea
Last synced: 05 Apr 2025
https://github.com/alex-petrov-git/petrowiki
My wiki-pages
acoustics aeroacoustics aeromechanics algebra analysis data-analysis fourier-analysis hydrodynamics linear-algebra math ml obsidian physics probability-theory statistics wiki wikipedia
Last synced: 02 Mar 2025
https://github.com/adriangalvanzamora/ecommerce-analytics-olist
Data analysis project based on the Olist Brazilian E-Commerce dataset. Includes data cleaning, exploratory analysis, delivery performance metrics, customer satisfaction modeling, and geospatial insights. Built entirely in Python (Jupyter Notebook) using real-world data from Kaggle.
brazil customer-satisfaction data-analysis data-visualization ecommerce folium geospatial-analysis machine-learning matplotlib notebook pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/drisskhattabi6/meteo-data-mining
This repo contains using Data Mining Techniques to analyze meteorological (meteo) data. The objective is to extract meaningful insights and patterns from the data that can aid in understanding weather phenomena and predicting future weather conditions.
cart data-analysis data-mining data-visualization decision-making decision-tree extract-data extract-insights insights-analytics insights-data k-means knn machine-learning svm
Last synced: 21 Mar 2025
https://github.com/capjamesg/personal-notebooks
Notebooks for personal experiments with machine learning and computer vision.
data-analysis machine-learning notebooks
Last synced: 03 Apr 2025
https://github.com/estherslabbert/data-exploration
Data analysis and data visualizations for different data sets
data data-analysis data-science data-visualization jupyter-notebook titanic-dataset usa-arrests-dataset
Last synced: 06 Apr 2025
https://github.com/maprihoda/learning-spark
apache-spark data-analysis data-science data-wrangling machine-learning pyspark python
Last synced: 19 May 2026
https://github.com/adikahnf/Data-analysis-with-Python
data-analysis numpy pandas python streamlit
Last synced: 31 Dec 2025
https://github.com/kevin-rsj/the-substance-sentiment-analysis
Se analiza los comentarios de usuarios de Reddit sobre la película The Substance (2024) usando técnicas de NLP. Se obtuvo un sentiment score promedio de 0.19, y palabras clave como "horror" y "like" destacan entre las opiniones.
data-analysis notebook python sentiment-analysis tableau visualization
Last synced: 19 May 2026
https://github.com/vubacktracking/freecodecamp-data-analysis-with-python
5 Projects in Data Analysis With Python Course on Freecodecamp
data-analysis freecodecamp freecodecamp-project python
Last synced: 19 May 2026
https://github.com/prady2309/football-players-analysis
The dataset used is available on Kaggle
data-analysis data-science data-visualization fifa football-analytics machine-learning python3
Last synced: 19 May 2026
https://github.com/abdoomohamedd/data-science-projects
A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.
data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 14 May 2025
https://github.com/sakan811/find-common-japanese-character-from-news
Showcase visualizations about common Japanese characters that appear in the news
beautifulsoup beautifulsoup4 data-analysis dataanalysis japanese japanese-language language news powerbi requests sqlite sqlite3 visualization webscraper webscraping
Last synced: 19 May 2026
https://github.com/jyrki69pro/pdf-insight-agent
📄 Extract insights from PDFs effortlessly with this AI-powered summarizer, transforming documents into structured, actionable points.
agent-based-model agentic-ai agentic-workflow agents ai-agent data-analysis finance-management financial-analysis generative-ai langchain langgraph llama3 llm multiagent-systems pdf phidata python toolcalling
Last synced: 11 Apr 2026
https://github.com/touppercase78/salary-prediction-collection
Salary predictions with ML models and analyses on datasets from several other GitHub repos
data-analysis data-visualization datasets machine-learning python3 regression-models
Last synced: 02 May 2026
https://github.com/galahad20/b244006e_analisis_data
Data Analysis project at Dicoding course "Belajar Analisis Data dengan Python". I learn to do analyst on data and visualizing it to get meaningful insight.
data-analysis data-analytics python streamlit
Last synced: 06 Apr 2026
https://github.com/srvcl/lung-cancer-survival-analysis
Data Cleaning of a dataset and Survival Analysis in R Language
data-analysis data-science data-visualization r survival-analysis
Last synced: 11 May 2026
https://github.com/jabulente/kruskall-wallis-test
This repository contain project that provides a reusable Python function to perform the Kruskal-Wallis H-test across multiple continuous variables, grouped by a categorical feature
data-analysis data-science eda hypothesis-tests kruskal-wallis kruskals-algorithm scipy-stats statistics
Last synced: 22 Jul 2025
https://github.com/logan722/employee-management-system
An Employee Management System
data-analysis problem-solving pycharm-ide python-library
Last synced: 06 Apr 2025
https://github.com/carvalhoandre/coletor-tweets
Criado para coletar e armazenar tweets utilizando a API do Twitter. Inicialmente inspirado no caso de uso do livro Um Voluntário na Campanha de Obama, este projeto tem como objetivo demonstrar a importância do monitoramento no X. O coletor permite buscar tweets sobre qualquer termo desejado
data-analysis mongodb python twiter-analysis twitter
Last synced: 19 May 2026
https://github.com/chaganti-reddy/ai-prototype-customer-segmentation
Artificial Intelligence Prototype product based model for Customer Segmentation in E-Commerce Industry.
artificial-intelligence cluster-analysis customer-segmentation data-analysis machine-learning product-based prototype
Last synced: 13 Mar 2025
https://github.com/sweta-kaundilya/sql_projects_data_analytics
This repository contains SQL porfolio projects
data-analysis mysql-database mysql-workbench
Last synced: 10 Sep 2025
https://github.com/julie-fliorko/rockbuster-insights-sql-project
Data analysis using PostgreSQL to help Rockbuster Stealth LLC identify revenue trends, customer insights, and rental behavior patterns.
Last synced: 22 Jul 2025
https://github.com/amishidesai04/interactive-data-visualisation-tool
A Java-based application leveraging JavaFX to create dynamic and interactive charts, including pie charts, bar charts, and line graphs. Ideal for visualizing various datasets, this tool offers customizable features and a user-friendly interface. Easily input and manage data, customize chart styles, and observe trends and patterns effectively.
charts data-analysis data-visualisation data-visualization-project gui java javafx visualization-tools
Last synced: 17 Apr 2026
https://github.com/nerooc/device-downtime-detection
Repozytorium dotyczące projektu z przedmiotu "Sztuczne Sieci Neuronowe"
data-analysis detection-model recurrent-neural-networks
Last synced: 22 Mar 2025
https://github.com/sharduljunagade/human-activity-recognition
This repository contains the code for the Assignment-1 of the course ES 335: Machine Learning 2024 at IIT Gandhinagar taught by Prof. Nipun Batra.
data-analysis data-collection decision-trees groq-api human-activity-recognition jupyter langchain-python machine-learning pandas prompt-engineering python sklearn tsfel
Last synced: 08 Apr 2026
https://github.com/drisskhattabi6/exploratory-data-analysis-projects
This Repo contains My Exploratory Data Analysis Projects for many datasets
data-analysis data-preprocessing data-visualization datasets diabetes-prediction eda exploratory-data-analysis iris-dataset
Last synced: 26 Jun 2025
https://github.com/nagar2nd/zomato-bangalore-analysis-tableau
Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.
data-analysis data-visualization tableau
Last synced: 05 Mar 2026
https://github.com/swatisinghit/e-commerce-trend-analysis-for-target
An exploratory and in-depth study of the E-Commerce sales data for a Brazilian store using SQL.
bigquery data-analysis mysql sql
Last synced: 19 May 2026
https://github.com/imnotamr/datasets-used
A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics
ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models
Last synced: 19 May 2026
https://github.com/mulukensholaye/spark_kafka_streaming_csv
Real-time streaming data analysis pipeline with integrating apache spark's streaming library to read records from kafka topic
apache-kafka apache-spark data-analysis python3 realtime-messaging
Last synced: 19 May 2026
https://github.com/devexpress-examples/wpf-pivotgrid-how-to-display-underlying-data
This example demonstrates how to obtain the records from the control's underlying data source for a selected cell or multiple selected cells.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 19 May 2026
https://github.com/samir-atra/share-lm_dataset_analysis
Analysis, studies and optimizations on the ShareLM extension dataset
data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas
Last synced: 19 May 2026
https://github.com/tusharpandey003/data-science
Data science include Data Analysis, Machine learning , EDA,PCA and Data Structure and Algorithms
algorithms algorithms-and-data-structures data-analysis data-analytics data-cleaning data-science data-structures data-visualization dsa kmeans-clustering machine-learning outlier-detection pca pca-analysis
Last synced: 13 Mar 2025
https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru
Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠
data-analysis jupyter-notebook python
Last synced: 24 Feb 2025
https://github.com/ishitaagl20/nyc-taxi_trip_prediction
Taxi Trip Duration Prediction Using the NYC Dataset
data-analysis data-exploration data-visualisation decision-trees matplotlib nyc-taxi-dataset python3 random-forest seaborn xgboost
Last synced: 19 May 2026
https://github.com/adeebkhan25/dataset_suicide_susceptible
The "Student Suicide Risk Factors Dataset" is a comprehensive collection of data aimed at understanding and mitigating the factors contributing to student suicides.
data-analysis dataset machine-learning supervised-learning
Last synced: 24 Dec 2025
https://github.com/archanakokate/bank_term_deposit_prediction
Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.
data-analysis data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Sep 2025
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/tabibyte/azerbaijani-rapper-lyrics-data-analysis
Lyrics Data Analysis of Azerbaijani Rappers
azerbaijan data-analysis rappers
Last synced: 22 Jul 2025
https://github.com/steviecurran/prediction-plot
Code to performs machine learning (k-nearest neighbours regression) and plot the predicted versus measured values
astrophysics c data-analysis high-redshift machine-learning pgplot python statistics tensorflow visualization
Last synced: 20 May 2026
https://github.com/nikitalpopov/news
v semester project
data-analysis data-science python scikit-learn
Last synced: 20 May 2026
https://github.com/nemat-al/multivariate_data_analysis
Tasks for Multivariate Data Analysis Course @ ITMO University
data-analysis multivariate-analysis python
Last synced: 20 May 2026
https://github.com/ranxi2001/predicting-mental-health-risk
数据分析案例-精神健康预测(数据来源kaggle)
data-analysis data-visualization eda
Last synced: 27 Jun 2025
https://github.com/gabrielramirezv/rnaseq_2025_notas
Repository for RNA-seq class from the Undergraduate Program in Genomic Sciences.
Last synced: 29 Mar 2025
https://github.com/saravanansuriya/energy-consumption-analysis
Project will analyze energy usage and greenhouse gas (GHG) emissions of Ontario's Broader Public Sector (BPS) organizations, leveraging a comprehensive database of reported data in Power Bi
data-analysis data-cleaning powerbi python-script
Last synced: 22 Mar 2025
https://github.com/s-narasimman/zepto_inventory_sql_data_analysis
This project focuses on data cleaning, exploration, and analysis of product information from the Zepto dataset using SQL. It provides actionable insights into pricing, stock availability, discounts, and category-level performance.
aggregation categorization csv data-analysis data-cleaning kaggle postgresql sql zepto
Last synced: 16 May 2026
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 07 May 2026
https://github.com/swouf/ntds_imdb_team4
data-analysis data-visualization datascience graph-theory
Last synced: 13 May 2025
https://github.com/hemant-kumar786/heart-disease-prediction
Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.
correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis
Last synced: 02 Nov 2025
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 26 Apr 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal
Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.
data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn
Last synced: 16 May 2026
https://github.com/dcs-training/introtodatabases
This repository host the material connected to a training developed by Dave Elsmore (Edina) for CDCS. Go to the readme file
data-analysis data-wrangling databases sql
Last synced: 10 Jun 2026
https://github.com/daniel-jcvv/daniel-jcvv
👨💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark
agentic-ai apache-airflow apache-kafka apache-spark automation business-intelligence citi-bank-apis data-analysis data-engineering data-lake data-warehouse etl-pipeline medallion-architecture mlops n8n-workflow python rag sql-server
Last synced: 15 Apr 2026
https://github.com/mvharsh/blinkit-sales-dashboard
An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.
blinkitdashboard data-analysis data-visualization powerbi
Last synced: 25 Jan 2026
https://github.com/abhipatel35/moviematcher-movie-recommender-system
A robust movie recommendation system using the MovieLens dataset, employing Collaborative Filtering, Matrix Factorization, and Hybrid Models to enhance recommendation accuracy and diversity.
collaborative-filtering content-based-filtering data-analysis eda hybrid-models machine-learning matrix-factorization movie-recommendations movielens-dataset python recommender-system surprise-library
Last synced: 21 May 2026
https://github.com/kaushik-puttaswamy/food-delivery-time-prediction-using-machine-learning
The Food Delivery Time Prediction Model estimates delivery times using regression algorithms, with XGBoost as the best performer, and is deployed as a real-time application via Streamlit.
data-analysis data-science delivery food-delivery geolocation machine-learning modeldeployment predictive-modeling python realtimeproject regression-models streamlit xgboost
Last synced: 16 Apr 2026
https://github.com/touradbaba/multi-page_dash_application
This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.
dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere
Last synced: 27 Jul 2025
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/gkn-tech/brisecheck_website
Web Crawler, Visualizations and Game
choropleth-map contact-form data-analysis data-visualization game-development pygame python-flask scatter-plot web-crawler web-scraping
Last synced: 25 Feb 2025
https://github.com/javorraca/unsupervised-ml
A short exercise using R to perform unsupervised machine learning (clustering) on a sample data set.
ade4 clustering clustering-algorithm clustering-analysis data-analysis data-analytics data-science dplyr jupyter k-means-clustering machine-learning machinelearning ml r r-programming sse unsupervised-machine-learning
Last synced: 05 Apr 2025
https://github.com/mmzong/gee_lifestyleeffectsonhypertension
Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.
aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots
Last synced: 29 Jul 2025
https://github.com/josericodata/josericodata
Adding a cool README file
big-data data-analysis data-science dublin hadoop hadoop-mapreduce hadoop-spark ireland jobsearch jobseeker portfolio portfolio-data-science portfolio-website python sql
Last synced: 26 Aug 2025
https://github.com/abhishekyadav915/diwali_sales_analysis
This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python
Last synced: 05 Apr 2025
https://github.com/chaedoll/teamproject-foreignerreport
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
Last synced: 25 Feb 2025
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 08 Apr 2026
https://github.com/kunalkumar2001/sales-project-using-excel-and-sql
Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.
data-analysis data-analytics excel mssql sql
Last synced: 03 Nov 2025
https://github.com/easonsyc/kc-house-price-prediction
Prediction for House Price in King County.
data-analysis jupyter-notebook machine-learning python
Last synced: 21 May 2026
https://github.com/lucashomuniz/project-01
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-02
TRACKING USER ACCEPTANCE TESTING WITH POWERBI AND R
data-analysis data-visualization dax-languague powerbi-report powerbi-visuals powerquery r-language shiny-apps visuals
Last synced: 30 Mar 2025
https://github.com/sdley/logiciel-de-deliberation-uam-2022
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 08 May 2026
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/sramalhao/sleep_health_analysis
This repository contains a comprehensive project focused on analyzing various factors influencing sleep health, such as BMI, occupation, gender, age, physical activity, and stress levels.
analytics data-analysis eda matplotlib pandas python seaborn sklearn visualization
Last synced: 13 Apr 2026
https://github.com/aditiagrawal04/netflix-insights-mysql-
SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.
business-intelligence data-analysis data-exploration mysql netflix sql sql-project
Last synced: 28 Jun 2025
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/nymarya/analise-correlacao-sifilis
Código da análise de correlação entre notificações de casos de sífilis e disponibilidade de testes e medicamentos
data-analysis healthcare pandas
Last synced: 03 Jan 2026
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026