Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/nmelgar/healthy_child_dataviz
Data visualization project to analyze what a healthy child is.
analysis data data-analysis data-science data-visualization dataviz research tableau visualization
Last synced: 23 Feb 2026
https://github.com/satyam4229/prediction-of-cement-compressive-strength
Prediction of cement compressive strength is a model which is based on Regression model, Here we predict that how much is the compressive strength of the particular cement has with variety of mixtures of its component.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/wilfordaf/dataanalyst-test
Test task for Junior Data Analyst position
data-analysis pandas python trading-data
Last synced: 28 Feb 2025
https://github.com/purposeachiever6/discovering_hidden_pattern
Discovering Hidden Patterns in Sequential and Numerical Data
data-analysis r statistical-analysis
Last synced: 28 Feb 2025
https://github.com/pratik-khose/realtime-sales-simulation
Power BI: Realtime Sales Simulation using SQL Server and Direct Query
data-analysis data-analytics data-visualization dax-query powerbi sql sql-server sqlserver
Last synced: 10 Jun 2026
https://github.com/robinmillford/cardiac-care-performance-dashboard
This project presents a comprehensive data analysis and interactive dashboard focused on Cardiac Surgery and Percutaneous Coronary Interventions (PCI) performance by hospital, spanning from 2008 onwards.
cardiac data-analysis data-visualization plotly-express streamlit-dashboard tableau tableau-public
Last synced: 07 Sep 2025
https://github.com/auliannee/customer-analysis-with-tableau
This repository contains the data source and the tableau workbook.
data-analysis data-visualization tableau
Last synced: 12 Mar 2026
https://github.com/firetyrant/sql-portfolio-projects
Documenting my SQL learning journey with hands-on projects focused on data cleaning, analysis, and optimization.
bigquery data-analysis databases etl learning portfolio query-optimization sql
Last synced: 19 Apr 2026
https://github.com/esther-poniatowski/multitask-context-dependent-behavior
Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states
cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing
Last synced: 26 Mar 2025
https://github.com/satyam4229/prediction-of-different-diseases
Prediction of the different diseases with the help of different symptoms express the diseases in the real time. In the dataset, there are 132+ different symptoms on which the model is trained to give the best result of the disease.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/roshaka/samplr
Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.
data data-analysis data-engineering decorators list python sampling
Last synced: 14 Jan 2026
https://github.com/agustin-caceres/Arg-Telecom-Analisis
Telecom Argentina Insights
business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit
Last synced: 29 Apr 2025
https://github.com/mattholy/haka
HaKa is an out-of-the-box tool system designed for data engineers and data analysts in medium-sized enterprises. It is easy to deploy and scale.
celery data-analysis data-engineering fastapi python uvicorn-gunicorn
Last synced: 19 May 2026
https://github.com/parthshah02/customer_churn_dashboard
This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python
data-analysis matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/scailfin/rob-client
Command line user interface for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 14 Jan 2026
https://github.com/mainak-97/weather-data-analysis-using-python
A comprehensive analysis of time-series weather data using Python and Pandas, focusing on data exploration, cleaning, and uncovering insights.
data-analysis jupyter-notebook pandas pandas-dataframe python python3 time-series-analysis
Last synced: 08 May 2026
https://github.com/nimomach/cafe-sales
This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.
cafe data-analysis data-visualization sales
Last synced: 12 Mar 2026
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/aalekhpatel07/statcan
StatCAN dataset fetcher and cleaner.
census data-analysis data-science statcan
Last synced: 02 Apr 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/k-means-clustering-for-retail-data-analysis
This project uses K-Means clustering to segment wholesale customers based on their spending habits. The data is preprocessed, scaled, and clustered into four groups. The Elbow and Silhouette methods determine the optimal number of clusters, and results are visualized using boxplots and scatter plots to uncover spending patterns.
clustering-visualisation data-analysis elbow-method k-means k-means-clustering r silhouette-score
Last synced: 10 Apr 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/sabelomkhwanzi/data-alchemist-boot-camp
Built on Covalent's Unified API, Increment has the full historical data set for 40+ chains including every smart contract, event, transaction, address, etc. With access to all this data you can find:
covalent data-analysis increment
Last synced: 11 Mar 2026
https://github.com/edanur-y/airline-customer-satisfaction-prediction-with-multiple-logistic-regression
Performing multiple logistic regression analysis on airline and customer data to predict the satisfaction. 🔵R
data-analysis missing-values-analysis multiple-logistic-regression optimal-cut-off-points r
Last synced: 09 Jun 2026
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 02 Apr 2025
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/ilke-kas/multivariate-data-analysis
A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.
classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics
Last synced: 05 Oct 2025
https://github.com/codesaadumair/pandas_exercises_personal
Personalized enhancements to pandas exercises with comprehensive solutions and practical insights for mastering data analysis in Python.
data-analysis data-science pandas python
Last synced: 09 May 2026
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/elcarrillo/computational_bootcamp_material
Material for a Computational Bootcamp
bootcamp-project computational-physics data-analysis data-visualization jupyter-notebooks
Last synced: 05 Oct 2025
https://github.com/jianxi-erin/bigdata-machinelearning-lab
本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。
data-analysis data-visualization hadoop machine-learning python spark sql
Last synced: 03 May 2026
https://github.com/mahmoud2abdallah/improvado-marketing-homework
This Looker Studio dashboard provides a comprehensive analysis of marketing performance for August 2024, transforming raw data into actionable insights for data-driven decision making.
bigquery business-intelligence data-analysis looker-studio marketing
Last synced: 05 Oct 2025
https://github.com/vishal786-commits/target-businesscasestudy-sql
This project analyzes Target’s e-commerce transactions in Brazil between 2016 and 2018 using SQL. The goal was to explore customer behavior, order patterns, payments, delivery times, and freight costs to generate actionable business insights.
Last synced: 05 Oct 2025
https://github.com/egbe34/sql-portfolio
SQL portfolio showcasing business-focused queries for KPIs, retention, churn, RFM, and Pareto analysis. Built with sample commerce data for analytics and BI use cases.
bigquery business-intelligence churn-analysis cohort-analysis data-analysis kpi postgresql rfmsegmentation sql windowfunction
Last synced: 19 May 2026
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/trismald/eurosoccer1023
Data Analyst - European Soccer 2010 2023
data-analysis data-visualization jupyter-notebook pandas powerbi python
Last synced: 06 May 2026
https://github.com/jimartskenya/ai-code-context
🤖 Automate code documentation with AI to enhance understanding and streamline your workflow, saving time on unfamiliar codebases and projects.
ai claude-code codebase-analysis context-management data-analysis dependency-analysis gemini intellij-plugin jupyterlab-extension llm-integration machine-learning mcp-server open-source pandas prompt-engineering streamlit-component token-reduction vibe-coding
Last synced: 08 May 2026
https://github.com/subhamghimire/dataanavis
Learning Data analysis and visualization
data-analysis data-science data-visualization dataset
Last synced: 06 Oct 2025
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit
Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.
analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics
Last synced: 08 May 2026
https://github.com/myles/notebooks
Some of my random Jupyter Notebooks.
data-analysis data-science jupyter-notebooks
Last synced: 18 Jan 2026
https://github.com/surbhi242singh/pizza_sales_project
Used SQL to analyze pizza sales data
data-analysis mysql pizza-sales sql
Last synced: 07 Oct 2025
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/marcus-v-freitas/media_movel_covid19
Estudo de série temporal com média móvel de casos e óbitos de Covid19 no munícipio de São Paulo
covid-19 data-analysis data-science government-data jupyter-notebook moving-average plotly python sao-paulo temporal-series
Last synced: 17 Jan 2026
https://github.com/michaelcurrin/yahoo-finance-reports
Use the Yahoo Finance API to get info on shares of interest and report on them
data-analysis data-science python reporting shares stock-market yahoo-finance yahoo-finance-api
Last synced: 07 Oct 2025
https://github.com/thejvdev/ml-from-scratch
Repository for Implementing ML Models from Scratch in Python
classification data-analysis data-mining data-science deep-learning jupyter-notebook machine-learning matplotlib neural-networks numpy pandas prediction python regression seaborn sklearn visualization
Last synced: 02 Apr 2026
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/roydevashish/algo8.ai-data-manipulation-assignment
This assignment performs transaction-level sales data analysis and generates reports using Pandas / SQL / Spark inside a containerized environment. The dataset contains sales transaction records and is used to analyze SKUs, customers, and sales representative performance.
data-analysis duckdb python3 sql uv
Last synced: 15 May 2026
https://github.com/rusiru-erandaka/pupil-dilation-signal-classification-pipeline-with-noise-filtering-feature-extraction
In this repository I have worked on Pupil Diameter Time series Dataset. here I have worked on data sampling, Blink detection and Noise Handling, Stimulus Onset Alignment & Ensemble Averaging, Baseline correction, Feature Extraction and finally create a Patient classification ML pipeliner
anomaly-detection classification-pipeline data-analysis data-preprocessing data-science time-series
Last synced: 08 Oct 2025
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/dcs-training/machinelearning
Introduction to Machine Learning with Python delivered by the centre in the year 2022-23. Go to the read me file
data-analysis data-wrangling machine-learning python statistics
Last synced: 08 Oct 2025
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/maccccd/sql-proficiency-journey
A technical journey of my SQL understanding.
data-analysis sql systems-analysis-and-design uml-class-diagram
Last synced: 15 Feb 2026
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/ndomah1/learning-probability-and-statistics
This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.
correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics
Last synced: 18 Jan 2026
https://github.com/alexquilis1/spanish-fuel-stations-analysis
Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons
data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse
Last synced: 08 Oct 2025
https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot
This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file
d3 data-analysis data-visualisation javascript observable-notebook
Last synced: 17 May 2026
https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report
This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.
dashboard data-analysis data-visualization financial-analysis powerbi statistics
Last synced: 21 Jan 2026
https://github.com/sarvesh2304/stellarator_simulation
A comprehensive Julia package for stellarator fusion reactor physics analysis featuring 3D magnetic field calculations, neoclassical transport modelling, quasi-isodynamic optimisation algorithms, and interactive 3D visualisations. Includes tokamak comparison framework and high-resolution plotting capabilities for fusion research.
3d-visualisation data-analysis field-line-tracing fusion-physics fusion-research interactive-3d julia magnetic-confinement magnetic-field-calculations magnetic-surfaces matplotlib neoclassical-transport numerical-methods optimisations physics-simulation plasma-physics plotly quasi-isodynamic stellarator stellarator-optimization
Last synced: 09 Oct 2025
https://github.com/jlee9503/telecommunication-churn
Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.
data-analysis data-visualization matplotlib pandas python scikit-learn
Last synced: 18 Jan 2026
https://github.com/dhruvalbhinsara1/influencer-and-platform-data-analysis
Exploratory analysis of synthetic influencer marketing data — engagement, revenue, and ROI.
campaign-analytics data-analysis data-analysis-project data-analysis-python eda influencer-marketing jupyter-notebook jupyter-notebooks marketing-analytics matplotlib pandas python seaborn synthetic-data visualization
Last synced: 09 May 2026
https://github.com/faisal-khann/ipl-analysis
The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn
Last synced: 08 May 2026
https://github.com/l1ght14/tradersentiment_primetrade
Analyzes Bitcoin market sentiment's impact on Hyperliquid trader PnL & behavior. Uncovers patterns using Python (Pandas, Seaborn) to derive actionable trading insights. Junior Data Scientist assignment for PrimeTrade
bitcoin crypto-trading cryptocurrency data-analysis financial-data-analysis jupyter-notebook market-sentiment pandas python trader-behavior web3
Last synced: 20 Oct 2025
https://github.com/izzyl3333/mosquito_analysis
An exercise using Python and statistical analysis in mosquito data to understand the relationship between the different variables and the mosquito number.
chicago data-analysis data-science exploratory-data-analysis mosquitoes python statistical-analysis west-nile-virus
Last synced: 19 Jan 2026
https://github.com/jhaayush2004/churncast
Fusion of deep Data Science, Machine Learning and MLOps...
aws data-analysis data-science data-visualization deep-neural-networks docker machine-learning mlops-workflow
Last synced: 09 Oct 2025
https://github.com/marianamartiyns/api-logisticregression
Data analysis, modeling, and deployment of a logistic regression model for churn prediction, integrating a FastAPI backend and a Streamlit frontend.
data-analysis data-science fastapi logistic-regression pyhton streamlit
Last synced: 29 Apr 2026
https://github.com/takshshah-16/pizza_sales_sql
SQL-powered pizza sales analytics project using MySQL Workbench to derive business insights through data exploration and queries.
business-intelligence data-analysis database-management mysql sql
Last synced: 09 Oct 2025
https://github.com/debjyotisaha/sql-projects
Designed and implemented SQL-based projects to analyse and manage datasets efficiently. Demonstrated expertise in writing complex queries, optimizing database performance, and performing data extraction, transformation, and loading (ETL) processes.
Last synced: 09 Oct 2025
https://github.com/alokthedataguy/financial-friend-web-app
Financial Friend is a privacy-first web app that takes a user’s payment statement (PhonePe, GPay, bank CSV/PDF), cleans and understands it, and then talks back like a friend—giving simple, human answers (plus a few tiny visuals) to questions people actually care about.
data-analysis data-science data-visualization fastapi finance-management financial-analysis financial-data insights personal-finance-and-data-anlaysis python react
Last synced: 14 Apr 2026
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/adithya2369/safa_public
AI-powered customer feedback analyzer that uses generative AI to transform customer reviews into actionable business insights. Upload review data, get instant summaries, satisfaction scores, detailed reports, and improvement suggestions—all in an easy-to-deploy Docker container.
data-analysis data-visualization docker-containerization full-stack-development generative-ai langchain langchain-groq web-development
Last synced: 10 Oct 2025
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Oct 2025
https://github.com/sabdikay/analysis-of-biodiversity
This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.
data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 14 Apr 2026
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/anandu-jpg/coffee-shop-sales-analysis
This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.
business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton
Last synced: 18 May 2026
https://github.com/filipe-rds/bi-atividade-1
Atividade de análise de dados para a disciplina de Inteligência Empresarial
data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/its-ekanshi/sql-analytics-project
Designed relational tables with primary and foreign keys, populated with sample data for real-world testing. Implemented advanced SQL techniques such as CTEs, window functions, aggregates, and filters to extract valuable insights.
business-intelligence data-analysis exploratory-data-analysis microsoft-sql-server sql sql-queries
Last synced: 10 Oct 2025
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 04 May 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/cyberoctane29/diamonds-anova-analysis
This project uses ANOVA in Python to analyze how diamond color and cut affect pricing. By testing for statistical significance and running post hoc comparisons, it reveals key pricing patterns. Built with pandas, statsmodels, and Seaborn, the findings help inform diamond valuation and purchasing decisions.
anova-test data-analysis data-analytics data-science diamonds-dataset regression-analysis statistical-analysis tukey-hsd
Last synced: 10 Oct 2025
https://github.com/scarlet-enlight/ml_project
Comparison of different classifiers (KNN, Naive Bayes, Decision Tree) on Sleep Health and Lifestyle Dataset
data-analysis machine-learning
Last synced: 13 Mar 2026
https://github.com/jrdnbradford/the-office-us
Data concerning NBC's mockumentary series The Office (U.S. version)
csv data-analysis json the-office xml
Last synced: 19 Jan 2026
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 09 May 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1
Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.
data-analysis data-visualization deloitte json tableau tableau-public
Last synced: 11 Oct 2025
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/azaz9026/email-spam-detection
Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.
data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit
Last synced: 14 Apr 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026