Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/anushkundu/student-performance-analysis
Exploring Student Performance Factors
classification-algorithm clustering-algorithm data-analysis data-science exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation
A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.
data-analysis data-analysis-python machine-learning python random-forest
Last synced: 18 Mar 2026
https://github.com/saro0307/exploratory-data-analysis-terrorism
Phase 1 of Data Science project (program) to perform Exploratory Data Analysis on Terrorism using Python On Google Colab for Coderscave Internship sept 2023
colaboratory data-analysis datascience machine-learning numpy pandas python seaborn skit-learn visualization
Last synced: 13 Apr 2026
https://github.com/tillbiskup/trepr
A Python package based on the ASpecD framework for handling TREPR data.
data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science spectroscopy time-resolved
Last synced: 06 Sep 2025
https://github.com/tatilimongi/first_python_project
Este repositório contém um estudo de caso de automação de planilhas em Python para análise de vendas de carros por fabricante ao longo dos anos
data-analysis email-sending file-manipulation graphical-visualization spreadsheet-automation
Last synced: 26 Mar 2025
https://github.com/fer-aguirre/covid19-venezuela
Análisis de datos de muertes por covid-19 en Venezuela
covid-19 data-analysis dataviz line-chart
Last synced: 09 Apr 2025
https://github.com/fbarffmann/vba-challenge
Built an Excel VBA script to automate stock market analysis across multiple years. Programmatically calculated and visualized key financial metrics, reducing manual reporting time and improving data accuracy.
automation data-analysis excel excel-vba financial-analysis reporting stock-market vba
Last synced: 04 Feb 2026
https://github.com/sadia-khan13/data-preprocessing
Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.
artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn
Last synced: 11 Apr 2026
https://github.com/weisswuerste/polars-eurovision-analytics
Analytics example using both the Pandas and Polars libraries
data-analysis data-analytics pandas polars python python-3 python3
Last synced: 08 May 2026
https://github.com/abhisek-13/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.
data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-lite
A cookiecutter template for data journalism projects that offers a simplified and beginner-friendly structure.
cookiecutter data-analysis data-journalism project-template python
Last synced: 14 Jun 2025
https://github.com/isaqueiros/newspapersales-predictions-linearregression_and_regularisation
This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.
data-analysis data-science linear-regression machine-learning python regularization-methods sklearn-library sklearn-linear-regression
Last synced: 02 May 2026
https://github.com/jakubteichman/bullbozer_price_prediction_ml_project
A bulldozer price estimatior from Kaggle competition dataset
data-analysis data-science estimation machine-learning prediction
Last synced: 06 Sep 2025
https://github.com/wsu-carbon-lab/ezfit
Fitting in python made dead simple
data-analysis experimental-physics fitting pandas-accessor
Last synced: 14 Jun 2025
https://github.com/hyperentangledqubit/shellplot
shellplot -- Generate plot(s) directly from terminal via matplotlib or ggplot2 (plotnine)!
data-analysis ggplot2 graphics matplotlib plotnine plotting pyplot terminal
Last synced: 10 May 2026
https://github.com/shruthin4/news-articles-classification
Classifying News Articles using Machine Learning and NLP techniques.. Built an end-to-end text classification pipeline using TF-IDF vectorization and models like Logistic Regression and SVM. Includes exploratory data analysis, model evaluation, and deployment-ready artifacts.
data-analysis data-science logistic-regression machine-learning model news-classification nlp python scikit-learn svm tf-idf-vectorization
Last synced: 13 Apr 2026
https://github.com/grandechowhiskey/fcc-data_analysis-projects
A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.
data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/quocduyenanhnguyen/california-crime-data-analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
2022 california crime-data dashboard data-analysis data-analytics data-manipulation data-modeling data-visualisation data-visualization database fbi mysql mysql-workbench nibrs sql tableau tableau-dashboards tableau-public
Last synced: 29 Apr 2026
https://github.com/srinibas-masanta/electric-vehicle-analysis-dashboard
This repository features an interactive Tableau dashboard that visualizes electric vehicle (EV) adoption trends in the U.S. 🚗⚡ Explore EV growth, top manufacturers, regional distribution, and the impact of incentives—all in one dynamic view. 📊 Use filters to dive deeper into the data and uncover key insights! 🚀
dashboards data-analysis data-visualization tableau
Last synced: 15 Jan 2026
https://github.com/srinibas-masanta/olympics-data-analysis
The Olympics Analysis project explores Olympic data to uncover trends in athlete performance, medal distribution, and participation across countries and demographics. By leveraging detailed datasets, it provides insights into the evolution of the Games, highlighting key patterns and disparities over time.
data-analysis data-science data-visualization olympics olympics-visualization
Last synced: 02 Apr 2025
https://github.com/sehgal-vishal/ev-vehicle-market-analysis-dashboard
This Dashboard is related to EV vehicles adoption
clean-energy data-analysis data-visualization electricvehicles future-technologies
Last synced: 04 Mar 2026
https://github.com/deypadma2020/dataanalysis-mlalgo
Practice repository for data analysis, feature engineering, statistics, web scraping, and building ML model pipelines in Python.
data-analysis eda feature-engineering machine-learning-algorithms ml-pipeline statistics web-scraping
Last synced: 30 May 2026
https://github.com/greatwoman23/hotel_reservation_analysis
In this project, we delve into the intricate world of hotel reservations, utilizing a multifaceted analytical approach to uncover valuable insights. Through a combination of SQL queries and Tableau visualizations, we meticulously dissect a rich dataset comprising booking details, customer demographics, and reservation statuses.
data-analysis data-science data-visualization hotel hotel-reservation publications sql sql-query sqlite3 tableau
Last synced: 15 May 2026
https://github.com/codeslash21/wrangle-twitter-archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling neural-networkt twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/codeslash21/wrangle_twitter_archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling nanodegree-project neural-network twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/spacebakery/nba-trends-project
Data Science Foundations I | Exploratory Data Analysis in Python | Summarizing Relationship Between Two Features
categorical-variables data-analysis data-visualization matplotlib nba-dataset quantitative-variables scipy seaborn subset summary-statistics
Last synced: 11 Mar 2025
https://github.com/samruddhi3012/public-health-data-analysis
Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.
dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup
Last synced: 05 Feb 2026
https://github.com/karthikmprakash/karthikmprakash
Karthik's Portfolio
bs4 data-analysis data-science keras machine-learning numpy pandas portfolio python selenium skills streamlit
Last synced: 13 Apr 2026
https://github.com/ankitpoddar07/sqlpizzas-saleproject
🍕 Pizza Sales Analysis with SQL
data-analysis database excel mysql powerbi ppt python
Last synced: 09 May 2026
https://github.com/satyam4229/prediction-of-cement-compressive-strength
Prediction of cement compressive strength is a model which is based on Regression model, Here we predict that how much is the compressive strength of the particular cement has with variety of mixtures of its component.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/sadratehranian/data-collection-and-machine-learning
create a model using logistic regression to predict whether the fire alarm of a smoke detector should sound or not. Second, predicts whether an electric drive in a production plant may be faulty or not.
data data-analysis data-science datacollection logistic-regression machine-learning ml nn
Last synced: 05 Jan 2026
https://github.com/a-iceberg/whisper_model_evaluator
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper
Last synced: 11 Mar 2025
https://github.com/robinmillford/cardiac-care-performance-dashboard
This project presents a comprehensive data analysis and interactive dashboard focused on Cardiac Surgery and Percutaneous Coronary Interventions (PCI) performance by hospital, spanning from 2008 onwards.
cardiac data-analysis data-visualization plotly-express streamlit-dashboard tableau tableau-public
Last synced: 07 Sep 2025
https://github.com/dcs-training/introtostatistics
This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 26 Mar 2025
https://github.com/firetyrant/sql-portfolio-projects
Documenting my SQL learning journey with hands-on projects focused on data cleaning, analysis, and optimization.
bigquery data-analysis databases etl learning portfolio query-optimization sql
Last synced: 19 Apr 2026
https://github.com/esther-poniatowski/multitask-context-dependent-behavior
Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states
cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing
Last synced: 26 Mar 2025
https://github.com/mattholy/haka
HaKa is an out-of-the-box tool system designed for data engineers and data analysts in medium-sized enterprises. It is easy to deploy and scale.
celery data-analysis data-engineering fastapi python uvicorn-gunicorn
Last synced: 19 May 2026
https://github.com/scailfin/rob-client
Command line user interface for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 14 Jan 2026
https://github.com/samruddhi3012/rfm-sales-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 12 Mar 2025
https://github.com/manditacaos/hypefemme-analise-vendas
Projeto de análise de dados e visualização no Power BI da loja fictícia Hype Femme.
data-analysis jupyter-notebook portfolio powerbi python
Last synced: 10 Apr 2025
https://github.com/subratamondal1/heart-attack-prediction
Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.
data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/aalekhpatel07/statcan
StatCAN dataset fetcher and cleaner.
census data-analysis data-science statcan
Last synced: 02 Apr 2025
https://github.com/deliprofesor/k-means-clustering-for-retail-data-analysis
This project uses K-Means clustering to segment wholesale customers based on their spending habits. The data is preprocessed, scaled, and clustered into four groups. The Elbow and Silhouette methods determine the optimal number of clusters, and results are visualized using boxplots and scatter plots to uncover spending patterns.
clustering-visualisation data-analysis elbow-method k-means k-means-clustering r silhouette-score
Last synced: 10 Apr 2025
https://github.com/georgehanymilad/sales-and-profit-analysis-using-excel
Excel Project for Data Analysis
dashboard data-analysis data-visualization excel excel-dashboard interactivedashboard pivot-tables pivotcharts profit sales-analysis visuzalization
Last synced: 05 Feb 2026
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 02 Apr 2025
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/nero103/airbnb-destination
This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings
data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau
Last synced: 27 Mar 2026
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/codesaadumair/pandas_exercises_personal
Personalized enhancements to pandas exercises with comprehensive solutions and practical insights for mastering data analysis in Python.
data-analysis data-science pandas python
Last synced: 09 May 2026
https://github.com/elcarrillo/computational_bootcamp_material
Material for a Computational Bootcamp
bootcamp-project computational-physics data-analysis data-visualization jupyter-notebooks
Last synced: 05 Oct 2025
https://github.com/paphada1103/data-analysis-with-python
📊 Analyze data efficiently using Python’s top libraries. Learn to explore, clean, and visualize data for meaningful insights in your projects.
carpentries data-analysis data-carpentry data-visualisation dataframe-api dataset english hacktoberfest ibm jovian lsl machine-learning matplotlib programming python realtime social-sciences spark
Last synced: 09 May 2026
https://github.com/mahmoud2abdallah/improvado-marketing-homework
This Looker Studio dashboard provides a comprehensive analysis of marketing performance for August 2024, transforming raw data into actionable insights for data-driven decision making.
bigquery business-intelligence data-analysis looker-studio marketing
Last synced: 05 Oct 2025
https://github.com/ankitwalimbe/ecommerce-funnel-analysis
SQL-based analysis of the Olist e-commerce dataset — building an order funnel (purchase → approval → delivery) with breakdowns by payment type, product category, region, and monthly trend. Includes insights, CSV exports, and Tableau dashboard.
bigquery business-intelligence data-analysis ecommerce funnel-analysis sql tableau-public
Last synced: 05 Oct 2025
https://github.com/affan005-ai/tesla-stock-prediction
This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models
data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn
Last synced: 05 Oct 2025
https://github.com/trismald/eurosoccer1023
Data Analyst - European Soccer 2010 2023
data-analysis data-visualization jupyter-notebook pandas powerbi python
Last synced: 06 May 2026
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Jan 2026
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/ilaxi/lomicontadores
data management tool in reference to number of actions per day in a year
data-analysis gdscript godot godot4 python
Last synced: 19 Apr 2026
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/omarsolieman/socialgiveawaydataanalysis
This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis
data-analysis data-science data-visualization instagram scraping threejs
Last synced: 14 May 2026
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/maccccd/sql-proficiency-journey
A technical journey of my SQL understanding.
data-analysis sql systems-analysis-and-design uml-class-diagram
Last synced: 15 Feb 2026
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/bhaveshbhakta/student-performance-prediction-using-ml
Student Performance Prediction
data-analysis data-visualization linear-regression machine-learning student-performance-analysis student-performance-prediction
Last synced: 08 Oct 2025
https://github.com/inddrsingh/e-commerce_orders
ETL project, with Python for Data cleaning and MySQL for Data analysis
data-analysis etl-pipeline mysql python
Last synced: 18 Apr 2026
https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report
This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.
dashboard data-analysis data-visualization financial-analysis powerbi statistics
Last synced: 21 Jan 2026
https://github.com/sarvesh2304/stellarator_simulation
A comprehensive Julia package for stellarator fusion reactor physics analysis featuring 3D magnetic field calculations, neoclassical transport modelling, quasi-isodynamic optimisation algorithms, and interactive 3D visualisations. Includes tokamak comparison framework and high-resolution plotting capabilities for fusion research.
3d-visualisation data-analysis field-line-tracing fusion-physics fusion-research interactive-3d julia magnetic-confinement magnetic-field-calculations magnetic-surfaces matplotlib neoclassical-transport numerical-methods optimisations physics-simulation plasma-physics plotly quasi-isodynamic stellarator stellarator-optimization
Last synced: 09 Oct 2025
https://github.com/jlee9503/telecommunication-churn
Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.
data-analysis data-visualization matplotlib pandas python scikit-learn
Last synced: 18 Jan 2026
https://github.com/ibromeat/market-orders-analysis
Data analysis of CRM market orders dataset
data-analysis jupyter-notebook machine-learning pandas python visualization
Last synced: 01 May 2026
https://github.com/amish5ingh/cricket-data-analytics-ipl
Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.
data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn
Last synced: 09 May 2026
https://github.com/jhaayush2004/churncast
Fusion of deep Data Science, Machine Learning and MLOps...
aws data-analysis data-science data-visualization deep-neural-networks docker machine-learning mlops-workflow
Last synced: 09 Oct 2025
https://github.com/takshshah-16/pizza_sales_sql
SQL-powered pizza sales analytics project using MySQL Workbench to derive business insights through data exploration and queries.
business-intelligence data-analysis database-management mysql sql
Last synced: 09 Oct 2025
https://github.com/debjyotisaha/sql-projects
Designed and implemented SQL-based projects to analyse and manage datasets efficiently. Demonstrated expertise in writing complex queries, optimizing database performance, and performing data extraction, transformation, and loading (ETL) processes.
Last synced: 09 Oct 2025
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/priyanshubiswas-tech/priyanshubiswas-tech
SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB
apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql
Last synced: 21 Jan 2026
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Oct 2025
https://github.com/brooks-code/toulouse-biblio-chronicle
Snapshot of Toulouse public library customer habits — cleaning raw, messy datasets of musical, cinematic, and literary checkouts; includes data-cleaning steps, analysis notebook revealing cultural tastes in the Pink City.
data-analysis data-cleaning data-cleaning-and-preprocessing data-quality exploratory-data-analysis jupyter-notebook library-data misaligned-data mojibake tutorial
Last synced: 10 Oct 2025
https://github.com/filipe-rds/bi-atividade-1
Atividade de análise de dados para a disciplina de Inteligência Empresarial
data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/its-ekanshi/sql-analytics-project
Designed relational tables with primary and foreign keys, populated with sample data for real-world testing. Implemented advanced SQL techniques such as CTEs, window functions, aggregates, and filters to extract valuable insights.
business-intelligence data-analysis exploratory-data-analysis microsoft-sql-server sql sql-queries
Last synced: 10 Oct 2025
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 04 May 2026
https://github.com/frankelavsky/security-dash-challenge
I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.
client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization
Last synced: 14 Apr 2026
https://github.com/cyberoctane29/diamonds-anova-analysis
This project uses ANOVA in Python to analyze how diamond color and cut affect pricing. By testing for statistical significance and running post hoc comparisons, it reveals key pricing patterns. Built with pandas, statsmodels, and Seaborn, the findings help inform diamond valuation and purchasing decisions.
anova-test data-analysis data-analytics data-science diamonds-dataset regression-analysis statistical-analysis tukey-hsd
Last synced: 10 Oct 2025
https://github.com/scarlet-enlight/ml_project
Comparison of different classifiers (KNN, Naive Bayes, Decision Tree) on Sleep Health and Lifestyle Dataset
data-analysis machine-learning
Last synced: 13 Mar 2026
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 09 May 2026
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 11 Oct 2025
https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data
The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.
bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics
Last synced: 14 Apr 2026
https://github.com/pyrypp/koivunen-vastaanottoanalyysi
An analysis on warehouse goods receiving
business-intelligence data-analysis interactive-visualizations
Last synced: 11 Oct 2025
https://github.com/kianaasd93/sensors-
Data Analysis of wearable technologies autonomous systems sensor in physiotherapy, Conducted a comprehensive data analysis on Xsens MTx sensor data
classification data-analysis data-science jupyter jupyter-notebook knn machine-learning physiotherapy python sensor svm wearable-devices wearable-technology
Last synced: 19 Feb 2026
https://github.com/vinay-jose/territorial-sales-dashboard
EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.
data-analysis data-visualization powerbi-desktop sql
Last synced: 11 Oct 2025
https://github.com/ahsankhizar5/titanic-eda-visualization
Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.
data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 31 May 2026