Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/priyanshubiswas-tech/priyanshubiswas-tech
SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB
apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql
Last synced: 21 Jan 2026
https://github.com/adithya2369/safa_public
AI-powered customer feedback analyzer that uses generative AI to transform customer reviews into actionable business insights. Upload review data, get instant summaries, satisfaction scores, detailed reports, and improvement suggestions—all in an easy-to-deploy Docker container.
data-analysis data-visualization docker-containerization full-stack-development generative-ai langchain langchain-groq web-development
Last synced: 10 Oct 2025
https://github.com/ninadpatil09/hospital_emergency_room_analysis
This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.
data-analysis tableau-public visualization
Last synced: 15 Feb 2026
https://github.com/atiqisrak/py
This repository houses the code and resources for the **100 Days of Python Challenge** – an intensive learning journey designed to propel you from beginner to a a confident Python programmer in just 100 days.
data-analysis data-science machine-learning python3
Last synced: 10 Oct 2025
https://github.com/sabdikay/analysis-of-biodiversity
This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.
data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 14 Apr 2026
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/ayushsiloiya619/spotify-song-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 08 May 2026
https://github.com/busra-deveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 30 Apr 2026
https://github.com/brooks-code/toulouse-biblio-chronicle
Snapshot of Toulouse public library customer habits — cleaning raw, messy datasets of musical, cinematic, and literary checkouts; includes data-cleaning steps, analysis notebook revealing cultural tastes in the Pink City.
data-analysis data-cleaning data-cleaning-and-preprocessing data-quality exploratory-data-analysis jupyter-notebook library-data misaligned-data mojibake tutorial
Last synced: 10 Oct 2025
https://github.com/anandu-jpg/coffee-shop-sales-analysis
This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.
business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton
Last synced: 18 May 2026
https://github.com/mahmoudwal27/powerbi-projects-for-data-analysis
This project leverages Power BI for data visualization, DAX for custom calculations, and integrates SQL and Excel for data preprocessing, analysis, and reporting, enabling dynamic and interactive insights.
data-analysis data-analysis-project data-analytics-project project
Last synced: 07 Mar 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 16 May 2025
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study
This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.
data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn
Last synced: 01 May 2026
https://github.com/suhailsallam/tips_dashboard
Dashboard using Python & Streamlit
dashboard data-analysis data-analytics data-science data-scientist data-visualization python streamlit streamlit-dashboard streamlit-webapp
Last synced: 21 Jan 2026
https://github.com/its-ekanshi/sql-analytics-project
Designed relational tables with primary and foreign keys, populated with sample data for real-world testing. Implemented advanced SQL techniques such as CTEs, window functions, aggregates, and filters to extract valuable insights.
business-intelligence data-analysis exploratory-data-analysis microsoft-sql-server sql sql-queries
Last synced: 10 Oct 2025
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/surayasumona/test_bowlers_analysis
Data Analysis with Python
data-analysis data-manipulation data-preprocessing numpy pandas
Last synced: 04 May 2026
https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/frankelavsky/security-dash-challenge
I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.
client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization
Last synced: 14 Apr 2026
https://github.com/cyberoctane29/diamonds-anova-analysis
This project uses ANOVA in Python to analyze how diamond color and cut affect pricing. By testing for statistical significance and running post hoc comparisons, it reveals key pricing patterns. Built with pandas, statsmodels, and Seaborn, the findings help inform diamond valuation and purchasing decisions.
anova-test data-analysis data-analytics data-science diamonds-dataset regression-analysis statistical-analysis tukey-hsd
Last synced: 10 Oct 2025
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/kingflow-23/ai-related-article-detector
Create a simple system that determines whether an article is related to AI or not using web scraping, text representation, and a classifier.
data-analysis data-engineering data-science logistic-regression pca-analysis scraping selenium umap
Last synced: 04 May 2026
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/apfirebolt/numpy-and-pandas-examples
Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python
data-analysis jupyter-notebook numpy pandas python
Last synced: 17 Apr 2026
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/syarwinaaa09/investigating-netflix-movies
🎬 investigating netflix movie trends using python and pandas 📊
csv data-analysis matplotlib netflix pandas visualization
Last synced: 01 May 2026
https://github.com/jrdnbradford/the-office-us
Data concerning NBC's mockumentary series The Office (U.S. version)
csv data-analysis json the-office xml
Last synced: 19 Jan 2026
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 11 Oct 2025
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/mindlessmuse666/titanic-data-visualization
Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.
data-analysis data-visualization matplotlib pandas plotly python seaborn titanic
Last synced: 04 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/azaz9026/email-spam-detection
Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.
data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit
Last synced: 14 Apr 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026
https://github.com/kianaasd93/sensors-
Data Analysis of wearable technologies autonomous systems sensor in physiotherapy, Conducted a comprehensive data analysis on Xsens MTx sensor data
classification data-analysis data-science jupyter jupyter-notebook knn machine-learning physiotherapy python sensor svm wearable-devices wearable-technology
Last synced: 19 Feb 2026
https://github.com/vineet416/eda-hr-analytics
EDA on HR-Analytics by PW Skills Data Analytics course
data-analysis data-analysis-python data-analytics data-preprocessing data-processing data-visualization exploratory-data-analysis jupyter-notebook matplotlib-pyplot numpy pandas python seaborn statistical-analysis
Last synced: 14 Apr 2026
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/vinay-jose/territorial-sales-dashboard
EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.
data-analysis data-visualization powerbi-desktop sql
Last synced: 11 Oct 2025
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 02 May 2026
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/ahsankhizar5/titanic-eda-visualization
Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.
data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 31 May 2026
https://github.com/jedrzej-wydra/data-analysis-associate
Associate Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/prince-pastakiya/human-resources-tableau-project
👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.
chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public
Last synced: 18 Apr 2026
https://github.com/shruti-h/netflix-eda
Exploratory Data Analysis on Netflix Movies & TV Shows dataset using Python, Pandas, Matplotlib, and Seaborn
data-analysis data-science eda matplotlib netflix pandas-library python seaborn
Last synced: 01 May 2026
https://github.com/silvermete0r/sdu_hackathon_uss_db_analysis
Smart Data Ukimet Hackathon - "Data Modeling" case Solution - Topic: Store Analysis based on Unified Star Schema
data-analysis data-modeling postgresql python sql unified-star-schema
Last synced: 14 Apr 2026
https://github.com/jedrzej-wydra/data-analysis-pro
Professional Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/thinzarhninyu/dap
Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org
data-analysis data-analysis-python ipynb jupyter-notebook python
Last synced: 18 Feb 2026
https://github.com/navp7/pizzasales_powerbi
This project involves creating a comprehensive sales performance dashboard using Power BI to visualize and analyze the sales data of an Italian pizza company.
data-analysis ms-sql-server ms-word powerbi visualization
Last synced: 13 Mar 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/mmfava/lonomia-host-plants-2024
This project investigates the relationship between Lonomia achelous and Lonomia obliqua caterpillars and their host plants. The project uses Docker for a consistent environment and R for statistical analysis, with detailed processes documented in Jupyter notebooks.
data-analysis host-plants lonomia lonomism r
Last synced: 01 May 2026
https://github.com/rohithsaji97/toll_gate
This is a electronic toll collection system.
data-analysis digital-image-processing ocr-text-reader opencv python3 trained-models
Last synced: 29 Apr 2026
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/dzakwanalifi/stadata-x
Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif
bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui
Last synced: 20 Jan 2026
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/virajbhutada/music-store-data-analysis-sql
Hands-on SQL data analysis project for music store. Enhance proficiency with database queries. Ideal for practitioners seeking real-world analytics experience. Gain insights into customer behavior, revenue trends, and genre preferences, empowering strategic decision-making in the music industry. Explore the project for a rich learning experience.
data-analysis data-insights data-science database genre-prediction music-industry music-store postgresql postgresql-database query-optimization revenue-trends sql sql-queries
Last synced: 01 May 2026
https://github.com/treasarose/us_candy_distribution_analysis_project
This project focuses on advanced data analysis and optimization using SQL. It includes queries for analyzing sales, product margins, and shipping efficiency for a US candy distributor.
data-analysis entity-relationship mssql optimization query sql-server sqlproject us-candy-distributor
Last synced: 12 Oct 2025
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026
https://github.com/marknature/machine-learning-intern
Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset
data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn
Last synced: 10 Apr 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/rodrigojunqueiradev/the-ultimate-mysql-bootcamp
The Ultimate MySQL Bootcamp
data-analysis data-engineering data-science data-visualization database dataset mysql sql
Last synced: 14 Apr 2026
https://github.com/ariyaarka/result-analysis
A simple analysis of result based on different factors shown in figures
data-analysis jupyter-notebook matplotlib numpy-library pandas-dataframe python seaborn
Last synced: 01 May 2026
https://github.com/akash1070/project--uber-data-analysis
To Determine UBER data from the dataset using Python
data-analysis data-science python
Last synced: 09 May 2026
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 17 Nov 2025
https://github.com/leosimoes/digitalinnovationone-analise-covid
Projeto prático "Criando modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil" da Digital Innovation One.
arima-models data-analysis data-science python time-series
Last synced: 09 May 2026
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/omnipotence-eth/manufacturing-quality-analytics
SQL + Python pipeline for semiconductor NCR analysis — supplier performance, defect Pareto, yield trends
analytics data-analysis etl manufacturing matplotlib pandas postgresql python quality sql
Last synced: 11 Apr 2026
https://github.com/rita94105/ethereum-fraud-detection
This project focuses on detecting fraudulent transactions in the Ethereum network using both traditional machine learning models and deep learning techniques. By analyzing transaction attributes and interaction patterns, we aim to develop an effective fraud detection model.
data-analysis deep-learning ethereum fraud-detection machine-learning
Last synced: 01 May 2026
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/chirlmin-joo-lab/papylio
Single-molecule fluorescence trace extraction and analysis
biophysics data-analysis fluorescence fret single-molecule sparxs
Last synced: 12 Oct 2025
https://github.com/veronsheva/hr_dashboards
Interactive HR dashboard using Tableau & MySQL – explore employee trends, performance, attrition, and salary insights.
calculated-fields charts cte dashboards data-analysis data-cleaning design eda mysql queries tableau window-functions
Last synced: 24 Jan 2026
https://github.com/agb2k/twitter-analyzer
Project to extract tweets based on searches, analyze it's data and autocorrect potentially incorrect words
data-analysis python tweepy twitter
Last synced: 13 Oct 2025
https://github.com/javedali99/machine-learning-hw-solution-notebooks
Machine Learning Homework Solution Notebooks (UCF CAP5610)
data-analysis data-preprocessing data-science decision-trees machine-learning python random-forest recommender-system supervised-learning support-vector-machines titanic-kaggle unsupervised-learning
Last synced: 05 Jan 2026
https://github.com/madhursinghbhadoriya/atliq_salesdata_analysis-powerbi
AtliqH_SalesData Analysis - Power
dashboard data-analysis powerbi
Last synced: 21 Jan 2026
https://github.com/javedali99/geospatial-and-earth-science-data
A comprehensive collection of global earth science and geospatial datasets 🌍
data-analysis dataset earth-observations earth-science earth-sciences earthscience geography geospatial geospatial-analysis geospatial-analytics geospatial-data open-datasets satellite-data
Last synced: 05 Jan 2026
https://github.com/gaaniruddha/mphil
This repository contains a copy of my final MPhil presentation and panel report.
data-analysis gpu-imager radio-astronomy
Last synced: 03 Mar 2026
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 16 May 2026
https://github.com/louisfernando1204/websocket-benchmark
A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.
benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws
Last synced: 09 Apr 2026
https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino
Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.
data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision
Last synced: 20 May 2026
https://github.com/stefagnone/-employee-salary-analysis-and-insights
Predictive analysis of employee salary determinants for an anonymized dataset, highlighting key factors influencing salary and providing insights for salary policy improvements.
business-intelligence data-analysis data-science employee-salary-analysis excel gender-pay-gap predictive-insights regression-modeling spss statistical-analysis
Last synced: 23 Feb 2026
https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil
Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.
data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn
Last synced: 30 Mar 2025
https://github.com/27ahmad/heart-disease-diagnostic-eda
This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.
data-analysis data-visualization pandas python
Last synced: 06 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/jsimell/sleepanalysis
A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.
data-analysis matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/gmalbert/supreme-court
Data Analysis of the US Supreme Court from 1790 to present
data-analysis data-science supreme-court
Last synced: 31 May 2026
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 15 May 2025
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/gmalbert/rugby
Rugby Data Analysis and Sports Betting
data-analysis rugby sports-betting
Last synced: 31 May 2026
https://github.com/szymon-budziak/real_estate_house_prices_prediction
Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.
data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn
Last synced: 21 Jan 2026