Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 15 May 2025
https://github.com/itsagurin/data-visualization
Data analysis
data-analysis data-visualization matplotlib pygal python
Last synced: 03 Oct 2025
https://github.com/brunomontezano/digital-interventions-for-depression
📱 "Digital interventions for depressive symptoms: a randomized clinical trial" code
academia clinical-trials cognitive-behavioral-therapy data-analysis digital-health open-science smartphone-app
Last synced: 03 Oct 2025
https://github.com/blackcub3s/msc-finalthesis
The most important programming files, code functions and data processing pipelines for the Machine learning final thesis of my Master's degree. Also, the LaTeX code of the thesis.
data-analysis latex machine-learning numpy python sklearn
Last synced: 09 Apr 2026
https://github.com/dcostachar/telco-customer-churn-dashboard
An interactive Tableau dashboard using the Telco Customer Churn dataset to analyze key drivers of customer churn and develop data-driven retention strategies for the telecommunications industry.
business-intelligence customer-churn-analysis data-analysis data-visualization marketing-analytics tableau
Last synced: 09 Mar 2026
https://github.com/alan-oliveir/state-of-data-2022
Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados.
data-analysis jupyter-notebook pandas-python seaborn-python
Last synced: 03 Oct 2025
https://github.com/vetrivel07/flight-price-prediction
Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 15 Jun 2025
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 03 May 2026
https://github.com/cassiofb-dev/fide-rating-analysis
The plot speaks for itself
chess data-analysis fide hans rating
Last synced: 15 Jun 2025
https://github.com/kineticloom/plydb-fun-nfl-analyst
Analyze NFL data with your AI agent
data-analysis football-analytics nfl
Last synced: 15 May 2026
https://github.com/hemangsharma/hotel-revenue-booking-analysis
This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.
analysis data-analysis data-visualization hotel
Last synced: 10 Aug 2025
https://github.com/nafisrayan/decentai
A comprehensive platform built using ReactJS and Flask, combining blockchain technology with AI to create a secure and intelligent space for community engagement and policy discussions. Leverages NLP and LLM for meaningful interactions and sentiment analysis while ensuring data security and user privacy.
chatbot data-analysis data-visualization flask gemini gemini-ai gemini-ai-chatbot gemini-api government government-tech llm mongodb nlp polls python react tailwind voting-systems winknlp
Last synced: 12 Apr 2026
https://github.com/ifigeneiatsiflidou/applied-statistics-project
Project for an Applied Statistics course, involving exploratory data analysis and predictive modeling of movie revenue using engineered features and multiple linear regression.
correlation-analysis data-analysis linear-regression python scikit-learn visualization
Last synced: 29 Apr 2026
https://github.com/srummanf/elnino-anomaly-study
Study on El Niño’s impact on Chennai groundwater sustainability
data-analysis machine-learning python satellite-imagery-analysis
Last synced: 15 May 2026
https://github.com/srikarveluvali/dataanalysis
The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.
css data-analysis html javascript matplotlib numpy pandas python seaborn tableau
Last synced: 09 Apr 2026
https://github.com/farhad-here/data-visualization-analysis-dva
This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.
altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 06 Apr 2026
https://github.com/gutow/langmuir_trough
Code to run homebuilt Langmuir Trough using Jupyter and Python. Link below for API docs:
data-acquisition data-analysis jupyter langmuir-trough plotting
Last synced: 11 Aug 2025
https://github.com/jofaval/teams-assistance-graph
Visualize in a Graph your Teams Attendance Report
charts data-analysis data-visualization data-viz github-copilot highcharts highcharts-js javascript microsoft-teams
Last synced: 08 Sep 2025
https://github.com/pentalpha/eu-car-emissions-analysis-2015
Analysis of CO² Emissions on Passenger Cars at the E.U. Contries, Year 2015.
data-analysis data-science dataset jupyter-notebook python python3
Last synced: 15 May 2026
https://github.com/pentalpha/bti-performance-study
A series of analysis on a large amount of data about the grades of students in the Technology Information course at UFRN
analysis big-data clustering data-analysis data-science data-visualization ipynb ipython jupyter-notebook performance-analysis plot python python3
Last synced: 15 May 2026
https://github.com/ct83/become-a-data-analyst-udacity
This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.
data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree
Last synced: 12 Aug 2025
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/r12habh/canada-imigration-data-analysis
Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)
canada data-analysis data-science data-visualization datascience python python3
Last synced: 23 May 2026
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/dsrodrigovieira/favoritasales
Este repositório contém o projeto desenvolvido para o desafio do kaggle "Store Sales - Time Series Forecasting. Use machine learning to predict grocery sales"
data-analysis data-science kaggle-competition machine-learning python telegram-bot xgboost-regression
Last synced: 05 May 2026
https://github.com/jprmaulion/cholera-gedeo-ethiopia-spatial-analysis
Exploratory spatial analysis and visualization of cholera case clusters in Gedeo Zone, Ethiopia that integrates demographic and geographic data to identify environmental risk patterns and inform public health interventions. Includes geospatial mapping of cholera incidence relative to waterways and administrative boundaries.
cholera data-analysis data-analysis-python epidemiology ethiopia openstreetmap python spatial-analysis
Last synced: 12 Apr 2026
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 24 Jan 2026
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/samanhur/data_visualization_pcc
First experiences in data visualization with python
data-analysis data-science data-visualization python3
Last synced: 23 Mar 2025
https://github.com/mtholahan/advanced-mysqlquery-tuning-mini-project
Analyzed EuroCup 2016 data with advanced SQL queries. Imported CSV datasets into MySQL, designed schema with match, player, and referee details, and implemented queries covering match outcomes, penalty shootouts, player stats, bookings, substitutions, and referee activity to explore tournament dynamics.
bootcamp data-analysis data-engineering data-modeling database eurocup football mysql queries soccer sports springboard sql
Last synced: 15 May 2026
https://github.com/arun-data-analyst/finance-reporting-sql
End-to-end SQL project for project/portfolio finance: schema, seed data, validation, data-quality checks, business queries, and KPI views (Power BI–ready).
data-analysis data-modeling data-quality database finance kpi portfolio-management powerbi sql sql-server ssms
Last synced: 18 May 2026
https://github.com/farhad-here/adventureworks_interactive_sales_dashboard_powerbi
An interactive Power BI dashboard for Adventure Works sales team to analyze performance, customers, products, and employees. Includes data cleaning, data modeling, DAX measures and advanced visualization features.
business-intelligence chart csv data-analysis data-cleaning data-cleaning-and-preprocessing data-visualization dax powerbi
Last synced: 13 Aug 2025
https://github.com/eesunmoon/genai_cor-recom
[Project] Outfit Coordination Recommender System using KoAlpaca
data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium
Last synced: 06 Apr 2026
https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 12 Mar 2025
https://github.com/imgabreuw/minicurso-python-para-financas
Mini curso de Python para finanças, disponibilizado por Varos.
data-analysis financial-analysis python
Last synced: 13 Aug 2025
https://github.com/natgluons/fmcg-data-modeling
SQL, ARIMA, and K-Means Clustering for data analysis dan customer segmentation regarding sales data
arima-forecasting arima-model customer-segmentation data-analysis data-science-projects kmeans-clustering sales-forecasting
Last synced: 13 Aug 2025
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/qalita-io/tutorials
Tutorials how to best use Qalita Products
data-analysis data-engineering data-quality data-quality-checks documentation qalita tutorials
Last synced: 17 Jan 2026
https://github.com/baguilar6174/python-jupyter-notebooks
Explore data analysis projects with Python, Jupyter and more tools. Discover stunning visualizations and reveal meaningful information in datasets to make informed decisions.
data-analysis jupyter-notebook kaggle pandas python
Last synced: 09 Apr 2026
https://github.com/danicaalana/sales-review-sentiment-analysis
This project is a sentiment analysis project using a machine learning model. It analyzes Amazon product reviews to determine whether the sentiment expressed is positive, negative, or neutral using Multinomial Naive Bayes Method.
amazon data-analysis data-science machine-learning naive-bayes python sales-review sentiment-analysis
Last synced: 15 May 2026
https://github.com/nishumehta/uber-rides-data-analysis
An in-depth analysis of Uber ride data for the year 2016, to uncover patterns in ride behavior, mileage trends, and frequent start locations to generate actionable insights for business decisions.
data-analysis jupyter-notebook matplotlib-pyplot pandas python tableau-dashboards
Last synced: 09 May 2026
https://github.com/applicativesystem/numpy-builder
code getter and numpty operator for numpy operations
data-analysis numpy numpy-python shell-script
Last synced: 15 Aug 2025
https://github.com/syarwinaaa09/exploring-airbnb-market-trends
a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.
airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types
Last synced: 30 Apr 2026
https://github.com/daveornedo/ejercicios-machine-learning
Proyecto académico de Machine Learning realizado con Python
algoritmos data-analysis data-visualization diccionario hyperparameter-tuning jupyter-notebook jupyter-notebooks knn lambda-functions lasso list machine-learning phd rstudio
Last synced: 10 Apr 2025
https://github.com/smsraj2001/sds-datathon
A simple data science project/hackathon done as part of SDS course
data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science
Last synced: 16 Jul 2025
https://github.com/jonnor/acm-2019-dbscan
clustering data-analysis data-science health machine-learning nhanes nutrition
Last synced: 03 Apr 2025
https://github.com/ggarciajavier/udacity-dalf-project1-investigate-dataset
Work performed for the 1st project of Udacity Data Analyst Nanodegree: exploratory data analysis of a football dataset.
data-analysis football-analytics python python36 udacity-data-analyst-nanodegree
Last synced: 15 May 2026
https://github.com/truongnhatbui/automatidata
Automatidata
data data-analysis data-science data-visualization python tableau
Last synced: 08 Jul 2025
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/rijul007/market-basket-analysis-using-r
Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.
Last synced: 02 Apr 2025
https://github.com/annnieglez/computer-vision-parking-lot
This project leverages computer vision techniques to analyze parking lot occupancy. The goal is to detect available parking spaces in real-time using image and video input.
computer-vision data-analysis data-science data-visualization google-colab image-classification image-processing machine-learning python transfer-learning
Last synced: 15 May 2026
https://github.com/douglasdavis/twaml
tW Analysis Machine Learning
data-analysis high-energy-physics machine-learning python
Last synced: 16 Aug 2025
https://github.com/dcs-training/scottishaccounts
This repo contains various examples of analysis that can be performed on the Statistical Accounts of Scotland dataset. Go to the readme file
data-analysis data-visualisation data-wrangling geographical-data r rmarkdown text-analysis
Last synced: 16 Aug 2025
https://github.com/kathisnehith/realestate-sales-analysis
Investigating real estate sales trends to understand market dynamics and inform investment decisions.
data-analysis excel realestate sales sql stastical-analysis-tools tableau
Last synced: 12 Feb 2026
https://github.com/sebastiansauer/hans-hackathon2025
Materials for a course on the evaluation of the AI student learn tool "HaNS"
Last synced: 04 Oct 2025
https://github.com/rachkat/random-foresst-analysis-r-studio-plotting-classification-tree
Classification analysis in R using the birthwt dataset. Built and compared Decision Tree and Random Forest models to predict low birth weight. Both achieved 71.05% accuracy, with Random Forest reducing overfitting and confirming maternal weight and age as key predictors.
classification data-analysis decision-trees machine-learning predictive-modeling r random-forest
Last synced: 04 Oct 2025
https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau
• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.
data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop
Last synced: 09 Apr 2025
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 06 May 2026
https://github.com/chandkund/loan-eligibility-prediction
This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.
data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2026
https://github.com/i-e-b/dynamictimewarp
A quick C# implementation of https://jeremykun.com/2012/07/25/dynamic-time-warping/
data-analysis pattern-matching working
Last synced: 17 Aug 2025
https://github.com/lotfiferaga/google-play-store-sentiment-analysis
Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.
data-analysis data-visualization googleplayservices python reviewsanalysis-nlp
Last synced: 26 Feb 2025
https://github.com/bkataru/math-ia
Data and analysis for IB Math IA
data-analysis data-science data-visualization matplotlib modeling plotting regression-analysis regression-models
Last synced: 09 Apr 2025
https://github.com/edoardotosin/january-2025-southern-california-wildfires-burn-severity-sentinel2
Scripts and data for analyzing burn severity of the January 2025 Southern California wildfires using Sentinel-2 satellite imagery. This project explores the use of the Differenced Normalized Burn Ratio (dNBR) and Relativized Burn Ratio (RBR) to classify burn severity, leveraging publicly available satellite data.
burn-severity copernicus data-analysis earth-observation satellite-imagery sentinel-2 wildfire wildfire-detection wildfires
Last synced: 09 Feb 2026
https://github.com/harshindcoder/online_retail_data_clustering_project
This marketing analytics project uses RFM (Recency, Frequency, Monetary) features for customer classification, inspired by the online retail mining paper. The RFM model helps segment customers, identify high-value ones, and optimize marketing strategies.
customer-segmentation data-analysis data-visualization market-analytics
Last synced: 17 Aug 2025
https://github.com/nishumehta/retail-sales-analysis
Retail sales performance analysis using Python and Power BI.
data-analysis ipynb-notebook jupyter-notebook powerbi python
Last synced: 15 May 2026
https://github.com/davidzajac1/four-percent-rule-pandas-analysis
Analysis of the 4% Personal Finance Rule of Thumb
data-analysis data-visualization pandas python
Last synced: 20 Apr 2026
https://github.com/jpgiant/nyc_energy_prediction
A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.
data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 10 Apr 2026
https://github.com/phanchenh/cohortanalysis_adventurework_sqlproject
Customer Retention and Lifetime Value Analysis Using Cohort Analysis on AdventureWorks Dataset (2011-2014)
adventureworks business-analytics business-intelligence clv-analysis customer-lifetime-value data-analysis data-visualization insights matplotlib microsoft-learn mssql mssqlserver pandas python retention-analysis retention-rate seaborn
Last synced: 12 Apr 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/palakjainanalyst/ecommerce-customer-spending-analysis
An end-to-end Ecommerce analytics project uncovering customer spending trends using Excel, Python, SQL, and Power BI. From raw data to interactive dashboards, this project delivers deep insights on spending patterns, high-value customer segments - showcasing a complete data-to-decisions workflow.
data-analysis data-visualization database ecommerce excel jupyter-notebook powerbi python spending sql
Last synced: 06 May 2026
https://github.com/grlyntng/rpims
Django Code and documentation for the Retail Pharmacy Inventory Management System (best final year project award)
data-analysis django erp forecasting-models lstm-neural-networks reporting
Last synced: 26 May 2026
https://github.com/ccoolbaugh/individualized_cooling_data_analysis
Matlab code to analyze data collected during a brown adipose tissue individualized cooling protocol.
brown-adipose-tissue cold-exposure data-analysis ibutton matlab skin-temperature thermoregulation
Last synced: 18 Aug 2025
https://github.com/prakashjha1/whatsapp-chat-analyzer
WhatsApp Analyzer means we are analyzing our WhatsApp group activities. It tracks our conversation and analyses how much time we are spending or saying it as “wasting” on WhatsApp.
data-analysis data-science natural-language-processing pandas pyhton regular-expression
Last synced: 15 May 2026
https://github.com/harmanveer-2546/movie-industry
Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.
business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python
Last synced: 10 Apr 2026
https://github.com/zborovskaanna/dou-salary-analysis
Python data analysis project focused on improving data manipulation skills using Pandas
Last synced: 26 Feb 2025
https://github.com/lucalullo/italian-justice-workload
Multidimensional analysis of the Italian justice system workload (2003–2024). A study of civil and criminal proceedings using judicial pressure and litigation indicators.
data-analysis italy judicial-workload justice-system kaggle legal-analytics pandas python time-series
Last synced: 24 May 2026
https://github.com/chiamakaukwuoma/portfolio
This repository contains various projects I've been privileged to work on outside of work.
aws-rds azure-fabric bigquery data-analysis docker-container elasticsearch excel grafana hadoop looker-studio mssql mysql postgresql powerbi python sql tableau
Last synced: 10 Apr 2026
https://github.com/farzeennimran/cryptocurrency-price-prediction
Cryptocurrency Price Prediction using machine learning models 💲💰💸📈📉📊₿
artificial-intelligence cryptocurrency data-analysis data-preprocessing data-science data-visualization deep-learning feature-selection machine-learning matplotlib numpy pandas plotly prediction python regression-models scipy seaborn sklearn
Last synced: 07 Jan 2026
https://github.com/jcm-ai/Personal-Data-Science-Projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 19 Aug 2025
https://github.com/jcm-ai/Quantium-Data-Analytics-Virtual-Experience-Program
This repository contains all about the proposed solutions to the assignments that I was required to complete as part of the Quantium Data Analytics Virtual Experience Program. 📊📈📉👨💻
commercial-thinking communication-skills data-analysis data-validation data-visualisation data-wrangling jupyter-notebook matplotlib-pyplot numpy-library pandas-python presentation-skills programming python3 scipy-stats seaborn statistical-testing
Last synced: 19 Aug 2025
https://github.com/j-wu1/analyse_ventes_jeuxvideo_python
Analyse Exploratoire de Données (EDA) sur les ventes de jeux vidéo avec Python, Pandas, Matplotlib et Seaborn dans un Jupyter Notebook.
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 19 Aug 2025
https://github.com/apostolis-bloutsos-data/employee-data-eda
Mini EDA project on synthetic employee records using Python, pandas, and matplotlib
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/cyberoctane29/epa-air-quality-aqi-analysis
This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.
data-analysis numpy pandas python statistics
Last synced: 10 Apr 2026
https://github.com/vara-co/solar-eclipse-2024
Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.
data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse
Last synced: 15 May 2026
https://github.com/emcramer/clockplot
Plotting utility for a "clockplot" that puts groups into a time-ordered heterogeneity visualization
biology data-analysis data-visualization heterogeneity pseudotemporal-ordering
Last synced: 10 Mar 2026
https://github.com/k31ner/inmopipeline
Proyecto integral de análisis y modelado predictivo de datos inmobiliarios, que abarca recolección, transformación, visualización y machine learning utilizando Python y herramientas modernas de ingeniería y ciencia de datos.
data-analysis data-engineering data-science fastapi python streamlit
Last synced: 08 May 2026
https://github.com/anas436/data-science-projects
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in to discover insights and techniques in data science. Reach out for collaborations and feedback.
data-analysis data-science machine-learning
Last synced: 27 Mar 2025
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/puspacempaka/superstore-analysis-with-sql
This repository showcases various data analyses on the popular Superstore dataset using SQL queries. The analyses cover a range of business insights, including sales performance, customer segmentation, and product profitability. Each analysis is documented with the SQL queries used and explanations of the steps involved.
business-intelligence data-analysis sales-analysis sql superstore-dataset
Last synced: 09 Mar 2026
https://github.com/rubyyy1118/share-price-analysis
The assignment in my MSc Business Analytics course
data-analysis data-preprocessing data-science data-visualization matplotlib numpy pandas python seaborn
Last synced: 10 Apr 2026
https://github.com/aalkiyumi/predicting-hospital-readmission-risk
This project aims to create a predictive model that forecasts the likelihood of a patient being readmitted to the hospital within 30 days of discharge.
big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis uc uc2026 university-of-cincinnati
Last synced: 11 Oct 2025
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/kaoutarmi/analyse-des-ventes-pour-optimiser-la-performance
Analyse des données de ventes pour identifier des opportunités d'amélioration des performances commerciales. Utilisation de Pandas pour le traitement des données, et Matplotlib/Seaborn pour la visualisation des tendances et des résultats.
business-intelligence data-analysis data-visualization jupyter-notebook matplotlib pandas sales-optimization seaborn
Last synced: 20 Aug 2025
https://github.com/aalkiyumi/project-3-docker-container-for-data-processing-script
This Dockerized Python application analyzes two text files (IF.txt and AlwaysRememberUsThisWay.txt). It counts total words, identifies the largest file, and finds the top three most frequent words in each. Results are saved to an output file and printed to the console.
cs5165 data-analysis data-engineering data-science docker introduction-to-cloud-computing statistical-analysis text-processing uc uc2026 university-of-cincinnati
Last synced: 17 May 2026