Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/palwisha-18/time_series_analysis_lex_vs_gdp
Analyzes how a country’s GDP per capita correlates with the life expectancy of its citizens over a period of about 100+ years
data-analysis data-visualization pandas plotl time
Last synced: 19 May 2026
https://github.com/owenl0000/housepricesproject
Kaggle Project
data-analysis data-science data-visualization gridsearchcv kaggle-competition kaggle-dataset linear-regression machine-learning machine-learning-algorithms numpy onehot-encoding ordinal-encoding pandas python random-forest-regression sckit-learn seaborn streamlit xgboost-regressor
Last synced: 09 Apr 2026
https://github.com/yousefmohammad/american_collage_quickanalysis
Quick Anaylsis about American Colleges
data-analysis data-visualisation data-visualization datanalysis datavisualisation datavisualization excel microsoft-excel
Last synced: 09 Mar 2026
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/aravind2060/employee_engagement_analysis_spark
Using Spark Structured APIs to analyze employee data and extract insights related to employee satisfaction, engagement, concerns, and job titles within an organization.
apache-spark data-analysis data-preprocessing docker docker-compose python
Last synced: 09 Apr 2026
https://github.com/jasoncobra3/finops-copilot
An end-to-end AI-powered FinOps platform that ingests cloud billing data, analyzes cost trends, answers natural-language questions using a RAG pipeline (LangChain + FAISS + sentence-transformers + Groq), and provides actionable cost optimization recommendations. Includes a FastAPI backend and Streamlit dashboard UI - fully containerized with Docker
ai-assistant cloud-cost-optimization cloud-enginee cost-analytics data-analysis devops docker faiss faiss-vector-database fastapi finops groq langchain llm pandas rag rag-pipeline sentence-transformers sqlite3 streamlit
Last synced: 13 Apr 2026
https://github.com/dcs-training/much-ado-about-nothing-missing-data-in-research
Repo for the Much ado about nothing workshop. Go to the Readme file
data-analysis data-cleaning data-wrangling r
Last synced: 15 Jun 2025
https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data
This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.
data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost
Last synced: 14 May 2026
https://github.com/rodolfo-brandao/pos-graduacao
[pt-BR] Repositório para armazenar alguns materiais e projetos de cada módulo da minha especialização em Ciência de Dados (2025–2027)
artificial-intelligence data-analysis data-science data-visualization databases deep-learning jupyter linear-algebra machine-learning python r statistics
Last synced: 09 Apr 2026
https://github.com/0xunkn0wn4m1r/data_engineering_banking_project
🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.
automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning
Last synced: 09 Apr 2026
https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-
This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.
airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server
Last synced: 09 Sep 2025
https://github.com/vishal-bhandary/sql-data-analytics
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql
Last synced: 02 Aug 2025
https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown
This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/phanchenh/supplychaindashboard_datacpsupplychain
Tracking Trends in Supply Chain – A Sales and Profit Review (2015-2017)
business-analytics business-intelligence data-analysis data-visualization dax-languague dax-query mssql mssqlserver powerbi supply-chain supply-chain-management
Last synced: 01 Aug 2025
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/rajesh9943/visualizing-global-development-trends-an-animated-analysis-of-life-expectancy-and-fertility-rates
To clean and analyze data to find trends in global population, fertility, and life expectancy from 1960 to 2016. This idea was inspired by hans rosling . To analyze the data, I used a scatter bubble chart, which clearly shows how's the population increased and the fertility rate decreased from 1960 to 2016.
data-analysis data-cleaning-and-preprocessing data-exploration expolatory-data-analysis identify-patterns reporting vizualisation
Last synced: 08 Oct 2025
https://github.com/jimohola/flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 05 May 2026
https://github.com/smehra1208/certifications
data-analysis data-visualization excel postgres powerbi python sql
Last synced: 14 May 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/dlozeve/topological-persistence
Topological persistence diagram (barcode) of a triangulation
data-analysis persistence topology
Last synced: 02 Aug 2025
https://github.com/ausaaf-rh/movie-recommendation-system-collaborative-filtering
🎬 A comprehensive movie recommendation system implementing item-based collaborative filtering with cosine similarity. Features real-time recommendations, performance evaluation metrics (Precision@K, Recall@K), and interactive user interface. Built with Python, scikit-learn, and MovieLens dataset for academic research and learning purposes.
agents data-analysis jupyter-notebook python python3
Last synced: 17 Apr 2026
https://github.com/baslia/zillow_exploration
data-analysis data-science ds machine-learning pandas python
Last synced: 09 Apr 2026
https://github.com/takshak26/predict_blood_donations-
About The title of the project is “Predict Blood Donations”. It uses python as language, data science, and machine learning as the field of operation, TPOT library for model selection, logistic regression for model building, and jupyter notebook as the code editor.
data-analysis data-visualization datascience machine-learning python3
Last synced: 16 May 2026
https://github.com/yuvrajs2003/formula-1-performance-analysis
Analysis of F1 races and their drivers
data-analysis data-science data-visualization hyperparameter-tuning pandas python
Last synced: 09 Apr 2026
https://github.com/bpkaur/exploring-67-years-of-lego
Exploring 67 years of LEGO
data-analysis datacamp pandas python3
Last synced: 10 May 2026
https://github.com/jotstolu/netflix-sql-data-analysis-project
This project explores the Netflix dataset using SQL queries to uncover trends, patterns, and business insights that could help stakeholders understand content distribution, viewer preferences, and platform optimization
data-analysis sql sql-server tsql
Last synced: 02 Aug 2025
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/jedrzej-wydra/competition-cooperation
Competition, cooperation, and parental effects in larval aggregations formed on carrion by communally breeding beetles Necrodes littoralis (Staphylinidae: Silphinae)
data-analysis non-linear-regression r
Last synced: 20 Aug 2025
https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project
In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.
data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql
Last synced: 09 Apr 2026
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/syed-amjad-ali/restaurant-sales-sql-project
This was a simple SQL project where I analyzed restaurant sales data, showcasing skills in data creation and querying. The project explores menu performance, order trends, and customer insights.
aggregations business-intelligence data-analysis guided-project joins maven-analytics querying restaurant-sales sales-data sql subqueries
Last synced: 03 Jan 2026
https://github.com/kuuhaku86/datmingemastik19
data-analysis data-mining data-science data-visualization
Last synced: 02 Aug 2025
https://github.com/quesocosteno03/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 02 Aug 2025
https://github.com/manish506/loan-approval-prediction
Explore predictive modeling in this project by applying classification techniques to a loan approval dataset. Analyze and preprocess the data, then use models like K-Nearest Neighbors, Random Forest, SVC, and Logistic Regression to predict loan outcomes. Gain insights into approval factors and enhance prediction accuracy.
classification classification-models data-analysis data-science jupyter-notebook loan-approval-prediction machine-learning predictive-analytics predictive-modeling project python
Last synced: 19 Jan 2026
https://github.com/yamslam/contentsunderpressure_processing
A repository for data processing and analysis for Contents Under Pressure.
data-analysis data-processing data-visualization game-based-learning judgments process-safety
Last synced: 07 Sep 2025
https://github.com/waghraj1699/car-price-prediction
Implementation of ML algorithm to predict the car price
artificial-intelligence data-analysis data-science data-visualization feature-engineering linear-regression machine-learning machine-learning-algorithms regression-models
Last synced: 02 Aug 2025
https://github.com/idaraabasiudoh/credit_card_fraud_detection
This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.
data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml
Last synced: 19 Feb 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/zeynepcol/data-science-cryptocurrencies-data-analysis-forecasting
Cryptocurrency price analysis and prediction using regression models
artificial-intelligence crytpocurrency data-analysis data-mining data-preprocessing data-processing data-science data-visualization extract-data financial-analysis linear-regression lstm machine-learning regression-algorithms xgboost
Last synced: 07 Jul 2025
https://github.com/abdullahashfaqvirk/PowerBI-Dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 27 Sep 2025
https://github.com/borjamome/explorando-madrid
Exploring Madrid: A Data-driven Analysis with R 🐻🌳
data-analysis data-visualization madrid r
Last synced: 26 Mar 2025
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/borjamome/top-goleadores
Mejores delanteros en Europa según los datos
data-analysis data-visualization football-analytics r
Last synced: 26 Mar 2025
https://github.com/faint-liebfraumilch101/fraud-detection-sql-unsupervised
🕵️♂️ Detect fraud in bank transactions using SQL for feature engineering and Python's Isolation Forest for unsupervised anomaly detection.
anomaly-detection banking-data data-analysis data-science financial-analytics fraud-detection isolation-forest machine-learning portfolio-project python sql sqlite unsupervised-learning
Last synced: 07 May 2026
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/0xnu/england-house-prices
Predict house prices for the next five years across all English local authorities.
data-analysis england england-house-prices housing-market housing-market-analysis predictive-modeling regression
Last synced: 03 Aug 2025
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 29 Sep 2025
https://github.com/mhkamel/ecommerce-targeting-system
A Flask-based E-Commerce Targeting System that provides customer segmentation and personalized product recommendations. Users can upload structured interaction data for analysis, receive AI-driven recommendations, and gain insights into user behavior. The application is built with Flask, Pandas, Scikit-Learn, and integrates an interactive web inter
ai bootstrap csv-processing customer-segmentation data-analysis data-science e-commerce flask machine-learning pandas python recommendation-system scikit-learn user-behavior web-application
Last synced: 09 Apr 2026
https://github.com/hari00887/analysis-of-global-terrorism
Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.
data-analysis data-visualization powerbi
Last synced: 02 Mar 2026
https://github.com/bilalhameed248/xg-boost-ts-prediction
Predict/Forecast monthly and daily charges, as well as payments associated with claims generated during the billing process
charges-prediction data-analysis data-analysis-python data-modeling data-science payment-prediction prediction rcm revenue-forecast sql sql-query time-series time-series-analysis xgboost xgboost-model xgboost-regression
Last synced: 09 Mar 2026
https://github.com/rahulsm20/car-data
A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.
data-analysis data-analytics matplotlib numpy pandas python
Last synced: 09 Apr 2026
https://github.com/asghar-rizvi/hotel_reservation_data_analysis
This project involves a comprehensive data analysis of a hotel reservation dataset using Excel. The primary focus is on examining reservation cancellations. Through detailed analysis and visual representation.
dashboard dashboard-templates data-analysis data-analysis-excel data-representation data-science excel
Last synced: 02 Mar 2026
https://github.com/agustin-caceres/arg-telecom-analisis
Telecom Argentina Insights
business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit
Last synced: 09 Apr 2026
https://github.com/lyubov0406/data_analyst_portfolio
В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных
data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization
Last synced: 09 Apr 2026
https://github.com/PanosChatzi/Healthcare_and_Bioinformatics_Analyses
This repo contains the final assignments of the Data Analyst bootcamp by Workearly. Python and SQL were used to complete the assignments.
data-analysis data-cleaning data-visualisation jupyter matplotlib pandas python seaborn
Last synced: 05 Aug 2025
https://github.com/mikhaelmounay/salty-med
Salty Mediterranean - Grade 12 Data Analysis & Visualization Capstone Project
data-analysis data-visualization
Last synced: 02 Feb 2026
https://github.com/phanchenh/datacosupplychain_sqlproject
Supply Chain Optimization – Tackling Delivery Delays and Profitability Challenges (2015-2017)
business-analytics business-intelligence data-analysis insights jupyter-notebook mssql mssqlserver python supply-chain supply-chain-analytics supply-chain-optimization
Last synced: 09 Mar 2026
https://github.com/shrutiijoshi/corporate-campus-hiring-analysis
This project analyzes corporate campus hiring trends for fresh graduates in India.
dashboard data-analysis data-visualization excel powerbi
Last synced: 09 Mar 2026
https://github.com/elissorokin/data-analyst-portfolio
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 09 Apr 2026
https://github.com/acerbilab/svbmc
Stacking Variational Bayesian Monte Carlo (S-VBMC) algorithm for combining Variational Bayesian Monte Carlo (VBMC) posteriors to boost inference performance.
bayesian-inference data-analysis machine-learning model-fitting python stacking variational-inference
Last synced: 20 Jan 2026
https://github.com/andremenezesds/pa004_health_insurance
Health Insurance Cross-Sell(Learning to Rank Machine Learning Project)
backend backend-api data-analysis data-science data-visualization dataviz lgbm machine-learning matplotlib numpy optuna pandas python scikit-learn shell-script sql webapi xgboost
Last synced: 09 Apr 2026
https://github.com/bhaveshbhakta/wine-quality-prediction-using-ml
Wine Quality Prediction
data-analysis data-visualization machine-learning ml random-forest wine-quality-prediction
Last synced: 07 Aug 2025
https://github.com/theashishmavii/job-trends-analyzer-automation
End-to-end automation: job scraping, data analysis, and trends reporting for job seekers and researchers.
automation beautifulsoup data-analysis open-source pandas python selenium webscraping
Last synced: 07 Aug 2025
https://github.com/fortunewalla/birdstrikes
birdstrikes database created for postgresql with simple sample queries
birdstrikes csv data-analysis data-science database dataset pgsql postgresql practice sample sql sql-query workshop
Last synced: 02 Oct 2025
https://github.com/byte7/fifa17-analysis-and-prediction
⚽ FIFA 17 Analysis and Prediction⚽
data-analysis dreamteam fifa17 fifa17-analysis ultimate-team
Last synced: 26 Mar 2025
https://github.com/omdoshi13/pricing-of-laptops-using-ml
Data Analysis, training Machine Learning models, and Model Evaluation and Refinement for Pricing of Laptops dataset.
data-analysis data-analysis-project datascience google-colab jupyter-notebook machine-learning matplotlib model-evaluation model-refinement numpy pandas python scikit-learn
Last synced: 09 Apr 2026
https://github.com/sebastianofazzino/ibm-data-science-professional-certificate
In this repository I've stored exercises and projects I've been working on while attending IBM Data Science Professional Certificate, using Python and its libraries.
data-analysis data-mining data-science data-structures data-visualization database machine-learning matplotlib numpy pandas python regression seaborn sql
Last synced: 09 Apr 2026
https://github.com/namratagulati/tweets_analysis
This repository focuses on sentiment analysis of Twitter data using Python, Natural Language Processing (NLP), and the Natural Language Toolkit (NLTK). The goal is to extract valuable insights from social media discussions, such as word frequency, hashtag trends, and sentiment patterns.
analysis data-analysis natural-language-processing nlp-machine-learning nltk-corpus nltk-python sentiment-analysis twitter-sentiment-analysis
Last synced: 07 Aug 2025
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/gmasson/datadash
DataDash é uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualização de dados dinâmicos em páginas web.
dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript
Last synced: 08 Aug 2025
https://github.com/sourceduty/data_metrics
📈 Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/nurulashraf/linear-regression-insurance-premium
This analysis applies simple linear regression to explore the relationship between age and insurance premium. It includes model training, visualisation, and evaluation using MSE and RMSE to assess prediction accuracy.
beginner-project data-analysis insurance-data linear-regression machine-learning matplotlib predictive-modeling python regression-models scikit-learn
Last synced: 05 May 2026
https://github.com/jagoda11/elastic-vision
This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.
backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables
Last synced: 09 Apr 2026
https://github.com/debjyotisaha/data-analytics-projects-phase-2
Developed and showcased various data analytics projects, including data preprocessing, exploratory data analysis, and visualization. Utilized tools such as Python, Pandas, NumPy, and Matplotlib to derive actionable insights and demonstrate problem-solving capabilities.
data-analysis data-preprocessing eda matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2026
https://github.com/lorennmarque/logistics-data-exploratory-analysis-iflow
Delivery Data Exploratory Analysis
case-study data-analysis data-science data-visualization eda
Last synced: 20 Jan 2026
https://github.com/thc1006/taiwan-ai-usage-index
台灣 AI 使用指數 (TAUI) - 開源資料分析框架,測量分析台灣各地區 AI 技術採用率 | Taiwan AI Usage Index - Open-source framework for measuring regional AI adoption
ai-adoption anthropic-index bilingual data-analysis human-ai-collaboration onet-classification open-source policy-analysis privacy-protection python research taiwan tdd usage-index visualization
Last synced: 03 Oct 2025
https://github.com/ryan-wong1/analyzing-stress-and-fatigue-drivers-in-railroad-workforces-data-analysis
Railroad dispatcher data on demographics, work, lifestyle, and stress factors
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-calculate-running-totals
This example demonstrates how to calculate running totals in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 08 Aug 2025
https://github.com/ibrahimm7004/machine-learning-projects
A collection of my ML projects.
ai artificial-intelligence data-analysis data-science llm machine-learning ml nlp python sklearn tensorflow
Last synced: 09 Apr 2026
https://github.com/muneeb706/human_activity_recognition
This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.
data-analysis data-cleaning data-exploration r-programming
Last synced: 08 Aug 2025
https://github.com/paraglondhe098/bigmart-sales-prediction
Implemented Xgboost model with optimum hyperparameters to predict sales in a BigMart mall.
data-analysis feature-engineering feature-extraction feature-transformation hyperparameter-tuning linear-regression machine-learning pandas python random-forest xgboost
Last synced: 09 Apr 2026
https://github.com/akunna1/energy-data-analysis-unc-campus
Link to Report: https://adminliveunc-my.sharepoint.com/:w:/r/personal/tadennis_ad_unc_edu/Documents/Capstone%20Group/Final%20Report%20Draft.docx?d=wba9e7182a9b948898133e4f89def1d90&csf=1&web=1&e=fQGAfy
arcgis-pro data-analysis dplyr excel geospatial-data-analysis ggplot ggplot2 lubricants tidyr tidyverse
Last synced: 08 Aug 2025
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/jakobzmrzlikar/trg-dela
Data analysis of student job offers.
data-analysis ipython-notebook web-scraping
Last synced: 09 Aug 2025
https://github.com/siddhartha-padhy/heart-disease-predictor
data-analysis machine-learning pandas python
Last synced: 20 Apr 2026
https://github.com/busradeveci/odev2-branching
This project is prepared for Artificial Intelligence and Technology Academy Git GitHub Assignment 2. Using the “Wine Reviews” dataset from Kaggle, it converts wine ratings into star ratings and analyzes them.
data-analysis kaggle-dataset python wine-reviews-dataset
Last synced: 03 Oct 2025
https://github.com/arnabushna24/titanic-disaster-analysis
Titanic - Machine Learning from Disaster
data-analysis data-visualization python statistical-analysis
Last synced: 03 Oct 2025
https://github.com/misszeferino/bellabeat-data-analysis
Bellabeat Data Analysis using R
analytics data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 09 Aug 2025
https://github.com/lashawnfofung/super-heroes-analysis-project
This portfolio project involves a detailed analysis of 732 superhero records from the heroes_information.csv dataset, comprising 11 columns of unique characteristics for each hero. The primary goal is to showcase key insights derived from this rich dataset, demonstrating proficiency in data analysis using SQL.
data-analysis datasets mysql-database mysql-server mysql-workbench sql
Last synced: 07 Jul 2025
https://github.com/prakhar-ff13/creating-customer-segments
Udacity Machine Learning Engineer Nanodegree project 3
clustering data-analysis data-science machine-learning udacity udacity-machine-learning-nanodegree unsupervised-learning
Last synced: 03 Oct 2025
https://github.com/yash22222/data-analysis-on-real-time-social-media-comments
EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.
data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis
Last synced: 14 May 2026
https://github.com/svetlanam/pycon-workshop
Pycon CZ workshop: Better data analyses and product recommendations with Instagram data
data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop
Last synced: 09 Apr 2026
https://github.com/abhigyan126/prompt2query
A Python desktop application for streamlined data analysis, enabling users to generate and execute Pandas and SQL queries with ease. Focus on reducing analysis time through an intuitive interface and efficient workflows
data-analysis data-science data-visualization database gemini generative-ai ide llm pandas pandas-interface python sql-interface
Last synced: 13 Feb 2026
https://github.com/itsagurin/data-visualization
Data analysis
data-analysis data-visualization matplotlib pygal python
Last synced: 03 Oct 2025
https://github.com/amiraflak/data-mining
Data Mining Course - Spring 2024
classification clustering data-analysis data-mining decision-tree-classifier eda pca
Last synced: 10 Aug 2025
https://github.com/brunomontezano/digital-interventions-for-depression
📱 "Digital interventions for depressive symptoms: a randomized clinical trial" code
academia clinical-trials cognitive-behavioral-therapy data-analysis digital-health open-science smartphone-app
Last synced: 03 Oct 2025