Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/dimits-ts/sport-repression-repl-study
A replication Study for the recent paper "International Sports Events and Repression in Autocracies: Evidence from the 1978 FIFA World Cup" paper.
data-analysis jupyter regression-models replication-study statistical-analysis
Last synced: 25 Jul 2025
https://github.com/kushalagarwalla/netflix-movie-data-analysis
🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.
data-analysis data-visualization jupyter-notebook numpy pandas python seaborn
Last synced: 06 May 2026
https://github.com/labex-labs/numpy-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.
array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations
Last synced: 20 Jun 2026
https://github.com/ddjain/jsonl-visualizer
A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes
data-analysis json jsonl viusal
Last synced: 18 Sep 2025
https://github.com/rh01/data-analysis-with-r
Duke University - Data Analysis With R
data-analysis r r-language r-studio rmarkdown
Last synced: 23 May 2026
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 27 Jul 2025
https://github.com/grindelfp/data-analysis-example
One of my UNI Artificial Intelligence Systems course's projects.
data-analysis data-preprocessing ipynb
Last synced: 19 Sep 2025
https://github.com/tbep-tech/tberf-oyster
Materials for evaluating TBERF oyster restoration success
ccmp-bh4 ccmp-bh6 data-analysis tampa-bay tbep tberf
Last synced: 19 Feb 2026
https://github.com/tkhoa2711/twitter-hate-speech
Hate speech detection on Twitter
Last synced: 28 Jul 2025
https://github.com/tbep-tech/seagrass-analysis
Materials for assessing coverage changes and analysis of drivers of change for Tampa Bay seagrass
dashboard data-analysis seagrass tampa-bay water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/rookery-bay-training
Materials for R training at Rookery Bay Monitoring Workshop 2020
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/rohitblaze10/survey_monkey_analysis--using-ipython
This data analysis project focused on extracting insights from survey responses. It involves data cleaning, merging, and transformation using iPython (Pandas,OS) and SQL. The goal is to identify trends and patterns in survey data for better decision-making.
data-analysis ipynb ipython-notebook
Last synced: 28 Jul 2025
https://github.com/ashwin331133/sql-project--sales-data-analysis--walmart
This SQL-based Walmart data analysis project aims to identify top-performing branches and products, optimize sales strategies using Kaggle's Walmart Sales Forecasting Competition dataset.
Last synced: 03 Jan 2026
https://github.com/lucashomuniz/project-09
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 28 Jul 2025
https://github.com/odinsride/monpl
PL/SQL Data Load Monitoring Tools
data-analysis database etl logging logging-framework monitoring oracle plsql
Last synced: 28 Jul 2025
https://github.com/swethajoseph/statistical-stock-performance-analysis
Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.
advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis
Last synced: 03 Jan 2026
https://github.com/aabbtree77/uci-marketing-analysis-cart
UCI bank marketing data analysis with decision trees (CART).
cart chatgpt commerce conversion-rate data-analysis decision-trees deepseek grok kovnatsky marketing-analytics miniconda scikit-learn-python uci-machine-learning
Last synced: 29 Jul 2025
https://github.com/kanasia-moore/sql-project
bigquery data-analysis data-analysis-project data-analytics dataset homelessness sql
Last synced: 21 Sep 2025
https://github.com/yash22222/literacy-exploration-analysis
Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.
csv data-analysis data-visualization government-data india literacy literacy-analysis states
Last synced: 29 Jul 2025
https://github.com/nathadriele/transaction_fraud_prevention_pipeline
Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.
data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost
Last synced: 10 Apr 2026
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 03 Jan 2026
https://github.com/phomint/udacity_free_datavisualization_with_tableau
Udacity free course using Tableau
data-analysis tableau udacity-course
Last synced: 09 Mar 2026
https://github.com/kartikey2807/bike-classification-1rt700
Binary classification problem involving Logistic regression, SMOTE and feature expansion.
data-analysis data-engineering data-visualization logistic-regression
Last synced: 30 Jul 2025
https://github.com/alekszhs/data-analyst-portfolio
Data Analytics Portfolio
data-analysis data-visualization excel mysql problem-solving python tableau
Last synced: 08 May 2026
https://github.com/alrza2003/google-data-analysis-case-study-cyclistic
This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.
business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau
Last synced: 09 May 2026
https://github.com/mainak-97/netflix-content-analysis-project
SQL-based analysis of Netflix’s movies and TV shows dataset to uncover content trends, popular genres, geographical insights, and audience preferences. Includes data queries, findings, and a presentation of key insights.
data-analysis mysql mysql-workbench powerpoint presentation-slides sql
Last synced: 23 Sep 2025
https://github.com/remram44/apex-legends-ocr-data
Get data from Apex Legends streams using OCR
apex-legends data-analysis video-games
Last synced: 31 Jul 2025
https://github.com/pauliorandall/airline-passenger-satisfaction-r
Analysing the Airline Passenger Satisfaction dataset from Maven Analytics
data-analysis data-analytics r
Last synced: 01 Aug 2025
https://github.com/aygp-dr/claude-log-stream
Advanced analytics engine for Claude Code logs with real-time processing capabilities
claude-api clojure data-analysis monitoring
Last synced: 24 Sep 2025
https://github.com/yousefmohammad/american_collage_quickanalysis
Quick Anaylsis about American Colleges
data-analysis data-visualisation data-visualization datanalysis datavisualisation datavisualization excel microsoft-excel
Last synced: 09 Mar 2026
https://github.com/aravind2060/employee_engagement_analysis_spark
Using Spark Structured APIs to analyze employee data and extract insights related to employee satisfaction, engagement, concerns, and job titles within an organization.
apache-spark data-analysis data-preprocessing docker docker-compose python
Last synced: 09 Apr 2026
https://github.com/jasoncobra3/finops-copilot
An end-to-end AI-powered FinOps platform that ingests cloud billing data, analyzes cost trends, answers natural-language questions using a RAG pipeline (LangChain + FAISS + sentence-transformers + Groq), and provides actionable cost optimization recommendations. Includes a FastAPI backend and Streamlit dashboard UI - fully containerized with Docker
ai-assistant cloud-cost-optimization cloud-enginee cost-analytics data-analysis devops docker faiss faiss-vector-database fastapi finops groq langchain llm pandas rag rag-pipeline sentence-transformers sqlite3 streamlit
Last synced: 13 Apr 2026
https://github.com/vishal-bhandary/sql-data-analytics
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql
Last synced: 02 Aug 2025
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/jimohola/flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 05 May 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/ausaaf-rh/movie-recommendation-system-collaborative-filtering
🎬 A comprehensive movie recommendation system implementing item-based collaborative filtering with cosine similarity. Features real-time recommendations, performance evaluation metrics (Precision@K, Recall@K), and interactive user interface. Built with Python, scikit-learn, and MovieLens dataset for academic research and learning purposes.
agents data-analysis jupyter-notebook python python3
Last synced: 17 Apr 2026
https://github.com/yuvrajs2003/formula-1-performance-analysis
Analysis of F1 races and their drivers
data-analysis data-science data-visualization hyperparameter-tuning pandas python
Last synced: 09 Apr 2026
https://github.com/kuuhaku86/datmingemastik19
data-analysis data-mining data-science data-visualization
Last synced: 02 Aug 2025
https://github.com/quesocosteno03/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 02 Aug 2025
https://github.com/idaraabasiudoh/credit_card_fraud_detection
This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.
data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml
Last synced: 19 Feb 2026
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/prasannnnn/real-time-share-price-scraping-and-analysis
The Stock Sentiment Analyzer is a web-based application built with Streamlit, BeautifulSoup, and Pandas to help users analyze the sentiment of a stock (BUY, SELL, or HOLD) based on its financial data. The tool extracts key financial metrics like Market Cap, Stock P/E, Dividend Yield, ROCE, ROE, and the 52-week High/Low from Screener.in.
beautifulsoup4 data-analysis python sentiment-analysis streamlit streamlit-dashboard webscraping
Last synced: 03 Aug 2025
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 29 Sep 2025
https://github.com/hari00887/analysis-of-global-terrorism
Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.
data-analysis data-visualization powerbi
Last synced: 02 Mar 2026
https://github.com/bilalhameed248/xg-boost-ts-prediction
Predict/Forecast monthly and daily charges, as well as payments associated with claims generated during the billing process
charges-prediction data-analysis data-analysis-python data-modeling data-science payment-prediction prediction rcm revenue-forecast sql sql-query time-series time-series-analysis xgboost xgboost-model xgboost-regression
Last synced: 09 Mar 2026
https://github.com/asghar-rizvi/hotel_reservation_data_analysis
This project involves a comprehensive data analysis of a hotel reservation dataset using Excel. The primary focus is on examining reservation cancellations. Through detailed analysis and visual representation.
dashboard dashboard-templates data-analysis data-analysis-excel data-representation data-science excel
Last synced: 02 Mar 2026
https://github.com/PanosChatzi/Healthcare_and_Bioinformatics_Analyses
This repo contains the final assignments of the Data Analyst bootcamp by Workearly. Python and SQL were used to complete the assignments.
data-analysis data-cleaning data-visualisation jupyter matplotlib pandas python seaborn
Last synced: 05 Aug 2025
https://github.com/shrutiijoshi/corporate-campus-hiring-analysis
This project analyzes corporate campus hiring trends for fresh graduates in India.
dashboard data-analysis data-visualization excel powerbi
Last synced: 09 Mar 2026
https://github.com/acerbilab/svbmc
Stacking Variational Bayesian Monte Carlo (S-VBMC) algorithm for combining Variational Bayesian Monte Carlo (VBMC) posteriors to boost inference performance.
bayesian-inference data-analysis machine-learning model-fitting python stacking variational-inference
Last synced: 20 Jan 2026
https://github.com/theashishmavii/job-trends-analyzer-automation
End-to-end automation: job scraping, data analysis, and trends reporting for job seekers and researchers.
automation beautifulsoup data-analysis open-source pandas python selenium webscraping
Last synced: 07 Aug 2025
https://github.com/sourceduty/data_metrics
📈 Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/debjyotisaha/data-analytics-projects-phase-2
Developed and showcased various data analytics projects, including data preprocessing, exploratory data analysis, and visualization. Utilized tools such as Python, Pandas, NumPy, and Matplotlib to derive actionable insights and demonstrate problem-solving capabilities.
data-analysis data-preprocessing eda matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-calculate-running-totals
This example demonstrates how to calculate running totals in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 08 Aug 2025
https://github.com/muneeb706/human_activity_recognition
This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.
data-analysis data-cleaning data-exploration r-programming
Last synced: 08 Aug 2025
https://github.com/akunna1/energy-data-analysis-unc-campus
Link to Report: https://adminliveunc-my.sharepoint.com/:w:/r/personal/tadennis_ad_unc_edu/Documents/Capstone%20Group/Final%20Report%20Draft.docx?d=wba9e7182a9b948898133e4f89def1d90&csf=1&web=1&e=fQGAfy
arcgis-pro data-analysis dplyr excel geospatial-data-analysis ggplot ggplot2 lubricants tidyr tidyverse
Last synced: 08 Aug 2025
https://github.com/jakobzmrzlikar/trg-dela
Data analysis of student job offers.
data-analysis ipython-notebook web-scraping
Last synced: 09 Aug 2025
https://github.com/busradeveci/odev2-branching
This project is prepared for Artificial Intelligence and Technology Academy Git GitHub Assignment 2. Using the “Wine Reviews” dataset from Kaggle, it converts wine ratings into star ratings and analyzes them.
data-analysis kaggle-dataset python wine-reviews-dataset
Last synced: 03 Oct 2025
https://github.com/prakhar-ff13/creating-customer-segments
Udacity Machine Learning Engineer Nanodegree project 3
clustering data-analysis data-science machine-learning udacity udacity-machine-learning-nanodegree unsupervised-learning
Last synced: 03 Oct 2025
https://github.com/yash22222/data-analysis-on-real-time-social-media-comments
EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.
data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis
Last synced: 14 May 2026
https://github.com/ashwani-199/dataanalysis2024
Python Data Analysis
data-analysis data-visualization numpy pandas python seaborn sklearn
Last synced: 09 Apr 2026
https://github.com/mkoeppe/jiawei-computations
Computations supporting Chapters 2 and 3 of Jiawei Wang's dissertation "Subadditivity of Piecewise Linear Functions", UC Davis, Ph.D. program in Mathematics, 2020
benchmark-framework branch-and-bound cluster cutting-planes data-analysis hpc integer-programming reproducible-research sagemath
Last synced: 10 Aug 2025
https://github.com/nafisrayan/decentai
A comprehensive platform built using ReactJS and Flask, combining blockchain technology with AI to create a secure and intelligent space for community engagement and policy discussions. Leverages NLP and LLM for meaningful interactions and sentiment analysis while ensuring data security and user privacy.
chatbot data-analysis data-visualization flask gemini gemini-ai gemini-ai-chatbot gemini-api government government-tech llm mongodb nlp polls python react tailwind voting-systems winknlp
Last synced: 12 Apr 2026
https://github.com/nuraj250/datainsighthub
A Node.js backend application that processes and analyzes personal user data to generate personalized insights and recommendations. It features secure user authentication, data upload and storage, custom algorithms for data analysis, and optional real-time notifications and third-party API integrations. Perfect for showcasing backend development
api-development backend-development bcrypt data-analysis data-analytics data-insights dotenv express jwt-authentication mongodb nodejs passport secure-api user-authentication
Last synced: 09 Apr 2026
https://github.com/ct83/become-a-data-analyst-udacity
This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.
data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree
Last synced: 12 Aug 2025
https://github.com/farhad-here/adventureworks_interactive_sales_dashboard_powerbi
An interactive Power BI dashboard for Adventure Works sales team to analyze performance, customers, products, and employees. Includes data cleaning, data modeling, DAX measures and advanced visualization features.
business-intelligence chart csv data-analysis data-cleaning data-cleaning-and-preprocessing data-visualization dax powerbi
Last synced: 13 Aug 2025
https://github.com/natgluons/fmcg-data-modeling
SQL, ARIMA, and K-Means Clustering for data analysis dan customer segmentation regarding sales data
arima-forecasting arima-model customer-segmentation data-analysis data-science-projects kmeans-clustering sales-forecasting
Last synced: 13 Aug 2025
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/mothraa/etl-marketanalysis-webscraping
OC project 2
data-analysis etl python web-scraping
Last synced: 15 Aug 2025
https://github.com/douglasdavis/twaml
tW Analysis Machine Learning
data-analysis high-energy-physics machine-learning python
Last synced: 16 Aug 2025
https://github.com/rachkat/random-foresst-analysis-r-studio-plotting-classification-tree
Classification analysis in R using the birthwt dataset. Built and compared Decision Tree and Random Forest models to predict low birth weight. Both achieved 71.05% accuracy, with Random Forest reducing overfitting and confirming maternal weight and age as key predictors.
classification data-analysis decision-trees machine-learning predictive-modeling r random-forest
Last synced: 04 Oct 2025
https://github.com/i-e-b/dynamictimewarp
A quick C# implementation of https://jeremykun.com/2012/07/25/dynamic-time-warping/
data-analysis pattern-matching working
Last synced: 17 Aug 2025
https://github.com/edoardotosin/january-2025-southern-california-wildfires-burn-severity-sentinel2
Scripts and data for analyzing burn severity of the January 2025 Southern California wildfires using Sentinel-2 satellite imagery. This project explores the use of the Differenced Normalized Burn Ratio (dNBR) and Relativized Burn Ratio (RBR) to classify burn severity, leveraging publicly available satellite data.
burn-severity copernicus data-analysis earth-observation satellite-imagery sentinel-2 wildfire wildfire-detection wildfires
Last synced: 09 Feb 2026
https://github.com/harshindcoder/online_retail_data_clustering_project
This marketing analytics project uses RFM (Recency, Frequency, Monetary) features for customer classification, inspired by the online retail mining paper. The RFM model helps segment customers, identify high-value ones, and optimize marketing strategies.
customer-segmentation data-analysis data-visualization market-analytics
Last synced: 17 Aug 2025
https://github.com/davidzajac1/four-percent-rule-pandas-analysis
Analysis of the 4% Personal Finance Rule of Thumb
data-analysis data-visualization pandas python
Last synced: 20 Apr 2026
https://github.com/jpgiant/nyc_energy_prediction
A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.
data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 10 Apr 2026
https://github.com/berkekaragoz/media-investments-data-analysis
Advertisement Investments Distribution of Turkey by Medium
Last synced: 19 Aug 2025
https://github.com/chiamakaukwuoma/portfolio
This repository contains various projects I've been privileged to work on outside of work.
aws-rds azure-fabric bigquery data-analysis docker-container elasticsearch excel grafana hadoop looker-studio mssql mysql postgresql powerbi python sql tableau
Last synced: 10 Apr 2026
https://github.com/apostolis-bloutsos-data/employee-data-eda
Mini EDA project on synthetic employee records using Python, pandas, and matplotlib
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/alikhalajii/text-classification-life-sciences
Text classification of Life Science apps
data-analysis data-science datasets feature-importance jupyter-notebook pandas sbert scikit-learn word2vec
Last synced: 05 May 2026
https://github.com/kaoutarmi/analyse-des-ventes-pour-optimiser-la-performance
Analyse des données de ventes pour identifier des opportunités d'amélioration des performances commerciales. Utilisation de Pandas pour le traitement des données, et Matplotlib/Seaborn pour la visualisation des tendances et des résultats.
business-intelligence data-analysis data-visualization jupyter-notebook matplotlib pandas sales-optimization seaborn
Last synced: 20 Aug 2025
https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas
This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian
data-analysis numpy pandas python
Last synced: 10 Apr 2026
https://github.com/vaishnavipaithane/cyclistic-bike-share-analysis-case-study
This capstone project was done as a part of Google Data Analytics Professional Certificate course.
data-analysis r-programming-language rstudio
Last synced: 24 Aug 2025
https://github.com/harshnevse/performance_analysis_of_solar_plants_in_india
A Data Analysis project using Tableau
Last synced: 03 Jan 2026
https://github.com/gustavo-zamai/analysis_online_shopping_data
Online Shopping Analysis
csv-files data-analysis pandas plotly-express python3
Last synced: 17 Apr 2026
https://github.com/lauratrigo/fft_matlab
📡Análise de Fourier para Dados Ionosféricos é um script MATLAB que aplica FFT para gerar espectros unilaterais e bilaterais de parâmetros ionosféricos (hF, f0F2, hmF2), identificando periodicidades e comparando assinaturas espectrais com resolução de 15 minutos, útil para estudos de variações e distúrbios ionosféricos.
data-analysis fast-fourier-transform fft fourier ionosphere matlab scientific scientific-initiation
Last synced: 29 Aug 2025
https://github.com/ahnaf19/rokomari_price_analysis
This was a job hiring assignment given my rokomari.com. The data was small, obviously a generated one for test purpose. I tried to describe myself while diving deep as much as possible.
data-analysis data-cleaning data-visualization etl
Last synced: 30 Aug 2025
https://github.com/luminati-io/walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/nischay002/us-honey-production-analysis
Analysis of US honey production (1995–2021) using Python & data visualization. Identifies trends in honey yield, pricing, and colony distribution across states.
data-analysis data-visualization exploratory-data-analysis honey-production matplotlib pandas python seaborn us-agriculture
Last synced: 26 Feb 2025
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/mehrab-kalantari/olympics-data-analysis
A streamlit application to analyze the Olympics dataset from several views
data-analysis streamlit-dashboard streamlit-webapp
Last synced: 20 Apr 2026
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 14 May 2025
https://github.com/scailfin/benchmark-templates
Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 16 Jan 2026
https://github.com/jayqi/data-analysis-tools
Presentation on Data Analysis Tools
data-analysis presentation-slides
Last synced: 06 Jan 2026
https://github.com/virajbhutada/diamond-price-estimator
This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.
cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface
Last synced: 14 Apr 2026
https://github.com/moenessgannouni/englandweather
A mini-project that analyzes weather data in England usingLinear Regression and Multiple Linear Regression. Ideal for learning and applying statistical analysis and predictive modeling.
data-analysis data-visualization linear-regression multiple-linear-regression rprogramming
Last synced: 22 Mar 2025
https://github.com/ronylpatil/whatsapp-group-chat-analysis
This project is totally based on data analysis where our college official Whatsapp group is used to extract useful information from the chat. Some of the useful extracted features are most active members of the group, most active day of the week, top-10 media contributors in the Group, and many more...
data-analysis data-preprocessing data-wrangling feature-engineering
Last synced: 14 Jun 2025
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 10 Apr 2026