Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees
Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.
classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn
Last synced: 17 Oct 2025
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/codeslash21/communicate_data_findings
Analyze and visualize Bay Wheel system data which contains 2.5M individual trips data. And communicate the data findings from the dataset in the form notebook slide.
bay-wheel data-analysis data-visualization explanatory-data-visualization exploratory-data-analysis
Last synced: 22 Jan 2026
https://github.com/antodata/hate_crimes_spain_2014_2017
Analysis of hate crimes in Spain between 2014 and 2017 using official data
chi-square chi-square-test data-analysis data-visualization datascience folium hatecrime json lgtbiq linear-regression maps matplotlib numpy pandas python python3 scipy selenium selenium-webdriver sklearn
Last synced: 14 Apr 2026
https://github.com/codeslash21/analyze-a-b-test-results
Analyze results of an A/B test run by an e-commerce website.
Last synced: 22 Jan 2026
https://github.com/abishek0103/olist-ecommerce-sql-project
SQL Project using Olist Dataset – E-commerce analysis with MS SQL Server to extract business insights.
business-insights data-analysis sql-server
Last synced: 19 Oct 2025
https://github.com/pauliorandall/airline-passenger-satisfaction-r
Analysing the Airline Passenger Satisfaction dataset from Maven Analytics
data-analysis data-analytics r
Last synced: 01 Aug 2025
https://github.com/Kaushik-Puttaswamy/Airline-Passenger-Referral-Prediction-Using-Machine-Learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 20 Oct 2025
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/jimohola/zomato-restaurant-ratings-ml
Flask Deployment Machine Learning
css data-analysis flask html machine-learning python3
Last synced: 04 May 2026
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/gunifiri/duckdb-ghw
🦆 Accelerate analytics with DuckDB's integration for GitHub workflows, enabling efficient data handling and processing directly within your repositories.
analytics analytics-engine big-data columnar-storage data-analysis data-science database duckdb in-memory-database open-source parquet python query-planner r sql
Last synced: 29 Apr 2026
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/jofaval/sonar
Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987
data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci
Last synced: 09 Apr 2026
https://github.com/mohamed-khaled0/covid-data-exploration.sql
Covid-19 data
covid19-data data-analysis datacleaning microsoft-sql-server sql
Last synced: 06 Feb 2026
https://github.com/janiavdv/data-spirits
Analysis of alcohol and sports betting data, including a correlation investigation.
correlation data-analysis data-science machine-learning
Last synced: 11 Nov 2025
https://github.com/ljadhav25/linear_regression_data_science
Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.
data-analysis data-science linear-regression machine-learning
Last synced: 26 Oct 2025
https://github.com/alaminsframe/weather-dengue-trends-in-dhaka
Analyzing weather-Dengue correlation in Dhaka (2020–2025)
beautifulsoup4 data-analysis data-scraping pandas public-health selenium tableau time-series-analysis
Last synced: 26 Oct 2025
https://github.com/vishalsiingh/deloitte-virtual-internship
Submission for the STEM Virtual Program by Deloitte via Forage.
coding cyber-security data-analysis deloitte development forage forensics
Last synced: 23 Jan 2026
https://github.com/limatix/limatix
Limatix datacollect and processtrak tools
data-analysis python scientific-workflows
Last synced: 23 Jan 2026
https://github.com/9dl/usbfalcon
Automatically copies files from plugged USB drives to a specified location, enabling quick data retrieval for analysis.
automation data-analysis data-retrieval ethical-hacking file-copying usb
Last synced: 27 Oct 2025
https://github.com/code-jl/nfl-kicker-predictor
A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.
beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping
Last synced: 01 Jun 2026
https://github.com/nordszamora/ds-ml-projects
My repository for Data Science & Machine Learning projects.
data-analysis data-science data-visualization jupyter-notebook kaggle machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 15 Apr 2026
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/garcane/unicorn-companies-analysis
Tracking unicorn startups (valued at $1B+) provides valuable insights for investors and analysts to identify high-growth industries and emerging trends.
data-analysis exploratory-data-analysis financial-analysis investor postgresql sql
Last synced: 24 Jan 2026
https://github.com/rahulchouhan1/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.
data-analysis data-cleaning data-engineering data-science data-warehouse datascience etl etl-pipeline sql sql-query sql-server
Last synced: 24 Jan 2026
https://github.com/snigdho8869/numerical-data-analysis-projects
Exploring numerical data analysis with credit card churn, fraud detection, health predictions and more.
adaboost cnn data-analysis deep-learning dnn ensemble-learning exploratory-data-analysis gradient-boosting-classifier keras logistic-regression machine-learning ml numeric numerical-analysis pandas python3 random-forest scikit-learn support-vector-machines tensorflow
Last synced: 15 Apr 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/srimantapal205/dataengineerwireframedesigns
Data Engineer Wireframe Designs are essential for planning and visualizing data pipelines, architecture, and workflows before implementation.
data-analysis data-engineering dataflow dataflow-programming datapipeline dataprocessing development visualization
Last synced: 29 Jan 2026
https://github.com/wareflowx/excel-toolkit
A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.
data-analysis data-wrangling excel pandas python uv
Last synced: 29 Jan 2026
https://github.com/smahala02/magnetism-lab
This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.
data-analysis inductance jupyter-notebook magnetism python toroids
Last synced: 29 Jan 2026
https://github.com/edumoraes1/comissao-reduzida
Criação de segmentação de publico via SQL para nova feature do enjoei de comissão reduzida
bq data-analysis salesforce sql
Last synced: 06 Feb 2026
https://github.com/surajwate/datalab
DataLab is a versatile toolkit designed to simplify data exploration, analysis, and visualization for data scientists.
data-analysis data-science python visualization
Last synced: 30 Jan 2026
https://github.com/mfakhriazhar/healthcare-dashboard-project
This project is a comprehensive data analysis and visualization of healthcare data using Power BI. It focuses on understanding patient distribution, billing trends, and hospital performance through a clean and interactive dashboard.
dashboard dashboardreporting data-analysis datacleaning excel powerbi powerquery
Last synced: 30 Jan 2026
https://github.com/nehar-2404/airbnb-nyc-eda-ml
This project analyzes Airbnb listings in New York City to uncover key insights about pricing, host activity, and neighborhood trends. It covers data cleaning, EDA, and basic machine learning to predict listing prices.
airbnb data-analysis eda machine-learning matplotlib pandas pyhton seaborn visualization
Last synced: 15 Apr 2026
https://github.com/jaseel342/ecommerce_sales_dashboard
The E-commerce Sales Dashboard project offers a comprehensive view of e-commerce sales performance using interactive Power BI dashboards. It focuses on key metrics like YTD Sales, YTD Profit, YTD Profit Margin, and Quantity of Products sold, analyzing data by product categories, states, and regions.
data-analysis data-modelling dax-expression excel power-query powerbi visualization
Last synced: 07 Feb 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/jujulis18/olympicsmedalsdashboard
Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.
dashboard data-analysis data-visualization eda olympic python streamlit
Last synced: 31 Jan 2026
https://github.com/shafaq-aslam/pandas-lab
A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.
analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series
Last synced: 15 Apr 2026
https://github.com/steviecurran/gbt-scripts
IDL scripts for the reduction of Green Bank Telescope data
data-analysis data-compression data-visualization radio-astronomy spectroscopy
Last synced: 31 Jan 2026
https://github.com/tusharpandey003/chat_analysis
Analysis of group chat with respect to individual member of group
chat-analysis chat-analyzer data-analysis data-science streamlit whatsapp whatsapp-chat whatsapp-web
Last synced: 01 Feb 2026
https://github.com/bineet-ratna-shakya/data-science-salary-analysis
analyzing a dataset containing salaries of data science professionals from 2020 to 2023.
data-analysis data-science data-visualization jupyter numpy pandas python
Last synced: 01 Feb 2026
https://github.com/asghar-rizvi/world-energy-consumption-analysis-1965-2023-
An in-depth analysis of global energy consumption trends from 1965 to 2023, using data from various countries and regions.
data-analysis data-analysis-python data-science python real-world-data real-world-data-analysis real-world-problem-solving real-world-project visulaization
Last synced: 15 Apr 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/amanraghuvanshi/adidas-western-zone-sales
Adidas United States Sales Report Analysis
data-analysis datatable pandas plotly statsmodels time-series
Last synced: 08 Feb 2026
https://github.com/mrgeislinger/bike-data-exploration
Data exploration of bike-related data
bicycle bike data-analysis data-science
Last synced: 08 Feb 2026
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Feb 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/jweinst1/xenon
A processing based language
data-analysis interpreter reactive-programming
Last synced: 15 Apr 2026
https://github.com/shubham200137/spotify-listening-habits-analytics
Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver
Last synced: 18 Mar 2026
https://github.com/naninsv/apple-retail-sales-warranty-analysis
An advanced SQL project analyzing over 1 million rows of Apple retail sales data to solve real-world business problems, optimize query performance, and extract actionable insights. The analysis includes sales trends, warranty claims, product performance, and year-over-year growth
business-intelligence data-analysis data-science etl insights retailanalytics sql sqladvance
Last synced: 26 Feb 2026
https://github.com/ninadpatil09/heart_disease_detection_analysis
The Heart Disease Detection Analysis aims to create a predictive model for identifying individuals at risk of heart disease. Using a dataset with attributes like age, sex, and health metrics, the project focuses on distinguishing patients with and without heart disease.
data-analysis data-cleaning data-science data-visualization machine-learning
Last synced: 15 Apr 2026
https://github.com/evgeniyarbatov/singapore-streets
Exploring Singapore street names
data-analysis geospatial gis mapping osm python singapore street
Last synced: 15 Apr 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/animesh-chourey/power-bi
Various projects at my attempt to learn Power BI
business-analytics data-analysis data-visualization powerbi
Last synced: 10 Feb 2026
https://github.com/gnneto/nf-analyzer
Script Python para extrair dados de Notas Fiscais Eletrônicas (XML) e gerar Excel consolidado, com foco na extração de informações financeiras, como vencimentos e valores, para uma análise mais detalhada e eficiente. mantendo formatação numérica.
data-analysis excel finance nf-analyzer pandas python xlm
Last synced: 16 Apr 2026
https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis
About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization
data-analysis data-visualization sql tableau
Last synced: 27 Feb 2026
https://github.com/georgehanymilad/mobile-usage-behavior-analysis
Excel Project for Data Analysis
data-analysis data-visualization dataanalyst dataanalytics excel-dashboard pivot-tables powerquery storytelling
Last synced: 11 Feb 2026
https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals
Analyzed Data By Creating Interactive Dashboard Using MS Excel
data-analysis data-cleaning data-visualization excel pivot-tables powerquery
Last synced: 11 Feb 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/vikktor93/proyecto-final-python-datascience
Dataset analysis of worldwide sales of video games on different platforms in 2020
data-analysis data-science jupyter-notebook kaggle matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/praveen-devknight/event-registration-analytics-dashboard
This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.
data-analysis data-visualization excel powerbi sql
Last synced: 11 Feb 2026
https://github.com/joemull/pyjade
A data curation script for the Jane Addams Digital Edition
data-analysis digital-humanities
Last synced: 11 Feb 2026
https://github.com/ancapitigoi/portfolio
This repository is my portfolio containing past and current projects.
analitycs dashboard data-analysis data-cleaning data-mining data-visualization excel exploratory-data-analysis r-programming sql story-telling tableau
Last synced: 12 Feb 2026
https://github.com/shreshthvashisht/hiring-process-analytics
Statistics Using Excel
advanced-excel data-analysis data-science data-visualization excel hr-analytics statistics
Last synced: 27 Feb 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/secureauditx/ecommerce-user-behavior-analysis
E-commerce User Behavior Analysis with Streamlit Dashboard
customer-segmentation data-analysis ecommerce python streamlit
Last synced: 28 Feb 2026
https://github.com/muyangli76/covidsql
Global Covid Data analyzed in SQL and visualized in Tableau
data-analysis data-visualization sql tableau
Last synced: 14 Feb 2026
https://github.com/malakaburamila/power-bi-dashboards
A portfolio of interactive Power BI dashboards I developed, showcasing data visualization, analytics, and data-driven insights.
amazonsalesanalysis analytics dashboards data-analysis data-visualization datasets hranalytics power-bi
Last synced: 14 Feb 2026
https://github.com/mo-elshamy/machine-learning-practice
This repository serves as a collection of my work and learning in machine learning while my internship in Cellual-Technologies, including algorithm explanations, data preprocessing workflows, and two projects.
data-analysis data-science dbscan decision-trees eda gradient-boosting gxboost hierarchical-clustering kmeans-clustering knn-classification linear-regression logistic-regression machine-learning model pca polynomial-regression preprocessing random-forest support-vector-machines training
Last synced: 14 Feb 2026
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/hlexnc/project-arepo
Data-driven stroke risk assessment & personalized recommendations, powered by machine-learning and an NLU-driven chatbot.
chatbot data-analysis docker docker-compose machine-learning nlu-chatbot python rasa scikit-learn sklearn streamlit
Last synced: 15 Feb 2026
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 15 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/arunesh-tiwari/sales-analysis
Tableau Data Analysis Project.
data-analysis data-visualization tableau
Last synced: 01 Mar 2026
https://github.com/rachit901109/simppl_task
Social Media Analytics Dashboard
dashboard-application data-analysis data-visualization network-graphs social-network-analysis
Last synced: 16 Apr 2026
https://github.com/johannaschmidle/road-collisions-project
Analyzed road accident data in the UK from 2019 to 2022 to identify patterns and trends in road accidents, for Effective Road Management [Excel]
data-analysis data-visualization excel pivot-tables traffic-analysis
Last synced: 01 Mar 2026
https://github.com/mayankyadav23/amazon-sales-data-analysis
Diving into Amazon sales data to uncover hidden gems! 📈 Analyzing iNeuron's dataset to optimize sales strategies and boost performance 💡 Driving business growth with data-driven decisions! 💻
amazon data-analysis data-visualization ineuron-ai internship-project
Last synced: 02 Mar 2026
https://github.com/mbarbetti/bachelor-thesis-public
:book: My bachelor thesis at the University of Firenze
bachelor-degree bachelor-degree-thesis bachelor-thesis data-analysis lhcb-experiment particle-physics thesis
Last synced: 02 Mar 2026
https://github.com/dmatking/dtlab
Date Time Lab
csv data-analysis data-quality datetime python timezone
Last synced: 02 Jun 2026
https://github.com/pujolsluis/businessintelligencecourse
Repository for my BI Course projects
business-intelligence data-analysis data-mining data-warehouse
Last synced: 27 Mar 2026
https://github.com/soumya-kushwaha/uber-analysis
data-analysis data-science data-visualization uber-analysis
Last synced: 16 Apr 2026
https://github.com/asghar-rizvi/eda_student_dataset
This repository contains the results of data analysis and exploratory data analysis (EDA) conducted on the Student_Dataset. The analysis focuses on understanding various factors affecting student grades and visualizing these relationships using Matplotlib and Seaborn.
data-analysis data-analysis-python data-science jupyter-notebook python3
Last synced: 16 Apr 2026
https://github.com/adrianlardies/feelms_predict_by_emotion
Feelms is a mood-based movie recommendation app that uses collaborative filtering and machine learning to suggest films based on your emotions. Built with Streamlit and powered by AWS, Feelms personalizes each user's experience through simulated interactions and tailored predictions.
aws-ec2 aws-rds data-analysis data-science machine-learning python streamlit
Last synced: 16 Apr 2026
https://github.com/abhipatel35/gym-performance-analysis
Analyzing gym performance and user engagement in Arizona using Spark SQL, PySpark, and visualization techniques on the Yelp dataset.
apache-spark asu business-insights data-analysis data-processing-at-scale data-visualization dps gym-analysis rating-patterns sql trend-analysis user-insights yelp-dataset
Last synced: 16 Apr 2026
https://github.com/samuelson777/titanic-dataset-analysis
Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.
data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 16 Apr 2026
https://github.com/akash-srm/user-engagement-analysis
Analyzed user engagement and feedback data to derive actionable insights for an online learning platform.
analytics-projects data-analysis data-cleaning eda jupyter-notebook pandas python seaborn student-engagement
Last synced: 16 Apr 2026
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/dina-hosny/analyze-and-model-airline-system
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
data-analysis data-modeling data-warehouse datawarehousing dwh plsql sql
Last synced: 05 Mar 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026