Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/farzeen-2001/blinkit-sales-analysis-using-powerbi
The project provides an overview about the BlinkIt Sales performances
data-analysis data-visualization datacleaning excel powerbi
Last synced: 24 Jan 2026
https://github.com/nordszamora/ds-ml-projects
My repository for Data Science & Machine Learning projects.
data-analysis data-science data-visualization jupyter-notebook kaggle machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 15 Apr 2026
https://github.com/gaurabkundu1/road-accident-data-analysis
This is an Excel project on Road Accident Data Analysis in the form of an interactive Dashboard.
dashboard data-analysis data-vizualisation excel road-accidents
Last synced: 24 Jan 2026
https://github.com/alunera-data/alunera-data
Hi, I’m Yvonne – building data solutions at the intersection of BI, SQL & Service Management
business-intelligence data-analysis data-engineering data-science github-profile portfolio rstats sql
Last synced: 28 Jan 2026
https://github.com/garcane/unicorn-companies-analysis
Tracking unicorn startups (valued at $1B+) provides valuable insights for investors and analysts to identify high-growth industries and emerging trends.
data-analysis exploratory-data-analysis financial-analysis investor postgresql sql
Last synced: 24 Jan 2026
https://github.com/diegopino/publibdata_codexhackathon
Public Library Data processing/analysis codex hackathon attempt
data-analysis data-visualization libraries public
Last synced: 24 Jan 2026
https://github.com/hdgiacon/power_bi_projects
Repositório contendo cursos, dashboards e projeto relacionados à análise de dados e Power BI.
data-analysis data-engineering data-visualization microsoft-power-bi
Last synced: 24 Jan 2026
https://github.com/snigdho8869/numerical-data-analysis-projects
Exploring numerical data analysis with credit card churn, fraud detection, health predictions and more.
adaboost cnn data-analysis deep-learning dnn ensemble-learning exploratory-data-analysis gradient-boosting-classifier keras logistic-regression machine-learning ml numeric numerical-analysis pandas python3 random-forest scikit-learn support-vector-machines tensorflow
Last synced: 15 Apr 2026
https://github.com/annnieglez/fraud-detection-eda
Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns
data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn
Last synced: 28 Jan 2026
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 28 Jan 2026
https://github.com/anurag-ghosh-12/library_management_system_sql
This project showcases the development of a comprehensive Library Management System utilizing Structured Query Language (SQL). It demonstrates a practical application of relational database principles to efficiently manage library resources, member information, and borrowing/returning transactions.
data-analysis data-visualisation dbms-project sql
Last synced: 29 Jan 2026
https://github.com/andreicirciumaru/best-of-breed
CSV fundamentals screener: schema validation + market-cap weights
csv data-analysis finance pandas python screener
Last synced: 15 Apr 2026
https://github.com/anmolian/data_analysis_facebook_api_ads
Big Data Analytics
data-analysis data-visualization pyspark sql
Last synced: 24 Feb 2026
https://github.com/engineertolulope/us_states_living_ranking_analysis
Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.
data-analysis data-science linear-regression machine-learning python scikit-learn
Last synced: 29 Jan 2026
https://github.com/wareflowx/excel-toolkit
A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.
data-analysis data-wrangling excel pandas python uv
Last synced: 29 Jan 2026
https://github.com/mattdelaune/powerbi_healthcare_dashboard
Interactive Hospital Insights Dashboard built with Power BI, showcasing comprehensive analysis of patient demographics, treatment outcomes, and hospital performance.
data-analysis healthcare power-bi visualization
Last synced: 29 Jan 2026
https://github.com/isaqueiros/newspapersoldout-predictions-logistic_regression
This notebook is a study of the application of sklearn Logistic Regression model and analysis of metric quality with a focus on the impact of imbalanced data. The problem presented is the analysis of sales of newspapers of a local stand in order to classify the probability of the newspaper being Sold Out or Not, given a set of features.
data-analysis data-imbalance data-science logistic-regression machine-learning python sklearn-library sklearn-logistic-regression
Last synced: 18 Apr 2026
https://github.com/shrutiijoshi/marketing-campaign-report
The dataset includes information on campaign types, recipient segments, interactions (clicks, opens, bounces, etc.), and conversion metrics.
dashboard data-analysis data-visualization tableau-public
Last synced: 25 Feb 2026
https://github.com/joannescode/regex_with_py
Learning by practicing with Regex (Python)
Last synced: 30 Jan 2026
https://github.com/mfakhriazhar/us-companies-revenue-dashboard
This project is a data visualization dashboard built using Power BI that highlights lists of the largest companies in the United States by revenue. The goal is to provide an interactive overview of company performance across industries, focusing on revenue, employee metrics, and industry trends.
dashboard data-analysis data-visualization largest-companies-us powerbi revenue united-states
Last synced: 30 Jan 2026
https://github.com/mfakhriazhar/healthcare-dashboard-project
This project is a comprehensive data analysis and visualization of healthcare data using Power BI. It focuses on understanding patient distribution, billing trends, and hospital performance through a clean and interactive dashboard.
dashboard dashboardreporting data-analysis datacleaning excel powerbi powerquery
Last synced: 30 Jan 2026
https://github.com/manishabarse/hr_data_analysis
Used Microsoft SQL Server Management Studio and Power BI
data-analysis powerbi sql ssms
Last synced: 30 Jan 2026
https://github.com/jaseel342/ecommerce_sales_dashboard
The E-commerce Sales Dashboard project offers a comprehensive view of e-commerce sales performance using interactive Power BI dashboards. It focuses on key metrics like YTD Sales, YTD Profit, YTD Profit Margin, and Quantity of Products sold, analyzing data by product categories, states, and regions.
data-analysis data-modelling dax-expression excel power-query powerbi visualization
Last synced: 07 Feb 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/jofaval/titanic-disaster
Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning
classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn
Last synced: 15 Apr 2026
https://github.com/jujulis18/olympicsmedalsdashboard
Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.
dashboard data-analysis data-visualization eda olympic python streamlit
Last synced: 31 Jan 2026
https://github.com/amishidesai04/flipkart-mobile-sales-analysis
Flipkart Mobile Sales Analysis is a Tableau project that visualizes mobile sales data from Flipkart. It highlights trends in brand performance, pricing, ratings, and customer preferences. The interactive dashboard helps users explore key insights for data-driven decisions in e-commerce and retail.
dashboard data-analysis data-visualization storyboard tableau
Last synced: 31 Jan 2026
https://github.com/shafaq-aslam/pandas-lab
A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.
analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series
Last synced: 15 Apr 2026
https://github.com/allanotieno254/bank-loan-analysis-dashboard-power-bi
An interactive Power BI dashboard that analyzes bank loan data to provide insights into approval trends, default risks, and customer profiles. Designed to assist financial institutions in making data-driven lending decisions.
bank-loans business-intelligence dashboard data-analysis financial-analysis power-bi risk-assessment
Last synced: 31 Jan 2026
https://github.com/malthejorgensen/repx
Python regular expression file transformer
command-line-tool data-analysis text-processing
Last synced: 31 Jan 2026
https://github.com/gastonstat/stat133
STAT 133: Concepts in Computing with Data
data-analysis data-science data-visualization r-programming syllabus
Last synced: 25 Feb 2026
https://github.com/alex-pierron/ekip-enedis-genai
Repository for the team "Ekip" during the H-GenAI Hackathon 2025 organized at SIA Partners, Paris, France
amazon-nova artificial-intelligence aws aws-lambda data-analysis database generative-ai mistral nlp
Last synced: 15 Apr 2026
https://github.com/axsk/geekgraph
parse, cluster and visualize boardgamegeek.com user profiles
Last synced: 01 Feb 2026
https://github.com/bineet-ratna-shakya/data-science-salary-analysis
analyzing a dataset containing salaries of data science professionals from 2020 to 2023.
data-analysis data-science data-visualization jupyter numpy pandas python
Last synced: 01 Feb 2026
https://github.com/farzeen-2001/hr_analytics_dashboard_powerbi
HR data analytics using Power BI
data-analysis data-visualization datacleaning hr powerbi
Last synced: 25 Feb 2026
https://github.com/ludreinsalvador/life-expectancy-data-analysis
Contains Power BI dashboards analyzing global life expectancy trends, mortality rates, and health expenditures. Using a dataset sourced from Google Sheets, the project explores the impact of economic and healthcare factors on longevity.
dashboard data-analysis data-visualization healthcare-analysis life-expectancy powerbi
Last synced: 25 Feb 2026
https://github.com/asghar-rizvi/world-energy-consumption-analysis-1965-2023-
An in-depth analysis of global energy consumption trends from 1965 to 2023, using data from various countries and regions.
data-analysis data-analysis-python data-science python real-world-data real-world-data-analysis real-world-problem-solving real-world-project visulaization
Last synced: 15 Apr 2026
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 15 Apr 2026
https://github.com/devbigboy/excel-power-query-get-transform
Power Query is a feature in Excel that allows you to quickly import data from multiple sources and easily clean, transform, and reshape it to suit your needs.
data-analysis data-science excel
Last synced: 08 Feb 2026
https://github.com/suhail25/hotel-booking-analysis
Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.
data-analysis ipynb-notebook matplotlib pandas python seaborn
Last synced: 08 Feb 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/shibbir24/customer-sales-analysis-dashboard-using-tableau
Customer Sales Analysis Dashboard Using Tableau
dashboard data-analysis data-visualization sales-analysis tableau
Last synced: 08 Feb 2026
https://github.com/mdaltamashalam/uber-fare-prediction-models
Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.
catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm
Last synced: 26 Feb 2026
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Feb 2026
https://github.com/djm158/learning-microsoft-r
Working through https://www.gitbook.com/book/smott/introduction-to-microsoft-r-server/details and creating samples
data-analysis data-science microsoft microsoft-sql-server r
Last synced: 15 Apr 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 09 Feb 2026
https://github.com/shubham200137/spotify-listening-habits-analytics
Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver
Last synced: 18 Mar 2026
https://github.com/barraharrison/airbnb-price-trends
Looking at how Airbnbs differ in price when it comes to location, room type and host activity
data-analysis data-science pandas plotly python streamlit
Last synced: 09 Feb 2026
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Apr 2026
https://github.com/nulltea/kicksware-scrapebot
Web scraping tool to retrieve sneaker details & images from web store sites
bot data-analysis pandas python sneakers web-scraping
Last synced: 15 Apr 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/shruti23-ui/blinkit-powerbi-dashboard
A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.
data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql
Last synced: 09 Feb 2026
https://github.com/purushothamadluru/kpi-driven-insights-dashboard-customer-churn-analysis
This repository features a Power BI project designed to deliver KPI-driven insights into customer churn patterns. Leveraging a robust dataset and advanced data modeling techniques, this project uncovers trends, identifies key drivers of churn, and enables businesses to make data-driven decisions.
customer-churn-analysis data-analysis insights-dashboard kpi powerbi
Last synced: 09 Feb 2026
https://github.com/animesh-chourey/power-bi
Various projects at my attempt to learn Power BI
business-analytics data-analysis data-visualization powerbi
Last synced: 10 Feb 2026
https://github.com/gnneto/nf-analyzer
Script Python para extrair dados de Notas Fiscais Eletrônicas (XML) e gerar Excel consolidado, com foco na extração de informações financeiras, como vencimentos e valores, para uma análise mais detalhada e eficiente. mantendo formatação numérica.
data-analysis excel finance nf-analyzer pandas python xlm
Last synced: 16 Apr 2026
https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis
About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization
data-analysis data-visualization sql tableau
Last synced: 27 Feb 2026
https://github.com/prateekbisht23/inventory_management
This project is an Inventory Management System built using Python (Pandas, NumPy, SciPy) and Jupyter Notebook. It allows efficient tracking of stock, performing data analysis, and generating useful statistical insights (mean, standard error, confidence intervals) to support better decision-making.
data-analysis jupyter-notebook management python3
Last synced: 11 Feb 2026
https://github.com/georgehanymilad/mobile-usage-behavior-analysis
Excel Project for Data Analysis
data-analysis data-visualization dataanalyst dataanalytics excel-dashboard pivot-tables powerquery storytelling
Last synced: 11 Feb 2026
https://github.com/nickenshidqia/startup-venture-funding-dashboard-data-analysis
The Startup Venture Funding Dashboard is a comprehensive visual representation of the dynamic landscape of startup funding, providing valuable insights into the top startups, funding round types, markets, startup statuses, and investor details.
dashboard data-analysis tableau tableau-dashboards
Last synced: 11 Feb 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/vikktor93/proyecto-final-python-datascience
Dataset analysis of worldwide sales of video games on different platforms in 2020
data-analysis data-science jupyter-notebook kaggle matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/praveen-devknight/event-registration-analytics-dashboard
This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.
data-analysis data-visualization excel powerbi sql
Last synced: 11 Feb 2026
https://github.com/prakshi-23/tableau
Report using Tableau
dashboard data-analysis data-visualization report tableau
Last synced: 11 Feb 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 09 Apr 2026
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 27 Feb 2026
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/bala-1409/sql-projects
The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language.
data-analysis data-mining data-science data-transformation database eda etl-framework exploratory-data-analysis microsoft-sql-server query-language sql sql-server sql-server-database sql-server-management-studio
Last synced: 27 Feb 2026
https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
data-analysis oracle-database sql sql-developer
Last synced: 12 Feb 2026
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Feb 2026
https://github.com/rohitblaze10/-excel-_seller_store_analysis
A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!
data-analysis data-visualization excel excel-export
Last synced: 12 Feb 2026
https://github.com/andimashkulli/vpms
Vehicle Parking Management System for Gjon Buzuku Gymnasium
backend-api data-analysis databases frontend-react mongodb nodejs software
Last synced: 12 Feb 2026
https://github.com/nhoiyee/other-python-projects
using Python in Jupyter Notebook
data-analysis data-engineering data-mining jupyter jupyter-notebook jupyter-notebooks python python3
Last synced: 12 Feb 2026
https://github.com/yalai92/alfalfa_imp_exp_analysis
This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.
data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau
Last synced: 12 Feb 2026
https://github.com/ankit21111/carpredict
This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!
data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn
Last synced: 16 Apr 2026
https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
big-data data-analysis dataset python spark sql
Last synced: 16 Apr 2026
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 16 Apr 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/mananabbasi/dashboard-power-bi
This repository showcases **Power BI projects** focused on data visualization and business intelligence. Each project transforms raw data into interactive dashboards and reports, providing actionable insights for decision-making. The repository includes Power BI files, datasets, and documentation for each project.
data-analysis data-science data-visualization powerbi
Last synced: 13 Feb 2026
https://github.com/m-ah07/text-sentiment-analysis-api
A lightweight Python project for analyzing the sentiment of textual data using the TextBlob library. This project provides a simple and effective way to measure the polarity and subjectivity of any given text.
data-analysis machine-learning python python-project sentiment-analysis text-analysis text-mining
Last synced: 14 Feb 2026
https://github.com/kambleakash0/mubi_eda
Mini Project #1 for EAS503 course at SUNY Buffalo
data-analysis data-visualization eda
Last synced: 16 Apr 2026
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
【Sprinkle some star dust on this repo! ⭐️ It's good karma!】A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Apr 2026
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/edumoraes1/republicacao-produtos
SQL Query realizada para criação de automação de disparo de push via salesforce
bq data-analysis salesforce sql
Last synced: 14 Feb 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/misszeferino/nashville-housing-data-cleaning
Data cleaning using SQL
data-analysis data-cleaning sql
Last synced: 19 Mar 2026
https://github.com/projects-developer/full-stack-network-intrusion-detection-system-using-machine-learning
The project aims to design and develop a full-stack network intrusion detection system using machine learning techniques. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
algorithms computerscienceproject cybersecurity data-analysis full-stack-development intrusion-detection-system machine-learning network-intrusion-detection network-security web-development
Last synced: 14 Feb 2026
https://github.com/hlexnc/project-arepo
Data-driven stroke risk assessment & personalized recommendations, powered by machine-learning and an NLU-driven chatbot.
chatbot data-analysis docker docker-compose machine-learning nlu-chatbot python rasa scikit-learn sklearn streamlit
Last synced: 15 Feb 2026
https://github.com/achique-luisdan/tops-songs-db
Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).
Last synced: 15 Feb 2026
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 15 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/siddhant2105s/bring-your-own-device-boyd-system
This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.
data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries
Last synced: 15 Feb 2026