Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/riju18/data-analysis-and-visualizaton
Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.
data-analysis data-mining data-science data-visualization powerbi tableau
Last synced: 04 Jan 2026
https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy
In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.
data-analysis data-science data-visualization eda elonmusk spacex
Last synced: 23 Mar 2025
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/manisharora96/instagram-reach-analysis
This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively
data-analysis data-visualization instagram-reach python-tools
Last synced: 23 Mar 2025
https://github.com/kernelshreyak/kaggle-notebooks
Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets
data-analysis data-science data-visualization kaggle kaggle-competition machine-learning
Last synced: 27 Apr 2026
https://github.com/nikhil-donthusaram/heartdiseaseprediction
Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.
data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode
Last synced: 11 Apr 2026
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 17 Apr 2025
https://github.com/felpzreiz/stockdata_pipeline
Este projeto consiste no desenvolvimento de um pipeline de dados que consome informações financeiras de uma API da Bolsa de Valores Americana (StockData.org) para análise e tratamento. Utilizando Python e bibliotecas como pandas, matplotlib e pyarrow
api data-analysis data-science jupyter-notebook pandas python
Last synced: 19 Apr 2026
https://github.com/mohammad-malik/covid-visualizations-d3
This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).
covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization
Last synced: 28 May 2026
https://github.com/k178412/sql-data-warehouse-project
A hands-on data warehouse project using SQL Server, covering ETL processes, and data modeling.
bronze-layer data-analysis data-analytics data-cleaning data-engineering data-warehouse database datalake dataset datawarehouse etl etl-pipeline etl-process gold-layer silver-layer sql sql-query sql-server sqlserver
Last synced: 25 Apr 2026
https://github.com/taralas209/moscow-programmer-salaries-analysis-dvmn
A Python script analyzing the average salaries of programmers in Moscow by popular programming languages using data from HeadHunter and SuperJob.
api data-analysis headhunter job-market-analysis python superjob
Last synced: 15 Mar 2025
https://github.com/abdullahashfaqvirk/powerbi-dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 10 Mar 2026
https://github.com/vetronics/data_analisys_by_pandas
piccolo script in python per analisi dei dati sugli incidenti del 2019
accident accidents-analysis car data-analysis data-science data-visualization dataset github github-actions istat maplotlib pandas python python3 scripts windows-11
Last synced: 11 Apr 2026
https://github.com/nurulashraf/customer-segmentation-hierarchical-clustering
A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.
business-analytics clustering-algorithm customer-segmentation data-analysis hierarchical-clustering machine-learning python unsupervised-learning
Last synced: 18 Apr 2025
https://github.com/devexpress-examples/wpf-pivot-grid-define-custom-cell-template-to-performing-data-editing
This example shows how to edit a cell with the cell editing template in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 02 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-connect-to-an-olap-datasource
This example shows how to specify connection settings to the server and create fields that relate to specific measures and dimensions of the cube for the Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf xpf
Last synced: 06 May 2026
https://github.com/shahriarha/sql
Structured query language
data-analysis mysql mysql-database sql
Last synced: 02 Sep 2025
https://github.com/alphatwirl/qtwirl
qtwirl (quick-twirl), one-function interface to AlphaTwirl
alphatwirl data-analysis data-frame pandas r root-cern
Last synced: 11 Apr 2026
https://github.com/reddyprasade/r-program
R is a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.
data-analysis data-science r-programming
Last synced: 11 Apr 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025
https://github.com/mradovic38/cnn-dominant-color-recognition
Detecting a dominant color of Tulip images using various convolutional neural networks.
artificial-intelligence cnn computer-vision data-analysis deep-learning dominant-color kmeans-clustering machine-learning neural-network pytorch regression resnet resnet-18 squeeze-and-excitation
Last synced: 01 Jul 2026
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/lucaso21/euro-2021-player-stats-analysis
A short project analyzing stats for players at the Euro 2021 tournament.
data-analysis data-science r rvest tidyverse
Last synced: 16 Mar 2025
https://github.com/aliiahmadi/data-manipulation
Algorithms and solutions
algorithm algorithms data-analysis data-science
Last synced: 30 Jun 2025
https://github.com/tolumie/rfm-marketing-analysis
This project focuses on RFM (Recency, Frequency, and Monetary) Analysis, a powerful customer segmentation technique used in marketing and business analytics. The analysis helps businesses identify their most valuable customers, potential loyalists, at-risk customers, and churned users.
business-analytics customer-behavior-analysis customer-loyalty customer-retention customer-segmentation-analysis data-analysis data-driven-decisions ecommerce marketing-analytics python
Last synced: 18 May 2026
https://github.com/b-varun-reddy/fairwai-bias-detection
Submission for the FairwAI Hospitality Intern Challenge. This project analyzes bias signals in Yelp hospitality reviews using open-source data, Python, and fairness-focused keyword detection.
bias-detection data-analysis ethical-ai fairness hospitality machine-learning natural-language-processing python social-impact yelp-dataset
Last synced: 19 Apr 2025
https://github.com/atharvapathak/rsvp_movies_case_study
SQL queries performed on IMDb database to provide recommendations to RSVP Movies based on insights.
data-analysis data-cleaning data-science imdb-dataset rsvp-movies sql
Last synced: 28 Jan 2026
https://github.com/wojtekdomino/titanic-eda
Exploratory Data Analysis (EDA) of Titanic dataset using Pandas, Matplotlib, and Seaborn.
data-analysis eda matplotlib pandas python seaborn
Last synced: 10 Jun 2025
https://github.com/aalekhpatel07/billie-ai-lish-and-oscar-w-ai-lde
Generating music and poetry using RNNs.
data-analysis generating rnn-tensorflow tensorflow-keras
Last synced: 16 Mar 2025
https://github.com/cyberoctane29/epa-carbon-monoxide-aqi-analysis
This project continues my EPA Air Quality AQI Analysis, focusing on carbon monoxide levels in EPA data. Using Python, I applied statistics, probability analysis, outlier detection, sampling, and hypothesis testing to assess pollution and health impacts. Leveraging Pandas, NumPy, SciPy, and Matplotlib, it supports environmental policy decisions.
data-analysis eda hypothesis-testing probability-distribution sampling sampling-distribution statistical-analysis
Last synced: 24 Mar 2025
https://github.com/cescedes/this-is-jeopardy
Writing several functions that investigate a dataset of Jeopardy! questions and answers.
codecademy data-analysis python
Last synced: 11 Apr 2026
https://github.com/curtisalexander/cramisc
Personal R functions for data analysis
Last synced: 12 Mar 2025
https://github.com/lucashomuniz/Project-04
STATISTICAL ANALYSIS FOR DEMAND PLANNING IN POWERBI
bigquery data-analysis data-structures data-visualization database google-cloud-platform powerbi powerbi-visuals sql sql-query
Last synced: 11 Oct 2025
https://github.com/anuragmudgal96/data-warehouse-project
Designing and implementing a modern data warehouse on SQL Server, covering ETL pipelines, dimensional modeling, and analytical reporting.
data-analysis data-engineering data-warehouse datawarehousing etl etl-job etl-pipeline sql sql-server
Last synced: 09 Oct 2025
https://github.com/bocchio01/skyward_recruitment_assignment
Assignment to join the PoliMi SkyWard software team
data-analysis kalman-filter model-rocket
Last synced: 15 Mar 2025
https://github.com/zachbateman/easy_plot
Easy Statistical Visualization in Python
data-analysis data-visualization graphics matplotlib python seaborn
Last synced: 18 Jan 2026
https://github.com/vinitgurjar/r_lang_exp
This is a collection of my collage Data Analytics lab work and assignment, the files here contains program of R language
data-analysis data-visualization r
Last synced: 02 Jul 2025
https://github.com/0-mostafa-rezaee-0/sandwich_structures
Impact test of Sandwich Structures
composite-materials data-analysis r
Last synced: 09 Aug 2025
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 01 May 2026
https://github.com/chrispsang/customerchurnanalysis
Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.
customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau
Last synced: 31 Jan 2026
https://github.com/katiesaund/tidy_tuesday
A weekly data project in R from the R4DS online learning community
data-analysis data-visualization datascience plot r rstats tidytuesday
Last synced: 24 Mar 2025
https://github.com/abhiram-kandiyana/us-bikeshare-analysis
Explorative analsis on a bike-share system (Motivate) to understand it's pain points
data-analysis data-visualization
Last synced: 26 Mar 2025
https://github.com/danielrosehill/data-projects-index
Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.
data-analysis data-science data-visualization
Last synced: 16 Mar 2026
https://github.com/ginga1402/youtube_analysis
Exploratory Data Analysis on YouTube data
college-project data-analysis pandas-python
Last synced: 30 Mar 2025
https://github.com/balajimohan18/loan-classification-datascience-project
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning
Last synced: 03 Sep 2025
https://github.com/johannaschmidle/bookauthors
Explored a book sales database. Cleaned data using Excel and created an interactive dashboard to analyze author popularity, ratings, and sales trends. The project highlighted key insights such as sales performance and rating distributions [Excel]
author-sales book-sales books data-analysis data-visualization excel
Last synced: 04 Feb 2026
https://github.com/misaghmomenib/shop-revenue-analysis
A Data Analysis Project Aimed at Analyzing and Forecasting Shop Revenue Based on Sales and Other Business Metrics. It Helps to Identify Trends, Patterns, and Key Factors Influencing Revenue to Make Data-driven Decisions for Business Growth.
data-analysis data-visualization python
Last synced: 24 Mar 2025
https://github.com/juliuspinsker/bioconductor-learning-container
🧬 Containerized development environment for Harvard's Professional Certificate in Data Analysis for Genomics (PH525.x series). Streamlined setup for Bioconductor, R, and genomic data analysis with RStudio and DevContainer support.
bioconductor bioinformatics chip-seq data-analysis data-science devcontainer dna-methylation docker edx functional-genomics genomics harvard harvardx ph525 ph525x r reproducible-research rna-seq rstudio single-cell-rna-seq
Last synced: 14 May 2026
https://github.com/codesaadumair/exploratory-data-analysis
A centralized repository showcasing various Exploratory Data Analysis (EDA) projects using Jupyter notebooks, visualizations, and accompanying documentation.
data-analysis data-science data-visualization eda jupyter-notebook jupyterlab python
Last synced: 24 Mar 2025
https://github.com/ayushbaid/football_stats
Analysing the competitiveness in different European football leagues
Last synced: 03 Apr 2025
https://github.com/adrianlardies/from-data-to-insight
This project creates and manages a MySQL database to analyze the performance of Bitcoin, Gold, and the S&P 500 in response to economic factors. It integrates historical data, executes advanced SQL queries, and visualizes key insights, showcasing the power of SQL and Python in financial analysis.
data-analysis data-science matplotlib pandas python seaborn sql
Last synced: 12 Apr 2026
https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021
In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.
data data-analysis data-science data-visualization
Last synced: 23 Mar 2025
https://github.com/1401dev/iowa-liquor-retail-sales-analysis
This repository contains the analysis of Iowa liquor retail sales data, aimed at uncovering sales trends and forecasting future sales patterns. The project involves data cleaning, preparation, and advanced time series analysis using Microsoft SQL Server and Google Colab.
customer-behavior data-analysis data-cleaning data-science data-visualization exploratory-data-analysis forecasting google-colab machine-learning microsoft-sql-server pandas prophet python retail-analytics retail-sales sales-forecasting sales-performance sql statsmodels time-series-analysis
Last synced: 16 Feb 2026
https://github.com/leosimoes/nexoseducacao-imersao-powerbi
Atividades realizadas na Imersão PowerBI pela Nexos Educação com Karine Lago e Leticia Smirelli em Setembro de 2023.
business-intelligence dashboards data-analysis microsoft-power-bi
Last synced: 06 Jan 2026
https://github.com/mituskillologies/aiml-pcp-jul25
Programs conducted at AI-ML Training Program at Pimpri Chinchwad Polytechnic, Pune in Jul 2025
artificial-intelligence classification clustering data-analysis data-visualization machine-learning matplotlib pandas regression scikit-learn supervised-learning unsupervised-learning
Last synced: 03 May 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/andersoncrs/prediccion-del-precio-de-vehiculos-un-enfoque-con-regresion-lineal-y-regularizacion
Este proyecto tiene como objetivo predecir el precio de vehículos usados utilizando técnicas de regresión lineal y regularización Lasso. A través del análisis y procesamiento de datos, se construye un modelo predictivo preciso e interpretable basado en las características más relevantes de cada vehículo.
data-analysis data-exploration lasso-regression machine-learning polinomial-regression regularization-methods
Last synced: 03 Jul 2025
https://github.com/krzysikd/apartment-prices-in-poland-analysis-and-visualization
Data Analyst portfolio project that involves cleaning, transforming, and visualizing data to create an insightful dashboard. The project uses SSIS for ETL processes, SSMS for database management and queries, and Power BI for data visualization, focusing on the analysis of rental and sales apartment prices in Poland.
data-analysis data-cleaning data-visualizations powerbi sql sqlserver ssis
Last synced: 04 Feb 2026
https://github.com/2013xile/sheethub
Organize, import, export, concatenate sheet files on web application.
data-analysis data-wrangler excel sheets
Last synced: 08 Apr 2025
https://github.com/jatin-s16/digital-marketing
This repository contains raw data for Marketing analysis along with key business questions. I performed data cleaning using Python and its libraries and extracted meaningful insights. The results were then visualised using Tableau to enhance business understanding.
data-analysis data-science python3 tableau
Last synced: 16 Mar 2025
https://github.com/phomint/udacity_free_datawragling_with_mongodb
Udacity Free course to use MongoDB
data-analysis mongodb udacity-course
Last synced: 11 Jun 2025
https://github.com/leosimoes/datascienceacademy-python
Atividades do curso Fundamentos de Linguagem Python Para Análise de Dados e Data Science (Com ChatGPT) da DataScienceAcademy.
chatgpt data-analysis data-science python
Last synced: 02 May 2026
https://github.com/kathkoeh/pimaindian-kk
Logistic regression analysis of diabetes risk using the Pima Indians dataset. Includes prevalence analysis, modeling, ROC/AUC evaluation, and patient testing in Python.
data-analysis diabetes epidemiology logistic-regression machine-learning public-health python
Last synced: 28 Apr 2026
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/hilalguleryuz/excel_supermarket_data_analysis_project
Supermarket Data Analysis with Excel
dashboard data-analysis data-visualization excel microsoft-excel supermarket supermarket-data-analysis supermarket-dataset
Last synced: 06 Jan 2026
https://github.com/hadeel-13/new_home
New Home is a Website for Buying and Selling Real Estate with user preferences, it is my Graduation project with a grade of 93%.
bootstrap5 chartjs css css3 data-analysis data-mining google-maps html html5 javascript jquery
Last synced: 12 Apr 2026
https://github.com/agricolamz/2018_fe_r_statistics
Further Education R course
data-analysis r rstats static teaching teaching-materials
Last synced: 24 Mar 2025
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 02 Apr 2025
https://github.com/eubrunoo/beer-consumption-predictor
An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.
data-analysis data-science data-visualization machine-learning r statistical-analysis statistics
Last synced: 02 Apr 2025
https://github.com/hari7261/data-visualization
Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.
data-analysis data-visualization export hari7261 import python realtime-visualization
Last synced: 17 Jun 2025
https://github.com/aniruddha-biswas/jpmorgan-chase-excel-internship
JPMorgan Chase & Co.'s Excel Skills on Forage Virtual Internship
conditional-formatting data-analysis data-cleaning data-visualization excel excel-dashboard macos pivot-tables power-query shortcuts storytelling vba-excel
Last synced: 01 Apr 2025
https://github.com/m4tice/qm_project
Bicycle project crowd evaluation.
data-analysis data-engineering data-visualization
Last synced: 16 Mar 2025
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql sql-query
Last synced: 04 Mar 2026
https://github.com/zenithclown/finfolio
A Personal Finance Management Tool for the Developers, by the Developer
data-analysis data-science finance finance-application finance-management good-habits personal-finance portfolio
Last synced: 04 Feb 2026
https://github.com/ashwin331133/hospital_allpatients_waitinglist_data
This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 03 Sep 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/beastienerd/dataquest_guided_projects
Portfolio for Guided Projects
data-analysis data-visualization guided-project r
Last synced: 25 Jun 2025
https://github.com/torchstack-ai/cancer-biomarker-discovery
scRNASeq drug discovery and biomarker project
bioinformatics cancer-research data-analysis data-visualization r scrna-seq-analysis startup
Last synced: 01 Apr 2025
https://github.com/saiteja-talluri/data-analytics-assignement
Report on World Happiness Data (Data Analysis and Visualisation of the data)
data-analysis data-visualization ipynb-jupyter-notebook
Last synced: 20 Jan 2026
https://github.com/danielafishwickinacap/coderhouse_da
Data analyst Final Project files
Last synced: 18 Jan 2026
https://github.com/ajay1214/credit-card-transaction-dashboard
Credit Card weekly dashboard that provides real-time insights into key performance metrics and trends
Last synced: 04 Feb 2026
https://github.com/aran203/fluxease
Python package for eddy flux data post processing
data-analysis data-science eddy-covariance python
Last synced: 03 Apr 2025
https://github.com/farzeen-2001/financial-analysis-report-using-powerbi
comprehensive analysis of financial report
data-analysis data-visualization datacleaning dax powerbi
Last synced: 17 Feb 2026
https://github.com/pedramjlo/car_sales_analysis
Car sales analysis
data-analysis jupyter-notebook pandas python
Last synced: 01 Apr 2025
https://github.com/bryanfks-dev/klempoken-analysis
Analysis and forcasting model for Klempoken MSMEs
big-data-analytics data-analysis data-forecast data-visualization
Last synced: 01 Apr 2025
https://github.com/ashwin331133/gorkha_earthquake_damage_prediction
The main objective is to predict the level of damage to buildings caused by the 2015 Gorkha earthquake in Nepal.
data-analysis data-visualization machine-learning python
Last synced: 29 Apr 2026
https://github.com/parth-jatav/super-store-analysis-project
The Super Store Analysis project leverages Python libraries such as pandas, matplotlib, and numpy to perform a comprehensive analysis of a retail store's data. This project includes data cleaning, visualization, and statistical analysis to identify key trends, optimize inventory, enhance decision-making processes for improved business performance.
data-analysis matplotlib numpy pandas python super-store
Last synced: 12 Apr 2026
https://github.com/pranav016/exploratory-data-analysis-of-sp500-dataset
This a data-analysis that I performed on the S&P 500 dataset and answered a few questions through data visualization techniques.
Last synced: 03 Jul 2026
https://github.com/dhanyasri20/credit-risk-prediction
Credit Risk Prediction using Python, SQL, and Flask. Trained ML models (Random Forest) to identify high-risk loan applicants with 86% accuracy, automated SQL reporting, and deployed a Flask web app for real-time predictions.
classification credit-risk data-analysis financial-data flask loan-prediction machine-learning python random-forest sql
Last synced: 28 Apr 2026
https://github.com/satvikpraveen/rsvp_case_study
A comprehensive IMDB dataset analysis using SQL. Includes database setup, advanced queries, and actionable insights. Organized with files for database creation, queries, and solutions. Features an Entity-Relationship Diagram (ERD), executive summary, and SQL scripts. Perfect for SQL workflows and business intelligence in the film industry.
aggregate-functions business-intelligence common-table-expressions data-analysis data-driven-decisions data-querying database-design entity-relationship-diagram imdb-dataset relational-database sql subqueries-and-joins
Last synced: 11 Jan 2026
https://github.com/filip-kustura/python-covid-19-behaviors-analysis
Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project
Last synced: 12 Apr 2026
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/hemangsharma/breast-cancer-patient-dashboard
This interactive Streamlit dashboard visualizes insights from the SEER Breast Cancer Dataset (2006-2010)
data-analysis streamlit streamlit-dashboard streamlit-webapp
Last synced: 05 May 2026
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 31 May 2026
https://github.com/theveryhim/frequent-item-sets-and-lsh
A practice on finding frequent item sets and similar items in pysaprk framework
big-data data-analysis frequent-itemset-mining locality-sensitive-hashing pyspark text-processing
Last synced: 03 Jul 2025