Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/jnyambok/epl_dashboard
English Premier League Dashboard summarizing match data from 2009-2024
data-analysis data-science gcp powerbi
Last synced: 04 Sep 2025
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/edprice25/us-states-analysis
Presents a series of visualizations for folks looking to relocate to more affordable areas in the US. Click on my link below to see a full analysis.
data-analysis jupyter-notebook matplotlib pandas python us-states
Last synced: 04 Jul 2025
https://github.com/meerantajalli/networksecuritydefense
This Network Security defense systems acts as an indicator against SMP Floods, UDP Floods, ICMP Floods. This model is trained using packets from wireshark and can easily differentiate between normal network traffic and traffic that has been targetted on the machine by an attacker using the rate of packets transfer and using the source IP.
anomaly-detection classification cyber-security data-analysis ddos-detection icmp-flood intrusion-detection machine-learning network-security packet-analysis python random-forest security smp-flood udp-flood wireshark
Last synced: 21 Apr 2026
https://github.com/jackieocham/rest-metrics-data-analysis
Data analysis on sleep and health tracking data collected over many years
data-analysis data-cleaning data-manipulation data-preparation data-project exploratory-data-analysis initial-data-analysis mysql mysql-database sql
Last synced: 01 Apr 2025
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/thinogueiras/jornada-python
Jornada Python - Hashtag Programação.
data-analysis data-science inteligencia-artificial python rpa
Last synced: 22 Apr 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-export-additional-captions-header-or-footer
This example illustrates how to add a custom header to the document exported to PDF in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 22 Apr 2026
https://github.com/vasulab/knightshock
Shock tube experiment planning and data analysis package.
cantera data-analysis matplotlib numpy shock-tube
Last synced: 18 Jul 2025
https://github.com/al-ogr/sf_pr1_job_analysis_hh
SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter
data-analysis data-science ipynb plotly python
Last synced: 23 Apr 2026
https://github.com/ihnokim/datk
Data Analysis Toolkit (DATK)
data-analysis data-engineering data-science deep-learning image-processing pandas signal-processing
Last synced: 24 Apr 2026
https://github.com/yashsingh43/cdc-sleep-duration-health-analysis
Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.
beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python
Last synced: 11 Jun 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/voidnire/redditviralmysteryposts
Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?
data-analysis data-visualization mysteries mystery nlms python-3 reddit
Last synced: 24 Apr 2026
https://github.com/lightning-chart/lcjs-example-0507-dashboardfiberanalysis
A demo application showcasing using LightningChart JS to visualize fiber analysis data.
area-plot area-series chart charts dashboard data-analysis demo heatmap javascript lcjs lightningchart-js performance visualization webgl
Last synced: 24 Apr 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/edwinrlambert/emomap-sentiment-analysis
To analyze public sentiment related to specific locations in a city (e.g., parks, transit stations, restaurants, neighborhoods) using geo-tagged social media posts, reviews, and comments. The goal is to visualize how people feel across different areas and times.
data-analysis jupyter-notebook python sentiment-analysis
Last synced: 24 Apr 2026
https://github.com/rachel-xmr/data-analysis-of-hand-drawn-symbols
Data analysis of hand draw symbols
cross-validation data-analysis decision-trees k-means-clustering machine-learning model-evaluation r
Last synced: 12 Jun 2026
https://github.com/pedrohdosanjos/economic-data-analysis
This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.
api data-analysis data-visualization jupyter-notebook python
Last synced: 24 Apr 2026
https://github.com/mariann95/sql_data_warehouse_and_analytics_project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-science-portfolio data-warehouse data-warehousing datalake datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver
Last synced: 06 Jun 2026
https://github.com/fbarffmann/belly-button-challenge
Built an interactive JavaScript dashboard to visualize bacterial biodiversity from belly button samples. Analyzed data from 153 participants and identified OTU 1167 as the most common bacteria.
biodiversity dashboard data-analysis data-visualization interactive-charts javascript json plotly
Last synced: 25 Apr 2026
https://github.com/sambit-mondal/stockx
StockX is a full-stack application designed to help store owners efficiently manage their inventory, track purchases, and analyze stock levels. The system integrates MongoDB, Express, React, and Flask (Python) to provide a seamless experience.
artificial-intelligence data-analysis inventory-management-system machine-learning mern-stack
Last synced: 12 Jun 2026
https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters
Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.
data-analysis data-science data-visualization machine-learning pandas
Last synced: 09 Apr 2025
https://github.com/ddihora1604/iit_patna
A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.
data-analysis data-visualization datamodelling machine-learning-algorithms python
Last synced: 25 Apr 2026
https://github.com/aastopher/mma_outcome
Simple exploratory analysis of UFC Fights and Vegas fight odds from 1993 to 2021
data-analysis data-visualization
Last synced: 06 Jun 2026
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 25 Apr 2026
https://github.com/devexpress-examples/wpf-pivotgrid-customize-the-cell-template
This example demonstrates how to customize the cell appearance in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 26 Apr 2026
https://github.com/madrury/commute-times
Simulated Commute Times Data
data-analysis data-science data-visualization dataset
Last synced: 12 Jun 2026
https://github.com/sarveshdhond/top_25_cad_stocks
In this project I have used Python Jupyter lab and Pandas to import data set from Yahoo stocks website. I have imported the top 25 most active Canadian stocks on 12th July 2024. This project shows skills such as Python, Web Scrapping and Pandas.
data-analysis pandas-dataframe python webscraping
Last synced: 01 Apr 2025
https://github.com/prady2309/unemployement-analysis
Data Science Project
colab-notebook data-analysis data-science data-visualization eda jupyter-notebook machine-learning python3
Last synced: 26 Apr 2026
https://github.com/pararang/nams-thesis-fuzzy
A specialized data processing tool designed to help with Fuzzy Delphi Method calculations for thesis research data analysis. Then extended with some new features for data processing with different method.
data-analysis dematel hacktoberfest hacktoberfest-accepted house-of-quality python sustainability vibecoding
Last synced: 27 Apr 2026
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 27 Apr 2026
https://github.com/saksham-jain177/cryptodataanalysis
A Python powered project that fetches live cryptocurrency data from the CoinMarketCap API, analyzes it, and updates a live Excel sheet every 5 minutes.
api-integration coinmarketcap cryptocurrency data-analysis excel live-data python
Last synced: 12 Jun 2026
https://github.com/garcane/exodus_analysis
This project analyses cryptocurrency transaction data exported from the Exodus wallet. The goal is to explore and visualize the inflows and outflows of assets, the types of transactions, and other key metrics over time.
bitcoin btc crypto cryptocurrencies cryptocurrency data-analysis data-visualization eth ethereum pandas seaborn
Last synced: 27 Apr 2026
https://github.com/jofaval/supertsore-timeseries
Timeseries Data Analysis and Forecast of the sales from a superstore in 2015-2018
data-analysis data-science deep-learning deep-neural-networks forecasting google-colab lstm openml python tensroflow time-series time-series-analysis time-series-forecasting
Last synced: 27 Apr 2026
https://github.com/airdac/ml-palmerpenguins
Classification and analysis of the palmerpenguins dataset in Python. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science machine-learning palmer-penguin python upc
Last synced: 07 Jun 2026
https://github.com/lu-m-dev/biostatistics-eda
Exploratory data analysis and visualization system for biostatistical research
biostatistics data-analysis data-visualization eda
Last synced: 25 Jun 2026
https://github.com/bheemisme/employee-attrition-analysis
A Dashboard on employee-attrition-analysis
dashboard data-analysis data-science plotly plotly-dash python
Last synced: 28 Apr 2026
https://github.com/sujata-adhikari/data-analysis
Data analysis of Market sales data using PowerBi, created dashboard to show analysis.
data-analysis excel pandas powerbi
Last synced: 12 Jun 2026
https://github.com/danielrosehill/data-projects-index
Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.
data-analysis data-science data-visualization
Last synced: 16 Mar 2026
https://github.com/hadson0/chess-live-ratings-data
A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.
beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping
Last synced: 28 Apr 2026
https://github.com/emmanuelletocs/steam-game-recommender
A powerful recommendation system for Steam games, combining Content-Based and Collaborative Filtering techniques. Built with Python, Scikit-learn, and Streamlit to deliver accurate, real-time game recommendations. Perfect for gamers and data scientists interested in building intelligent recommendation engines.
als-algorithm data-analysis gaming-industry knn machine-learning mds mysql ncf neural-network pyspark recommendation-engine recommendation-system scikit-learn spark
Last synced: 28 Apr 2026
https://github.com/jovicdev97/financial-loan-datascience-notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 28 Apr 2026
https://github.com/ovuiproduction/tabletalk
TableTalk - Your Data, Your Language Query tabular data using natural language—no SQL required! Upload your data, ask questions, and get instant insights. 🔹 Convert Natural Language to SQL 🔹 Handle Complex Queries & Aggregations 🔹 Upload CSVs for Easy Analysis 🔹 React + Flask + SQLite3 Backend 🔹 Powered by LLMs for Accuracy
ai data-analysis flask llm machine-learning natural-language-processing prompt-engineering react sql sqlite
Last synced: 28 Apr 2026
https://github.com/sufyan14/weather-data-analysis
A Streamlit dashboard that forecasts 30-day weather trends using uploaded CSV data and Facebook Prophet.
data-analysis python streamlit
Last synced: 28 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/hayatiyrtgl/arima_linearregression_xgboost_time_series_analysis
This Python script conducts various data processing, visualization, and modeling tasks on a dataset.
arima arima-forecasting arima-model data-analysis data-visualization linear-models linear-regression machine-learning machine-learning-algorithms pandas python xgboost xgboost-regression
Last synced: 28 Apr 2026
https://github.com/edanur-y/agricultural-yield-prediction-with-multiple-linear-regression
Performing multiple linear regression analysis on agricultural data to predict the yield.
data-analysis missing-data-imputation multiple-linear-regression outlier-analysis r
Last synced: 13 Jun 2026
https://github.com/theveryhim/dimensionality-reduction-and-clustering
Simple ML-like data analysis and processing.
autoencoder clustering data-analysis dimensionality-reduction pca
Last synced: 10 Sep 2025
https://github.com/hemangsharma/breast-cancer-patient-dashboard
This interactive Streamlit dashboard visualizes insights from the SEER Breast Cancer Dataset (2006-2010)
data-analysis streamlit streamlit-dashboard streamlit-webapp
Last synced: 05 May 2026
https://github.com/wei-rongrong2/openfoodfactclustering
A project that explores clustering food products based on nutritional attributes using K-Means, Fuzzy C-Means, and DBSCAN algorithms, with a Streamlit dashboard for visualizing results.
clustering dashboard data-analysis dbscan food-products fuzzy-cmeans k-means machine-learning nutrition nutrition-clustering open-food-facts streamlit
Last synced: 28 Apr 2026
https://github.com/josedanielchg/efficient-data-storage-for-predictive-modeling
DataCamp project from the Associate Data Scientist track, focusing on optimizing dataset storage by transforming data types and filtering. Prepares data for efficient machine learning workflows
cleaning-dataset data-analysis jupyter-notebook python
Last synced: 28 Apr 2026
https://github.com/ricram2/column-name-extractor
Jupyter Notebook. Takes Folder with one or more CSV and gives back one CSV with a compendium of column names and 3 example values (first, random, random)
Last synced: 29 Apr 2026
https://github.com/luizassimoes/q5ga-latency-and-throughput
Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.
data-analysis data-visualization pyqt5 python
Last synced: 13 Jun 2026
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 29 Apr 2026
https://github.com/devexpress-examples/winforms-visualize-pivot-grid-data-in-chart
The following example shows how to integrate the Pivot Grid with the Chart control.
charting data-analysis dotnet pivot-grid-for-winforms winforms
Last synced: 29 Apr 2026
https://github.com/nivasharmaa/spiderverse
A comprehensive Java program for analyzing and managing events and data points within a fictional spiderverse. Features event handling, anomaly detection, cluster management, and robust file I/O operations.
advanced-algorithms anomaly-detection clustering data-analysis file-io object-oriented-programming
Last synced: 29 Apr 2026
https://github.com/jakebrehm/ezpz-plotting
📈 Easily visualize and manipulate plots from multiple data files.
data-analysis data-visualization engineering matplotlib matplotlib-pyplot pandas plotting python python-3 software software-engineering tkinter tkinter-gui
Last synced: 29 Apr 2026
https://github.com/imosudi/unsupervised-ml-kmeans-analysis
K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.
clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning
Last synced: 25 Jun 2026
https://github.com/saroshfarhan/kaggle-playground-s4e11
Kaggle old competirion just for practice
data-analysis data-science data-visualization jupiter-notebook python3
Last synced: 29 Apr 2026
https://github.com/parsabordbar/ctx3docs
The Documentation for context Tree Project.
ai-tools context ctx3 ctx3-docs data-analysis documentation tree workflow
Last synced: 25 Jun 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/ernanej/data-science-dca0131
Files, developed throughout the 2024.1 semester of the Data Science discipline taught at the Federal University of Rio Grande do Norte by the Department of Computer Engineering and Automation (DCA). 📚
big-data data-analysis data-science ia
Last synced: 30 Mar 2025
https://github.com/teja-1403/forage-standard-bank-data-science
This repository contains solutions to the 4 different tasks that must be performed during the Data Science virtual internship provided by Standard Bank via Forage.
automl communication-skills data-analysis data-science machine-learning python sql
Last synced: 29 Apr 2026
https://github.com/noodleslove/house-of-representatives-analysis-ii
In this project, we want to estimate if a transaction will have capital gains exceeding $200 using the provided dataset.
coursework data-analysis data-science eda feature-engineering pandas python3
Last synced: 12 Apr 2026
https://github.com/valikmorinko/ecommerce-sales-analysis
Анализ продаж e-commerce: данные, визуализации, аналитические выводы.
data-analysis e-commerce jupyter matplotlib pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/alunera-data/sql-use-cases
Practical SQL use cases for Business Intelligence and IT Service Management (BI & ITSM)
business-intelligence dashboards data-analysis data-quality eda itsm kpis postgresql process-monitoring query reporting sql sqlserver
Last synced: 29 Apr 2026
https://github.com/istinnew/eniac_ab_insight
Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.
ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code
Last synced: 29 Apr 2026
https://github.com/shimaa83/eda-repo
Exploratory data analysis for Police and retail dataset in kaggle
Last synced: 29 Apr 2026
https://github.com/dindagustiayu/data-processing
The digital text book to interpreting characterisation results.
characterisation data-analysis gitbook latex-package myst qualitative-analysis quantitative-analysis
Last synced: 08 Jun 2026
https://github.com/abhipatel35/diabetes_ml_classification
Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.
classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn
Last synced: 20 Jan 2026
https://github.com/katiesaund/tidy_tuesday
A weekly data project in R from the R4DS online learning community
data-analysis data-visualization datascience plot r rstats tidytuesday
Last synced: 24 Mar 2025
https://github.com/akorablov/project_data_jobs
Looking into the data job market across Czechia, Slovakia, and the US.
data-analysis data-science git job-market jupyter learn-to-code matplotlib pandas python seaborn skills visual-studio-code visualisation
Last synced: 29 Apr 2026
https://github.com/muhammadusman-khan/e-commerce-store-eda
Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.
data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/filip-kustura/python-covid-19-behaviors-analysis
Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project
Last synced: 12 Apr 2026
https://github.com/okdp/platform-packages
Collection of OKDP platform packages
clickops data-analysis data-platform data-science fluxcd gitops helm kubernetes kubocd portal spark
Last synced: 26 Jun 2026
https://github.com/shruti-h/sales_data_analysis
Sales Data Analysis | Pandas & Matplotlib
data-analysis data-science data-vi matplotlib pandas-library python
Last synced: 30 Apr 2026
https://github.com/prady2309/email-spam-detection-with-machine-learning
Implemented using Naive Bayes Algorithm
data-analysis data-science machine-learning python
Last synced: 30 Apr 2026
https://github.com/avazasgarov/soccer-hypothesis-testing
Statistical analysis comparing goal-scoring patterns in Men’s vs. Women’s FIFA World Cups using hypothesis testing.
data-analysis eda hypothesis-testing matplotlib-pyplot pandas pingouin python scipy
Last synced: 30 Apr 2026
https://github.com/mxagar/eda_fe_summary
An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.
data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas
Last synced: 30 Apr 2026
https://github.com/divyanshugit/indian-judiciary-analysis
Analysis of Indian district court data across states.
Last synced: 02 Jul 2025
https://github.com/celineboutinon/laplace-immo
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3
data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql
Last synced: 30 Apr 2026
https://github.com/yankh764/revenue-data-analysis
A take home assignment of improving a revenue data pipeline
data-analysis docker python sql take-home-assignment
Last synced: 30 Apr 2026
https://github.com/anilyigitsel/tourist-attraction-data-analysis
This project analyzes tourism trends from 2017 to 2021, focusing on visitor numbers, ratings, and attraction popularity during these years.
data-analysis data-visualization excel sql tourism
Last synced: 26 Jan 2026
https://github.com/aniketmondal/dataanalysis
Contains cleaning, transformation, and exploratory analysis of various data sets using Python Pandas, NumPy, re, random, etc.
analysis data-analysis data-science pandas python
Last synced: 30 Apr 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/devexpress-examples/wpf-pivot-grid-provide-custom-summary-values
This example demonstrates how to determine the value type when you calculate custom summary values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 01 May 2026
https://github.com/dhanyasri20/credit-risk-prediction
Credit Risk Prediction using Python, SQL, and Flask. Trained ML models (Random Forest) to identify high-risk loan applicants with 86% accuracy, automated SQL reporting, and deployed a Flask web app for real-time predictions.
classification credit-risk data-analysis financial-data flask loan-prediction machine-learning python random-forest sql
Last synced: 28 Apr 2026
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/ariyaarka/result-analysis
A simple analysis of result based on different factors shown in figures
data-analysis jupyter-notebook matplotlib numpy-library pandas-dataframe python seaborn
Last synced: 01 May 2026
https://github.com/filip-kustura/data-warehouse-olympics
This project, part of the elective Advanced Database Systems course, involved building a data warehouse based on the already existing database in PostgreSQL. It focuses on analyzing Olympic Games data across time, covering athletes' performance by discipline, location, and other dimensions. Implemented in Spring 2022.
data-analysis data-warehouse database extract-transform-load olympic-games postgresql sql star-schema university-project
Last synced: 01 May 2026
https://github.com/sairupeshl/leo-orbital-congestion-analysis
Geospatial data analysis of the UCS Satellite Database using Python to map active LEO space assets, validate orbital parameters, and isolate mega-constellation traffic bottlenecks.
aerospace-engineering data-analysis geospatial-analysis orbital-mechanics pandas python satellite-data seaborn
Last synced: 08 Jun 2026
https://github.com/monish-nallagondalla/sensor_fault_detection
This repo contains sensor data for analysis, focusing on sensor readings, their attributes, and classification (Good/Bad). It includes 500+ sensors with features for predictive modeling, anomaly detection, and sensor failure prediction.
anomaly-detection classification data-analysis data-science machine-learning predictive-modeling python sensor-data
Last synced: 01 May 2026
https://github.com/pratanup/solar-power-generation-prediction
A solar power generation company wants to optimize solar power production and needs the prediction model to predict ‘Clearsky DHI’, ‘Clearsky DNI’, ‘Clearsky GHI’.
anaconda data-analysis data-science google-colab jupiter-notebook machine-learning machine-learning-algorithms machinelearning-python prediction prediction-model python
Last synced: 01 May 2026