Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-18 00:08:03 UTC
- JSON Representation
https://github.com/malexandersalazar/tools-python-mssql-statistics-descriptor
A lightweight tool based on sweetviz that generates high-density visualizations to kickstart Exploratory Data Analysis within Microsoft Azure SQL Database using ODBC with just one line of code
azure-sql-database data-analysis data-visualization eda python
Last synced: 17 Feb 2025
https://github.com/jhrcook/protein-language-models
Experimenting with protein language model predictions
data-analysis protein-language-model variant-effect-prediction
Last synced: 13 Jan 2025
https://github.com/marwan-ahmed-23/text-sentiment-analysis-api
A lightweight Python project for analyzing the sentiment of textual data using the TextBlob library. This project provides a simple and effective way to measure the polarity and subjectivity of any given text.
data-analysis machine-learning python python-project sentiment-analysis text-analysis text-mining
Last synced: 05 Jan 2025
https://github.com/yash22222/literacy-exploration-analysis
Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.
csv data-analysis data-visualization government-data india literacy literacy-analysis states
Last synced: 05 Jan 2025
https://github.com/yash22222/data-analysis-on-real-time-social-media-comments
EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.
data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis
Last synced: 05 Jan 2025
https://github.com/yash22222/cinesphere-crafting-personalized-movie-experiences
"CineSphere" is a groundbreaking project developing a personalized movie recommendation engine. By analyzing user preferences and viewing history, CineSphere suggests movies tailored to individual tastes, revolutionizing the movie-watching experience.
cinesphere data-analysis imdb machine-learning movie-recommendation-engine movie-recommendation-system movielens real-time
Last synced: 05 Jan 2025
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 05 Jan 2025
https://github.com/yash22222/pwc-power-bi-virtual-case-experience
The Power BI PwC Virtual Case Experience is an exciting and educational program designed to provide participants with hands-on exposure to Power BI, a prominent business intelligence and data visualization tool, within the context of consulting at PwC.
business-analyst business-analytics business-intelligence dashboard data-analysis data-analyst data-analytics dax microsoft-power-bi powerbi powerbi-dashboards powerbi-visuals pwc
Last synced: 05 Jan 2025
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 13 Jan 2025
https://github.com/justinhennis1/hackathon24
Hofstra's Hacknology Competition 2024 - Team Null Pointers
data data-analysis data-science data-visualization data-visualization-python dataanalysis dataanalytics traveling web webapplication
Last synced: 13 Jan 2025
https://github.com/andrewzgheib/football-database-analysis
Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization
data-analysis data-visualization database pandas pgsql postgr powerbi sql
Last synced: 10 Feb 2025
https://github.com/luminati-io/walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 23 Jan 2025
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 17 Feb 2025
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 17 Feb 2025
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 06 Jan 2025
https://github.com/shreshthvashisht/abc-call-volume-trend-analysis
Customer Experience Analysis
advanced-excel call-centre-analysis call-volume-trend data-analysis data-visualisation experience-analytics pivot-tables
Last synced: 06 Jan 2025
https://github.com/shreshthvashisht/hiring-process-analytics
Statistics Using Excel
advanced-excel data-analysis data-science data-visualization excel hr-analytics statistics
Last synced: 06 Jan 2025
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 06 Jan 2025
https://github.com/nilayhangarge/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python
Last synced: 06 Jan 2025
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 17 Feb 2025
https://github.com/quantumudit/groceries-basket-analysis
This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.
data-analysis data-visualization pandas powerbi python
Last synced: 17 Feb 2025
https://github.com/mosalem149/pythonutilities
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting
Last synced: 06 Jan 2025
https://github.com/andremenezesds/pa004_health_insurance
Health Insurance Cross-Sell(Learning to Rank Machine Learning Project)
backend backend-api data-analysis data-science data-visualization dataviz lgbm machine-learning matplotlib numpy optuna pandas python scikit-learn shell-script sql webapi xgboost
Last synced: 10 Jan 2025
https://github.com/alphatwirl/qtwirl
qtwirl (quick-twirl), one-function interface to AlphaTwirl
alphatwirl data-analysis data-frame pandas r root-cern
Last synced: 20 Jan 2025
https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal
Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.
data-analysis data-visualization python
Last synced: 06 Jan 2025
https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru
Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠
data-analysis jupyter-notebook python
Last synced: 06 Jan 2025
https://github.com/satyacoder29/crowdfunding-in-sql
Crowdfunding is a method of raising funds for projects or causes by collecting small contributions from a large group of people, usually through online platforms. It enables individuals, startups, and nonprofits to secure funding, offering rewards or recognition in exchange, and helps bring ideas to life without traditional financing.
data-analysis data-cleaning database-management mysql-database quries sql sql-functions sql-server views
Last synced: 28 Dec 2024
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 28 Dec 2024
https://github.com/hugo-hattori/watercraft_values_ai_prediction
Data Science Project.
ai-model artificial-intelligence artificial-intelligence-algorithms data-analysis data-analytics data-science jupyter jupyter-notebook machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas pandas-dataframe pandas-python python seaborn sklearn sklearn-library sklearn-metrics
Last synced: 28 Dec 2024
https://github.com/jillie-wink/sql-portfolio
SQL Data Analysis Projects
data-analysis data-manipulation portfolio sql sqlite
Last synced: 20 Jan 2025
https://github.com/al-ogr/sf_pr1_job_analysis_hh
SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter
data-analysis data-science ipynb plotly python
Last synced: 20 Jan 2025
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 28 Dec 2024
https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure
Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure
data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny
Last synced: 20 Jan 2025
https://github.com/nikbarb810/covid_growth_rate_390.51
Exploring Covid Growth Rate of European Population using genetic data analysis
bioinformatics data-analysis r rcpp
Last synced: 01 Jan 2025
https://github.com/vara-co/solar-eclipse-2024
Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.
data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse
Last synced: 02 Feb 2025
https://github.com/drisskhattabi6/exploratory-data-analysis-projects
This Repo contains My Exploratory Data Analysis Projects for many datasets
data-analysis data-preprocessing data-visualization datasets diabetes-prediction eda exploratory-data-analysis iris-dataset
Last synced: 26 Jan 2025
https://github.com/drisskhattabi6/data-analysis-and-ml-app
A Python desktop application using CustomTkinter for data analysis and machine learning.
custom-tkinter data-analysis data-processing data-visualization desktop-application machine-learning machine-learning-models machine-learning-pipeline tkinter
Last synced: 26 Jan 2025
https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization
Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.
apache-spark bigdata data-analysis data-science delta-lake sqlite
Last synced: 28 Dec 2024
https://github.com/codeonthespectrum/web-scrap
Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.
data-analysis data-visualization webscraping
Last synced: 17 Feb 2025
https://github.com/sferez/gradient_descent
Multiple Linear Regression, Gradient Descent with Python
data-analysis data-science gradient-descent linear-regression python
Last synced: 13 Jan 2025
https://github.com/sferez/simple_linear_regression
Simple Linear Regression using Python
data-analysis data-science linear-regression python regression
Last synced: 13 Jan 2025
https://github.com/jethronap/jstat-gui
Web-based GUI application for data analysis
data-analysis data-visualization java jstat mongodb
Last synced: 06 Jan 2025
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 27 Jan 2025
https://github.com/analyticalnahid/pandas-tutorial
A complete tutorial on Pandas for Data Science and Machine Learning
data-analysis pandas pandas-python pandas-tutorial
Last synced: 02 Feb 2025
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 17 Feb 2025
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Feb 2025
https://github.com/rahulsm20/car-data
A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.
data-analysis data-analytics matplotlib numpy pandas python
Last synced: 06 Jan 2025
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 06 Jan 2025
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 11 Jan 2025
https://github.com/mr-chang95/udacity_movie_project
Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook movie python
Last synced: 26 Jan 2025
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 26 Jan 2025
https://github.com/mr-chang95/sf_data_visualization
In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.
business data-analysis data-visualization jupyter-notebook pandas python san-francisco
Last synced: 26 Jan 2025
https://github.com/vubacktracking/freecodecamp-data-analysis-with-python
5 Projects in Data Analysis With Python Course on Freecodecamp
data-analysis freecodecamp freecodecamp-project python
Last synced: 06 Jan 2025
https://github.com/imannoferesti/3d_datavisualizer_vr
🚀 3D Scatter Plot Visualizer in Unity VR: Explore large datasets in 3D with dynamic point dispersion and value grouping (round, ceil, floor). Perfect for trend analysis and dense data visualization! 🎨📊
3d-scatter-plot big-data continuous-data-handling customizable-visualization data-analysis data-science-tools data-visualization dense-data-visualization discrete-data-handling exploratory-data-analysis immersive-analytics interactive-visualization overlapping-data-points trend-analysis unity value-grouping virtual-reality vr vr-data-visualization
Last synced: 10 Feb 2025
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 06 Jan 2025
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 06 Jan 2025
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 06 Jan 2025
https://github.com/sieunnnn/data-lab
데이터 분석을 연습하는 Repository 입니다.
data-analysis jupyter-notebook python
Last synced: 13 Jan 2025
https://github.com/patricialjohnson/data-visualization-tableau-project
Tableau Visualization Project
business-analytics business-intelligence data-analysis data-visualization digital-marketing digital-marketing-agency kpi microsoft-excel program-management project-management python search-engine-optimization seo sql tableau
Last synced: 06 Jan 2025
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 Jan 2025
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 17 Feb 2025
https://github.com/abdul-wahab-318/black-friday-eda-prediction
EDA and model training on Black Friday dataset
data-analysis data-visualization eda machine-learning sklearn
Last synced: 17 Feb 2025
https://github.com/shrutiijoshi/coffee_sales
This project aims to analyze coffee sales data to identify key trends, patterns, and factors influencing sales performance.
Last synced: 18 Jan 2025
https://github.com/rainbowatcher/simple
Make data work easier, saving your working time
Last synced: 15 Feb 2025
https://github.com/datalopes1/manufacturing_defects
Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle
data-analysis data-visualization eda exploratory-data-analysis python
Last synced: 02 Feb 2025
https://github.com/datalopes1/fifa21_datacleaning
Neste projeto será feito o processo de limpeza e manipulação a partir do dataset FIFA 21 messy, raw dataset for cleaning/ exploring, que pode ser encontrado no Kaggle, com licensa CC0: Public Domain e enviado por Rachit Toshniwal.
data-analysis data-cleaning python
Last synced: 02 Feb 2025
https://github.com/danielrosehill/data-projects-index
Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.
data-analysis data-science data-visualization
Last synced: 20 Jan 2025
https://github.com/datalopes1/warehouse_rfv
Neste projeto será realizada uma análise do tipo RFV (Recência, Frequência e Valor) com dados que encontrei neste video no Youtube do canal Jie Jenn.
analise-rfv data-analysis data-science kmeans python rfm-analysis
Last synced: 02 Feb 2025
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 02 Feb 2025
https://github.com/chdre/data-analyzer
A small package to analyze and preprocess data.
Last synced: 25 Jan 2025
https://github.com/rosanafss/data-visualization-nanodegree
Data Visualization Nanodegree
dashboard data-analysis data-preparation data-visualization design interactive sketch storytelling tableau wireframe
Last synced: 28 Dec 2024
https://github.com/rosanafss/predictive-analytics-for-business-nanodegree
Predictive Analytics for Business Nanodegree
alteryx alteryx-server alteryx-workflow clustering data-analysis data-visualization datacleaning experiment filtering join modeling prediction preparation randomized regression segmentation summarization tableau testing workflow
Last synced: 28 Dec 2024
https://github.com/rosanafss/sql-journey
SQL, practicing for Udacity Data Track.
data-analysis database datascience jupyter-notebook python queries relational-databases sql sql-server
Last synced: 28 Dec 2024
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 10 Feb 2025
https://github.com/alan-oliveir/state-of-data-2022
Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados.
data-analysis jupyter-notebook pandas-python seaborn-python
Last synced: 13 Jan 2025
https://github.com/chanmeng666/advanced-neural-network-applications
【Stars make the code shine brighter! ⭐️】Educational project demonstrating practical applications of neural networks through perceptron-based fish classification and linear neuron heat influx prediction, implemented in Python with detailed Jupyter notebook examples and documentation.
classification data-analysis jupyter-notebook linear-neuron machine-learning neural-networks perceptron python
Last synced: 17 Feb 2025
https://github.com/chanmeng666/water-quality-testing-data-analysis
【Every star adds a spark to our coding journey!⭐️】A Python-based data analysis toolkit for water quality monitoring and prediction. Provides comprehensive analysis of key water quality parameters including pH, temperature, turbidity, dissolved oxygen, and conductivity.
data-analysis data-visualization environmental-monitoring jupyter-notebook machine-learning pandas python seaborn statistics water-quality
Last synced: 17 Feb 2025
https://github.com/roland045/smart_fluid_sedimentation_tester
Control program for custom developed smart fluid sedimentation tester system
arduino data-analysis instrumentation measurement sensor
Last synced: 06 Jan 2025
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 16 Jan 2025
https://github.com/mrfoxak/movie-recommender-system-project
This is a Machine Learning Recommendation System Project
data-analysis machine-learning python recommender-system regression tokenization
Last synced: 26 Jan 2025
https://github.com/apfirebolt/numpy-and-pandas-examples
Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python
data-analysis jupyter-notebook numpy pandas python
Last synced: 25 Jan 2025
https://github.com/touradbaba/multi-page_dash_application
This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.
dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere
Last synced: 21 Jan 2025
https://github.com/aurcode/breast_cancer_predict_api
API that can classify whether breast cancer is benign or malignant based on measured characteristics.
Last synced: 08 Feb 2025
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 17 Feb 2025
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 17 Feb 2025
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 31 Jan 2025
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 17 Feb 2025
https://github.com/siddhartha-padhy/heart-disease-predictor
data-analysis machine-learning pandas python
Last synced: 23 Jan 2025
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 10 Feb 2025
https://github.com/vvipjain/skipline-super-store-dashboard
SkipLine Super Store Dashboard
csv-files data-analysis data-visualization powerbi visualization
Last synced: 10 Feb 2025
https://github.com/vvipjain/bank-loan-report
Bank Loan Reports
data data-analysis data-visualization powerbi sql
Last synced: 10 Feb 2025
https://github.com/vvipjain/ev-data-analysis
EV Data Analysis
data data-analysis data-visualisation tableau tableau-public
Last synced: 10 Feb 2025
https://github.com/vvipjain/ecommerce-sales-analysis
Ecommerce Sales Analysis
data-analysis pandas pandas-dataframe python sql sqlalchemy
Last synced: 10 Feb 2025
https://github.com/brianlesko/r_data_science_stat5730
Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository
data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse
Last synced: 17 Feb 2025
https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach
Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.
data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost
Last synced: 02 Feb 2025
https://github.com/gaurav-van/data-analysis-projects
Collections of Projects that involves Data Analysis and Informed Decision Making
data-analysis database powerbi sql
Last synced: 02 Feb 2025