Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/marta-barea/hsgcms-paraffinwax-odor-ml
Data processing approaches for paraffin wax odor discrimination and quantification
https://github.com/marta-barea/hsgcms-paraffinwax-odor-ml
chemistry chemoinformatics chemometrics gc-ms gc-ms-data machine-learning machine-learning-algorithms petroleum petroleum-science r
Last synced: 21 days ago
JSON representation
Data processing approaches for paraffin wax odor discrimination and quantification
- Host: GitHub
- URL: https://github.com/marta-barea/hsgcms-paraffinwax-odor-ml
- Owner: Marta-Barea
- License: gpl-3.0
- Created: 2023-01-11T12:57:02.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-20T17:10:17.000Z (about 1 month ago)
- Last Synced: 2024-11-20T18:21:33.906Z (about 1 month ago)
- Topics: chemistry, chemoinformatics, chemometrics, gc-ms, gc-ms-data, machine-learning, machine-learning-algorithms, petroleum, petroleum-science, r
- Language: R
- Homepage:
- Size: 1.99 MB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# **Parffin Wax Odor detection using HS-GC/MS data combined with ML**
The **machine-learning_HSGCMS_ParaffinWax_Odor** repository contains data processing approaches and machine learning techniques applied to discriminate and quantify the odor of paraffin wax samples. The study focuses on the classification of paraffin wax based on hydroprocessing levels, which significantly affect their olfactory characteristics.
---
## ๐ Objective
The main goal of this project is to classify paraffin wax samples into five categories based on the degree of hydroprocessing:
- **None**: Non-hydrotreated paraffin.
- **Low**: 75% hydrotreated + 25% non-hydrotreated paraffin.
- **Moderate**: 50% hydrotreated + 50% non-hydrotreated paraffin.
- **High**: 25% hydrotreated + 75% non-hydrotreated paraffin.
- **Full**: Fully hydrotreated paraffin.---
## ๐ ๏ธ **System Requirements**
### Software
- **R version 4.2.0**
- RStudio (optional but recommended)1. **Hierarchical Clustering Analysis (HCA)**:
- Performed using the `hclust` function from the **stats** package (version 4.1.2).
- The agglomerative coefficient was calculated using the `agnes` function from the **cluster** package (version 2.1.3).
- Results were visualized with the `fviz_dend` function from the **factoextra** package (version 1.0.7).2. **Supervised Learning Models**:
- Developed using the `trainControl` and `train` functions from the **caret** package (version 6.0-93).
- Regression metrics were calculated using the **MLmetrics** package (version 1.1.1).
- Important variables for PLS and RF models were extracted with the `varImp` function from the **caret** package.3. **Data Visualization**:
- Plots generated using **ggplot2** (version 3.3.6) and **graphics** (version 4.1.2).4. **Interactive Web Application**:
- Developed with the **shiny** package (version 1.7.2).---
## ๐ Repository Structure
- `App/`: Contains the interactive web application developed using Shiny.
- `method development/`: Scripts and notebooks for developing data processing methods.
- `supervised algorithms/`: Scripts for supervised machine learning models.
- `unsupervised algorithms/`: Scripts for unsupervised learning models, such as clustering.
- `tis/`: Analysis scripts for Total Ion Spectra of paraffin wax samples.
- `.gitattributes` and `.gitignore`: Repository configuration files.
- `LICENSE`: Project license.
- `README.md`: This file.---
## โ๏ธ How to Use This Repository
### Clone the Repository
```bash
git clone https://github.com/Marta-Barea/hsgcms-paraffinwax-odor-ml
cd hsgcms-paraffinwax-odor-ml### Running the Shiny Application
1. Place `app.R`, `svm.rds`, `svr.rds`and `test_data.xlsx` in the same folder.
2. In your R console, run:
```R
shiny::runApp("app.R")3. Use the web interface to:
- ๐ **Upload** `.csv` or `.xlsx` data files.
- ๐ ๏ธ **Preprocess** data using advanced filtering techniques.
- ๐ค **Predict** hydroprocessing grades with AI.---
### ๐ **Example Dataset**
A sample dataset (`test_data.xlsx`) is included for demonstration purposes. It contains Vis-NIR spectral readings and hydroprocessing grades for various wax samples.---
### ๐ค **Contributors**
- **University of Cรกdiz (AGR-291 Research Group)**
- Specializing in hydrocarbon characterization and spectroscopy.---
### ๐ **License**
This project is licensed under the GNU GENERAL PUBLIC License. See `LICENSE` for details.