https://github.com/singjc/redeem

Repository for Deep Learning Models for Mass Spectrometry written in Rust
https://github.com/singjc/redeem

deep-learning mass-spectrometry rust semi-supervised-learning

Last synced: 7 months ago
JSON representation

Repository for Deep Learning Models for Mass Spectrometry written in Rust

Host: GitHub
URL: https://github.com/singjc/redeem
Owner: singjc
Created: 2024-10-09T18:13:53.000Z (about 1 year ago)
Default Branch: master
Last Pushed: 2025-03-13T19:27:01.000Z (7 months ago)
Last Synced: 2025-03-13T19:31:42.320Z (7 months ago)
Topics: deep-learning, mass-spectrometry, rust, semi-supervised-learning
Language: Rust
Homepage:
Size: 326 KB
Stars: 11
Watchers: 4
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

ReDeem Logo

---

# ReDeeM: Repository for Deep Learning Models for Mass Spectrometry

ReDeeM is a Rust crate designed for implementing deep learning models specifically tailored for mass spectrometry data. The primary goal of this project is to facilitate the prediction of peptide properties and to develop classifier scoring models (TDA).

### Usage

The ReDeeM crates are designed to be used as a library in other projects, i.e. in Sage. To use the ReDeeM crates, add the following to your `Cargo.toml` file:

```toml
[dependencies]
redeem-properties = { git = "https://github.com/singjc/redeem.git", branch = "master" }
redeem-classifiers = { git = "https://github.com/singjc/redeem.git", branch = "master" }
```

**Note**: The ReDeeM crates are still under development and are not yet available on crates.io.

### Current Crates

The ReDeeM project consists of two primary crates:

1. **redeem-properties**:
- This crate focuses on deep learning models for peptide property prediction. It implements models for predicting retention time (RT), ion mobility (IM), and MS2 fragment intensities using the Candle library.
- The models can be fine-tuned on new data and can be saved in the safetensor format for later use.

- Current Models

Model | Name | Architecture | Implemented
--- | --- | --- | ---
AlphaPept RT Model | `redeem_properties::RTCNNLSTMModel` | CNN-LSTM | :heavy_check_mark:
AlphaPept MS2 Model | `redeem_properties::MS2BertModel` | Bert | :heavy_check_mark:
AlphaPept IM Model | `redeem_properties::CCSCNNLSTMModel` | CNN-LSTM | :heavy_check_mark:

2. **redeem-classifiers**:
- This crate is aimed at developing semi-supervised scoring classifier models. The goal is to create models for separating target peptides from decoys.

- Current Models

Model | Name | Architecture | Implemented
--- | --- | --- | ---
XGBoost Classifier | `redeem_classifiers::XGBoostClassifier` | XGBoost | :heavy_check_mark:
GBDT Classifier | `redeem_classifiers::GBDTClassifier` | GBDT | :heavy_check_mark:
SVM Classifier | `redeem_classifiers::SVMClassifier` | SVM | :heavy_check_mark:

> [!NOTE]
> To use the XGBoost classifier, or the SVM classifier, you need to compile with the `--features xgboost` or `--features linfa` flag respectively.

> [!IMPORTANT]
> The XGBoost crate is a wrapper around the original XGBoost library, which requires clang/c++ to be installed on the system. On Ubuntu, you can do the following:

```bash
sudo apt update
sudo apt install build-essential
sudo apt install clang
sudo apt install libstdc++-12-dev
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/singjc/redeem

Awesome Lists containing this project

README