Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/SGDDNB/translational_regulation

Detection of differential translated genes using Ribo-seq
https://github.com/SGDDNB/translational_regulation

Last synced: about 1 month ago
JSON representation

Detection of differential translated genes using Ribo-seq

Awesome Lists containing this project

README

        

## Detection and classification of differential translation-efficiency genes with DTEG.R

Citation: Chothani, S., Adami, E., Ouyang, J. F., Viswanathan, S., Hubner, N., Cook, S. A., Schafer, S., & Rackham, O. J. L. (2019). deltaTE: Detection of translationally regulated genes by integrative analysis of Ribo-seq and RNA-seq data. Current Protocols in Molecular Biology, 129, e108. doi.org/10.1002/cpmb.108

Link to publication
https://currentprotocols.onlinelibrary.wiley.com/doi/full/10.1002/cpmb.108

**Preparing input files**
Calculating differentially-TE genes (DTEGs) requires the count matrices for both Ribo-seq and RNA-seq. These should be the raw counts obtained from feature counts (or any other read counting software), they should not be normalized or batch corrected. Each row should represent a gene and each column represents a sample. The matrix should have a header as shown below.

Ribo-seq count matrix (RPFs):

| Gene ID | Sample 1 | Sample 2 | Sample 3 | Sample 4 |
| --------|----------|----------|----------|----------|
| Gene 1 | 1290 | 130 | 2 | 1000 |
| Gene 2 | 2 | 10 | 5 | 1 |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| Gene Z | 200 | 140 | 15 | 11 |

RNA-seq count matrix (mRNA counts):

| Gene ID | Sample 5 | Sample 6 | Sample 7 | Sample 8 |
| --------|----------|----------|----------|----------|
| Gene 1 | 1290 | 130 | 2 | 1000 |
| Gene 2 | 2 | 10 | 5 | 1 |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| Gene Z | 200 | 140 | 15 | 11 |

Next, we also need to prepare a also requires a sample information file which should follow the same sample order as count matrices. This file outlines the condition, sequencing type and batch for each sample. This script requires the same header names as shown below (case sensitive). If your experiment has more covariates it is recommended to use the Alternate protocol.

Sample information file:

| SampleID | Condition | SeqType | Batch |
| --------|----------|----------|----------|
| Sample 1 | 1 | RIBO | 1 |
| Sample 2 | 1 | RIBO | 2 |
| Sample 3 | 2 | RIBO | 3 |
| Sample 4 | 2 | RIBO | 4 |
| Sample 5 | 1 | RNA | 1 |
| Sample 6 | 1 | RNA | 2 |
| Sample 7 | 2 | RNA | 3 |
| Sample 8 | 2 | RNA | 4 |

**Running script DTEG.R for detection and classification of DTGs and DTEGs**
Once the input files are ready the script DTEG.R can be executed on the bash shell prompt as follows:

$ Rscript --vanilla DTEG.R arg1 arg2 arg3 arg4 arg5 arg6
where,
Argument 1 (arg1): Ribo-seq count matrix
Argument 2 (arg2): RNA-seq count matrix
Argument 3 (arg3): Sample information file
Argument 4 (arg4): Batch effect covariate yes=1, or no=0
Argument 5 (arg5): Default = 1, Save Rdata
Argument 6 (arg6): Default = 0, Verbose mode

Example:
$ Rscript --vanilla DTEG.R ribo_counts.txt rna_counts.txt sample_info.txt 1

**Note: Log fold shrinkage function usage is updated now to adapt to latest DESeq2 version updates**
Please install apeglm R package for smooth functioning of DTEG.R
https://bioconductor.org/packages/release/bioc/html/apeglm.html