Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/SGDDNB/translational_regulation
Detection of differential translated genes using Ribo-seq
https://github.com/SGDDNB/translational_regulation
Last synced: 3 months ago
JSON representation
Detection of differential translated genes using Ribo-seq
- Host: GitHub
- URL: https://github.com/SGDDNB/translational_regulation
- Owner: SGDDNB
- Created: 2017-11-27T10:57:12.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2021-04-10T03:25:20.000Z (over 3 years ago)
- Last Synced: 2024-07-01T18:34:58.202Z (4 months ago)
- Language: R
- Size: 20.1 MB
- Stars: 12
- Watchers: 3
- Forks: 11
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-riboseq - Code
README
## Detection and classification of differential translation-efficiency genes with DTEG.R
Citation: Chothani, S., Adami, E., Ouyang, J. F., Viswanathan, S., Hubner, N., Cook, S. A., Schafer, S., & Rackham, O. J. L. (2019). deltaTE: Detection of translationally regulated genes by integrative analysis of Ribo-seq and RNA-seq data. Current Protocols in Molecular Biology, 129, e108. doi.org/10.1002/cpmb.108
Link to publication
https://currentprotocols.onlinelibrary.wiley.com/doi/full/10.1002/cpmb.108**Preparing input files**
Calculating differentially-TE genes (DTEGs) requires the count matrices for both Ribo-seq and RNA-seq. These should be the raw counts obtained from feature counts (or any other read counting software), they should not be normalized or batch corrected. Each row should represent a gene and each column represents a sample. The matrix should have a header as shown below.Ribo-seq count matrix (RPFs):
| Gene ID | Sample 1 | Sample 2 | Sample 3 | Sample 4 |
| --------|----------|----------|----------|----------|
| Gene 1 | 1290 | 130 | 2 | 1000 |
| Gene 2 | 2 | 10 | 5 | 1 |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| Gene Z | 200 | 140 | 15 | 11 |RNA-seq count matrix (mRNA counts):
| Gene ID | Sample 5 | Sample 6 | Sample 7 | Sample 8 |
| --------|----------|----------|----------|----------|
| Gene 1 | 1290 | 130 | 2 | 1000 |
| Gene 2 | 2 | 10 | 5 | 1 |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| .. | | | | |
| Gene Z | 200 | 140 | 15 | 11 |Next, we also need to prepare a also requires a sample information file which should follow the same sample order as count matrices. This file outlines the condition, sequencing type and batch for each sample. This script requires the same header names as shown below (case sensitive). If your experiment has more covariates it is recommended to use the Alternate protocol.
Sample information file:
| SampleID | Condition | SeqType | Batch |
| --------|----------|----------|----------|
| Sample 1 | 1 | RIBO | 1 |
| Sample 2 | 1 | RIBO | 2 |
| Sample 3 | 2 | RIBO | 3 |
| Sample 4 | 2 | RIBO | 4 |
| Sample 5 | 1 | RNA | 1 |
| Sample 6 | 1 | RNA | 2 |
| Sample 7 | 2 | RNA | 3 |
| Sample 8 | 2 | RNA | 4 |**Running script DTEG.R for detection and classification of DTGs and DTEGs**
Once the input files are ready the script DTEG.R can be executed on the bash shell prompt as follows:$ Rscript --vanilla DTEG.R arg1 arg2 arg3 arg4 arg5 arg6
where,
Argument 1 (arg1): Ribo-seq count matrix
Argument 2 (arg2): RNA-seq count matrix
Argument 3 (arg3): Sample information file
Argument 4 (arg4): Batch effect covariate yes=1, or no=0
Argument 5 (arg5): Default = 1, Save Rdata
Argument 6 (arg6): Default = 0, Verbose modeExample:
$ Rscript --vanilla DTEG.R ribo_counts.txt rna_counts.txt sample_info.txt 1**Note: Log fold shrinkage function usage is updated now to adapt to latest DESeq2 version updates**
Please install apeglm R package for smooth functioning of DTEG.R
https://bioconductor.org/packages/release/bioc/html/apeglm.html