Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/greenelab/gbm_immune_validation

Validating glioblastoma immune cell immunohistochemsitry using computational deconvolution of TCGA tumors
https://github.com/greenelab/gbm_immune_validation

analysis cancer data-science gene-expression glioblastoma machine-learning survival-analysis tool

Last synced: about 2 months ago
JSON representation

Validating glioblastoma immune cell immunohistochemsitry using computational deconvolution of TCGA tumors

Awesome Lists containing this project

README

        

# Investigating immune cell profiles across Glioblastoma subtypes

**Gregory Way and Casey Greene**
**University of Pennsylvania**

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.195238.svg)](https://doi.org/10.5281/zenodo.195238)

[![Build Status](http://165.123.67.152/api/badges/greenelab/gbm_immune_validation/status.svg)](http://165.123.67.152/greenelab/gbm_immune_validation)

## Summary

The amount and type of immune cell infiltration into tumors is an important
determinant of disease progression and survival. Cancer subtypes can often be
distinguished by the rate of immune cell infiltration since this tends to be one
of the more dominant observable signatures in gene expression data. Current
methods for directly observing immune cell profiles in a given population
include laboriously quantifying immune cell proportions by flow cytometry or
other technically challenging cell labeling techniques. Therefore, deconvolution
methods are being developed to automatically extract immune cell proportions
from full tumor gene expression data.

We used ssGSEA ([Barbie et al. 2009](http:/doi.org/10.1038/nature08460 "ssGSEA"))
to deconvolute immune cell signatures from glioblastoma multiforme (GBM) tumors
from The Cancer Genome Atlas. Briefly, ssGSEA is a simple rank based test that
evaluates the empirical cumulative distribution function of input gene sets
compared to the eCDF of the remaining genes.

We used LM22.txt as defined by
[Newman et al. 2015](http://doi.org/10.1038/nmeth.3337 "CIBERSORT") as input
genelists to ssGSEA.

## Reproducibility

Our end to end analysis from downloading data to generating publication ready
figures is provided in this github repository. We implement an automatic
[reproducible workflow using continuous analysis](http://doi.org/10.1101/056473 "Beaulieu-Jones and Greene 2016")
to ensure a stable compute environment and consistent reproducibility.

We use the ssGSEA implementation available on
[bioconductor](https://bioconductor.org/packages/release/bioc/html/GSVA.html)
(Guinney and Castelo 2016).

```bash
# To reproduce the pipeline independently simply run:
bash run_pipeline.sh
```

For exact instructions on how to reproduce our analysis see `run_pipeline.sh`.

## Results

Our _in silico_ deconvolution of CD4+ cells, CD8+ cells, and Macrophages in TCGA
data matches very closely to immunohistochemistry estimates of the same cell types
in a separate dataset. The proportions of immune cell infiltrate across subtypes
corresponds strongly.

![immune deconvolution and IHC](figures/boxplot_validation_TCGA_summary.png?raw=true)

We also observed that high macrophage infiltration was associated with worse outcomes
in the TCGA dataset. This relationship was strengthed after adjusting for several
covariates including age, gender, and gene expression based subtype.

![TCGA macrophage survival](figures/TCGA_kaplanmeier_Macrophages.png?raw=true)

## Contact

For all code related questions, bug reporting, or feature requests please file a
[GitHub issue](https://github.com/greenelab/GBM_immune_profiles/issues "file an issue")

## Dependencies

All analyses were performed in R version 3.2.3 and packages were versioned with the
checkpoint package (version 0.3.18) set to a snapshot date of "2016-08-16". The checkpoint
package will automatically download all the specified packages at the versions they
existed in on that specific date. See [install.R](install.R) for more details.
The versions for each package are specified in [sessionInfo.txt](sessionInfo.txt).

We also provide a [Docker image](https://hub.docker.com/r/gregway/gbm_immune_validation/)
to recreate the compute environment. See the [Dockerfile](Dockerfile) for more details.