Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cbg-ethz/netics
NetICS: network-based integration of multi-omics data for prioritizing cancer genes
https://github.com/cbg-ethz/netics
Last synced: about 8 hours ago
JSON representation
NetICS: network-based integration of multi-omics data for prioritizing cancer genes
- Host: GitHub
- URL: https://github.com/cbg-ethz/netics
- Owner: cbg-ethz
- License: gpl-3.0
- Created: 2017-02-21T13:08:21.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2021-06-27T21:13:17.000Z (over 3 years ago)
- Last Synced: 2024-08-09T00:21:41.528Z (3 months ago)
- Language: MATLAB
- Homepage:
- Size: 6.6 MB
- Stars: 15
- Watchers: 3
- Forks: 8
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-multi-omics - NetICS - Christos Dimitrakopoulos - Network-based integration of multi-omics data for prioritizing cancer genes - [paper](https://pubmed.ncbi.nlm.nih.gov/29547932/) (Software packages and methods / Multi-omics networks)
README
# NetICS: Network-based integration of multi-omics data for prioritizing cancer genes
NetICS performs a per sample bidirectional network diffusion technique to prioritize genes based on their proximity to genetically aberrant and differentially expressed genes. It provides rank aggregation techniques for integrating the sample-specific gene lists into an overall ranked list of genes.
The method is called as follows:
```
>> [ ranked_list_genes, ranked_scores ] = netics_fun( 'mutation_data_breast.txt', adj_lar_com, restart_prob, 'RANK_AGGREG=SUM', 'network_genes.txt', 'RNA_diff_expr_breast.txt', 'protein_diff_expr_breast.txt');
```'mutation_data_breast.txt' --> tab delimited file that contains the genetically aberrant genes of each sample. It contains two columns that map each gene (1st column) to the samples that it is genetically aberrant (2nd column).
'network_genes.txt' --> input file that contains the list of the genes that are present in the network. They should be in the same order as in the rows of the diffused matrices F and F_opp. An example file is given that contains the gene names of the network described in (Wu et al, 2010).
'RNA_diff_expr_breast.txt' --> tab delimited file with two columns. First column contains the genes for which differential expression between the tumor and normal samples at the RNA level was measured. Second column contains the p-values of these measurements. This file can be the result of a tool for differential expression analysis such as DESeq2. Each gene should have only one entry in this file.
'protein_diff_expr_breast.txt' --> tab delimited file with two columns. First column contains the proteins for which differential expression between the tumor and normal samples at the protein level was measured. Second column contains the p-values of these measurements. Each gene should have only one entry in this file.
The two files that contain the differentially expressed genes at the RNA and proteome levels (for example, 'RNA_diff_expr_breast.txt' and 'protein_diff_expr_breast.txt') are optional. If not provided, NetICS only uses the labels of the genetically aberrant genes for network diffusion.
The p-values in files 'RNA_diff_expr_breast.txt' and 'protein_diff_expr_breast.txt' should be provided unadjusted because they are combined by using the Fisher's method. After that, NetICS adjusts them for multiple testing by using Benjamini & Hochberg FDR correction. The function for computing FDR correction can be derived from https://brainder.org/2011/09/05/fdr-corrected-fdr-adjusted-p-values/.
Example files for 'mutation_data_breast.txt', 'RNA_diff_expr_breast.txt' and protein_diff_expr_breast.txt' are given based on the breast invasive carcinoma dataset in TCGA (BRCA).
'RANK_AGGREG' determines the rank aggregation scheme to be used. It can take the values "SUM", "MEDIAN" or "RRA". "SUM" computes the summation of the per sample ranks and "MEDIAN" computes the median. "RRA" implements the robust rank aggregation technique as described in (Kolde et al, 2012). The matlab code of the RRA method can be derived from http://ch.mathworks.com/matlabcentral/fileexchange/41835-rank-aggregation. You will need to include the files betaScores.m, correctBetaPvalues.m, rhoScores.m and thresholdBetaScore.m.
_adj_lar_com_ -> The adjacency matrix for the directed functional network described in (Wu et al, 2010). It is given as a .mat file (_adj_lar_com.mat_). Every entry of the matrix should be 1 in _A[i,j]_ if there is an edge from node _i_ to node _j_. You can load the adjacency matrix by typing:
```
>> load('adj_lar_com');
```_restart_prob_ -> The restart probability to be used in the insulated diffusion. For the (Wu et al., 2010) network, a restart probability of _0.4_ should be used. In general, a reasonable value should be chosen that depends on the network. See the HotNet2 algorithm (Leiserson et al., 2015) for selecting the restart probability based on the inflection point. Define your restart probability as follows:
```
>> restart_prob=0.4;
```You can also average the ranks over several values of the restart probability. The script _netics_fun_all.m_ gives an example for the TCGA breast invasize carcinoma dataset, for restart probability values between 0.2 and 0.8 with a step size of 0.1. After the execution of the _netics_fun_, we can access the 10 highest ranked genes of the method by typing:
```
>> ranked_list_genes(1:10)
```The files _pchisq.m_ and _pgamma.m_ were derived from https://ch.mathworks.com/matlabcentral/fileexchange/15171-jennrich-test/content/Jennrich/pchisq.m.
Whenever the word 'sample' is mentioned above, we mean one paired observation for which measurements for tumor and normal tissues are available.
Dependencies:
- Matlab (at least R2015a)### Contributions
- [Christos Dimitrakopoulos](https://www.bsse.ethz.ch/cbg/group/people/person-detail.html?persid=197642)
- [Niko Beerenwinkel](http://www.bsse.ethz.ch/cbg/group/people/person-detail.html?persid=149417)### Contact
```
Christos Dimitrakopoulos
christos.dimitrakopoulos (at) bsse.ethz.ch
```