https://github.com/maayanlab/druglib_enrichment_comparison

Comparison of drug target libraries using enrichment analysis.
https://github.com/maayanlab/druglib_enrichment_comparison

Last synced: 4 months ago
JSON representation

Comparison of drug target libraries using enrichment analysis.

Host: GitHub
URL: https://github.com/maayanlab/druglib_enrichment_comparison
Owner: MaayanLab
License: gpl-3.0
Created: 2018-01-13T19:08:48.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-06-18T07:48:01.000Z (about 7 years ago)
Last Synced: 2025-01-22T03:16:00.649Z (6 months ago)
Language: HTML
Homepage:
Size: 43.4 MB
Stars: 2
Watchers: 5
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.mdown
- License: LICENSE

Awesome Lists containing this project

README

# Summary

In this project, I evaluated the accuracy of different drug target libraries by measuring their agreement with drug perturbation signature libraries. The drug target libraries I looked at are DrugBank, the Target Central Resource Database, the Drug Repurposing Hub, the Drug Gene Interaction Database (DGIdb), DrugCentral, STITCH, and the Drug Target Commons. The drug perturbation signature libraries I used are the CRowd Extracted Expression of Differential Signatures (CREEDS) and the LINCS L1000. For all drug target libraries, since their average number of gene targets was small, I used hu.MAP and BioGRID, protein-protein interaction (PPI) databases, and ARCHS4, a protein co-expression database, to create expanded gene sets from the target lists.

I measured agreement using enrichment analysis–specifically, the Fisher’s exact test for overrepresentation. I treated each (drug, expanded target list) from the drug target libraries as an (annotation, gene set) pair, and each (drug sample, list of most-perturbed genes) from the drug perturbation signature libraries also as an (annotation, gene set) pair. I then performed enrichment into one library using the gene sets from the other library, and observed how highly the matching annotations, i.e. those which correspond to the same drug as the input, were ranked. If two libraries agree with one another, then these matching annotations should often be highly ranked.

The full description and results of this project can be found here:

* [With code](http://htmlpreview.github.io/?https://github.com/MaayanLab/DrugLib_Enrichment_Comparison/blob/master/writeup_with_code.html)
* [Without code](http://htmlpreview.github.io/?https://github.com/MaayanLab/DrugLib_Enrichment_Comparison/blob/master/writeup.html)

# How to Use

Here is how to run the code and replicate my results:

1. Clone this repository to your computer.
2. Download the library files from [this zip file](https://drive.google.com/file/d/1RZVz7W-MuikJbX0IlJvQwbuMsBjtYD5k/view?usp=sharing). It contains three folders, one for each format: "original_drug-gene_libs", the original format, "gvms", gene vector matrices, and "gmts", gene matrix transpose. Place "original_drug-gene_libs" as a subfolder within the main repository folder. The other two folders will be re-created in the next step.
3. Run `writeup.rmd` to convert and expand the drug libraries.
4. Run `perform_enrichment.py` to perform enrichment analysis.
5. Run `visualize_results` to produce the bridge plots visualizing the results of enrichment.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/maayanlab/druglib_enrichment_comparison

Awesome Lists containing this project

README