https://github.com/maayanlab/druglib_enrichment_comparison
Comparison of drug target libraries using enrichment analysis.
https://github.com/maayanlab/druglib_enrichment_comparison
Last synced: about 1 month ago
JSON representation
Comparison of drug target libraries using enrichment analysis.
- Host: GitHub
- URL: https://github.com/maayanlab/druglib_enrichment_comparison
- Owner: MaayanLab
- License: gpl-3.0
- Created: 2018-01-13T19:08:48.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-06-18T07:48:01.000Z (almost 7 years ago)
- Last Synced: 2025-01-22T03:16:00.649Z (3 months ago)
- Language: HTML
- Homepage:
- Size: 43.4 MB
- Stars: 2
- Watchers: 5
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.mdown
- License: LICENSE
Awesome Lists containing this project
README
# Summary
In this project, I evaluated the accuracy of different drug target libraries by measuring their agreement with drug perturbation signature libraries. The drug target libraries I looked at are DrugBank, the Target Central Resource Database, the Drug Repurposing Hub, the Drug Gene Interaction Database (DGIdb), DrugCentral, STITCH, and the Drug Target Commons. The drug perturbation signature libraries I used are the CRowd Extracted Expression of Differential Signatures (CREEDS) and the LINCS L1000. For all drug target libraries, since their average number of gene targets was small, I used hu.MAP and BioGRID, protein-protein interaction (PPI) databases, and ARCHS4, a protein co-expression database, to create expanded gene sets from the target lists.
I measured agreement using enrichment analysis–specifically, the Fisher’s exact test for overrepresentation. I treated each (drug, expanded target list) from the drug target libraries as an (annotation, gene set) pair, and each (drug sample, list of most-perturbed genes) from the drug perturbation signature libraries also as an (annotation, gene set) pair. I then performed enrichment into one library using the gene sets from the other library, and observed how highly the matching annotations, i.e. those which correspond to the same drug as the input, were ranked. If two libraries agree with one another, then these matching annotations should often be highly ranked.
The full description and results of this project can be found here:
* [With code](http://htmlpreview.github.io/?https://github.com/MaayanLab/DrugLib_Enrichment_Comparison/blob/master/writeup_with_code.html)
* [Without code](http://htmlpreview.github.io/?https://github.com/MaayanLab/DrugLib_Enrichment_Comparison/blob/master/writeup.html)# How to Use
Here is how to run the code and replicate my results:
1. Clone this repository to your computer.
2. Download the library files from [this zip file](https://drive.google.com/file/d/1RZVz7W-MuikJbX0IlJvQwbuMsBjtYD5k/view?usp=sharing). It contains three folders, one for each format: "original_drug-gene_libs", the original format, "gvms", gene vector matrices, and "gmts", gene matrix transpose. Place "original_drug-gene_libs" as a subfolder within the main repository folder. The other two folders will be re-created in the next step.
3. Run `writeup.rmd` to convert and expand the drug libraries.
4. Run `perform_enrichment.py` to perform enrichment analysis.
5. Run `visualize_results` to produce the bridge plots visualizing the results of enrichment.