Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/quantixed/volcanoplot
Making volcano plots from proteomic data in IgorPro
https://github.com/quantixed/volcanoplot
igor-pro proteomics-data statistics volcano
Last synced: about 2 months ago
JSON representation
Making volcano plots from proteomic data in IgorPro
- Host: GitHub
- URL: https://github.com/quantixed/volcanoplot
- Owner: quantixed
- License: mit
- Created: 2017-10-21T05:50:04.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2024-05-09T12:10:48.000Z (8 months ago)
- Last Synced: 2024-05-09T13:39:11.625Z (8 months ago)
- Topics: igor-pro, proteomics-data, statistics, volcano
- Language: IGOR Pro
- Size: 2.56 MB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# VolcanoPlot
[![DOI](https://zenodo.org/badge/107754967.svg)](https://zenodo.org/doi/10.5281/zenodo.12570017)
Making volcano plots from proteomic data in IgorPro
- [Using data direct from MaxQuant](#from-maxquant)
- [Doing manual analysis](#manual-analysis)
- [Volcano Plot outputs](#outputs)
- [Subcellular analysis](#subcellular-analysis)
- [Further notes](#further-notes)## From MaxQuant
You can load data from a `proteinGroups.txt` file using the *Load MaxQuant Data...* option. This will load the data and do the analysis.
This procedure will do a comparison of "LFQ Intensity" from proteomics data using two conditions.Tell Igor which two conditions you want to compare (give Igor a wildcard searchstring to specify waves), e.g. `ctrl*` and `test*` will find ctrl_1,ctrl_2 etc. and test_1,test_2 etc.
Also tell Igor what the basevalue is. This is the intensity assigned to any proteins which were not detected. Default is 0 (for MaxQuant).
If you have several `proteinGroups.txt` files, it is possible to analyse them together using the *Load Multiple MaxQuant...* option.
Due to potential differences in LFQ values between files, the default is to display the mean of ratios rather than the ratio of means.
Paired T-test is the default test, otherwise this option will give equivalent outputs as for a single file.For both types of analysis it there is an option to give Perseus style processing.
When this option is checking the difference is calculated by subtraction using log2 transformed LFQ data, rather than a ratio of the linear data.## Manual analysis
Analysis can also be started manually from the Proteomics menu.
In this case, supply data (intensity, LFQ_Intensty or peptides) for the conditions you want to analyse (with logical naming) and also the protein names. These need to be in TextWaves with names `NAME` and `SHORTNAME`.Minimum data to run (all in `root:`):
- Three waves of condition1
- Three waves of condition2 - logical naming advised
- NAME and SHORTNAME textwaves## Outputs
Igor will make a Volcano Plot of the comparison. Proteins are coloured according to magnitude of change from control and according to P-value. Proteins can be clicked on to reveal their `SHORTNAME`.
Transforms and imputation are done exactly as described for the default settings in Perseus.
## Secondary analysis
An option for using Gene Ontology terms to filter the data is included...
### Subcellular analysis
Once the data has been processed, you can do further (subcellular) analysis. For example, you may want to only view the hits that are associated with a certain GO Term or group of GO Terms. This is possible but involves some manual steps.
Select _Proteomics > Subcellular Analysis > Make List to Retrieve Uniprot Data_
Follow the instructions and paste the data into Uniprot's [ID mapping tool](https://www.uniprot.org/uploadlists). Select the appropriate species as shown, and submit.
![Screen01](img/uniprot01.png?raw=true)
You will see your results, but we need additional columns. Click here. Select _Gene Ontology (cellular component)_ and _Subcellular location [CC]_ as shown and save. You only need to do this on your browser once and your choice will be remembered for your next visit if you have cookies enabled.
![Screen02](img/uniprot02.png?raw=true)
![Screen03](img/uniprot03.png?raw=true)
You should see these columns. Click save.
![Screen04](img/uniprot04.png?raw=true)
We need to filter for Reviewed records, click here.
You should now have fewer records (only the reviewed ones). Now click download.
Select tab-separated file and Uncompressed.
Now, back in Igor select _Proteomics > Subcellular Analysis > Load and Match Uniprot Data..._
Find the file that you downloaded from Uniprot. Igor will now present you with a list of all the GO Terms that Uniprot found from your hits. Select using cmd-click (on a Mac) and then click _Filter_. Shift-click will select contiguous rows. You can reset the selection by clicking the button or by simply click-selecting one new row.
After clicking _Filter_, a new Volcano Plot will be generated together with a table to show you the hits that matched the GO terms you selected. If you want to change your selection, there is no need to reload just select _Proteomics > Subcellular Analysis > Filter for GO Term(s)_
## Further notes
It is possible to do pairwise comparisons for the volcano plot by clicking the checkbox.
The default is to look at a 2-fold change, but this can be changed in the dialog box.
A PCA is generated for the selected data.
This works by comparing intensities of each detected protein per replicate, in the datasets.
The PCA is designed to be equivalent to SIMCA-P+.Note that on loading of MaxQuant data, any rows that have no information for Gene Name or Protein Name (i.e. contaminants) are filtered out during the load procedure.