Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-proteomics

An awesome list of proteomics tools and resources
https://github.com/lyons89/awesome-proteomics

Last synced: 3 days ago
JSON representation

  • 1A. Leaning Resources - Proteomics

  • 1. Leaning Resources

  • 2. Databases

    • Table of Contents

      • ProteomeXchange
      • massIVE
      • Pride database
      • Pastel BioScience
      • Biogrid - protein-protein interaction database
      • KEGG - biological pathway database
      • Reactome - nicer looking biological pathway database
      • CPTAC - python/R - API interaface to publically available cancer datasets - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00919)
      • ppx - python - Python interface to proteomics data repositories - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.1c00454)
      • Uniprot - Has all the information you will ever need to know for individual proteins and the go to for protein FASTA databases.
    • Table of Contents

      • Peptide-shaker
      • Fragpipe - Java - It's a very fast search engine with a nice GUI. The software is modular, it consists of [MSfragger](https://msfragger.nesvilab.org/) the database search algorithm, [Philosopher](https://philosopher.nesvilab.org/) that analyzes the database results, as well as others for PTM and TMT integration.
      • Comet - C++ - Free and open-source search engine, lately it's had several - [paper](https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/abs/10.1002/pmic.201200439)
      • DIA-NN - C/C++ - free and open source search tool for DIA data that uses neural networks, works using either a library or a FASTA database. - [paper](https://www.nature.com/articles/s41592-019-0638-x)
      • Sage - Rust - most likely the current fasest search engine, it's completely terminal based but if you learn to use it, it will be worth it - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.3c00486?ref=PDF)
      • MaxQuant - [paper](https://www.nature.com/articles/nbt.1511)
      • skyline - software for targeted proteomics - [paper](https://pubmed.ncbi.nlm.nih.gov/20147306/)
  • 4. Assorted pipeline Tools

    • Table of Contents

      • MaxQuant Live
      • RawBeans - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00956?goto=supporting-info)
      • DIAproteomics - python - a module that can be added to a openMS workflow for the analysis of DIA data - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.1c00123)
      • PAW_pipeline - python - a pretty much stock python raw file protoemics pipeline tool. It includes functions, to convert files, run comet, produce histograms. Can also do TMT - [paper](https://pubmed.ncbi.nlm.nih.gov/20157357/)
      • Ursgal - python - combines multiple search engine algorithms, postprocessing algorithms, and stastis on the output from multiple search engines - [paper1](https://pubs.acs.org/doi/10.1021/acs.jproteome.5b00860) [paper2](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00799)
      • DIAlignR - R - DIA retention time alignment of targetd MS data, including DIA and SWATH-MS - [paper](https://www.mcponline.org/article/S1535-9476(20)31843-0/fulltext)
      • Monocle - C# - for monoisotopic peak and accurate precursor m/z detection in shotgun proteomics experiments. - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00563)
      • mokapot - python - Semisupervised Learning for Peptide Detection - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c01010)
      • qcloud2 - cloud based quality control pipeline, can be integrated with nextflow and openMS - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00853)
  • 5. Raw Data Analysis

    • Table of Contents

      • pyteomics - python - proteomics framework tools - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.8b00576)
      • MSstatsQC - R - provides methods for multiple peptide monitoring using raw MS files, works for DDA and DIA data - [paper](https://pubs.acs.org/doi/full/10.1021/acs.jproteome.8b00732)
      • RforMassSpectrometry - R - a massive project that contains multiple helpful packages including RforMassSpectrometry, MsExperiment, Spectra, QFeatures, PSMatch, Chromatograms, MsCoreUtils, and MetaboCoreUtils.
      • MSnbase - R - provides MS data structures, allows you to process, quantify, visualize raw data - [paper](https://academic.oup.com/bioinformatics/article/28/2/288/199094?login=false)
      • MaRaCluster - C++ - clustering technique to identify fragment spectra stemming from the same peptide species - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.5b00749)
      • pyproteome - python - analyzes proteomics data, can filter, normalize, perform motif and pathway enrichment. Currently only supports ProteomeDiscoverer .msf search files - [paper](https://pyproteome.readthedocs.io/en/latest/)
      • RawTools - C# - quality control checking of raw files, can assist in method development and insturment quality control - [paper](https://pubs.acs.org/doi/full/10.1021/acs.jproteome.8b00721)
      • rawDiag - R - Package that can be used in conjustion with rawrr - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.8b00173)
      • COSS - java - user-friendly spectral library search tool - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.9b00743)
      • rawrr - R - A great package that can read in raw thermo files! Thats great to me, because I always find it tedious to convert a raw file into a mzML or mzXML file - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00866)
      • PSpecteR - R - User Friendly and Interactive for Visualizing the quality of Top-Down and Bottom-Up Proteomics - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00857)
      • mpwR - R - package that allows you to directly compare the output of raw search engines such as MQ, DIANN, spectronaut and I think PD. It's also helpful if you're testing out different settings within your search engine and you want to quickly see how each performs. - [paper](https://pubmed.ncbi.nlm.nih.gov/37267150/)
  • 7. Protein Pathway Enrichment

    • Table of Contents

      • lipidR - R - lipidomics data analysis - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00082)
      • fgsea - R - fast gene set enrichment analysis - [paper](https://www.biorxiv.org/content/10.1101/060012v2.full)
      • pathfindR - R - active subnetwork oriented pathway enrichment analyses that uses protein-protein ineteraction networks to enchance the standard pathway analysis method - [paper](https://www.frontiersin.org/articles/10.3389/fgene.2019.00858/full)
      • phosphoRWHN - R - pathway enrichment for phosphoproteomics data - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.1c00150)
      • leapR - R - package for multiple pathway analysis - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00963)
  • Miscellaneous

    • Table of Contents

      • cytoscape - visualizing protein-protein interaction netweorks
      • ProteoWizard - Great software for converting one MS file type to another. I mostly use it ot convert thermo .raw files to mzML - [paper](https://www.nature.com/articles/nbt.2377)
      • IPSC - Interactive Peptide Spectrum Annotator, web based utility for shotgun mass spectrum annotation - [paper](https://www.mcponline.org/article/S1535-9476(20)32771-7/fulltext)
      • PeCorA - R - peptide correlation analysis - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00602)
      • ProteaseGuru - C# - tool for In Silico Database Digestion, optimize bottom up experiments - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00954)
      • DeepLC - python - predicts retention times for peptides that have unseen modifications - [paper](https://www.nature.com/articles/s41592-021-01301-5)
  • 1B. Learning Resources - Programming

  • 6. Stastical Analysis

    • Table of Contents

      • MSstats - R - DDA/shotgun, bottom-up, SRM, DIA - [paper](https://academic.oup.com/bioinformatics/article/30/17/2524/2748156?login=false)
      • PaDuA - python - proteomics and phosphoproteomics data analysis - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.8b00576)
      • MSstatsTMT - R - TMT shotgun proteomics - [paper](https://www.mcponline.org/article/S1535-9476(20)35114-8/fulltext)
      • proteiNorm - R - TMT and unlabeled, has multiple options for normalization and statistical analysis - [paper](https://pubs.acs.org/doi/10.1021/acsomega.0c02564)
      • DEqMS - R - Developed ontop of limma, but takes into account variability in PSMs. Works on both labelled and unlabelled samples - [paper](https://www.mcponline.org/article/S1535-9476(20)34997-5/fulltext)
      • MSstatsPTM - labeled and unlabeled PTM data analysis - [paper](https://www.mcponline.org/article/S1535-9476(22)00285-7/fulltext)
      • PermFDP - R - Package to perform multiple hypothesis correction using permutation based FDP. One of the better performing methods for multiple test corrections. - [paper](https://pubs.acs.org/doi/full/10.1021/acs.analchem.2c03719?casa_token=4CgZMMnAmjgAAAAA%3A-8SyKwz2Hs3L-yRXXQDkq45ZBPW8nexcpqaYzDB5Ok-Kgp0C_W9KPscLE-zUfN2nZUv8uiNYZZcCxy-7) the paper isn't on the tool, it's just a paper that uses it and compares it to other methods.
  • 8. Kinase Motif/Activity Analysis

    • Table of Contents

      • KSEAapp - R - Kinase substrate enrichment analysis. I would recommend using with a freshly downloaded kinase-substarte database from phosphositeplus - [paper](https://pubmed.ncbi.nlm.nih.gov/28655153/)
      • rnotifx - R - motif enrichment analyssis of PTMs on proteins, probably mostly used for phosphorylation - [paper](https://pubmed.ncbi.nlm.nih.gov/26572964/)
      • KSEAapp - R - Kinase substrate enrichment analysis. I would recommend using with a freshly downloaded kinase-substarte database from phosphositeplus - [paper](https://pubmed.ncbi.nlm.nih.gov/28655153/)
  • 9. Top down data analysis

    • Table of Contents

      • ClipsMS - python - analysis of terminal and internal fragments in top-down mass spectrometry data - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.0c00952)
  • 10. Multi-Omics data analysis

    • Table of Contents

      • MOGSA - R - Multiple omics data integrative clustering and gene set analysis - [paper](https://www.mcponline.org/article/S1535-9476(20)32768-7/fulltext)
      • moCluster - R - Integration of multiple omics datasets to identify patterns - [paper](https://pubs.acs.org/doi/10.1021/acs.jproteome.5b00824)