An open API service indexing awesome lists of open source software.

https://github.com/sebastiandha/depy

A hybrid Python-R proteomics DEA package
https://github.com/sebastiandha/depy

bioinformatics differential-expression-analysis metabolomics proteomics python r transcriptomics

Last synced: 29 days ago
JSON representation

A hybrid Python-R proteomics DEA package

Awesome Lists containing this project

README

          

# DEPy

A differential expression analysis package for bulk proteomics (and metabolomics) data, which leverages transcriptomics tools.
Inspired by R tools like DEP and SummarizedExperiment, it brings the power of Bioconductor to Python.
All you need is a matrix of features and their intensity values.

* PyPI package: https://pypi.org/project/summarizedpy/
* GitHub: [SebastianDHA/DEPy](https://github.com/SebastianDHA/DEPy)
* Free software: MIT License

## Features

* SummarizedPY: A container for your -omics data, much like SummarizedExperiment or DEP in R.
* Filtering and subsetting your samples and features
* Missing value filtering
* Imputation using ImputeLCMD (many methods)
* Transforming (log, centering, standardizing, vsn)
* Leverage surrogate variable analysis (sva) to adjust for latent batch effects
* Use the flexibility and power of limma-trend to improve your DEA results and accommodate mixed effects
* Limma arrayWeights to adjust variable sample quality (often an issue in human and animal datasets)
* Visualize your DEA results with elegant volcano plots
* Highly-variable feature selection
* PCA plots
* Saving & loading SummarizedPy objects to & from disk

## Installation
### conda
This is the best way to install DEPy.
```Sh
conda env create -f environment.yml
```
Note that DEPy (summarizedpy) must be run within the [depy conda environment](environment.yml) or a cloned version of it.
This is because summarizedpy needs an isolated environment to run R in due to the complex loading behavior of Bioconductor packages.

## Using pip
```Sh
pip install summarizedpy
```

## Quick start
```Py
import depy as dp

sp = dp.SummarizedPy()
sp = sp.import_from_delim_file(path="path/to/pgroup.tsv", delim="\t")
```
See the full [tutorial](docs/usage.md) for more.

## Documentation
- [GitHub pages](https://sebastiandha.github.io/DEPy/)
- [ReadTheDocs](https://depy.readthedocs.io/en/latest/)

## Citation
DEPy and its theoretical foundations are described in the following paper:
```
Dohm-Hansen, S., et al. (2026).
Expanding the Proteomics and Metabolomics Toolkit with Methods for Differential Expression Analysis from Transcriptomics.
Journal of Proteome Research Article ASAP.
https://doi.org/10.1021/acs.jproteome.5c00719
```

## Credits
This package leverages amazing packages from the R and Bioconductor community, including [limma](https://bioconductor.org/packages/3.20/bioc/html/limma.html), [vsn](https://bioconductor.org/packages/release/bioc/html/vsn.html), [sva](https://bioconductor.org/packages/release/bioc/html/sva.html), [ImputeLCMD](https://cran.r-project.org/package=imputeLCMD), and [Tidyverse](https://www.tidyverse.org/).
This package was created with [Cookiecutter](https://github.com/audreyfeldroy/cookiecutter) and the [audreyfeldroy/cookiecutter-pypackage](https://github.com/audreyfeldroy/cookiecutter-pypackage) project template.