An open API service indexing awesome lists of open source software.

https://github.com/JonnyTran/OpenOmics

A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.
https://github.com/JonnyTran/OpenOmics

data-integration data-manipulation genomics multi-omics python

Last synced: 2 months ago
JSON representation

A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.

Awesome Lists containing this project

README

        





[![PyPI version](https://badge.fury.io/py/openomics.svg)](https://badge.fury.io/py/openomics)
[![Documentation Status](https://readthedocs.org/projects/openomics/badge/?version=latest)](https://openomics.readthedocs.io/en/latest/?badge=latest)
[![OpenOmics](https://github.com/JonnyTran/OpenOmics/actions/workflows/python-package.yml/badge.svg?branch=master)](https://github.com/JonnyTran/OpenOmics/actions/workflows/python-package.yml)
[![codecov](https://codecov.io/gh/JonnyTran/OpenOmics/branch/master/graph/badge.svg?token=6N1UZ27MPH)](https://codecov.io/gh/JonnyTran/OpenOmics)
[![openomics](https://snyk.io/advisor/python/openomics/badge.svg)](https://snyk.io/advisor/python/openomics)

[![pyOpenSci](https://tinyurl.com/y22nb8up)](https://github.com/pyOpenSci/software-review/issues/31)
[![status](https://joss.theoj.org/papers/aca43e3c2989a803b514faef72dd3294/status.svg)](https://joss.theoj.org/papers/aca43e3c2989a803b514faef72dd3294)
[![DOI](https://zenodo.org/badge/125549505.svg)](https://zenodo.org/badge/latestdoi/125549505)

**OpenOmics is currently under active development and we may break API compatibility in the future.**

This Python package provide a series of tools to integrate and explore the genomics, transcriptomics, proteomics, and
clinical data (aka multi-omics data). With interfaces to popular annotation databases and scalable data-frame
manipulation tools, OpenOmics facilitates the common
data wrangling tasks when preparing data for RNA-seq bioinformatics analysis.

Documentation ([Latest](https://openomics.readthedocs.io/en/latest/)
| [Stable](https://openomics.readthedocs.io/en/stable/))
| [OpenOmics at a glance](https://openomics.readthedocs.io/en/latest/usage/getting-started.html)

## Features
OpenOmics assist in integration of heterogeneous multi-omics bioinformatics data. The library provides a Python API as well as an interactive Dash web interface.
It features support for:
- Genomics, Transcriptomics, Proteomics, and Clinical data.
- Harmonization with 20+ popular annotation, interaction, disease-association databases.

OpenOmics also has an efficient data pipeline that bridges the popular data manipulation Pandas library and Dask distributed processing to address the following use cases:

- Providing a standard pipeline for dataset indexing, table joining and querying, which are transparent and customizable
for end-users.
- Providing Efficient disk storage for large multi-omics dataset with Parquet data structures.
- Integrating various data types including interactions and sequence data, then exporting to NetworkX graphs or data generators for down-stream machine learning.
- Accessible by both developers and scientists with a Python API that works seamlessly with an external Galaxy tool interface or the built-in Dash web interface (WIP).

## Installation:
### PyPI
```console
pip install openomics
```

### Conda
```sh
conda install openomics -c jonnytran # Work in progress
```

### From source
```sh
git clone https://github.com/JonnyTran/OpenOmics/
cd OpenOmics
pip install -e .
```

##

## Citations
The journal paper for this scientific package was reviewed by JOSS at , and can be cited with:

# BibTeX
@article{Tran2021,
doi = {10.21105/joss.03249},
url = {https://doi.org/10.21105/joss.03249},
year = {2021},
publisher = {The Open Journal},
volume = {6},
number = {61},
pages = {3249},
author = {Nhat C. Tran and Jean X. Gao},
title = {OpenOmics: A bioinformatics API to integrate multi-omics datasets and interface with public databases.},
journal = {Journal of Open Source Software}
}

## Credits

Thank you for extremely helpful feedback and guidance from the pyOpenSci reviewers. This package was created with the [pyOpenSci/cookiecutter-pyopensci](https://github.com/pyOpenSci/cookiecutter-pyopensci) project template, based off [audreyr/cookiecutter-pypackage](https://github.com/audreyr/cookiecutter-pypackage).