https://github.com/bionode/bionode-watermill
đŸ’§Bionode-Watermill: A (Not Yet Streaming) Workflow Engine
https://github.com/bionode/bionode-watermill
bioinformatics bionode nodejs pipeline tool
Last synced: about 1 year ago
JSON representation
đŸ’§Bionode-Watermill: A (Not Yet Streaming) Workflow Engine
- Host: GitHub
- URL: https://github.com/bionode/bionode-watermill
- Owner: bionode
- License: mit
- Created: 2016-06-20T17:13:21.000Z (about 10 years ago)
- Default Branch: master
- Last Pushed: 2018-06-27T16:45:21.000Z (almost 8 years ago)
- Last Synced: 2024-04-28T09:22:07.907Z (about 2 years ago)
- Topics: bioinformatics, bionode, nodejs, pipeline, tool
- Language: JavaScript
- Homepage: https://bionode.gitbooks.io/bionode-watermill/content/
- Size: 679 KB
- Stars: 37
- Watchers: 19
- Forks: 11
- Open Issues: 30
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# bionode-watermill
> Bionode-watermill: A (Not Yet Streaming) Workflow Engine
[](http://npmjs.org/package/bionode-watermill)
[](https://travis-ci.org/bionode/bionode-watermill)
[](https://travis-ci.org/bionode/bionode-watermill)
[](https://github.com/bionode/bionode-watermill/tree/master)
[](http://david-dm.org/bionode/bionode-watermill)
[](https://www.npmjs.com/package/bionode-watermill)
[](https://gitter.im/bionode/bionode-watermill)
## Table of Contents
* [What is bionode-watermill](#what-is-bionode-watermill)
* [Main features](#main-features)
* [Who is this tool for?](#who-is-this-tool-for)
* [Installation](#installation)
* [Documentation](#documentation)
* [Tutorial](#tutorial)
* [Example pipelines](#example-pipelines)
* [Why bionode-watermill?](#why-bionode-watermill)
* [Contributing](#contributing)
## What is bionode-watermill
**Bionode-watermill** is a workflow engine that lets you assemble and run
bioinformatic pipelines with ease and less overhead. Bionode-watermill
pipelines are
essentially node.js scripts in which [tasks](docs/BeginnerWalkthrough.md#task) are the modules that will be
assembled in the final *pipeline* using [orchestrators](docs/BeginnerWalkthrough.md#orchestrators).
### Main features
* Lots of **modularity** - tasks can be recycled as many times as you want!
* **Reusability** - tasks can be reused many times within and between pipelines.
* **Automated Input/Output handling** - no need to worry about input/output
location, bionode-watermill does that for you.
* Ability to **run programs using Unix shell** - As demonstrated by `myTask`.
So, there is no need to reinvent the wheel, you can use your previous
scripts and programs within bionode-watermill framework.
* **Node.js integration** - not explored here, but you can use javascript
alongside with bionode-watermill tasks and pipelines and even inside tasks
instead of Unix commands.
* [Streamable tasks](docs/Task.md#streamable-tasks-potential) (still not
implemented - Issue [#79](https://github.com/bionode/bionode-watermill/issues/79))
### Who is this tool for?
Bionode-watermill is for **biologists** who understand it is important to
experiment with sample data, parameter values, and tools. Compared to other
workflow systems, the ease of swapping around parameters and tools is much
improved, allowing you to iteratively compare results and construct more
confident inferences. Consider the ability to construct your own
[Teaser](https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0803-1)
for *your data* with a *simple syntax*, and getting utmost performance out of the box.
Bionode-watermill is for **programmers** who desire an efficient and
easy-to-write methodology for developing complex and dynamic data pipelines,
while handling parallelization as much as possible. Bionode-watermill is an npm
module, and is accessible by anyone willing to learn a little JavaScript. This
is in contrast to other tools which develop their own DSL
(domain specific language), which is not useful outside the tool. By leveraging
the npm ecosystem and JavaScript on the client, Bionode-watermill can be built
upon for inclusion on web apis, modern web applications, as well as native
applications through [Electron](http://electron.atom.io/). Look forward to
seeing Galaxy-like applications backed by a completely configurable Node API.
## Installation
Local installation:
```npm install bionode-watermill```
Global installation:
```npm install bionode-watermill -g```
## Documentation
Our documentation is available [here](https://bionode.gitbooks.io/bionode-watermill/content/).
There you may find how to **use** bionode-watermill to construct and **run**
your
pipelines. Moreover, you will also find the description of the API to help
anyone
willing to **contribute**.
## Tutorial
- [Try bionode-watermill tutorial!](https://github.com/bionode/bionode-watermill-tutorial)
## Example pipelines
- [Toy pipeline with shell/node](https://github.com/bionode/bionode-watermill/blob/master/examples/pipelines/pids/pipeline.js)
- [Simple capitalize task](https://github.com/bionode/bionode-watermill/blob/master/examples/pipelines/capitalize/capitalize.js)
- [Simple SNP calling](https://github.com/bionode/bionode-watermill/blob/master/examples/pipelines/variant-calling-simple/pipeline.js)
- [SNP calling with filtering and fork](https://github.com/bionode/bionode-watermill/blob/master/examples/pipelines/variant-calling-filtered/pipeline.js)
- [Mapping with bowtie2 and bwa (with tutorial)](https://github.com/bionode/bionode-watermill/tree/master/examples/pipelines/two-mappers)
## Why bionode-watermill?
[This blog post](https://jmazz.me/blog/NGS-Workflows)
compares the available tools to deal with NGS workflows, explaining the
advantages of each one, including **bionode-watermill**.
## Contributing
We welcome all kinds of contributions at all levels of experience, please
refer to
the [Issues section](https://github.com/bionode/bionode-watermill/issues).
Also, you can allways reach us on [gitter](https://gitter.im/bionode/bionode-watermill).
### Feel free to submit your pipeline to us
Just make a PR for us, that adds a pipeline under `./examples/pipelines/`.
You can check some of the already existing examples [here](examples/pipelines).