https://github.com/thierrygosselin/stackr
stackr: an R package to run stacks software pipeline
https://github.com/thierrygosselin/stackr
filter gbs genetics genomic-data-analysis genomics genomics-visualization genotype-likelihoods genotyping-by-sequencing rad radseq radseq-data stackr
Last synced: 5 months ago
JSON representation
stackr: an R package to run stacks software pipeline
- Host: GitHub
- URL: https://github.com/thierrygosselin/stackr
- Owner: thierrygosselin
- Created: 2015-05-23T03:21:36.000Z (almost 10 years ago)
- Default Branch: master
- Last Pushed: 2020-11-05T13:51:25.000Z (over 4 years ago)
- Last Synced: 2024-08-13T07:12:08.438Z (8 months ago)
- Topics: filter, gbs, genetics, genomic-data-analysis, genomics, genomics-visualization, genotype-likelihoods, genotyping-by-sequencing, rad, radseq, radseq-data, stackr
- Language: R
- Homepage: http://thierrygosselin.github.io/stackr/
- Size: 17.7 MB
- Stars: 27
- Watchers: 12
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.Rmd
Awesome Lists containing this project
- jimsghstars - thierrygosselin/stackr - stackr: an R package to run stacks software pipeline (R)
README
---
output: github_document
---```{r, echo = FALSE}
description <- readLines("DESCRIPTION")
rvers <- stringr::str_match(grep("R \\(", description, value = TRUE), "[0-9]{1,4}\\.[0-9]{1,4}\\.[0-9]{1,4}")[1,1]
version <- gsub(" ", "", gsub("Version:", "", grep("Version:", description, value = TRUE)))
```[](https://tidyverse.org/lifecycle/#experimental)
[](http://cran.r-project.org/package=stackr)
[](http://www.repostatus.org/#active)
[](https://zenodo.org/badge/latestdoi/14548/thierrygosselin/stackr)
[](commits/master)
[)`-brightgreen.svg)](/commits/master)
[](https://cran.r-project.org/)```{r, echo = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "README-"
)
```# stackr: an R package to run stacks software pipeline
This is the development page of the **stackr**.
**What's the difference with running stacks directly in the terminal?**
Besides running stacks within R, not much, tiny differences here and there
that speed up my RADseq workflow:* The philosophy of working by project with pre-organized folders.
* Some important steps are **parallelized**.
* You have more than 1 sequencing chip/lane ? This workflow will save you lots of time.
* **Technical replicates**, inside or across chip/lanes are managed uniquely.
* Noise reduction.
* Data normalization.
* **nightmares because of a crashed computer/cluster/server?** stackr manage
stacks unique integer (previously called SQL IDs) throughout the pipeline. It's
integrated from the start, making it a breeze to just re-start your pipeline after a crash!
* **mismatch testing:** *de novo* mismatch threshold series is integrated
inside `run_ustacks` and stackr will produce tables and figures automatically.
* **catalog**: for bigger sampling size project, breaking down the catalog into
several separate *cstacks* steps makes the pipeline more rigorous if your
computer/cluster/server crash.
* **logs** generated by stacks are read and transferred in human-readable tables/tibbles.
Detecting problems is easier.
* summary of different stacks modules: available automatically inside stackr
pipeline, but also available for users who didn't use stackr to run stacks.
* For me all this = increased reproducibly.**Who's it for?**
* It's currently developed with my own projects in mind.
* To help collaborators to get the most out of stacks.It's not for R or stacks beginners. stacks related issues should be highlighted
on [stacks google group](https://groups.google.com/forum/?fromgroups#!forum/stacks-users).## Installation
To try out the dev version of **stackr**, copy/paste the code below:```r
if (!require("devtools")) install.packages("devtools")
devtools::install_github("thierrygosselin/stackr")
library(stackr)
```## Citation:
To get the citation, inside R:
```r
citation("stackr")
```Web site with additional info: [http://thierrygosselin.github.io/stackr/](http://thierrygosselin.github.io/stackr/)
* [Computer setup and troubleshooting](https://thierrygosselin.github.io/radiator/articles/rad_genomics_computer_setup.html)
* [Vignettes](https://thierrygosselin.github.io/radiator/articles/index.html)## Life cycle
stackr is maturing, but in order to make the package better, changes are
inevitable. Argument names are very stable and follows stacks development closely.* Philosophy, major changes and deprecated functions/arguments are documented in
life cycle section of functions.
* The latest changes are documented in [changelog, versions, new features and bug history](http://thierrygosselin.github.io/stackr/news/index.html)
* [issues](https://github.com/thierrygosselin/stackr/issues/new/choose) and [contributions](https://github.com/thierrygosselin/stackr/issues/new/choose)## Stacks modules and RADseq typical workflow
**stackr** package provides wrapper functions to run
[STACKS](http://catchenlab.life.illinois.edu/stacks/) *process_radtags*,
*ustacks*, *cstacks*, *sstacks*, *rxstacks* and *populations* inside R.Below, a flow chart
showing the corresponding stacks modules and stackr corresponding functions.
