https://github.com/bcgsc/chromeqc

ChromeQC: Summarize sequencing library quality of 10x Genomics Chromium linked reads
https://github.com/bcgsc/chromeqc

genome genome-sequencing linked-reads quality-control

Last synced: 8 months ago
JSON representation

ChromeQC: Summarize sequencing library quality of 10x Genomics Chromium linked reads

Host: GitHub
URL: https://github.com/bcgsc/chromeqc
Owner: bcgsc
License: mit
Created: 2017-10-10T22:45:13.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2019-04-30T17:17:18.000Z (over 6 years ago)
Last Synced: 2024-07-31T20:28:40.345Z (about 1 year ago)
Topics: genome, genome-sequencing, linked-reads, quality-control
Language: HTML
Homepage: https://bcgsc.github.io/chromeqc
Size: 1.98 MB
Stars: 15
Watchers: 18
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-linked-reads - ChromeQC - commit/bcgsc/chromeqc?label=%20) (Tools)

README

![ChromeQC logo](chromeqc-logo.png)

# ChromeQC: Summarize library quality of 10x Genomics Chromium linked reads

This tool provides a quick report on the quality of a 10x Genomics Chromium linked reads library. The report summarizes the sizes of the molecules, the number of reads per molecule, the number of molecules per barcode, and the amount of DNA per barcode. The idea is to provide a [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)-like tool in terms of speed but to contain information provided by the Summary page of the [Loupe software of 10x Genomics](https://support.10xgenomics.com/genome-exome/software/visualization/latest/what-is-loupe). ChromeQC is developed in Python 3, R, AWK, RMarkdown, and Flexdashboard, and uses BWA-MEM for read alignment.

# Usage

```
-w --whitelist : default='whitelist_barcodes', type=str
-k --subsample_size: default=4000 , type=int
-i --in : default='-' , type=str
-o --out : default='stdout' , type=str
-s --seed : default=1334 , type=int
-m --max_read_pairs: default=-1 , type=int , note: -1 means all read pairs
-p --stats_out_path: default='.' , type=str , note: the directory needs to be created already
-v --verbose : default=False , no value , note: If supplied, will be set to true, else will be false.
```

# Examples

+ [Sample ChromeQC report](http://bcgsc.github.io/chromeqc/report)
+ [Sample MultiQC report](https://bcgsc.github.io/chromeqc/multiqc/)

```sh
# Install Homebrew on macOS, Linux, or Windows: https://brew.sh
which pigz || brew install pigz
python3 select_random_subset/random_sampling_from_whitelist.py -v -w data/4M-with-alts-february-2016.txt -i data/read-RA_si-GAGTTAGT_lane-001-chunk-0002.fastq.gz | pigz -p4 >data/subsampled.fq.gz
```

The pipeline starts with raw FASTQ files of interleaved paired end reads provided by the 10x Chromium platform.

# Dependencies

```sh
pip3 install -r requirements.txt
brew bundle
```

+ BWA or Minimap2
+ Pysam
+ Python 3
+ Samtools

# Prerequisites

The analysis and report will be created using R, the Tidyverse, RMarkdown, and Flexdashboard. Familiarity with some of these tools is useful, but not necessary to participate in this project. Non-technical participants are welcome to design the aesthetics of the report, prepare and deliver the presentation, and coordinate writing a brief paper about the tool.

Team Lead: [Shaun Jackman](http://sjackman.ca) | sjackman@gmail.com | @sjackman | Grad Student | BC Cancer Agency Genome Sciences Centre

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bcgsc/chromeqc

Awesome Lists containing this project

README