Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bcgsc/chromeqc
ChromeQC: Summarize sequencing library quality of 10x Genomics Chromium linked reads
https://github.com/bcgsc/chromeqc
genome genome-sequencing linked-reads quality-control
Last synced: about 2 months ago
JSON representation
ChromeQC: Summarize sequencing library quality of 10x Genomics Chromium linked reads
- Host: GitHub
- URL: https://github.com/bcgsc/chromeqc
- Owner: bcgsc
- License: mit
- Created: 2017-10-10T22:45:13.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2019-04-30T17:17:18.000Z (over 5 years ago)
- Last Synced: 2024-07-31T20:28:40.345Z (5 months ago)
- Topics: genome, genome-sequencing, linked-reads, quality-control
- Language: HTML
- Homepage: https://bcgsc.github.io/chromeqc
- Size: 1.98 MB
- Stars: 15
- Watchers: 18
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-linked-reads - ChromeQC - commit/bcgsc/chromeqc?label=%20) (Tools)
README
![ChromeQC logo](chromeqc-logo.png)
# ChromeQC: Summarize library quality of 10x Genomics Chromium linked reads
This tool provides a quick report on the quality of a 10x Genomics Chromium linked reads library. The report summarizes the sizes of the molecules, the number of reads per molecule, the number of molecules per barcode, and the amount of DNA per barcode. The idea is to provide a [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)-like tool in terms of speed but to contain information provided by the Summary page of the [Loupe software of 10x Genomics](https://support.10xgenomics.com/genome-exome/software/visualization/latest/what-is-loupe). ChromeQC is developed in Python 3, R, AWK, RMarkdown, and Flexdashboard, and uses BWA-MEM for read alignment.
# Usage
```
-w --whitelist : default='whitelist_barcodes', type=str
-k --subsample_size: default=4000 , type=int
-i --in : default='-' , type=str
-o --out : default='stdout' , type=str
-s --seed : default=1334 , type=int
-m --max_read_pairs: default=-1 , type=int , note: -1 means all read pairs
-p --stats_out_path: default='.' , type=str , note: the directory needs to be created already
-v --verbose : default=False , no value , note: If supplied, will be set to true, else will be false.
```# Examples
+ [Sample ChromeQC report](http://bcgsc.github.io/chromeqc/report)
+ [Sample MultiQC report](https://bcgsc.github.io/chromeqc/multiqc/)```sh
# Install Homebrew on macOS, Linux, or Windows: https://brew.sh
which pigz || brew install pigz
python3 select_random_subset/random_sampling_from_whitelist.py -v -w data/4M-with-alts-february-2016.txt -i data/read-RA_si-GAGTTAGT_lane-001-chunk-0002.fastq.gz | pigz -p4 >data/subsampled.fq.gz
```The pipeline starts with raw FASTQ files of interleaved paired end reads provided by the 10x Chromium platform.
# Dependencies
```sh
pip3 install -r requirements.txt
brew bundle
```+ BWA or Minimap2
+ Pysam
+ Python 3
+ Samtools# Prerequisites
The analysis and report will be created using R, the Tidyverse, RMarkdown, and Flexdashboard. Familiarity with some of these tools is useful, but not necessary to participate in this project. Non-technical participants are welcome to design the aesthetics of the report, prepare and deliver the presentation, and coordinate writing a brief paper about the tool.
Team Lead: [Shaun Jackman](http://sjackman.ca) | [email protected] | @sjackman | Grad Student | BC Cancer Agency Genome Sciences Centre