Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/bcgsc/link_str

Analysis scripts developed for genotyping STRs in linked-read data
https://github.com/bcgsc/link_str

Last synced: about 1 month ago
JSON representation

Analysis scripts developed for genotyping STRs in linked-read data

Awesome Lists containing this project

README

        

# Genotyping STRs in linked-read data

This repository contains Python scripts developed to:
- extract in-repeat repeats (IRRs) using barcodes from linked-read alignments ([IRR extraction](irr))
- estimate sizes of genomic intervals by calculating Jaccard index (JI) of barcode sharing ([distance estimate](jaccard_index))

## Dependancies
- [NumPy](https://numpy.org/)
- [Pandas](https://pandas.pydata.org/)
- [pybedtools](https://daler.github.io/pybedtools/)
- [pysam](https://github.com/pysam-developers/pysam)
- [TRF](https://tandem.bu.edu/trf/trf.html) (for IRR extraction)
- [blastn](https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/) (for IRR extraction)

Author: [Readman Chiu](mailto:[email protected])

:copyright: Canada's Michael Smith Genome Sciences Centre, BC Cancer