An open API service indexing awesome lists of open source software.

https://github.com/danko-lab/glimpse2_setup

Scripts to create a phased reference panel from 1kGP for imputation with GLIMPSE
https://github.com/danko-lab/glimpse2_setup

Last synced: 10 months ago
JSON representation

Scripts to create a phased reference panel from 1kGP for imputation with GLIMPSE

Awesome Lists containing this project

README

          

# GLIMPSE2_setup

This repo contains a snakemake pipeline to create a phased reference panel from 1kGP for imputation with GLIMPSE2. This pipeline follows their [tutorial](https://odelaneau.github.io/GLIMPSE/docs/tutorials/getting_started/#3-split-the-genome-into-chunks) and use the most recent high coverage integrated phased panel from [1kGP](http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000G_2504_high_coverage/working/20220422_3202_phased_SNV_INDEL_SV/).

## Requirements

Software requirements are detailed in the conda environment file (`spec/snakemake.yml`). Paths to the base directory where everything will be downloaded/written and to GLIMPSE should be set by editing `spec/file_spec.json` (by default, this will be written to a subdirectory `outputs` where this repo is downloaded). GLIMPSE uses genetic maps to help decide how to chunk the genome to. The install path for GLIMPSE and parallel must be provided in the spec file.

Note that the X chromosome is bugged, and I have not yet figured out how to fix it. For now, I am just excluding it entirely (both non-PAR and the two PARs).