{"id":23768541,"url":"https://github.com/nci-gdc/somaticsniper-tool","last_synced_at":"2026-05-03T13:42:35.055Z","repository":{"id":52691753,"uuid":"49169878","full_name":"NCI-GDC/somaticsniper-tool","owner":"NCI-GDC","description":"GDC SomaticSniper docker","archived":false,"fork":false,"pushed_at":"2022-12-08T11:57:11.000Z","size":60,"stargazers_count":0,"open_issues_count":1,"forks_count":1,"subscribers_count":10,"default_branch":"master","last_synced_at":"2025-01-01T01:37:20.824Z","etag":null,"topics":["bioinformatics","docker","workflow-tool"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/NCI-GDC.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2016-01-07T00:16:55.000Z","updated_at":"2021-04-20T21:50:12.000Z","dependencies_parsed_at":"2023-01-25T13:45:52.048Z","dependency_job_id":null,"html_url":"https://github.com/NCI-GDC/somaticsniper-tool","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NCI-GDC%2Fsomaticsniper-tool","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NCI-GDC%2Fsomaticsniper-tool/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NCI-GDC%2Fsomaticsniper-tool/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NCI-GDC%2Fsomaticsniper-tool/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/NCI-GDC","download_url":"https://codeload.github.com/NCI-GDC/somaticsniper-tool/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239946915,"owners_count":19723018,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bioinformatics","docker","workflow-tool"],"created_at":"2025-01-01T01:37:23.075Z","updated_at":"2026-03-27T18:30:19.234Z","avatar_url":"https://github.com/NCI-GDC.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# GDC SomaticSniper\n![Version badge](https://img.shields.io/badge/SomaticSniper-1.0.5.0-\u003cCOLOR\u003e.svg)\n\nThe purpose of SomaticSniper is to identify single nucleotide positions that are different between tumor and normal. It takes a tumor bam and a normal bam and compares the two to determine the differences.\n\nOriginal SomaticSniper: http://gmt.genome.wustl.edu/packages/somatic-sniper/index.html\n\n## Docker\n\nThere are three `Dockerfile`s for different purposes:\n\n* Vanilla BAM-readcount\n  * `/docker/bam-readcount/Dockerfile` : BAM-readcount docker without additional features. The `bam-readcount` will be used for the fpfilter.\n* Vanilla SomaticSniper\n  * `/docker/somaticsniper/Dockerfile` : SomaticSniper docker without additional features.\n* Multi-threading SomaticSniper\n  * `/docker/multi_muse/Dockerfile` : A python multi-threading implementation on SomaticSniper with builtin `LOH` and `High confidence` filtering. To achieve `scatter/gather` method on Docker level, input needs an array of tumor-normal mpileup from `samtools mpileup -f`.\n\n## How to build\n\nhttps://docs.docker.com/engine/reference/builder/\n\nThe docker images are tested under multiple environments. The most tested ones are:\n* Docker version 19.03.2, build 6a30dfc\n* Docker version 18.09.1, build 4c52b90\n* Docker version 18.03.0-ce, build 0520e24\n* Docker version 17.12.1-ce, build 7390fc6\n\n## For external users\nThe repository has only been tested on GDC data and in the particular environment GDC is running in. Some of the reference data required for the workflow production are hosted in [GDC reference files](https://gdc.cancer.gov/about-data/data-harmonization-and-generation/gdc-reference-files \"GDC reference files\"). For any questions related to GDC data, please contact the GDC Help Desk at support@nci-gdc.datacommons.io.\n\nThere is a production-ready CWL example at https://github.com/NCI-GDC/somaticsniper-cwl which uses the docker images that are built from the `Dockerfile`s in this repo.\n\nTo use docker images directly or with other workflow languages, we recommend to build and use either vanilla SomaticSniper or multi-threading SomaticSniper.\n\nTo run multi-threading SomaticSniper:\n\n```\n[INFO] [20200110 04:32:14] [multi_somaticsniper] - --------------------------------------------------------------------------------\nusage: Internal multithreading SomaticSniper calling. [-h] -f REFERENCE_PATH\n                                                      -t TUMOR_BAM -n\n                                                      NORMAL_BAM -c\n                                                      THREAD_COUNT -m MPILEUP\n                                                      [-q MAP_Q] [-Q BASE_Q]\n                                                      [-L] [-G] [-p] [-J]\n                                                      [-s PPS] [-T THETA]\n                                                      [-N NHAP] [-r PD]\n                                                      [-F FOUT]\n\noptional arguments:\n  -h, --help            show this help message and exit\n  -f REFERENCE_PATH, --reference_path REFERENCE_PATH\n                        Reference path.\n  -t TUMOR_BAM, --tumor_bam TUMOR_BAM\n                        Tumor bam file.\n  -n NORMAL_BAM, --normal_bam NORMAL_BAM\n                        Normal bam file.\n  -c THREAD_COUNT, --thread_count THREAD_COUNT\n                        Number of thread.\n  -m MPILEUP, --mpileup MPILEUP\n                        A list of normal-tumor samtools mpileup files on\n                        different region. Created by \"samtools mpileup -f\".\n                        The file name must contain region. e.g.\n                        chr1-1-248956422.mpileup\n  -q MAP_Q, --map_q MAP_Q\n                        filtering reads with mapping quality less than this\n                        value.\n  -Q BASE_Q, --base_q BASE_Q\n                        filtering somatic snv output with somatic quality less\n                        than this value.\n  -L, --loh             do not report LOH variants as determined by genotypes\n                        (T/F).\n  -G, --gor             do not report Gain of Reference variants as determined\n                        by genotypes (T/F).\n  -p, --psc             disable priors in the somatic calculation. Increases\n                        sensitivity for solid tumors (T/F).\n  -J, --ppa             Use prior probabilities accounting for the somatic\n                        mutation rate (T/F).\n  -s PPS, --pps PPS     prior probability of a somatic mutation (implies -J).\n  -T THETA, --theta THETA\n                        theta in maq consensus calling model (for -c/-g).\n  -N NHAP, --nhap NHAP  number of haplotypes in the sample.\n  -r PD, --pd PD        prior of a difference between two haplotypes.\n  -F FOUT, --fout FOUT  output format (classic/vcf/bed).\n```\n\n## For GDC users\n\nSee https://github.com/NCI-GDC/gdc-somatic-variant-calling-workflow.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnci-gdc%2Fsomaticsniper-tool","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnci-gdc%2Fsomaticsniper-tool","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnci-gdc%2Fsomaticsniper-tool/lists"}