{"id":20921147,"url":"https://github.com/czheluo/mbsa","last_synced_at":"2025-05-13T15:34:07.731Z","repository":{"id":165921439,"uuid":"179829485","full_name":"czheluo/MBSA","owner":"czheluo","description":"Multple methods for BSA Pipeline","archived":false,"fork":false,"pushed_at":"2019-12-22T16:13:51.000Z","size":2529,"stargazers_count":7,"open_issues_count":1,"forks_count":4,"subscribers_count":3,"default_branch":"master","last_synced_at":"2023-10-20T19:09:46.416Z","etag":null,"topics":["bsa","bsr","fine-mapping","ngs-analysis","qtl-analyses","rnaseq","stats"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/czheluo.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2019-04-06T12:16:59.000Z","updated_at":"2023-10-20T19:09:51.367Z","dependencies_parsed_at":null,"dependency_job_id":"3104c1cc-4131-4f41-9d6e-74c05873031c","html_url":"https://github.com/czheluo/MBSA","commit_stats":null,"previous_names":[],"tags_count":0,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/czheluo%2FMBSA","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/czheluo%2FMBSA/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/czheluo%2FMBSA/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/czheluo%2FMBSA/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/czheluo","download_url":"https://codeload.github.com/czheluo/MBSA/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":225241567,"owners_count":17443140,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bsa","bsr","fine-mapping","ngs-analysis","qtl-analyses","rnaseq","stats"],"created_at":"2024-11-18T19:42:34.763Z","updated_at":"2024-11-18T19:42:35.404Z","avatar_url":"https://github.com/czheluo.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\u003c!-- README.md is generated from README.Rmd. Please edit that file --\u003e\n## BSA workflow\n\n![BSA Schematic](pipeline.jpg \"BSA Schematic\")\n\n\u003eSO this pipeline that aiming to use [MutMap](https://www.nature.com/articles/nbt.2095) and [QTL-seq](https://onlinelibrary.wiley.com/doi/full/10.1111/tpj.12105) method which rapidly mapping of quantitative trait loci in animal or plant species by whole genome resequencing of DNA from two bulked populations (F1, F2, DH or RIL). For more details, Please see the mutmap and QTL-seq paper. \n\nQTL-seq method:\n\u003e Takagi, H., Abe, A., Yoshida, K., Kosugi, S., Natsume, S., Mitsuoka, C., Uemura, A., Utsushi, H., Tamiru, M., Takuno, S., Innan, H., Cano, L. M., Kamoun, S. and Terauchi, R. (2013), QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. *Plant J*, 74: 174–183.[doi:10.1111/tpj.12105](https://onlinelibrary.wiley.com/doi/full/10.1111/tpj.12105)\n\nG prime method:\n\n\u003e Magwene PM, Willis JH, Kelly JK (2011) The Statistics of Bulk Segregant Analysis Using Next Generation Sequencing. *PLOS Computational Biology* 7(11): e1002255.[doi.org/10.1371/journal.pcbi.1002255](http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002255)\n\nEuclidean distance calculation (ED-\u003eMAPPR)\n\u003eHill J T , Demarest B L , Bisgrove B W , et al. MMAPPR: Mutation Mapping Analysis Pipeline for Pooled RNA-seq. [Genome Research](https://genome.cshlp.org/content/23/4/687.long), 2013, 23(4):687-697.\nBulked Segregant RNA-Seq (BSR-Seq)\n\u003e Liu S, Yeh CT, Tang HM, Nettleton D, Schnable PS (2012) Gene Mapping via Bulked Segregant RNA-Seq (BSR-Seq). PLOS ONE 7(5): e36406. https://doi.org/10.1371/journal.pone.0036406\n\n\n## Installation \nMBSA pipeline depends on some R packages, before using it make sure you alread install these packages :\n\n\u003c!-- install packages --\u003e\n\n\n``` r\n#install QTLseqr\n# install devtools first to download packages from github\ninstall.packages(\"devtools\")\n# use devtools to install QTLseqr\ndevtools::install_github(\"bmansfeld/QTLseqr\")\n# and basics R packages\nlibrary(magrittr)\nlibrary(dplyr)\nlibrary(tidyr)\nlibrary(parallel)\nlibrary(grid)\nlibrary(ggplot2)\nlibrary(gridExtra)\n```\n### USAGE\n\n```perl\n$perl bsa.pipeline.pl\nContact:        czheluo@gmail.com\nScript:                 bsa.pipeline.pl\nDescription:\n        Multple methods for BSA Pipeline \nUsage:\n  Options:\n        -vcf   pop.final.vcf\n        -ann pop.summary or anno.summary\n        -ref ref.fa file or if you want to use the GATK to generate TABLE (default is NO)\n        -pid  parental name A,B\n        -bid  bulk name C,D\n        -out out dir\n        -bs    bulk size default was 30\n        -ws window size default was 1M\n        -pta pemutation test confidence interval\n        -ptb pemutation test confidence interval\n        -tgl  total genetic length (default was 2000)\n        -bs Set N to be the number of individuals in the mutant pool\n        -step which step you want\n        -stop control the step\n        -h      Help\n```\n1.methods\n\nMBSA provides several different statistics for analysis as follows:\n* ED^4 = EuclideanDist (default) -\u003e Euclidean Distance\n* SNPindex -\u003e Delta SNPindex :\nThe analysis is based on calculating\nthe allele frequency differences, or ∆(SNP-index), from the allele depths at each SNP. To determine regions\nof the genome that significantly differ from the expected ∆(SNP-index) of 0.\n\n* G’ analysis :\nAn alternate approach to determine statistical significance of QTL from NGS-BSA was proposed by Magwene et al. (2011) – calculating a modified G statistic for each SNP based on the observed and expected allele\ndepths and smoothing this value using a tricube smoothing kernel. Using the smoothed G statistic, or G’, Magwene et al.\n* Bulked Segregant RNA-Seq (BSR-Seq) : \nAn empirical Bayesian approach was used to estimate, for each SNP, the conditional probability of no recombination between the SNP marker and the causal gene in the mutant pool, given the SNP allele-specific counts.\n\n2.Inputs\n\nThe main input file is the VCF file which contains genomic variants for  two bulks and parental bulks, and your species genome annotation (blast to GO, KEGG, interproscan, NT/NR etc.). For the genomic variant calling, for WGS, I'd love to recommendate using [GATK](https://software.broadinstitute.org/gatk/), and RNAseq you can just using the [samtools](http://samtools.sourceforge.net/).\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fczheluo%2Fmbsa","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fczheluo%2Fmbsa","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fczheluo%2Fmbsa/lists"}