Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mrc-ide/mlscluster
mlscluster: multilevel selection clustering algorithm
https://github.com/mrc-ide/mlscluster
Last synced: 10 days ago
JSON representation
mlscluster: multilevel selection clustering algorithm
- Host: GitHub
- URL: https://github.com/mrc-ide/mlscluster
- Owner: mrc-ide
- License: mit
- Created: 2022-07-27T13:38:12.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-15T10:33:13.000Z (10 months ago)
- Last Synced: 2024-11-06T02:31:33.338Z (about 2 months ago)
- Language: R
- Homepage: https://doi.org/10.12688/wellcomeopenres.20704.1
- Size: 76.4 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# mlscluster
[![DOI](https://zenodo.org/badge/518473195.svg)](https://zenodo.org/doi/10.5281/zenodo.10520061)
**[m]ulti[l]evel [s]election [cluster]ing**: tree-based variant fitness clustering algorithm
*mlscluster* is an algorithm that computes three statistics (ratio of descendant
clade sizes, ratio of persistence times, and logistic growth rates) for sister clades
in huge time-scaled phylogenies. Clade-defining homoplasies are matched with these
statistics and results for different cluster thresholds are produced.
Therefore, this method highlights mutations (or transmission fitness polymorphisms [TFPs])
which likely present different selective pressures within and between hosts (multilevel selection).
This package contains functions to:* Compute the multilevel selectives pressures across thresholds;
* Perform statistical analysis and graphical visualisation for all thresholds to identify lineages
and genomic regions enriched for TFPs;
* Quantify the false discovery rate of the algorithm for each threshold;
* Generate multiple lineage and gene-specific plots of TFPs for the threshold selected by the user.For a complete description of the statistical methods incorporated into this
package, see our paper:Bonetti Franceschi, V., & Volz, E. (2024). Phylogenetic signatures reveal multilevel selection and fitness costs in SARS-CoV-2. Wellcome Open Research, 9, 85. [https://doi.org/10.12688/wellcomeopenres.20704.1](https://doi.org/10.12688/wellcomeopenres.20704.1)
## Installation
In R, install the `devtools` package and run:
```
devtools::install_github('mrc-ide/mlscluster')
```## Documentation
See the [SARS-CoV-2 computational experiments from our paper](https://github.com/vinibfranc/mlscluster-experiments) for examples of how to use *mlscluster*.
If you need guidance on how to run some functions, see their help pages. For example, within R you can try:
```
help(mlsclust)
```## Maintenance
The package currently support SARS-CoV-2 analyses only. Further support for other pathogens, especially HIV, is planned. This involves adding genomic coordinates and specific sanity checks for known sequencing artifacts of each pathogen. Watch this space for future releases or contact us if you are interested in such applications.