https://github.com/nylander/resample_fasta
Resample large FASTA alignments - fast!
https://github.com/nylander/resample_fasta
Last synced: 8 months ago
JSON representation
Resample large FASTA alignments - fast!
- Host: GitHub
- URL: https://github.com/nylander/resample_fasta
- Owner: nylander
- License: mit
- Created: 2018-04-12T12:35:32.000Z (about 8 years ago)
- Default Branch: main
- Last Pushed: 2025-03-28T16:46:15.000Z (about 1 year ago)
- Last Synced: 2025-07-21T13:42:11.354Z (11 months ago)
- Language: Perl
- Size: 147 KB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Resample FASTA alignments
- Last modified: fre mar 28, 2025 05:24
- Sign: Johan Nylander
## Description
Sample random columns from a set of aligned fasta formatted sequences.
That is, sample positions in first sequence, then make sure to
sample the same positions in all consecutive sequences as well.
Written with low memory requirements and speed in mind.
## Install
See file [INSTALL](INSTALL)
## Run
$ refast data/infile > outfile.fas
## Help and options
$ refast -h
Usage: refast [options]
Options:
-x fraction Set the fraction to sample (default: 0.50)
-w wrap Set the line wrap length (default: 60)
-h Show this help message
-v Show version information
infile is an alignment in fasta format
## Scripts
The file [`scripts/resample_fasta.pl`](scripts/resample_fasta.pl) is the first
implementation. Still works, but too slow on very large data.
## License and copyright
[MIT License](LICENSE)