https://github.com/vmikk/ucs
UC file summarizer
https://github.com/vmikk/ucs
Last synced: 2 months ago
JSON representation
UC file summarizer
- Host: GitHub
- URL: https://github.com/vmikk/ucs
- Owner: vmikk
- License: mit
- Created: 2024-05-23T12:03:09.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-16T18:46:38.000Z (3 months ago)
- Last Synced: 2025-02-16T19:32:15.241Z (3 months ago)
- Language: Go
- Homepage:
- Size: 216 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# `ucs` - UC file proccessing tool
The `ucs` tool reads USEARCH cluster format (UC) files,
which are tab-separated text files commonly used in clustering and database searches.
Each line in a UC file represents a record corresponding to an input sequence,
describing cluster-membership information, alignments, and sequence identities.
For detailed information on the UC format, refer to the [USEARCH manual](https://drive5.com/usearch/manual/opt_uc.html).## Usage
Check clustering summary
(estimates the number of unique query and target sequences,
as well as the number of queries with identical names mapped to multiple targets):```bash
ucs -i test.uc.gz -s
```Extract mapping results (only Query and Target columns),
remove redundant records and duplicates,
save results to text file:```bash
ucs -i test.uc.gz -o mappings.txt
```