{"id":22188439,"url":"https://github.com/vmikk/distwiz","last_synced_at":"2025-03-24T20:14:46.230Z","repository":{"id":231125014,"uuid":"780531805","full_name":"vmikk/distwiz","owner":"vmikk","description":"distwiz - Sparse to full matrix converter","archived":false,"fork":false,"pushed_at":"2024-04-03T18:23:11.000Z","size":9,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-04-04T16:02:45.364Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vmikk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-04-01T17:14:31.000Z","updated_at":"2024-06-19T06:33:12.788Z","dependencies_parsed_at":"2024-04-02T15:47:05.348Z","dependency_job_id":"f779a026-e73c-4bfa-b28e-dfcc1afc074d","html_url":"https://github.com/vmikk/distwiz","commit_stats":null,"previous_names":["vmikk/distwiz"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vmikk%2Fdistwiz","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vmikk%2Fdistwiz/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vmikk%2Fdistwiz/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vmikk%2Fdistwiz/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vmikk","download_url":"https://codeload.github.com/vmikk/distwiz/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245344005,"owners_count":20599867,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-02T11:10:35.902Z","updated_at":"2025-03-24T20:14:46.196Z","avatar_url":"https://github.com/vmikk.png","language":"Go","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Sparse to full matrix converter\n\n## Overview\n\nThis utility converts a sparse matrix \n(such as those produced by `usearch -calc_distmx`) \ninto a full square matrix.\n\n## Notes and limitations\n\nThis project is experimental and may require further optimization to enhance performance.\nSelection between the in-memory or disk-based processing methods \nis based on the number of unique labels in the input data (10,000 sequences by default).  \n\n- The in-memory solution requires approximately 100GB of RAM for 30,000 objects (equivalent to around 450 million pairwise distances)  \n- The disk-based solution, while significantly more memory-efficient, is I/O intensive and much slower.  \n\n## Usage\n\n```shell\ndistwiz --input mx.txt --output dist.txt.gz\n```\n\nSupported arguments:  \n- `--input`: Path to the input file containing the sparse distance matrix  \n- `--output`: Path to the output file (GZIP-compressed)  \n- `--mode`: Processing mode: `auto`, `mem` (in-memory), or `disk` (disk-based). Default is `auto`  \n- `--compresslevel`: GZIP compression level (1-9). The default is 4  \n\n## Installation\n\nCompile the program using `go build`.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvmikk%2Fdistwiz","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvmikk%2Fdistwiz","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvmikk%2Fdistwiz/lists"}