{"id":21961748,"url":"https://github.com/daac-tools/crawdad","last_synced_at":"2025-10-10T04:33:59.576Z","repository":{"id":38417696,"uuid":"472002903","full_name":"daac-tools/crawdad","owner":"daac-tools","description":"🦞 Rust library of natural language dictionaries using character-wise double-array tries.","archived":false,"fork":false,"pushed_at":"2025-01-13T20:01:16.000Z","size":3951,"stargazers_count":31,"open_issues_count":0,"forks_count":2,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-08-21T18:43:38.506Z","etag":null,"topics":["cjk-characters","data-structures","double-array","no-std","rust","search","trie"],"latest_commit_sha":null,"homepage":"https://docs.rs/crawdad","language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/daac-tools.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE-APACHE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-03-20T14:22:49.000Z","updated_at":"2025-05-26T20:22:20.000Z","dependencies_parsed_at":"2025-03-02T09:10:07.339Z","dependency_job_id":"b691039b-c7cf-409b-986e-d3ea630adb26","html_url":"https://github.com/daac-tools/crawdad","commit_stats":null,"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"purl":"pkg:github/daac-tools/crawdad","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daac-tools%2Fcrawdad","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daac-tools%2Fcrawdad/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daac-tools%2Fcrawdad/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daac-tools%2Fcrawdad/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/daac-tools","download_url":"https://codeload.github.com/daac-tools/crawdad/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daac-tools%2Fcrawdad/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279002674,"owners_count":26083442,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-10T02:00:06.843Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cjk-characters","data-structures","double-array","no-std","rust","search","trie"],"created_at":"2024-11-29T10:17:48.118Z","updated_at":"2025-10-10T04:33:59.556Z","avatar_url":"https://github.com/daac-tools.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🦞 Crawdad: ChaRActer-Wise Double-Array Dictionary\n\n[![Crates.io](https://img.shields.io/crates/v/crawdad)](https://crates.io/crates/crawdad)\n[![Documentation](https://docs.rs/crawdad/badge.svg)](https://docs.rs/crawdad)\n![Build Status](https://github.com/daac-tools/crawdad/actions/workflows/rust.yml/badge.svg)\n[![Slack](https://img.shields.io/badge/join-chat-brightgreen?logo=slack)](https://join.slack.com/t/daac-tools/shared_invite/zt-1pwwqbcz4-KxL95Nam9VinpPlzUpEGyA)\n\n## Overview\n\nCrawdad is a library of natural language dictionaries using character-wise double-array tries.\nThe implementation is optimized for strings of multibyte-characters,\nand you can enjoy fast text processing on strings such as Japanese or Chinese.\n\nFor example, on a large Japanese dictionary of IPADIC+Neologd, Crawdad has a better time-space tradeoff than other Rust libraries.\n\n![](./figures/neologd.svg)\n\nThe detailed experimental settings and other results are available on [Wiki](https://github.com/daac-tools/crawdad/wiki/Performance-Comparison).\n\n### What can do\n\n- **Key-value mapping**: Crawdad stores a set of string keys with mapping arbitrary integer values.\n- **Exact match**: Crawdad supports a fast lookup for an input key.\n- **Common prefix search**: Crawdad supports fast *common prefix search* that can be used to enumerate all keys appearing in a text.\n\n### Data structures\n\nCrawdad contains the two trie implementations:\n\n- `crawdad::Trie` is a standard trie form that often provides the fastest queries.\n- `crawdad::MpTrie` is a minimal-prefix trie form that is memory-efficient for long strings.\n\n## Slack\n\nWe have a Slack workspace for developers and users to ask questions and discuss a variety of topics.\n\n * https://daac-tools.slack.com/\n * Please get an invitation from [here](https://join.slack.com/t/daac-tools/shared_invite/zt-1pwwqbcz4-KxL95Nam9VinpPlzUpEGyA).\n\n## License\n\nLicensed under either of\n\n * Apache License, Version 2.0\n   ([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)\n * MIT license\n   ([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)\n\nat your option.\n\n## Acknowledgment\n\nThe initial version of this software was developed by LegalOn Technologies, Inc.,\nbut not an officially supported LegalOn Technologies product.\n\n## Contribution\n\nSee [the guidelines](./CONTRIBUTING.md).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdaac-tools%2Fcrawdad","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdaac-tools%2Fcrawdad","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdaac-tools%2Fcrawdad/lists"}