{"id":31753787,"url":"https://github.com/servicenow/bytesteady","last_synced_at":"2025-10-09T17:54:31.768Z","repository":{"id":38334353,"uuid":"388223032","full_name":"ServiceNow/bytesteady","owner":"ServiceNow","description":"A fast classification and tagging tool using byte-level n-gram embeddings. bytesteady is a ServiceNow Research project that was started at Element AI.","archived":false,"fork":false,"pushed_at":"2023-07-06T15:37:39.000Z","size":382,"stargazers_count":9,"open_issues_count":1,"forks_count":2,"subscribers_count":6,"default_branch":"master","last_synced_at":"2024-03-30T05:22:37.159Z","etag":null,"topics":["genomics","machine-learning","n-gram","text-classification"],"latest_commit_sha":null,"homepage":"https://arxiv.org/abs/2106.13302","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ServiceNow.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2021-07-21T19:21:14.000Z","updated_at":"2023-09-27T08:46:05.000Z","dependencies_parsed_at":"2022-07-25T20:30:58.450Z","dependency_job_id":null,"html_url":"https://github.com/ServiceNow/bytesteady","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ServiceNow/bytesteady","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ServiceNow%2Fbytesteady","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ServiceNow%2Fbytesteady/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ServiceNow%2Fbytesteady/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ServiceNow%2Fbytesteady/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ServiceNow","download_url":"https://codeload.github.com/ServiceNow/bytesteady/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ServiceNow%2Fbytesteady/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279001804,"owners_count":26083197,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-09T02:00:07.460Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["genomics","machine-learning","n-gram","text-classification"],"created_at":"2025-10-09T17:54:26.514Z","updated_at":"2025-10-09T17:54:31.763Z","avatar_url":"https://github.com/ServiceNow.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"*ServiceNow completed its acquisition of Element AI on January 8, 2021. All references to Element AI in the materials that are part of this project should refer to ServiceNow.*\n\n# byteSteady\n\n**A fast classification and tagging tool using byte-level n-gram embeddings**\n\n## Reference\n\nPlease read our [[paper]](https://arxiv.org/abs/2106.13302) for details on byteSteady.\n\n```\n@article{zhang2021bytesteady,\n  title={byteSteady: Fast Classification Using Byte-Level n-Gram Embeddings},\n  author={Zhang, Xiang and Drouin, Alexandre and Li, Raymond},\n  journal={arXiv preprint arXiv:2106.13302},\n  year={2021}\n}\n```\n\n## Dependencies\n\n1. GNU/Linux (`byteswap.h` from glibc used in CityHash)\n2. C++17 compiler (`::std::variant`, `::std::filesystem`, etc)\n3. [Thunder](http://thunder.nyc) (tensor math)\n4. [Google googletest](https://github.com/google/googletest) (unit tests)\n5. [Google gflags](https://github.com/gflags/gflags) (command-line option parsing)\n6. [Google glog](https://github.com/google/glog) (logging and error handling)\n\n## Compile\n\nMake sure all the dependencies are installed, and then simply `make`.\n\nThere will be a few outputs:\n- `bytesteady/bytesteady`: the byteSteady executable\n- `bytesteady/libbytesteady.so`: the byteSteady dynamic library\n- `bytesteady/*_test`: unit tests of different modules for byteSteady\n\n## Command-line options\n\nbyteSteady is built with [Google gflags](https://github.com/gflags/gflags) to support command-line flag parsing. The definition of all available flags can be found in `bytesteady/flags.cpp`. You can also query these flags by\n\n```\n$ bytesteady/bytesteady -helpon bytesteady/flags\n```\n\nThe `-helpon` is provided by Google gflags to show help for flags only defined in some source code file. For full help information, including flags from the other parts of the program (such as Google glog), simply use `-help`.\n\n\n## Gene classification dataset\n\nThe gene classification dataset used in the paper can be downloaded at https://zenodo.org/record/5181235#.YWc4bG3MJb8.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fservicenow%2Fbytesteady","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fservicenow%2Fbytesteady","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fservicenow%2Fbytesteady/lists"}