{"id":17166680,"url":"https://github.com/willf/compression_classification","last_synced_at":"2025-04-13T15:25:50.144Z","repository":{"id":181212734,"uuid":"666192304","full_name":"willf/compression_classification","owner":"willf","description":"Using compression to classify","archived":false,"fork":false,"pushed_at":"2023-08-06T22:29:36.000Z","size":2530,"stargazers_count":4,"open_issues_count":1,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-04-10T20:13:00.069Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/willf.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"contributing.md","funding":null,"license":null,"code_of_conduct":"code-of-conduct.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-13T23:55:08.000Z","updated_at":"2023-07-31T16:46:11.000Z","dependencies_parsed_at":null,"dependency_job_id":"c1b05b88-f2bb-4a71-ad3d-a439931e3115","html_url":"https://github.com/willf/compression_classification","commit_stats":null,"previous_names":["willf/compression_classification"],"tags_count":0,"template":false,"template_full_name":"willf/joyfully","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fcompression_classification","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fcompression_classification/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fcompression_classification/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fcompression_classification/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/willf","download_url":"https://codeload.github.com/willf/compression_classification/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248734134,"owners_count":21153155,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-14T23:06:22.864Z","updated_at":"2025-04-13T15:25:50.138Z","avatar_url":"https://github.com/willf.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# compression_classification\n\n[![Python package](https://github.com/willf/compression_classification/actions/workflows/test.yml/badge.svg)](https://github.com/willf/compression_classification/actions/workflows/test.yml)\n\nCompression Classification is a Python package for classifying via compression.\n\nIt is inspired by my talk on \"[Stupid Language Tricks](https://www.entish.org/lang-id-preso/lang-id-preso.pdf)\" and  [“Low-Resource” Text Classification: A Parameter-Free Classification\nMethod with Compressors](https://aclanthology.org/2023.findings-acl.426.pdf)\n\nSimple example:\n\n```python\nfrom compression_classification import compression_classification\nclr = compression_classification.CompressionClassifier()\nclr.train(\"FilterGenie 的基础设施旨在处理大量数据而不影响性能。 无论您拥有小型项目还是大型企业应用程序，我们 的 API 都可以轻松扩展以满足您的需求。\", \"zh\")\nclr.train(\"FilterGenie's infrastructure is built to handle high volumes of data without compromising performance. Whether you have a small-scale project or a large enterprise application, our API scales effortlessly to meet your needs.\", \"en\")\n\nclr.predict(\"This is the day they give babies away\")\n'en'\n\nclr.predict(\"这一天是他们送孩子的日子\")\n'zh'\n```\n\nIn general, you'll want a lot more data, though.\n\n\n## Contributing\n\nWe welcome contributions to compression_classification. Please see our [contributing guidelines](contributing.md) for more information.\n\nTo install the package for development, install [poetry](https://python-poetry.org/) and then run:\n\n```bash\ngh repo clone willf/compression_classification\ncd compression_classification\npoetry install\npoetry shell\n```\n\n## Code of Conduct\n\nWe expect project participants to adhere to our [Code of Conduct](code-of-conduct.md).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillf%2Fcompression_classification","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwillf%2Fcompression_classification","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillf%2Fcompression_classification/lists"}