{"id":17632918,"url":"https://github.com/kenorb/bloom","last_synced_at":"2025-10-28T07:31:04.868Z","repository":{"id":216125093,"uuid":"740535277","full_name":"kenorb/bloom","owner":"kenorb","description":"Command utility to filter out duplicated lines using bloom filter method.","archived":false,"fork":false,"pushed_at":"2024-10-28T15:45:02.000Z","size":103,"stargazers_count":1,"open_issues_count":15,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-12-20T14:16:34.040Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kenorb.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-01-08T14:44:16.000Z","updated_at":"2024-12-04T19:18:56.000Z","dependencies_parsed_at":"2024-01-08T16:21:49.427Z","dependency_job_id":"4f878a1c-e51f-48db-bc47-3e8b1b12bba9","html_url":"https://github.com/kenorb/bloom","commit_stats":null,"previous_names":["kenorb/bloom"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kenorb%2Fbloom","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kenorb%2Fbloom/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kenorb%2Fbloom/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kenorb%2Fbloom/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kenorb","download_url":"https://codeload.github.com/kenorb/bloom/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":238610915,"owners_count":19500745,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-23T01:46:26.970Z","updated_at":"2025-10-28T07:31:04.863Z","avatar_url":"https://github.com/kenorb.png","language":"Rust","readme":"# Bloom\n\nCommand utility to filter out duplicated lines using bloom filter method.\n\n## Overview\n\nBloom is a command-line tool aims at printing only unique lines from the\nstandard input. It uses [Bloom filter][bf-wiki], a space-efficient\nprobabilistic data structure used to test whether a given element is a member\nof a set.  The tool provides functionality to read lines from standard input,\ncalculate check sums, and insert them into a Bloom filter. Additionally, it\nsupports the option to load and update existing Bloom filter files, set\nlimits on the number of lines inserted, and more.\n\n## Build\n\nTo build from source, clone this repository and run:\n\n    cargo build\n\nTo build using optimized profile, run:\n\n    cargo build --profile optimized\n\n## Install\n\nTo install, run:\n\n    cargo install --git https://github.com/kenorb/bloom.git\n\n## Examples\n\n    # Filters out duplicated lines.\n    $ (seq 10; seq 10) | bloom | wc -l\n    10\n    # Prints lines only when present in bloom filter.\n    $ (seq 10; seq 10) | bloom -v | wc -l\n    10\n    # Store maximum 9 lines.\n    $ seq 10 | bloom -l 9 | wc -l\n    10\n    # Writes bloom filter into the file, then use it again to filter out lines.\n    $ seq 10 | bloom -f 10.blf -w; seq 10 | bloom -f 10.blf | wc -l\n    0\n\n\u003c!-- Named links --\u003e\n\n[bf-wiki]: https://en.wikipedia.org/wiki/Bloom_filter\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkenorb%2Fbloom","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkenorb%2Fbloom","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkenorb%2Fbloom/lists"}