{"id":19492082,"url":"https://github.com/kylemclaren/scrub","last_synced_at":"2025-06-16T21:38:26.277Z","repository":{"id":153848615,"uuid":"623891625","full_name":"kylemclaren/scrub","owner":"kylemclaren","description":"A Python package to scrub PII","archived":false,"fork":false,"pushed_at":"2023-04-21T10:11:26.000Z","size":43,"stargazers_count":24,"open_issues_count":0,"forks_count":6,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-05-26T20:16:01.014Z","etag":null,"topics":["pii","python","sanitization"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kylemclaren.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-04-05T10:04:09.000Z","updated_at":"2025-04-04T10:41:19.000Z","dependencies_parsed_at":null,"dependency_job_id":"6e0a05be-c650-4935-8424-307d809507a8","html_url":"https://github.com/kylemclaren/scrub","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/kylemclaren/scrub","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kylemclaren%2Fscrub","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kylemclaren%2Fscrub/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kylemclaren%2Fscrub/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kylemclaren%2Fscrub/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kylemclaren","download_url":"https://codeload.github.com/kylemclaren/scrub/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kylemclaren%2Fscrub/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260246214,"owners_count":22980355,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["pii","python","sanitization"],"created_at":"2024-11-10T21:19:17.206Z","updated_at":"2025-06-16T21:38:26.243Z","avatar_url":"https://github.com/kylemclaren.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003ch1 align=\"center\"\u003e🫧 scrub 🫧\u003c/h1\u003e\n\n**scrub** is a lightweight, extensible Python package designed to remove Personally Identifiable Information (PII) from any text input. It leverages advanced Machine Learning algorithms to detect PII and provides multiple levels of scrubbing to ensure optimal anonymization of sensitive information. safeguarding user privacy.\n\n## Installation\n\nInstall the latest version of scrub from [PyPI](https://pypi.org/project/scrubadubdub/):\n\n```bash\npip install scrubadubdub\n```\n\n## Key Features\n\n- Advanced PII detection using Machine Learning algorithms.\n- Support for a wide range of PII types (names, email addresses, phone numbers, etc.).\n- Customizable detection threshold and levels of sanitization.\n- Extensible architecture to allow for the addition of new PII types and algorithms.\n\n## Basic Usage\n\n```python\nfrom scrubadubdub import Scrub\n\nscrubber = Scrub()\ninput_text = \"My name is John Doe and my email address is john.doe@email.com, and my phone number is 123-456-7890\"\nscrubbed_text = scrubber.scrub(input_text)\nprint(scrubbed_text)\n```\n\n```txt\nMy name is [REDACTED-NAME] and my email address is [REDACTED-EMAIL], and my phone number is [REDACTED-PHONE]\n```\n\n\u003c!-- ## Advanced Usage\n\nWith scrub, you can customize the detection threshold, sanitization levels, and even integrate additional scrubbing functionalities based on your needs.\n\n```python\n# Set custom threshold and level\nscrubber.set_threshold(0.9)\nscrubber.set_sanitization_level(scrub.SanitizationLevels.MEDIUM)\n\n# Enable additional sanitization features\nscrubber.enable_email_sanitization()\n\n# Add custom PII detection function\ndef custom_detection(text):\n    ...\n\nscrubber.add_detection_function(custom_detection)\n``` --\u003e\n\n\u003c!-- ## Documentation\n\nMore details on setting up and using scrub can be found in the [Documentation](https://scrub.readthedocs.io).\n\n## Contributing\n\nWe love contributions! If you'd like to contribute to scrub, please read our [Contributing Guidelines](./CONTRIBUTING.md) for more information on how to get started. --\u003e\n\n## License\n\nscrub is licensed under the MIT License. See [LICENSE](./LICENSE) for more details.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkylemclaren%2Fscrub","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkylemclaren%2Fscrub","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkylemclaren%2Fscrub/lists"}