https://github.com/microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
https://github.com/microsoft/presidio
anonymization data-anonymization data-masking data-obfuscation data-privacy data-redaction de-identification guardrails image-redactor named-entity-recognition nlp personally-identifiable-information phi pii pii-detection privacy python sensitive-data spacy transformers
Last synced: about 1 hour ago
JSON representation
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
- Host: GitHub
- URL: https://github.com/microsoft/presidio
- Owner: microsoft
- License: mit
- Created: 2018-05-04T11:08:58.000Z (about 7 years ago)
- Default Branch: main
- Last Pushed: 2025-04-27T10:27:10.000Z (16 days ago)
- Last Synced: 2025-04-30T12:48:52.256Z (13 days ago)
- Topics: anonymization, data-anonymization, data-masking, data-obfuscation, data-privacy, data-redaction, de-identification, guardrails, image-redactor, named-entity-recognition, nlp, personally-identifiable-information, phi, pii, pii-detection, privacy, python, sensitive-data, spacy, transformers
- Language: Python
- Homepage: https://microsoft.github.io/presidio
- Size: 222 MB
- Stars: 4,511
- Watchers: 68
- Forks: 638
- Open Issues: 80
-
Metadata Files:
- Readme: README.MD
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT
- Codeowners: .github/CODEOWNERS
- Security: SECURITY.md
- Support: docs/supported_entities.md
Awesome Lists containing this project
- awesome-privacy-engineering - Presidio - Context aware, pluggable and customizable PII anonymization service for text and images, developed by Microsoft. (Awesome Privacy Engineering [](https://awesome.re) / De-Identification and Anonymization)
- awesome-rainmana - microsoft/presidio - An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines. (Python)