An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-privacy

A curated list of projects in awesome lists tagged with data-privacy .

https://github.com/countly/countly-server

Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.

analytics coppa crash-analytics crash-reports dashboard data data-ownership data-privacy feature-flags gdpr hipaa insights mobile-analytics product-analytics product-management push-notifications remote-configuration tracking user-feedback web-analytics

Last synced: 07 Jan 2026

https://github.com/Countly/countly-server

Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.

analytics coppa crash-analytics crash-reports dashboard data data-ownership data-privacy feature-flags gdpr hipaa insights mobile-analytics product-analytics product-management push-notifications remote-configuration tracking user-feedback web-analytics

Last synced: 13 Mar 2025

https://github.com/microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

anonymization data-anonymization data-masking data-obfuscation data-privacy data-redaction de-identification guardrails image-redactor named-entity-recognition nlp personally-identifiable-information phi pii pii-detection privacy python sensitive-data spacy transformers

Last synced: 13 May 2025

https://github.com/ibm/differential-privacy-library

Diffprivlib: The IBM Differential Privacy Library

data-privacy differential-privacy machine-learning python

Last synced: 21 Oct 2025

https://github.com/IBM/differential-privacy-library

Diffprivlib: The IBM Differential Privacy Library

data-privacy differential-privacy machine-learning python

Last synced: 27 Mar 2025

https://github.com/hypersign-protocol/hid-node

A permissionless blockchain network to manage digital identity and access rights

blockchain data-privacy did ssi

Last synced: 24 Dec 2025

https://github.com/datafog/datafog-python

Lightweight Python SDK for PII detection, redaction, and LLM guardrails, with fast regex defaults and optional NLP/OCR/Spark extras.

anonymization compliance data-privacy data-protection de-identification gliner guardrails llm ner nlp ocr pii pii-detection pii-redaction privacy-engineering python redaction sdk spacy

Last synced: 31 May 2026

https://github.com/nightfallai/nightfall_dlp_action

GitHub Data Loss Prevention (DLP) Action: Scan Pull Requests for sensitive data, like credentials & secrets, PII, credit card numbers, and more.

code-scanner data-loss-prevention data-privacy data-protection data-security devsecops github-action nightfall secrets-detection

Last synced: 16 Jan 2026

https://github.com/edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

anonymisation anonymization anonymize data-generation data-generator data-privacy database dgpr private-life rgpd

Last synced: 04 Apr 2025

https://github.com/ethyca/fidesops

Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.

compliance compliance-as-code compliance-automation data-privacy gdpr gdpr-compliant privacy privacy-as-code

Last synced: 04 Apr 2025

https://github.com/yaojin17/Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

copyright-protection data-privacy llm llm-unlearning machine-unlearning unlearning

Last synced: 24 Mar 2025

https://github.com/k1m0ch1/big-brother-test

list of applications or some things that "probably" steal or broke the data privacy.

application-usage data-privacy hacktoberfest hactoberfest

Last synced: 30 Apr 2025

https://github.com/thyrisAI/safe-zone

TSZ (Thyris Safe Zone) is an open-source PII detection and guardrails engine that prevents sensitive data from leaking to LLMs and third-party APIs.

ai-security data-privacy data-security guardrails guardrails-redaction pii-detection secrets-detection zero-trust

Last synced: 20 Jan 2026

https://github.com/subrose/thorn

🌹 Thorn is an open-source, data privacy vault to store and manage PII in a fully compliant manner.

ccpa data-privacy encryption gdpr hipaa pci pci-dss privacy privacy-by-default privacy-by-design privacy-engineering security subrose

Last synced: 10 Mar 2026

https://github.com/nightfallai/nightfall-go-sdk

Go Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

api data-loss-prevention data-privacy data-security dlp go golang sdk secrets-detection

Last synced: 13 Mar 2026

https://github.com/nightfallai/nightfall-java-sdk

Java Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

api data-loss-prevention data-privacy data-protection data-security dlp java nightfall secrets-detection

Last synced: 14 Jan 2026

https://github.com/senzing-garage/sz-semantics

Transform JSON output from Senzing SDK for use with graph technologies, semantics, and downstream LLM integration

context-engineering data-privacy entity-resolution entity-resolved-knowlege-graph ontology rdf semantic-layer semantics senzing-test-linux-darwin-windows senzing-test-weekly skos

Last synced: 28 Feb 2026

https://github.com/fvaleye/delta-buddy

Introducing Delta-Buddy: Your ultimate Delta Lake companion! 🚀 Streamline your data journey with an AI-powered chatbot. Ask Delta-Buddy anything about your Delta Lake.

chromadb data-privacy databricks-notebooks delta-lake dolly langchain llm python

Last synced: 14 Feb 2026

https://github.com/nightfallai/nightfall-nodejs-sdk

NodeJS Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

api data-loss-prevention data-privacy data-protection dlp javascript nodejs sdk secrets-detection typescript

Last synced: 16 Jan 2026

https://github.com/lvndry/clausea

Transform complex legal documents into clear, actionable insights with AI-powered analysis.

agents ai compliance crawling data-privacy gdpr legal llm ocr rag summarization

Last synced: 18 Jan 2026

https://github.com/synthesized-io/synthesized-notebooks

Discover the art of enhancing your data using generative modelling in these notebooks.

data-privacy data-science generative-modelling ml notebooks synthetic-data

Last synced: 14 Jul 2025

https://github.com/arcana-network/storage

Arcana Storage SDK offers data privacy and enables dApp users to control any and all accesses to their data uploaded in the Arcana Store. Data owners can share, revoke, delete or transfer ownership to other users powered by blockchain.

data-privacy decentralized-storage distributed-storage private-nft web3

Last synced: 12 Jan 2026

https://github.com/strmprivacy/cli

This is the STRM Privacy Command Line Interface, to define and manage your privacy streams, data schemas, event contracts and much more.

cli data data-pipeline data-privacy data-privacy-compliance data-processing privacy

Last synced: 23 Jun 2025

https://github.com/cs-joy/blockchain_exploration

All about of Blockchain Technology - Especially about Data Privacy and Security

blockchain-technology cia data-privacy security

Last synced: 06 Feb 2026

https://github.com/linkdd/secure-user-data

Infrastructure to store, manage and access user data in a very secure manner.

data-privacy data-protection microservice python3 security user-data

Last synced: 15 Sep 2025

https://github.com/sushegaad/semantic-privacy-guard

Semantic Privacy Guard: A Java middleware that intercepts text, identifies PII using a three-layer hybrid pipeline (Regex + Naive Bayes ML + Apache OpenNLP NER), and redacts it before it reaches an LLM or leaves the corporate network — with stream-based processing for memory-efficient handling of large files and log streams.

ai-firewall ai-safety compliance data-privacy eu-ai-act gdpr java java-library llm-privacy llm-security machine-learning maven middleware naive-bayes nlp pii-detection pii-redaction prompt-engineering regex zero-dependency

Last synced: 12 Apr 2026

https://github.com/nfdi4health/syndat-dashboard

A Dashboard for evaluation & visualization of synthetic patient level data

dashboard-application data-evaluation data-privacy data-quality synthetic-data

Last synced: 20 Feb 2026

https://github.com/amirhosseinhonardoust/teaching-neural-networks-to-imagine-tables

A comprehensive deep dive into how Variational Autoencoders (VAEs) learn to generate realistic synthetic tabular data. This project explores latent space learning, probabilistic modeling, and neural creativity, combining data privacy, interpretability, and generative AI techniques in a structured format.

autoencoder data-augmentation data-privacy data-science deep-learning explainable-ai feature-learning generative-model latent-space machine-learning neural-networks python pytorch representation-learning research simulation statistical-learning synthetic-data tabular-data vae

Last synced: 17 Nov 2025

https://github.com/yanisvdc/opendp_baseline

Baseline solution for the privacy part of the PETs for Public Health Challenge 2024

challenge data-privacy differential-privacy epidemiology open-source opendp privacy-enhancing-technologies

Last synced: 20 May 2026

https://github.com/dwain-barnes/flowise-private-doc-chat-rag-blog

A private, local RAG (Retrieval-Augmented Generation) system using Flowise, Ollama, and open-source LLMs to chat with your documents securely and offline.

data-privacy flowise flowise-ai local-llm offline-ai ollama open-source pdf-chatbot private-doc-chat rag retrieval-augmented-generation self-hosted-ai

Last synced: 02 May 2026

https://github.com/deco31416/facialtrustpay-deco31416

MVP designed for facilitating micropayments using facial recognition and blockchain technology, ensuring secure and fast transactions while prioritizing user privacy by not storing biometric data.

blockchain dapp data-privacy decentralized facial-recognition hash-based-identification micropayments mvp payments privacy security solidity

Last synced: 25 Jan 2026

https://github.com/recheck-io/vue-recheck-authorizer

A set of Vue.js components for authentication and data interaction with Recheck Platform.

data-exchange data-privacy digital-proofs recheck-authorizer recheck-io

Last synced: 20 Mar 2026

https://github.com/ssciwr/mailcom

Recognize and pseudonymize named entities in emails

anonymization data-privacy llm-inference pseudonymization text-preprocessing

Last synced: 21 Apr 2025

https://github.com/tdiprima/visionguard

Detecting, masking, and protecting sensitive information in images

data-privacy dicom image-analysis image-processing medical-imaging ocr text-detection

Last synced: 26 Nov 2025

https://github.com/zlovtnik/dispo-rusty

Built to solve real-world SaaS and managed platform pain points—compliance, scale, and speed—dispo-rusty is an open-source foundation for secure, high-performance, tenant-isolated REST API services. Whether you're a fast-moving founder, a platform CTO, or an enterprise engineering team, dispo-rusty helps you reduce costs, onboard clients instantly.

actix-web ant-design compliance data-privacy database-isolation enterprise-security role-based-access-control rust saas

Last synced: 13 Apr 2026

https://github.com/zlovtnik/backend

Built to solve real-world SaaS and managed platform pain points—compliance, scale, and speed—dispo-rusty is an open-source foundation for secure, high-performance, tenant-isolated REST API services. Whether you're a fast-moving founder, a platform CTO, or an enterprise engineering team, dispo-rusty helps you reduce costs, onboard clients instantly,

actix-web compliance data-privacy database-isolation enterprise-security role-based-access-control rust saas

Last synced: 13 Apr 2026

https://github.com/dishine-digital-agency/dishine-data-safe-usb

Privacy is non-negotiable, but so is data-driven growth. This tool ensures no real phone numbers or IBANs are uploaded to external servers, allowing your AI models to perform grouping and trend detection on safe, anonymized datasets. A one-click Mac-native experience designed for non-technical teams.

anonymization consulting-tools data-privacy gdpr local-ai pii-redaction

Last synced: 14 Apr 2026

https://github.com/daniel-elston/data-privacy-and-cyber-security

A report detailing the current problems with Data Privacy, the GDPR and how Cyber Security can be improved.

consent cyber-security data-breach data-privacy data-science gdpr

Last synced: 19 Jan 2026

https://github.com/4xyy/cli-password-manager

A command-line tool for securely managing passwords. It allows users to store, retrieve, and generate strong passwords, all encrypted with a master key.

automation cli-tool credential-management cross-platform cybersecurity data-privacy encryption fernet-encryption key-management open-source password-generator password-manager python secure-storage security

Last synced: 15 Oct 2025

https://github.com/taoq-ai/wuming

无名 (The Nameless) is a zero-config PII detection and redaction library for Go. 75+ detectors across 14 locales with compliance presets for GDPR, HIPAA, LGPD, APPI, PIPL, and more.

anonymization compliance data-privacy data-protection gdpr go golang hexagonal-architecture hipaa open-source personally-identifiable-information pii pii-detection pii-redaction privacy regex security sensitive-data

Last synced: 01 Apr 2026

https://github.com/bespredel/encryption-form

A Laravel package to securely encrypt form fields on the client-side using public key encryption and decrypt them on the server-side using the private key.

client-side-encryption data-privacy encryption form-encryption laravel open-source php privacy-protection rsa-encryption security

Last synced: 25 Apr 2026

https://github.com/gigameshgarages/reefnet

Privacy Preserving Metadata Proof Powered Data Streaming Token Vaults with zkRollups on Ocean Protocol

balancer-pools circom data-privacy defi erc1620 erc20 metadata metadata-management ocean-protocol randao snarkjs starkware starkware-veedo-vdf streaming vdf zkrollup zokrates

Last synced: 29 Apr 2026

https://github.com/b-macker/naab-passage

🔒 Sovereign data gateway & PII protection - Zero leakage to LLMs and APIs with self-synthesizing architecture. HIPAA/GDPR compliant. Part of the NAAb Ecosystem.

anthropic api-gateway audit compliance data-privacy data-protection encryption gateway gdpr hipaa llm-proxy naab naab-ecosystem openai pii-protection polyglot privacy security soc2 zero-trust

Last synced: 06 Mar 2026

https://github.com/rabelais88/consent-nextjs

cookie and private data consent management in Next.js with lightning-fast setup

compliance cookie cookie-consent cookie-consent-banner data-consent data-privacy gdpr next nextjs

Last synced: 14 Mar 2026

https://github.com/akimuddinshaikh/data-governance-2

An ethical AI approach for liver disease detection using machine learning and deep learning models. The project ensures data privacy, fairness, and transparency, using the Indian Liver Patient Records dataset for scalable healthcare diagnostics

bias-mitigation data-privacy fairness transparency

Last synced: 26 Jan 2026

https://github.com/vypher-io/cli

Fast, parallelized CLI for detecting PII and PHI in files. Built for Finance (PCI DSS) and Healthcare (HIPAA) compliance.

cli compliance data-privacy devtools go golangci hipaa pci-dss phi pii sarif scanner security

Last synced: 14 Apr 2026

https://github.com/mele0/air2lung

SQL and Python pipeline for analyzing the impact of air pollution and lifestyle factors on lung disease using a synthetic UK cohort.

air-pollution data-privacy encryption k-anonymity lung-health-severity mysql python sql

Last synced: 19 Aug 2025

https://github.com/lucianoscarpaci/cryptography

Embark on a journey of cryptographic exploration as we delve into the intricate worlds of RSA and ElGamal encryption, crafted entirely from scratch without the aid of OpenSSL. Delve into the inner workings of these fundamental encryption algorithms, unraveling the complexities of secure data transmission.

algorithm-implementation asymmetric-cryptography cryptographic-algorithms cryptographic-security cryptography data-privacy elgamal-cryptosystem encryption-de key-generation rsa-algorithm secure-communication

Last synced: 14 Aug 2025

https://github.com/sergiovzambelli/clearview-ai-gdpr-case-study

Case study on Clearview AI's potential GDPR violations and the impact of the AI Act. This project, done as part of our cybersecurity master's program at the University of Pisa, includes an analysis of Clearview's data processing practices and their legal implications under European law.

ai-ethics clearview-ai cybersecurity data-privacy european-law gdpr

Last synced: 26 Feb 2026

https://github.com/xdcobra/react-native-sherpa-onnx

Offline Speech Processing SDK for React Native using sherpa-onnx. Supports Speech-to-Text, Text-to-Speech, Speaker Diarization, Speech Enhancement, Source Separation & VAD.

android ctc data-privacy funasr ios paraformer react-native sdk sensevoice sherpa-onnx source-separation speaker-diarization speech-enhancement speech-to-text text-to-speech vad voice-activity-detection whisper zipformer

Last synced: 02 Apr 2026

https://github.com/zafrem/pattern-engine

A comprehensive collection of regex patterns and verification functions for detecting and validating PII and sensitive data across multiple global regions and languages.

compliance data-privacy multi-region pii-detection regex-patterns security-tooling sensitive-data verification-functions

Last synced: 11 Feb 2026

https://github.com/zafrem/data-detector

Data-detector is a Python-based PII detection and protection framework featuring multi-language NLP support, RAG security, and data tokenization capabilities.

data-privacy detector docker fake hash masking nlp pii pii-detection rag rag-security synthetic-data testpypi

Last synced: 21 Feb 2026

https://github.com/dwain-barnes/local-low-code-chatbot-ollama-flowise

Create a local, customizable chatbot using Flowise's visual builder and Ollama's open-source LLMs. This low-code project ensures privacy, scalability, and interactive AI features through an easy-to-use workflow. Ideal for developers seeking a secure, efficient chatbot solution

ai-chatbot ai-workflow chatbot data-privacy flowise llm local-chatbot low-code ollama open-source privacy-focused

Last synced: 03 Apr 2025

https://github.com/faisalzia-bit1/ethstorage-ceremony-x-earn-by-abhi

🚀 Join the EthStorage V1 Trusted Setup Ceremony to enhance blockchain security while earning rewards through a straightforward, step-by-step setup process.

blockchain ceremony cloud-storage crypto data-privacy decentralized-storage digital-assets distributed-systems earn-rewards eth-storage ethereum incentives peer-to-peer smart-contracts web3

Last synced: 13 Sep 2025

https://github.com/datareporter-gmbh/module-core

Core Magento2 module containing base containers for config and news

data-privacy magento2-module

Last synced: 16 Jan 2026

https://github.com/dahmansphi/differential_privacy_with_ai_and_ml

Review of Data Privacy Techniques: Concepts, Scenarios & Architectures, Simulations, Challenges, and Future Directions

ai data-privacy differential-privacy machine-learning

Last synced: 31 Mar 2025

https://github.com/ai-sdc/sumit

SUMiT: Statistical Uncertainty Management Toolkit

data-privacy privacy statistical-disclosure-control

Last synced: 21 Feb 2026

https://github.com/filiprokita/tobase64

This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.

base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64

Last synced: 30 Jun 2025