Projects in Awesome Lists tagged with data-privacy
A curated list of projects in awesome lists tagged with data-privacy .
https://github.com/countly/countly-server
Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.
analytics coppa crash-analytics crash-reports dashboard data data-ownership data-privacy feature-flags gdpr hipaa insights mobile-analytics product-analytics product-management push-notifications remote-configuration tracking user-feedback web-analytics
Last synced: 07 Jan 2026
https://github.com/Countly/countly-server
Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.
analytics coppa crash-analytics crash-reports dashboard data data-ownership data-privacy feature-flags gdpr hipaa insights mobile-analytics product-analytics product-management push-notifications remote-configuration tracking user-feedback web-analytics
Last synced: 13 Mar 2025
https://github.com/microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
anonymization data-anonymization data-masking data-obfuscation data-privacy data-redaction de-identification guardrails image-redactor named-entity-recognition nlp personally-identifiable-information phi pii pii-detection privacy python sensitive-data spacy transformers
Last synced: 13 May 2025
https://github.com/ibm/differential-privacy-library
Diffprivlib: The IBM Differential Privacy Library
data-privacy differential-privacy machine-learning python
Last synced: 21 Oct 2025
https://github.com/IBM/differential-privacy-library
Diffprivlib: The IBM Differential Privacy Library
data-privacy differential-privacy machine-learning python
Last synced: 27 Mar 2025
https://github.com/BUAA-BDA/OpenHuFu
OpenHuFu is an open-sourced data federation system to support collaborative queries over multi databases with security guarantee.
data-federation data-privacy data-security data-sharing differential-privacy federated-learning mpc multiparty-computation privacy-preserving secure-computation spatial-analysis spatial-data-analysis spatial-queries swarm-intelligence transportation
Last synced: 20 Nov 2025
https://github.com/privacytrustlab/ml_privacy_meter
Privacy Meter: An open-source library to audit data privacy in statistical and machine learning algorithms.
data-privacy data-protection data-protection-impact-assessment explainable-ai gdpr inference information-leakage machine-learning membership-inference-attack privacy privacy-audit
Last synced: 21 Oct 2025
https://github.com/ethyca/fides
The Privacy Engineering & Compliance Framework
data data-privacy data-privacy-compliance developer-tools gdpr hacktoberfest privacy-as-code python
Last synced: 06 Feb 2026
https://github.com/hypersign-protocol/hid-node
A permissionless blockchain network to manage digital identity and access rights
blockchain data-privacy did ssi
Last synced: 24 Dec 2025
https://github.com/bibinprathap/VeritasGraph
VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution
data-privacy enterprise-ai explainable-ai fine-tuning generative-ai generativeai graph-rag information-retrieval knowledge-graph langchain llamaindex llm lora multi-hop-reasoning neo4j nlp ollama on-premise question-answering rag
Last synced: 21 Sep 2025
https://github.com/pluginkollektiv/statify
Statify – statistics plugin for WordPress
dashboard-widget data-privacy hacktoberfest statistics web-statistics wordpress wordpress-plugin wp-plugin
Last synced: 09 Apr 2025
https://github.com/kiprotect/data-privacy-for-data-scientists
A workshop on data privacy methods for data scientists.
anonymization data-privacy data-science education jupyter-notebooks tutorial
Last synced: 26 Jun 2025
https://github.com/bibinprathap/veritasgraph
VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution
data-privacy enterprise-ai explainable-ai fine-tuning generative-ai generativeai graph-rag information-retrieval knowledge-graph langchain llamaindex llm lora multi-hop-reasoning neo4j nlp ollama on-premise question-answering rag
Last synced: 18 Sep 2025
https://github.com/datafog/datafog-python
Lightweight Python SDK for PII detection, redaction, and LLM guardrails, with fast regex defaults and optional NLP/OCR/Spark extras.
anonymization compliance data-privacy data-protection de-identification gliner guardrails llm ner nlp ocr pii pii-detection pii-redaction privacy-engineering python redaction sdk spacy
Last synced: 31 May 2026
https://github.com/nightfallai/nightfall_dlp_action
GitHub Data Loss Prevention (DLP) Action: Scan Pull Requests for sensitive data, like credentials & secrets, PII, credit card numbers, and more.
code-scanner data-loss-prevention data-privacy data-protection data-security devsecops github-action nightfall secrets-detection
Last synced: 16 Jan 2026
https://github.com/edyan/neuralyzer
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
anonymisation anonymization anonymize data-generation data-generator data-privacy database dgpr private-life rgpd
Last synced: 04 Apr 2025
https://github.com/hharcolezi/multi-freq-ldpy
Multiple Frequency Estimation Under Local Differential Privacy in Python
data-privacy dbitflippm differential-privacy differential-privacy-algorithm frequency-estimation google ldp-protocols local-differential-privacy longitudinal-data microsoft multidimensional-data rappor
Last synced: 14 Jan 2026
https://github.com/ethyca/fidesops
Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.
compliance compliance-as-code compliance-automation data-privacy gdpr gdpr-compliant privacy privacy-as-code
Last synced: 04 Apr 2025
https://github.com/yaojin17/Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
copyright-protection data-privacy llm llm-unlearning machine-unlearning unlearning
Last synced: 24 Mar 2025
https://github.com/dsietz/test-data-generation
Test Data Generation
algorithm archconf data data-privacy generate json machine-learning markov-decision-processes nfjs privacy profile rust-lang testing
Last synced: 17 Mar 2025
https://github.com/k1m0ch1/big-brother-test
list of applications or some things that "probably" steal or broke the data privacy.
application-usage data-privacy hacktoberfest hactoberfest
Last synced: 30 Apr 2025
https://github.com/aliengiraffe/deidentify
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
anonymization compliance data-anonymization data-masking data-privacy data-protection data-security deidentification gdpr go golang llm pii pii-detection privacy privacy-tools redaction security-tools sensitive-data zero-dependency
Last synced: 13 Mar 2026
https://github.com/nightfallai/nightfall-python-sdk
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
api data-classification data-loss-prevention data-privacy data-protection nightfall pii python sdk secrets-detection
Last synced: 16 Jan 2026
https://github.com/thyrisAI/safe-zone
TSZ (Thyris Safe Zone) is an open-source PII detection and guardrails engine that prevents sensitive data from leaking to LLMs and third-party APIs.
ai-security data-privacy data-security guardrails guardrails-redaction pii-detection secrets-detection zero-trust
Last synced: 20 Jan 2026
https://github.com/subrose/thorn
🌹 Thorn is an open-source, data privacy vault to store and manage PII in a fully compliant manner.
ccpa data-privacy encryption gdpr hipaa pci pci-dss privacy privacy-by-default privacy-by-design privacy-engineering security subrose
Last synced: 10 Mar 2026
https://github.com/nightfallai/nightfall-go-sdk
Go Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
api data-loss-prevention data-privacy data-security dlp go golang sdk secrets-detection
Last synced: 13 Mar 2026
https://github.com/acuciureanu/spidertrap-rs
A simple trap for web crawlers
anti-bot anti-scraping bot-detection bot-trap crawler-trap cybersecurity data-privacy ethical-hacking intrusion-detection network-security rust server-defense spider-trap web-crawler web-monitoring web-robots web-scraping web-security web-spider website-protection
Last synced: 24 Jun 2025
https://github.com/nightfallai/nightfall-java-sdk
Java Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
api data-loss-prevention data-privacy data-protection data-security dlp java nightfall secrets-detection
Last synced: 14 Jan 2026
https://github.com/senzing-garage/sz-semantics
Transform JSON output from Senzing SDK for use with graph technologies, semantics, and downstream LLM integration
context-engineering data-privacy entity-resolution entity-resolved-knowlege-graph ontology rdf semantic-layer semantics senzing-test-linux-darwin-windows senzing-test-weekly skos
Last synced: 28 Feb 2026
https://github.com/fvaleye/delta-buddy
Introducing Delta-Buddy: Your ultimate Delta Lake companion! 🚀 Streamline your data journey with an AI-powered chatbot. Ask Delta-Buddy anything about your Delta Lake.
chromadb data-privacy databricks-notebooks delta-lake dolly langchain llm python
Last synced: 14 Feb 2026
https://github.com/nightfallai/nightfall-nodejs-sdk
NodeJS Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
api data-loss-prevention data-privacy data-protection dlp javascript nodejs sdk secrets-detection typescript
Last synced: 16 Jan 2026
https://github.com/lvndry/clausea
Transform complex legal documents into clear, actionable insights with AI-powered analysis.
agents ai compliance crawling data-privacy gdpr legal llm ocr rag summarization
Last synced: 18 Jan 2026
https://github.com/synthesized-io/synthesized-notebooks
Discover the art of enhancing your data using generative modelling in these notebooks.
data-privacy data-science generative-modelling ml notebooks synthetic-data
Last synced: 14 Jul 2025
https://github.com/arcana-network/storage
Arcana Storage SDK offers data privacy and enables dApp users to control any and all accesses to their data uploaded in the Arcana Store. Data owners can share, revoke, delete or transfer ownership to other users powered by blockchain.
data-privacy decentralized-storage distributed-storage private-nft web3
Last synced: 12 Jan 2026
https://github.com/strmprivacy/cli
This is the STRM Privacy Command Line Interface, to define and manage your privacy streams, data schemas, event contracts and much more.
cli data data-pipeline data-privacy data-privacy-compliance data-processing privacy
Last synced: 23 Jun 2025
https://github.com/cs-joy/blockchain_exploration
All about of Blockchain Technology - Especially about Data Privacy and Security
blockchain-technology cia data-privacy security
Last synced: 06 Feb 2026
https://github.com/dsietz/pbd
Privacy by Design SDK
actix-web best-practices data data-privacy development-kit nfjs pbd pbd-sdk privacy privacy-by-design rust rust-lang sdk sdk-rust strategies
Last synced: 09 Apr 2025
https://github.com/edoardottt/gdpr
General Data Protection Regulation
data data-privacy data-protection gdpr gdpr-checklist gdpr-compliant gdpr-component gdpr-consent gdpr-cookie gdpr-cookies gdpr-dashboard gdpr-presentation-slides gdpr-tracker general-data-protection-regulation privacy privacy-protection
Last synced: 04 Mar 2026
https://github.com/linkdd/secure-user-data
Infrastructure to store, manage and access user data in a very secure manner.
data-privacy data-protection microservice python3 security user-data
Last synced: 15 Sep 2025
https://github.com/sushegaad/semantic-privacy-guard
Semantic Privacy Guard: A Java middleware that intercepts text, identifies PII using a three-layer hybrid pipeline (Regex + Naive Bayes ML + Apache OpenNLP NER), and redacts it before it reaches an LLM or leaves the corporate network — with stream-based processing for memory-efficient handling of large files and log streams.
ai-firewall ai-safety compliance data-privacy eu-ai-act gdpr java java-library llm-privacy llm-security machine-learning maven middleware naive-bayes nlp pii-detection pii-redaction prompt-engineering regex zero-dependency
Last synced: 12 Apr 2026
https://github.com/zaahidali/data-privacy-fundamentals-by-ibm-on-cognitive-class-ai
Data Privacy Course By IBM on Cognitive Class.ai
cognitive-class cognitive-class-course data-privacy data-privacy-course ibm ibm-data-privacy-course
Last synced: 20 Feb 2026
https://github.com/nfdi4health/syndat-dashboard
A Dashboard for evaluation & visualization of synthetic patient level data
dashboard-application data-evaluation data-privacy data-quality synthetic-data
Last synced: 20 Feb 2026
https://github.com/amirhosseinhonardoust/teaching-neural-networks-to-imagine-tables
A comprehensive deep dive into how Variational Autoencoders (VAEs) learn to generate realistic synthetic tabular data. This project explores latent space learning, probabilistic modeling, and neural creativity, combining data privacy, interpretability, and generative AI techniques in a structured format.
autoencoder data-augmentation data-privacy data-science deep-learning explainable-ai feature-learning generative-model latent-space machine-learning neural-networks python pytorch representation-learning research simulation statistical-learning synthetic-data tabular-data vae
Last synced: 17 Nov 2025
https://github.com/yanisvdc/opendp_baseline
Baseline solution for the privacy part of the PETs for Public Health Challenge 2024
challenge data-privacy differential-privacy epidemiology open-source opendp privacy-enhancing-technologies
Last synced: 20 May 2026
https://github.com/dwain-barnes/flowise-private-doc-chat-rag-blog
A private, local RAG (Retrieval-Augmented Generation) system using Flowise, Ollama, and open-source LLMs to chat with your documents securely and offline.
data-privacy flowise flowise-ai local-llm offline-ai ollama open-source pdf-chatbot private-doc-chat rag retrieval-augmented-generation self-hosted-ai
Last synced: 02 May 2026
https://github.com/deco31416/facialtrustpay-deco31416
MVP designed for facilitating micropayments using facial recognition and blockchain technology, ensuring secure and fast transactions while prioritizing user privacy by not storing biometric data.
blockchain dapp data-privacy decentralized facial-recognition hash-based-identification micropayments mvp payments privacy security solidity
Last synced: 25 Jan 2026
https://github.com/recheck-io/vue-recheck-authorizer
A set of Vue.js components for authentication and data interaction with Recheck Platform.
data-exchange data-privacy digital-proofs recheck-authorizer recheck-io
Last synced: 20 Mar 2026
https://github.com/secnnet/encryptor
cryptography cryptography-library data-encryption data-privacy data-protection data-security encryption-algorithm encryption-library encryption-tool encryptor fernet-algorithm file-encryption file-encryption-python file-protection file-security password-based-encryption python-script scrypt-key-derivation secure-communications secure-file-storage
Last synced: 25 Feb 2025
https://github.com/ssciwr/mailcom
Recognize and pseudonymize named entities in emails
anonymization data-privacy llm-inference pseudonymization text-preprocessing
Last synced: 21 Apr 2025
https://github.com/tdiprima/visionguard
Detecting, masking, and protecting sensitive information in images
data-privacy dicom image-analysis image-processing medical-imaging ocr text-detection
Last synced: 26 Nov 2025
https://github.com/zlovtnik/dispo-rusty
Built to solve real-world SaaS and managed platform pain points—compliance, scale, and speed—dispo-rusty is an open-source foundation for secure, high-performance, tenant-isolated REST API services. Whether you're a fast-moving founder, a platform CTO, or an enterprise engineering team, dispo-rusty helps you reduce costs, onboard clients instantly.
actix-web ant-design compliance data-privacy database-isolation enterprise-security role-based-access-control rust saas
Last synced: 13 Apr 2026
https://github.com/zlovtnik/backend
Built to solve real-world SaaS and managed platform pain points—compliance, scale, and speed—dispo-rusty is an open-source foundation for secure, high-performance, tenant-isolated REST API services. Whether you're a fast-moving founder, a platform CTO, or an enterprise engineering team, dispo-rusty helps you reduce costs, onboard clients instantly,
actix-web compliance data-privacy database-isolation enterprise-security role-based-access-control rust saas
Last synced: 13 Apr 2026
https://github.com/dishine-digital-agency/dishine-data-safe-usb
Privacy is non-negotiable, but so is data-driven growth. This tool ensures no real phone numbers or IBANs are uploaded to external servers, allowing your AI models to perform grouping and trend detection on safe, anonymized datasets. A one-click Mac-native experience designed for non-technical teams.
anonymization consulting-tools data-privacy gdpr local-ai pii-redaction
Last synced: 14 Apr 2026
https://github.com/daniel-elston/data-privacy-and-cyber-security
A report detailing the current problems with Data Privacy, the GDPR and how Cyber Security can be improved.
consent cyber-security data-breach data-privacy data-science gdpr
Last synced: 19 Jan 2026
https://github.com/4xyy/cli-password-manager
A command-line tool for securely managing passwords. It allows users to store, retrieve, and generate strong passwords, all encrypted with a master key.
automation cli-tool credential-management cross-platform cybersecurity data-privacy encryption fernet-encryption key-management open-source password-generator password-manager python secure-storage security
Last synced: 15 Oct 2025
https://github.com/taoq-ai/wuming
无名 (The Nameless) is a zero-config PII detection and redaction library for Go. 75+ detectors across 14 locales with compliance presets for GDPR, HIPAA, LGPD, APPI, PIPL, and more.
anonymization compliance data-privacy data-protection gdpr go golang hexagonal-architecture hipaa open-source personally-identifiable-information pii pii-detection pii-redaction privacy regex security sensitive-data
Last synced: 01 Apr 2026
https://github.com/bespredel/encryption-form
A Laravel package to securely encrypt form fields on the client-side using public key encryption and decrypt them on the server-side using the private key.
client-side-encryption data-privacy encryption form-encryption laravel open-source php privacy-protection rsa-encryption security
Last synced: 25 Apr 2026
https://github.com/gigameshgarages/reefnet
Privacy Preserving Metadata Proof Powered Data Streaming Token Vaults with zkRollups on Ocean Protocol
balancer-pools circom data-privacy defi erc1620 erc20 metadata metadata-management ocean-protocol randao snarkjs starkware starkware-veedo-vdf streaming vdf zkrollup zokrates
Last synced: 29 Apr 2026
https://github.com/b-macker/naab-passage
🔒 Sovereign data gateway & PII protection - Zero leakage to LLMs and APIs with self-synthesizing architecture. HIPAA/GDPR compliant. Part of the NAAb Ecosystem.
anthropic api-gateway audit compliance data-privacy data-protection encryption gateway gdpr hipaa llm-proxy naab naab-ecosystem openai pii-protection polyglot privacy security soc2 zero-trust
Last synced: 06 Mar 2026
https://github.com/rabelais88/consent-nextjs
cookie and private data consent management in Next.js with lightning-fast setup
compliance cookie cookie-consent cookie-consent-banner data-consent data-privacy gdpr next nextjs
Last synced: 14 Mar 2026
https://github.com/akimuddinshaikh/data-governance-2
An ethical AI approach for liver disease detection using machine learning and deep learning models. The project ensures data privacy, fairness, and transparency, using the Indian Liver Patient Records dataset for scalable healthcare diagnostics
bias-mitigation data-privacy fairness transparency
Last synced: 26 Jan 2026
https://github.com/vypher-io/cli
Fast, parallelized CLI for detecting PII and PHI in files. Built for Finance (PCI DSS) and Healthcare (HIPAA) compliance.
cli compliance data-privacy devtools go golangci hipaa pci-dss phi pii sarif scanner security
Last synced: 14 Apr 2026
https://github.com/mele0/air2lung
SQL and Python pipeline for analyzing the impact of air pollution and lifestyle factors on lung disease using a synthetic UK cohort.
air-pollution data-privacy encryption k-anonymity lung-health-severity mysql python sql
Last synced: 19 Aug 2025
https://github.com/lucianoscarpaci/cryptography
Embark on a journey of cryptographic exploration as we delve into the intricate worlds of RSA and ElGamal encryption, crafted entirely from scratch without the aid of OpenSSL. Delve into the inner workings of these fundamental encryption algorithms, unraveling the complexities of secure data transmission.
algorithm-implementation asymmetric-cryptography cryptographic-algorithms cryptographic-security cryptography data-privacy elgamal-cryptosystem encryption-de key-generation rsa-algorithm secure-communication
Last synced: 14 Aug 2025
https://github.com/sergiovzambelli/clearview-ai-gdpr-case-study
Case study on Clearview AI's potential GDPR violations and the impact of the AI Act. This project, done as part of our cybersecurity master's program at the University of Pisa, includes an analysis of Clearview's data processing practices and their legal implications under European law.
ai-ethics clearview-ai cybersecurity data-privacy european-law gdpr
Last synced: 26 Feb 2026
https://github.com/xdcobra/react-native-sherpa-onnx
Offline Speech Processing SDK for React Native using sherpa-onnx. Supports Speech-to-Text, Text-to-Speech, Speaker Diarization, Speech Enhancement, Source Separation & VAD.
android ctc data-privacy funasr ios paraformer react-native sdk sensevoice sherpa-onnx source-separation speaker-diarization speech-enhancement speech-to-text text-to-speech vad voice-activity-detection whisper zipformer
Last synced: 02 Apr 2026
https://github.com/zafrem/pattern-engine
A comprehensive collection of regex patterns and verification functions for detecting and validating PII and sensitive data across multiple global regions and languages.
compliance data-privacy multi-region pii-detection regex-patterns security-tooling sensitive-data verification-functions
Last synced: 11 Feb 2026
https://github.com/zafrem/data-detector
Data-detector is a Python-based PII detection and protection framework featuring multi-language NLP support, RAG security, and data tokenization capabilities.
data-privacy detector docker fake hash masking nlp pii pii-detection rag rag-security synthetic-data testpypi
Last synced: 21 Feb 2026
https://github.com/dwain-barnes/local-low-code-chatbot-ollama-flowise
Create a local, customizable chatbot using Flowise's visual builder and Ollama's open-source LLMs. This low-code project ensures privacy, scalability, and interactive AI features through an easy-to-use workflow. Ideal for developers seeking a secure, efficient chatbot solution
ai-chatbot ai-workflow chatbot data-privacy flowise llm local-chatbot low-code ollama open-source privacy-focused
Last synced: 03 Apr 2025
https://github.com/faisalzia-bit1/ethstorage-ceremony-x-earn-by-abhi
🚀 Join the EthStorage V1 Trusted Setup Ceremony to enhance blockchain security while earning rewards through a straightforward, step-by-step setup process.
blockchain ceremony cloud-storage crypto data-privacy decentralized-storage digital-assets distributed-systems earn-rewards eth-storage ethereum incentives peer-to-peer smart-contracts web3
Last synced: 13 Sep 2025
https://github.com/datareporter-gmbh/module-core
Core Magento2 module containing base containers for config and news
Last synced: 16 Jan 2026
https://github.com/dahmansphi/differential_privacy_with_ai_and_ml
Review of Data Privacy Techniques: Concepts, Scenarios & Architectures, Simulations, Challenges, and Future Directions
ai data-privacy differential-privacy machine-learning
Last synced: 31 Mar 2025
https://github.com/ai-sdc/sumit
SUMiT: Statistical Uncertainty Management Toolkit
data-privacy privacy statistical-disclosure-control
Last synced: 21 Feb 2026
https://github.com/filiprokita/tobase64
This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.
base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64
Last synced: 30 Jun 2025