Projects in Awesome Lists tagged with data-integrity
A curated list of projects in awesome lists tagged with data-integrity .
https://github.com/kanmaytacker/fundamentals
A collection of tutorials on computer fundamentals
data-integrity database mysql normalization query-optimization relational-databases sql
Last synced: 22 Jan 2026
https://github.com/trapexit/scorch
Silent CORruption CHecker and filesystem audit tool
bitrot corruption data-integrity filesystem md5 md5sum sha1sum
Last synced: 02 Mar 2026
https://github.com/datajoint/datajoint-python
Relational data pipelines for the science lab
data-engineering data-integrity data-lineage data-pipelines datajoint declarative metadata-management mysql neuroscience object-storage postgresql python relational-model reproducibility research-software schema-management scientific-computing workflow-management
Last synced: 17 Feb 2026
https://laktak.github.io/chkbit/
Check your files for data corruption and run quick file deduplication
backup bitrot-detection btrfs cloud-backup data-degradation data-integrity dedup dedupe deduper deduplication disk-check storage-media
Last synced: 03 Apr 2026
https://github.com/laktak/chkbit
Check your files for data corruption and run quick file deduplication
backup bitrot-detection btrfs cloud-backup data-degradation data-integrity dedup dedupe deduper deduplication disk-check storage-media
Last synced: 04 Apr 2025
https://github.com/anishkny/integrify
🤝 Enforce referential and data integrity in Cloud Firestore using triggers
data-integrity firebase firebase-functions firebase-trigger firestore referential-integrity serverless
Last synced: 09 Apr 2025
https://github.com/serradura/u-attributes
Create "immutable" objects with no setters, just getters.
activemodel change-detection data-integrity data-validation getters immutability no-setters ruby ruby-gem
Last synced: 07 Apr 2025
https://github.com/great-expectations/great_expectations_action
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
actions continuous-integration data-integrity data-quality data-science mlops
Last synced: 07 Apr 2025
https://github.com/s-d-adams/muxfs
A mirroring, checksumming, and self-healing filesystem layer for OpenBSD.
c data-integrity fuse-filesystem openbsd
Last synced: 10 Apr 2025
https://github.com/zachalam/bitfact
🛡️ Robust data integrity tool. Prove data, text, & files using the Ethereum blockchain.
blockchain crypto data-integrity ethereum ethereum-blockchain fingerprint fingerprint-data hashing javascript-library
Last synced: 01 Jul 2025
https://github.com/fair-research/bdbag
Big Data Bag Utilities
bagit checksum data-integrity file-container file-transfer hashing identifiers profile-validation pypi python research-object ro-profiles
Last synced: 21 Oct 2025
https://github.com/cinar/checker
Effortless input validation in Go with the power of struct tags. No dependencies, just pure simplicity. ✨ See how! 👀
checker customizable data-integrity data-validation form-validation go golang input-validation library lightweight localization no-dependencies normalization security struct-tags validation validator
Last synced: 12 Jan 2026
https://github.com/byte271/6cy
High-performance, streaming-first container format with per-block codec polymorphism and robust data recoverability. Reference implementation in Rust.
codec-polymorphism compression container-format data-integrity lz4 rust specification storage-engine streaming-data zstd
Last synced: 23 Feb 2026
https://github.com/trustbloc/vc-go
Verifiable Credentials (and related code) for Go
data-integrity go jwt sd-jwt verifiable-credentials
Last synced: 15 Apr 2025
https://github.com/averagehoarder/flacr
multithreaded script to recursively recompress/verify flac files and calculate replaygain tags
audioconverter data-integrity flac-files multithreading replaygain-tags
Last synced: 13 May 2025
https://github.com/tiagohm/checkdigit
A set of check digit algorithms implemented in Dart
check-digit cpf damm dart data-integrity iban isbn isin luhn verhoeff
Last synced: 10 Aug 2025
https://github.com/serviejs/keysign
Data signing and verification for rotating credentials and algorithms
cryptography data-integrity hmac hmac-sha256 sign
Last synced: 19 Jul 2025
https://github.com/tsmx/object-hmac
Create and verify HMAC's for JSON objects.
data-integrity hash hmac json json-objects object objects-hmac sha-256 sign verify
Last synced: 24 Aug 2025
https://github.com/w3c/vc-di-quantum-resistant
Data Integrity Quantum Safe Cryptography Suite
cryptography data-integrity quantum vc verifiable-credentials w3c
Last synced: 29 May 2026
https://github.com/itzshoaib/hashtegrity
A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity
crypto-hash data data-integrity hacktoberfest hash integrity
Last synced: 07 Mar 2026
https://github.com/inc44/thesync
Synchronize files between a source directory and a destination directory
automation cli command-line-tool conflict-resolution cross-platform data-backup data-integrity file-synchronization linux macos nodependence parallel-processing python windows
Last synced: 18 Apr 2026
https://github.com/pirate-emperor/enigma
Enigma is a Java-based cryptographic system that integrates symmetric (AES, DES, Blowfish), asymmetric (Diffie-Hellman, RSA, DSA), and hashing algorithms (MD5, SHA-1, SHA-224, SHA-256, SHA-384, SHA-512). It ensures data confidentiality, integrity, and authentication with modern cryptographic standards.
aes asymmetric-algorithm authentication blowfish cryptography data-integrity diffie-hellman encryption encryption-decryption enigma hashing java md5 rsa security sha-256 symmetric-algorithms
Last synced: 27 Jan 2026
https://github.com/i5heu/ouroboros-db
A versioned, distributed key-value store designed with a focus on data integrity. Each value boasts a comprehensive history, ensuring eventual consistency across the system. It features seamless merging capabilities to harmonize divergent data states.
browser-compatible browser-database data-integrity data-merging distributed-systems key-value-store versioned-datastore- wasm
Last synced: 07 Mar 2026
https://github.com/ndxdeveloper/slashsum
Fast multi-threaded checksum calculator (CRC32, MD5, SHA1, SHA256, SHA512)
checksum cli-tool concurrent-processing crc32 cross-platform cryptography data-integrity file-integrity file-validation file-verification hash-calculator md5 multi-threaded parallel-computing performance rust sha1 sha256 sha512 utility
Last synced: 08 Apr 2026
https://github.com/blievrouw/crc64iso
64-bit CRC (ISO 3309) checksum generator using python 3.x
checksum crc-64 crc64 cyclic-redundancy-check data-integrity digest iso3309 python3
Last synced: 20 Feb 2026
https://github.com/welovejeff/tamper-evident-verification
Tamper Signal: signed receipts for vibe-coded data pipelines. Proves nobody changed your data, and shows the exact link if they did.
analytics data-integrity data-pipelines ed25519 hash-chain provenance python signed-receipts tamper-evident tamper-signal verification vibe-coding
Last synced: 11 Jun 2026
https://github.com/mohamedsci/jira-to-azure-devops-migration-script
A Robust Python script to migrate issues from Jira to Azure DevOps, ensuring data integrity and formatting consistency.
azure azuredevops csv-processing data-integrity data-migration-automation formatting-consistency jira jira-to-azure-devops-migration-script migration migration-tool python-script
Last synced: 16 May 2026
https://github.com/lucianoscarpaci/ethereum-cryptography
This project uses secp256k1, keccak256 hashing, and BIP39 for generating vanity addresses, implementing secure cryptographic operations and creating mnemonic phrases.
algorithm-implementation bip39 blockchain-technology cryptocurrency-wallets data-integrity digital-signatures hash-functions keccak public-key-cryptography secp256k1 secure-communication security-protocols vanity-address
Last synced: 27 Aug 2025
https://github.com/glueops/vault-backup-validator
This repository contains a Go application that validates HashiCorp Vault backups. It provides an API endpoint to ensure the integrity and validity of Vault backup snapshots using provided keys and expected data values. The tool assists in verifying that backups can be restored correctly and data remains consistent.
allow-auto-merge api backup backup-validation backup-validator data-integrity glueops-platform go go-application hashicorp hashicorp-vault rest-api validation vault vault-backup vault-backup-validation
Last synced: 29 May 2026
https://github.com/ivonnebenitesrodriguez/bookingform
Full-Stack Web Application
cia cia-triad cybersecurity-projects data-integrity entity-relationship-models github-pages javascript mvc-architecture postgres rails react react-bootstrap react-icons reliability render ruby security security-principles version-control
Last synced: 05 Apr 2026
https://github.com/kytmanov/semantic-dog
Your NAS media library’s smart watchdog. Parses images, RAW, videos, PDFs & docs to catch real corruption — not just checksums. CLI + FastAPI + alerts + Claude-ready MCP for AI agents.
ai-agents bitrot-detection cli corruption-detection data-integrity file-integrity-monitoring homelab integrity-checker mcp media-library nas python self-hosted
Last synced: 22 Apr 2026
https://github.com/tibolpol/sealgood
100% DIY Document Authenticator
bash-script data-integrity digital-signature diy-tool document-signing document-verification non-repudiation openssl pdf-signature timestamping tsa
Last synced: 28 Apr 2026
https://github.com/apeleghq/ts-memcached-encrypted-store
A secure and efficient encrypted store of keys and values for memcached
caching data-integrity encrypted encrypted-store memcached store
Last synced: 24 Jul 2025
https://github.com/abdulvahapmutlu/reprokit-ml
One-command determinism + manifest for ML projects.
cli data-integrity dataset-versioning determinism github-actions hashing jax manifest merkle-tree mlops pre-commit provenance python pytorch reproducibility reproducible-research tensorflow xxhash
Last synced: 14 Jan 2026
https://github.com/labex-labs/sqlite-intermediate-to-advanced
In this course, delve into advanced SQLite techniques. Master constraints, indexing, joins, subqueries, transactions, triggers, views, full-text search, JSON, backups, PRAGMA tuning, CTEs, window functions, and more!
advanced-sql course data-analysis data-integrity data-manipulation data-modeling database database-design hands-on labex labs performance-tuning programming query-optimization relational-database schema-management sql sqlite stored-procedures transaction-management
Last synced: 18 May 2026
https://github.com/pjsheals/aivo-divm
DIVM — Data Integrity & Verification Methodology v1.0. Governance-grade framework for reproducible AI visibility assurance.
ai-visibility ai-visibility-data ai-visibility-tracking aivo aivo-standard data-integrity divm governance llm reproducibility verification
Last synced: 31 May 2026
https://github.com/imsatyasaiteja/visual-crypto-coding
Encrypts & Decrypts an Image based on four most important encryption methods
aes-encryption assymetric-key-cryptography data-integrity des-encryption encryption-decryption java rsa-encryption symmetric-key-cryptography triple-des visual-cryptographic-schemes
Last synced: 25 Feb 2025
https://github.com/fabriciopsouza/metacognition-framework
Universal prompt engineering framework for critical AI validation
ai-validation business-intelligence chatgpt claude data-integrity data-science llm metacognition prompt-engineering python quality-assurance
Last synced: 31 May 2026
https://github.com/jhartyharr23/fors33-verifier
Cryptographic verifier for high-integrity data. Prove provenance and ensure immutable audit trails. Automate at fors33.com/products
audit-trail chain-of-custody compliance cryptographic-attestation cryptography cybersecurity cybersecurity-tools data-integrity data-lineage data-protection data-provenance digital-forensics evidence-management forensics immutable-ledger python security-tools sha256 tamper-evident zero-trust
Last synced: 01 Mar 2026
https://git.crates.im/mirrors/muxfs
A mirroring, checksumming, and self-healing filesystem layer for OpenBSD.
c data-integrity fuse-filesystem openbsd
Last synced: 08 Oct 2025
https://github.com/therealdreg/disk_fill_test
CLI tool that fills a target drive with a single large file, showing live progress/ETA, then reads it back to verify data integrity via SHA-256. Reports precise write/read times and throughput, and deletes the test file by default
chatgpt cross-platform data-integrity disk-fill disk-verify storage-benchmark storage-test
Last synced: 26 May 2026
https://github.com/masonedotcloud/xml-validator
XML Validator: un'app web per caricare e verificare file XML rispetto a XSD o DTD.
data-integrity dtd javascript php validator web-app xml xml-schemma xsd
Last synced: 18 May 2026
https://github.com/srodriguezamarillo/mundotrade
Proyecto SQL de gestiĂłn de equipamiento y enlaces a Internet para la empresa de logĂstica "Mundo Trade". Implementa tĂ©cnicas avanzadas de SQL para garantizar la integridad y eficiencia del sistema y la correcta administraciĂłn de datos.
data-integrity database-management equipment-management logistics optimization sql sql-functions stored-procedures triggers views
Last synced: 23 Jan 2026
https://github.com/salahealer9/salah-gherbi-3i_atlas_anomaly_2025
v2.6 refines 3I/ATLAS analysis with automated peak detection, improved Δv modelling, Ni/Fe compositional anomaly, colour–brightness evolution, and multi-station validation of post-perihelion noise events.
astronomy blockchain blockchain-verification color-indices comet-science cryptographic-timestamping data-integrity delta-v interstellar-object minor-planet-center non-grav-forces non-gravitational-forces open-science optical-acceleration optical-analysis photometry post-perihelion-evolution python-data-analysis reproducible-research trajectory-modelling
Last synced: 01 Mar 2026
https://github.com/mehnaz2004/data-cleaning-casestudy
This repository demonstrates data cleaning with a layoffs dataset. It covers handling missing values, detecting outliers, and encoding categorical data, using visualizations like boxplots and distplots to enhance data quality. Check out the code to see these techniques in action.
categorical-data-encoding data-cleaning data-integrity data-visualization missing-value-handling outlier-detection-and-removal pandas seaborn sklearn
Last synced: 01 Mar 2026
https://github.com/diegopau/powerdirhasher
Comprehensive Windows data integrity tool to hash, verify and sync hashes for your files, keeping a history of any file modifications for a given folder or set of folders. Differentiates file modification from file silent corruption. Uses PowerShell. No GUI.
data-integrity hashes hashing integrity powershell sha-256
Last synced: 11 Sep 2025
https://github.com/vspaone/cephalodb
CephaloDB is an intelligent, real-time document-based database system designed to build and manage complex relationships dynamically. Inspired by the adaptability of an octopus, CephaloDB leverages fuzzy logic and automation to establish and update connections between nodes in documents, ensuring flexibility and efficiency.
access-control adaptive-database cephalodb data-integrity data-management database docker document-database dynamic-relations encryption event-driven express fuzzy-logic json-storage nodejs real-time relationships scalable-database secure-database socketio
Last synced: 11 Apr 2026
https://github.com/cmangun/agentic-artifacts
Artifact manifests and provenance rules for integrity-preserved agent outputs
agentic-systems data-integrity data-lineage manifest provenance slsa verifiable-ai
Last synced: 03 May 2026
https://github.com/altxriainc/janus
Janus is a Python library designed for robust data validation, serialization, and schema versioning. It offers a comprehensive toolkit to handle input validation, data transformation, and API schema evolution, making it ideal for modern Python applications that require data integrity and compatibility.
api-schema data-integrity data-validation input-validation json-serialization nested-data-validation python-library python-validation schema-versioning yaml-serialization
Last synced: 06 Apr 2026
https://github.com/entropy-tamer/reynard-validation
Data validation and schema management for Reynard - comprehensive validation utilities with type safety
data-integrity data-validation error-handling form-validation frontend input-validation javascript json-schema reynard runtime-validation sanitization schema security solidjs type-safety typescript validation web-development
Last synced: 04 May 2026
https://github.com/redx94/quantumblockchainautomation
A revolutionary initiative combining Quantum Computing and Blockchain Technology to establish a secure, decentralized system for quantum-powered data integrity. This project pioneers the fusion of advanced quantum randomness with blockchain's immutability, ensuring unmatched reliability and security for distributed data processes
blockchain blockchain-immutability cryptography data-integrity decentralization distributed-systems ethereum flask-dashboard ganache ibm-quantum-experience qiskit quantum quantum-blockchain-fusion quantum-randomness quantum-security quatum-cryptography smart-contracts zeromq
Last synced: 24 Apr 2026
https://github.com/pzingg/integrity_proofs
Some functions to implement Verifiable Credential Data Integrity 1.0 using ED25519 and JCS
canonicalization data-integrity decentralized-identifiers ed25519 fediverse integrity-proofs jcs jcs-eddsa-2022 multikey
Last synced: 07 Jul 2025
https://github.com/aliciusschroeder/json_segments
A robust C library for efficient segmentation and reassembly of large JSON objects in IoT and resource-constrained environments. Ensures data integrity and efficiency in network communication.
cjson data-integrity data-processing data-segmentation embedded-systems iot-communication json network-communication resource-constrained-devices
Last synced: 29 Apr 2026
https://github.com/dlr-pa/file_hook_server_timestamping
`file_hook_server_timestamping.py` is a file hook for a GitLab instance. It is used to automatically create timestamped commits for every push to the default branch of a repository. This can be useful for a number of reasons.
data-integrity gitlab gitlab-file-hook gpg python server-side timestamping
Last synced: 30 Apr 2026
https://github.com/filiprokita/tosha256
This is a Python script that prompts the user to enter a string of text and then calculates its SHA-256 hash using the hashlib library. The resulting hash is then printed to the console.
cryptography data-integrity hash hashing python security sha256 tosha
Last synced: 24 Mar 2025
https://github.com/ramy-badr-ahmed/merkle-dag-matlab
Merkle-Directed Acyclic Graph (DAG) in MATLAB - https://doi.org/10.5281/zenodo.12808889
cryptographic-hash-functions dag data-integrity data-structures-and-algorithms data-verification directed-acyclic-graph graph-algorithms graph-algorithms-and-data-sturcture integrity-check java-security matlab matlab-oop merkle-dag
Last synced: 30 Mar 2025
https://github.com/zyd8/hash-transfer-cli
A secure command-line tool for copying and moving files securely between locations. This tool supports a variety of hash algorithms to ensure data integrity during the transfer process.
csharp data-integrity dotnet hashing python
Last synced: 10 Apr 2026