{"id":19260916,"url":"https://github.com/mitre/data-owner-tools","last_synced_at":"2025-04-21T17:30:32.067Z","repository":{"id":37898989,"uuid":"246928169","full_name":"mitre/data-owner-tools","owner":"mitre","description":"Tools for the Childhood Obesity Data Initiative (CODI) data owners and partners to use in record linkage","archived":false,"fork":false,"pushed_at":"2024-03-20T16:59:27.000Z","size":383,"stargazers_count":5,"open_issues_count":10,"forks_count":8,"subscribers_count":8,"default_branch":"master","last_synced_at":"2025-04-01T14:45:51.985Z","etag":null,"topics":["codi"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mitre.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-03-12T20:48:03.000Z","updated_at":"2023-07-12T01:43:40.000Z","dependencies_parsed_at":"2024-11-09T19:24:24.539Z","dependency_job_id":"aa68956a-0675-43ce-95e1-d9286833a3aa","html_url":"https://github.com/mitre/data-owner-tools","commit_stats":null,"previous_names":[],"tags_count":16,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mitre%2Fdata-owner-tools","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mitre%2Fdata-owner-tools/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mitre%2Fdata-owner-tools/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mitre%2Fdata-owner-tools/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mitre","download_url":"https://codeload.github.com/mitre/data-owner-tools/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":250100138,"owners_count":21374878,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["codi"],"created_at":"2024-11-09T19:23:27.768Z","updated_at":"2025-04-21T17:30:31.817Z","avatar_url":"https://github.com/mitre.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Data Owner Tools\n\n[![.github/workflows/style.yml](https://github.com/mitre/data-owner-tools/actions/workflows/style.yml/badge.svg)](https://github.com/mitre/data-owner-tools/actions/workflows/style.yml)\n\nTools for Clinical and Community Data Initiative (CODI) Data Owners to extract personally identifiable information (PII) from the CODI Data Model and garble PII to send to the linkage agent for matching. These tools facilitate hashing and Bloom filter creation part of a Privacy-Preserving Record Linkage (PPRL) process.\n\nThis software package is specifically for use by CODI \"Data Owners\" and \"Data Partners\" to garble their data, which is then sent to a Linkage Agent, a third-party organization that links the hashes (see \"PPRL Overivew/CODI Roles\" in the wiki for more information). To view the software package used by the CODI Linkage Agent, see [Linkage Agent Tools](https://github.com/mitre/linkage-agent-tools).\n\nFor more information about PPRL and CODI, visit the wiki: https://github.com/mitre/data-owner-tools/wiki/PPRL-Overview\n\n![pprl_example](https://user-images.githubusercontent.com/13512036/208981398-a3e206b3-5366-494e-99bd-63d4bc8ea27f.png)\n\n\n## Quick Start\nThis section provides a very brief overview of the sequence of steps required of a Data Owner within CODI. For more detailed instructions, see our [wiki](https://github.com/mitre/data-owner-tools/wiki).\n\n#### Installation\nDetails at: https://github.com/mitre/data-owner-tools/wiki/Installation\n\n```sh\ngit clone https://github.com/mitre/data-owner-tools.git\ncd data-owner-tools/\npip install -r requirements.txt\n```\n\n#### Extracting Data from a CODI Record Linkage Data Model\nDetails at: https://github.com/mitre/data-owner-tools/wiki/Data-Extraction,-Validation,-and-Cleaning\n```sh\npython extract.py -s v2 postgresql://codi:codi@localhost/codi\n```\n\n#### Garbling PII\nDetails at: https://github.com/mitre/data-owner-tools/wiki/Garbling-PII\n```sh\npython garble.py temp-data/pii.csv ./example-schema/ deidentification_secret.txt\n```\n\n#### Mapping LINKIDs to PATIDs\nDetails at: https://github.com/mitre/data-owner-tools/wiki/Mapping-LINK-IDs-to-PATIDs\n```sh\npython linkid_to_patid.py --sourcefile pii-20220304.csv --linkszip sitename.zip --hhsourcefile households_pii-20220304.csv --hhlinkszip sitename_households.zip\n```\n\n\n## Notice\n\nCopyright 2020-2023 The MITRE Corporation.\n\nApproved for Public Release; Distribution Unlimited. Case Number 19-2008\n\n#### Licence\n\n[Apache License 2.0](https://github.com/mitre/data-owner-tools/blob/master/LICENSE)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmitre%2Fdata-owner-tools","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmitre%2Fdata-owner-tools","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmitre%2Fdata-owner-tools/lists"}