{"id":19338447,"url":"https://github.com/gbv/k10plus-cleanup","last_synced_at":"2026-05-16T02:38:52.583Z","repository":{"id":188658502,"uuid":"679164634","full_name":"gbv/K10plus-cleanup","owner":"gbv","description":"Scripts to clean up PICA records in K10plus catalogue","archived":false,"fork":false,"pushed_at":"2023-08-28T10:43:32.000Z","size":10,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-01-06T10:31:02.841Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gbv.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-08-16T08:33:45.000Z","updated_at":"2023-08-18T07:30:49.000Z","dependencies_parsed_at":"2023-08-16T10:23:36.306Z","dependency_job_id":null,"html_url":"https://github.com/gbv/K10plus-cleanup","commit_stats":null,"previous_names":["gbv/k10plus-cleanup"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gbv%2FK10plus-cleanup","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gbv%2FK10plus-cleanup/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gbv%2FK10plus-cleanup/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gbv%2FK10plus-cleanup/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gbv","download_url":"https://codeload.github.com/gbv/K10plus-cleanup/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240441952,"owners_count":19801793,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-10T03:17:29.434Z","updated_at":"2026-05-16T02:38:47.550Z","avatar_url":"https://github.com/gbv.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# K10plus cleanup\n\nThis repository contains scripts to clean up PICA records in K10plus catalogue.\n\nAnalysis of K10plus catalogue with **QA Catalogue** reveals errors such as\ntypos and wrongly used fields. Some of these can automatically be fixed using\nCatmandu and [Catmandu::PICA].\n\n## Installation\n\nRequires Catmandu and current version of [Catmandu::PICA] as listed in `cpanfile`. Install with:\n\n    cpanm --installdeps .\n\n## Usage\n\nCreate or reuse a `.fix` file in Catmandu fix language. Either call `catmandu`\nas documented with selected data source (e.g. records retrieved via unAPI or\nSRU or a plain PICA dump file) or use the tiny shell script `fix`:\n\n    ./fix $FIX_FILE [$INPUT_FILE] [$OUTPUT_FILE]\n\nThe default output file name is fix file name with extension changed to `.patch`changed to .\n\nSome file extensions of input file are detected:\n\n- `*.dat.gz` is gzipped [normalized PICA]\n- `*.dat` is [normalized PICA]\n- everything else is [PICA Plain]\n\nScript `kxpo` can be used to get K10plus records via SRU in online expansion form:\n\n    ./kxpo 1543420281                           # get via PPN\n    ./kxpo pica.isb=9783894017316 --total 1     # get first matching record by ISBN\n\nTo validate a Patch file and give statistics run (given `$patchfile` has extension `.patch`):\n\n    picadata count $patchfile\n    picadata fields $patchfile\n\n## Examples\n\nSee `*.fix` files in this repository for examples:\n\nfix file | description | JIRA issue\n---------|-------------|------------\n[ddc-edition.fix](ddc-edition.fix) | unify DDC edition statementin `045F$e` | \u003chttps://jira.gbv.de/browse/CBS-1765\u003e\n[bk-74.50.fix](bk-74.50.fix) | deleted BK class `74.50X` | \u003chttps://jira.gbv.de/browse/CBS-1766\u003e\n[bk-remove-invalid.fix](bk-remove-invalid.fix) | remove invalid BK notation `XX.XX` | \u003chttps://jira.gbv.de/browse/CBS-1767\u003e\n\n## See also\n\n- [Einführung in die Verarbeitung von PICA-Daten](https://pro4bib.github.io/pica/)\n  with a section on editing PICA with Catmandu::PICA\n\n[Catmandu::PICA]: https://metacpan.org/release/Catmandu-PICA\n[normalized PICA]: https://format.gbv.de/pica/normalized\n[PICA Plain]: https://format.gbv.de/pica/plain\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgbv%2Fk10plus-cleanup","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgbv%2Fk10plus-cleanup","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgbv%2Fk10plus-cleanup/lists"}