Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/gbv/k10plus-ddc
Analyze, convert and publish DDC numbers found K10plus catalog
https://github.com/gbv/k10plus-ddc
coli-conc
Last synced: 4 days ago
JSON representation
Analyze, convert and publish DDC numbers found K10plus catalog
- Host: GitHub
- URL: https://github.com/gbv/k10plus-ddc
- Owner: gbv
- Created: 2023-08-14T09:23:10.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-10T13:53:04.000Z (9 months ago)
- Last Synced: 2024-11-10T03:18:06.312Z (2 months ago)
- Topics: coli-conc
- Language: JavaScript
- Homepage:
- Size: 1.95 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# K10plus DDC
This repository contains scripts to analyze, convert and publish Dewey Decimal Classification (DDC) numbers found K10plus catalogue. The analysis is mainly based on [coli-ana] DDC number decomposition.
[coli-ana]: https://github.com/gbv/coli-ana
[K10plus Subjects]: https://github.com/gbv/k10plus-subjects
[jskos]: https://format.gbv.de/jskos**See subdirectory `publication` for script to generate the data publication **
## Installation
~~~sh
npm ci
~~~## Usage
### analyze.js
Read a list of (numerically sorted) DDC numbers and generate analysis with coli-ana
#### k10plus-patch.js
The script `bin/k10plus-patch.js`
1. reads PICA+ records (or PPNs to retrieve records from K10plus)
2. extracts DDC fields from the records
3. retrieves DDC analysis (cached in a local database)
4. and emits [PICA Patch](https://format.gbv.de/pica/patch/specification) files to modify records~~~
Usage: k10plus-patch [options] < fileCheck and extend DDC numbers in PICA records of K10plus catalogue
Options:
-a, --api coli-ana API endpoint
-c, --continue continue after given PPN (expect sorted)
-f, --format PICA+ serialization (default: plain)
-i, --input input file (default: - for STDIN)
-d, --database optional SQLite file for caching
-p, --ppns input is list of PPNs instead of PICA records
-h, --help display help for command
~~~### simplify-for-pdf.jq
Given the full analysis from coli-ana API in JSKOS format as published at
, this jq script can be used to
simplify the JSKOS records for creation of PDF files for each DDC number:~~~sh
zcat ddc-decomposition.ndjson.gz | jq -c -f simplify-for-pdf.jq -c > ddc-pdf-data.ndjson
~~~### count.js
Calculate frequency of individual DDC elements in analysis result and emit as CSV or JSKOS concept list
## Data in this repository
- `ddcs` sorted DDC numbers found in K10plus with number of occurrences. Data generated from [K10plus Subjects].
## See Also
- [coli-ana] API to analyze DDC numbers
- [K10plus Subjects] to analyze, extract and publish subject indexing data (including DDC but also other systems) from K10plus