Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/gbv/k10plus-ddc

Analyze, convert and publish DDC numbers found K10plus catalog
https://github.com/gbv/k10plus-ddc

coli-conc

Last synced: 4 days ago
JSON representation

Analyze, convert and publish DDC numbers found K10plus catalog

Awesome Lists containing this project

README

        

# K10plus DDC

This repository contains scripts to analyze, convert and publish Dewey Decimal Classification (DDC) numbers found K10plus catalogue. The analysis is mainly based on [coli-ana] DDC number decomposition.

[coli-ana]: https://github.com/gbv/coli-ana
[K10plus Subjects]: https://github.com/gbv/k10plus-subjects
[jskos]: https://format.gbv.de/jskos

**See subdirectory `publication` for script to generate the data publication **

## Installation

~~~sh
npm ci
~~~

## Usage

### analyze.js

Read a list of (numerically sorted) DDC numbers and generate analysis with coli-ana

#### k10plus-patch.js

The script `bin/k10plus-patch.js`

1. reads PICA+ records (or PPNs to retrieve records from K10plus)
2. extracts DDC fields from the records
3. retrieves DDC analysis (cached in a local database)
4. and emits [PICA Patch](https://format.gbv.de/pica/patch/specification) files to modify records

~~~
Usage: k10plus-patch [options] < file

Check and extend DDC numbers in PICA records of K10plus catalogue

Options:
-a, --api coli-ana API endpoint
-c, --continue continue after given PPN (expect sorted)
-f, --format PICA+ serialization (default: plain)
-i, --input input file (default: - for STDIN)
-d, --database optional SQLite file for caching
-p, --ppns input is list of PPNs instead of PICA records
-h, --help display help for command
~~~

### simplify-for-pdf.jq

Given the full analysis from coli-ana API in JSKOS format as published at
, this jq script can be used to
simplify the JSKOS records for creation of PDF files for each DDC number:

~~~sh
zcat ddc-decomposition.ndjson.gz | jq -c -f simplify-for-pdf.jq -c > ddc-pdf-data.ndjson
~~~

### count.js

Calculate frequency of individual DDC elements in analysis result and emit as CSV or JSKOS concept list

## Data in this repository

- `ddcs` sorted DDC numbers found in K10plus with number of occurrences. Data generated from [K10plus Subjects].

## See Also

- [coli-ana] API to analyze DDC numbers
- [K10plus Subjects] to analyze, extract and publish subject indexing data (including DDC but also other systems) from K10plus