https://github.com/swh/classification-gold-standard
Gold standard for the evaluation of machine classification of patent data
https://github.com/swh/classification-gold-standard
Last synced: 4 months ago
JSON representation
Gold standard for the evaluation of machine classification of patent data
- Host: GitHub
- URL: https://github.com/swh/classification-gold-standard
- Owner: swh
- License: bsd-3-clause
- Created: 2019-06-27T15:54:36.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2019-09-23T08:23:09.000Z (almost 6 years ago)
- Last Synced: 2025-01-12T17:47:47.789Z (6 months ago)
- Size: 403 KB
- Stars: 7
- Watchers: 1
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# classification-gold-standard
Gold standard for the evaluation of machine classification of patent dataThe data for the gold standards can be found in the data/ directory, as TSV files. The columns are:
* Class - whether the example is positive or negative
* DocDB Family ID - the ID for the family from the DOCDB system, to aid grouping into families
* Serial no. - the serial number of the publication, to allow identification of the publication, in the EPO [country-code][number][kind-code] format.
* Title - the title as published, to aid cross-checking
* Publication date - date the patent was published by the POAlongside the classification data are the scope notes, in a .txt file with the same base name.