Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nichtich/wikidata-dumps
Scripts for processing Wikidata JSON dumps
https://github.com/nichtich/wikidata-dumps
Last synced: 22 days ago
JSON representation
Scripts for processing Wikidata JSON dumps
- Host: GitHub
- URL: https://github.com/nichtich/wikidata-dumps
- Owner: nichtich
- Created: 2014-08-31T20:30:45.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-08-31T20:34:08.000Z (over 10 years ago)
- Last Synced: 2024-10-30T16:24:50.103Z (2 months ago)
- Language: Perl
- Size: 105 KB
- Stars: 1
- Watchers: 4
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Scripts for processing Wikidata JSON dumps
[Wikidata](https://wikidata.org) provides weekly dumps of Wikidata items and
properties at .Each dump consists of a compressed JSON file containing a JSON array of
items/properties such as returned by the MediaWiki API.# Processing Wikidata dumps with Catmdandu
To start:
zcat ... | catmandu convert JSON --multiline 1 to YAML
See `string-statements` to extract all string statements.
zcat 20140825.json.gz | ./string-statements | gzip > statements.csv.gz
## Resources