Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nichtich/wikidata-dumps

Scripts for processing Wikidata JSON dumps
https://github.com/nichtich/wikidata-dumps

Last synced: 22 days ago
JSON representation

Scripts for processing Wikidata JSON dumps

Awesome Lists containing this project

README

        

# Scripts for processing Wikidata JSON dumps

[Wikidata](https://wikidata.org) provides weekly dumps of Wikidata items and
properties at .

Each dump consists of a compressed JSON file containing a JSON array of
items/properties such as returned by the MediaWiki API.

# Processing Wikidata dumps with Catmdandu

To start:

zcat ... | catmandu convert JSON --multiline 1 to YAML

See `string-statements` to extract all string statements.

zcat 20140825.json.gz | ./string-statements | gzip > statements.csv.gz

## Resources