An open API service indexing awesome lists of open source software.

https://github.com/bashkirtsevich-llc/wiki-dump-parser

Wiki dump parser (jupyter)
https://github.com/bashkirtsevich-llc/wiki-dump-parser

bz2 demos jupyter jupyter-notebook jupyter-notebooks parser python python3 tutorial tutorial-code tutorials wiki wikia wikipedia wikipedia-corpus wikipedia-dump wiktionary xml xml-parser

Last synced: about 1 month ago
JSON representation

Wiki dump parser (jupyter)

Awesome Lists containing this project

README

          

# Wikipedia bz2 dump parser

* Wikipedia dumps basic parser: [wiki-parser](wiki-parser.ipynb).
* Wiktionary nouns parser: [wiktionary-parser](wiktionary-parser.ipynb)