Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/bgamari/mediawiki-parser

Haskell parsers for MediaWiki markup
https://github.com/bgamari/mediawiki-parser

mediawiki parser

Last synced: 4 months ago
JSON representation

Haskell parsers for MediaWiki markup

Awesome Lists containing this project

README

        

# mediawiki-parser

Parser for the MediaWiki markup language for GHC Haskell.

This package only provides the library --- the development of executables that convert MediaWiki to another format is left to the dependee.

# Link Graph Utility

The only executable offered in this package is the `mediawiki-links` utility which
accepts a MediaWiki XML dump file on `stdin` and writes a set of link graph
edges to `stdout`. The output format is a tab-separated text file with the
following columns,

* source node name
* link target node name
* link target namespace
* link anchor text

## Installation

1. Download and install [Haskell Platform](https://www.haskell.org/platform/)
2. Run `cabal update`
3. Clone this repository, `git clone git://github.com/bgamari/mediawiki-parser`
4. Run `cabal install` from within the repository
5. The `mediawiki-import` executable can be found in `~/.cabal/bin`