Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bgamari/mediawiki-parser
Haskell parsers for MediaWiki markup
https://github.com/bgamari/mediawiki-parser
mediawiki parser
Last synced: 4 months ago
JSON representation
Haskell parsers for MediaWiki markup
- Host: GitHub
- URL: https://github.com/bgamari/mediawiki-parser
- Owner: bgamari
- License: bsd-3-clause
- Created: 2016-08-08T22:13:18.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2023-12-22T22:08:18.000Z (about 1 year ago)
- Last Synced: 2023-12-22T23:20:45.020Z (about 1 year ago)
- Topics: mediawiki, parser
- Language: Haskell
- Homepage:
- Size: 176 KB
- Stars: 5
- Watchers: 5
- Forks: 5
- Open Issues: 3
-
Metadata Files:
- Readme: README.mkd
- Changelog: ChangeLog.md
- License: LICENSE
Awesome Lists containing this project
README
# mediawiki-parser
Parser for the MediaWiki markup language for GHC Haskell.
This package only provides the library --- the development of executables that convert MediaWiki to another format is left to the dependee.
# Link Graph Utility
The only executable offered in this package is the `mediawiki-links` utility which
accepts a MediaWiki XML dump file on `stdin` and writes a set of link graph
edges to `stdout`. The output format is a tab-separated text file with the
following columns,* source node name
* link target node name
* link target namespace
* link anchor text## Installation
1. Download and install [Haskell Platform](https://www.haskell.org/platform/)
2. Run `cabal update`
3. Clone this repository, `git clone git://github.com/bgamari/mediawiki-parser`
4. Run `cabal install` from within the repository
5. The `mediawiki-import` executable can be found in `~/.cabal/bin`