Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/frankier/opus-parse

This Rust library can parse OPUS's monolingual XML files.
https://github.com/frankier/opus-parse

corpus corpus-linguistics nlp opus rust subtitles

Last synced: 8 days ago
JSON representation

This Rust library can parse OPUS's monolingual XML files.

Awesome Lists containing this project

README

        

# opus-parse

[![Crates.io](https://img.shields.io/crates/v/opus-parse.svg)](https://crates.io/crates/opus-parse) [![Docs.rs](https://docs.rs/opus-parse/badge.svg)](https://docs.rs/opus-parse/)

This library can parse [OPUS's](http://opus.nlpl.eu/) monolingual XML files.
Currently it's only been tested on the OpenSubtitles corpus.

See also
[opus_tools](https://github.com/emk/subtitles-rs/tree/master/opus_tools) which
has an overlapping purpose.