https://github.com/frankier/opus-parse
This Rust library can parse OPUS's monolingual XML files.
https://github.com/frankier/opus-parse
corpus corpus-linguistics nlp opus rust subtitles
Last synced: 3 days ago
JSON representation
This Rust library can parse OPUS's monolingual XML files.
- Host: GitHub
- URL: https://github.com/frankier/opus-parse
- Owner: frankier
- Created: 2018-02-14T16:13:42.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-02-21T10:32:56.000Z (over 7 years ago)
- Last Synced: 2025-09-06T02:27:51.460Z (about 1 month ago)
- Topics: corpus, corpus-linguistics, nlp, opus, rust, subtitles
- Language: Rust
- Homepage:
- Size: 12.7 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# opus-parse
[](https://crates.io/crates/opus-parse) [](https://docs.rs/opus-parse/)
This library can parse [OPUS's](http://opus.nlpl.eu/) monolingual XML files.
Currently it's only been tested on the OpenSubtitles corpus.See also
[opus_tools](https://github.com/emk/subtitles-rs/tree/master/opus_tools) which
has an overlapping purpose.