Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/frankier/opus-parse
This Rust library can parse OPUS's monolingual XML files.
https://github.com/frankier/opus-parse
corpus corpus-linguistics nlp opus rust subtitles
Last synced: 8 days ago
JSON representation
This Rust library can parse OPUS's monolingual XML files.
- Host: GitHub
- URL: https://github.com/frankier/opus-parse
- Owner: frankier
- Created: 2018-02-14T16:13:42.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-02-21T10:32:56.000Z (over 6 years ago)
- Last Synced: 2024-10-11T17:50:15.044Z (about 1 month ago)
- Topics: corpus, corpus-linguistics, nlp, opus, rust, subtitles
- Language: Rust
- Homepage:
- Size: 12.7 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# opus-parse
[![Crates.io](https://img.shields.io/crates/v/opus-parse.svg)](https://crates.io/crates/opus-parse) [![Docs.rs](https://docs.rs/opus-parse/badge.svg)](https://docs.rs/opus-parse/)
This library can parse [OPUS's](http://opus.nlpl.eu/) monolingual XML files.
Currently it's only been tested on the OpenSubtitles corpus.See also
[opus_tools](https://github.com/emk/subtitles-rs/tree/master/opus_tools) which
has an overlapping purpose.