https://github.com/fostroll/rucor_to_conllu
RuCor corpus to CoNLL-U format conversion
https://github.com/fostroll/rucor_to_conllu
Last synced: about 2 months ago
JSON representation
RuCor corpus to CoNLL-U format conversion
- Host: GitHub
- URL: https://github.com/fostroll/rucor_to_conllu
- Owner: fostroll
- License: cc0-1.0
- Created: 2020-11-02T19:19:14.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2020-11-04T19:39:24.000Z (over 4 years ago)
- Last Synced: 2025-02-14T06:35:59.530Z (3 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 124 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# rucor_to_conllu: RuCor corpus to CoNLL-U format conversion
This notebook provides a pipeline to convert
[*RuCor*](http://rucoref.maimbava.net/) corpus to
[*CoNLL-U*](https://universaldependencies.org/format.html) format.## Prerequisites
1. [***Toxine***](https://github.com/fostroll/toxine) preprocessor.
1. **Opional:** Morphological parser
[***MorDL***](https://github.com/fostroll/mordl) with UPOS and FEATS models
already trained.## Usage
All pipeline is contained rucor_to_conllu.ipynb notebook file. Just set the
paths to the *RuCor* and *MorDL* models and run. Maybe, you'll want also to
change some variables in cells `2` or/and `4`.## License
***rucor_to_conllu*** is released under the Creative Commons License. See the
[LICENSE](https://github.com/fostroll/srv_zero/blob/master/LICENSE) file for
more details.