An open API service indexing awesome lists of open source software.

https://github.com/ololobus/slavic_text_scht

St. Petersburg corpus of hagiographic texts
https://github.com/ololobus/slavic_text_scht

corpora hagiographic-texts linguistics slavic-languages

Last synced: 2 months ago
JSON representation

St. Petersburg corpus of hagiographic texts

Awesome Lists containing this project

README

          

### St. Petersburg Corpus of Hagiographic Texts

Old Church Slavic corpus

http://project.phil.spbu.ru/scat/page.php?page=project

### Parser
Run to get entire xml text.

```
./tei_parser.py xml/Aleksandr_svirskij.xml
```

TODO:
* return text sentence by sentence
* return text clause by clause
* keep info about named entities (`` tag)