Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jean-baptiste-camps/word_normalisation_data


https://github.com/jean-baptiste-camps/word_normalisation_data

Last synced: about 1 month ago
JSON representation

Awesome Lists containing this project

README

        

# Word Normalisation Datasets

Datasets to be used for training word segmentation, in particular with Pie.

They come from various sources, and from the datasets published by Oriflamms (Stutzmann et al., https://github.com/oriflamms/).

## Old French

### Geste

- Geste: un corpus de chansons de geste, dir. Jean-Baptiste Camps, avec la collab. d'Elena Albarran, Alice Cochet & Lucence Ing, Paris, 2016-…, DOI: 10.5281/zenodo.1744918, [https://github.com/Jean-Baptiste-Camps/Geste/](https://github.com/Jean-Baptiste-Camps/Geste/).

### Oriflamms

Oriflamms projects, dir. Dominique Stutzmann, available at:

- [https://github.com/oriflamms/AlbumMssFrXIII](https://github.com/oriflamms/AlbumMssFrXIII).
- [https://github.com/oriflamms/ECMEN](https://github.com/oriflamms/ECMEN).
- [https://github.com/oriflamms/Pelerinage](https://github.com/oriflamms/Pelerinage).
- [https://github.com/oriflamms/Graal](https://github.com/oriflamms/Graal).

### Wauchier

- Pinche, Ariane, _Li Seint Confessor_, données issues de la thèse de doctorat : Edition nativement numérique du recueil hagiographique "Li Seint Confessor" de Wauchier de Denain d'après le manuscrit fr. 412 de la Bibliothèque nationale de France, 2021-09-01_, [https://github.com/ArianePinche/EditionLiSeintConfessor](https://github.com/ArianePinche/EditionLiSeintConfessor).

## Latin

### ORIFLAMMS

Oriflamms projects, dir. Dominique Stutzmann, available at:

- https://github.com/oriflamms/Fontenay/
- https://github.com/oriflamms/Dated-and-Datable-Manuscripts_AI2A
- https://github.com/oriflamms/PsautierIMS