https://github.com/cynthia/kosentences
Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)
https://github.com/cynthia/kosentences
corpus datasets korean language-modeling nlp
Last synced: 11 months ago
JSON representation
Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)
- Host: GitHub
- URL: https://github.com/cynthia/kosentences
- Owner: cynthia
- License: other
- Created: 2019-08-05T08:46:40.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-08-11T07:24:37.000Z (almost 7 years ago)
- Last Synced: 2025-04-02T23:04:02.007Z (about 1 year ago)
- Topics: corpus, datasets, korean, language-modeling, nlp
- Language: Python
- Size: 15.6 KB
- Stars: 27
- Watchers: 2
- Forks: 6
- Open Issues: 2