Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/famrashel/idn-treebank
Indonesian Treebank
https://github.com/famrashel/idn-treebank
Last synced: 24 days ago
JSON representation
Indonesian Treebank
- Host: GitHub
- URL: https://github.com/famrashel/idn-treebank
- Owner: famrashel
- Created: 2015-08-24T04:27:17.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2022-07-05T12:09:31.000Z (over 2 years ago)
- Last Synced: 2024-08-04T22:13:37.103Z (4 months ago)
- Size: 1.26 MB
- Stars: 36
- Watchers: 2
- Forks: 18
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- Awesome-Indonesia-NLP - Indonesian Treebank
README
# idn-treebank
=================Manually Tagged Indonesian Corpus
* [Bahasa](#readmemd-versi-bahasa).
* [English](#readmemd-english-version).# README.md versi Bahasa
## Format DataSetiap baris berisi parse-tree dari kalimat bahasa Indonesia.
Korpus ini terdiri dari dua jenis berkas, yang satu disertai ID dan yang lainnya RAW. Setiap parse-tree pada berkas dengan ID selalu diawali dengan id kalimat yang dipisahkan sebuah karakter tab (\t).# README.md English version
## Data FormatEach line consists of parse-tree from Indonesian sentence.
The corpus consists of two types of file, the one with ID and the other is RAW. Each parse-tree inside the file with ID has a sentence ID in the beginning of the line separated by a single tab character (\t).### Authors
- Ruli Manurung
- Arawinda Dinakaramani
- Fam Rashel
- Andry Luthfi### Page
For more details about this work, please visit http://bahasa.cs.ui.ac.id/treebank/corpus### License
This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/.### UPDATE
This work was carried out under the framework of a research project done at [IR-NLP Lab](https://ir.cs.ui.ac.id/new/).
As there is an initiative to bring together and document all the works done in the IR-NLP Lab,
**please refer to the IR-NLP Lab's [repository](https://github.com/ir-nlp-csui/idn-treebank-CSUI) for official updates and future versions of this work**.
This repository will still be available as a personal repository.