https://github.com/stanfordnlp/handparsed-treebank
Extra hand parsed data for training models
https://github.com/stanfordnlp/handparsed-treebank
Last synced: about 1 month ago
JSON representation
Extra hand parsed data for training models
- Host: GitHub
- URL: https://github.com/stanfordnlp/handparsed-treebank
- Owner: stanfordnlp
- Created: 2020-04-14T17:33:13.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2025-01-28T00:16:59.000Z (5 months ago)
- Last Synced: 2025-04-14T12:21:23.940Z (2 months ago)
- Language: Perl
- Size: 17.8 MB
- Stars: 4
- Watchers: 9
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# handparsed-treebank
Extra hand parsed data for training modelsenglish-handparsed: PTB style trees with some coverage of words or structures not well represented in WSJ PTB or other common datasets
english-tagged: data which is tagged, but not parsed.
italian-mwt: a collection of Italian phrases tokenized in the style of UD conll datasets.