https://github.com/yuvalpinter/nytwit
New York Times Word Innovation Types dataset
https://github.com/yuvalpinter/nytwit
computational-linguistics corpus dataset news nlp
Last synced: 14 days ago
JSON representation
New York Times Word Innovation Types dataset
- Host: GitHub
- URL: https://github.com/yuvalpinter/nytwit
- Owner: yuvalpinter
- License: gpl-3.0
- Created: 2020-03-04T18:21:49.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2020-12-01T14:25:28.000Z (about 4 years ago)
- Last Synced: 2025-01-31T21:17:34.602Z (21 days ago)
- Topics: computational-linguistics, corpus, dataset, news, nlp
- Size: 189 KB
- Stars: 21
- Watchers: 5
- Forks: 5
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# NYTWIT
This repository hosts the New York Times Word Innovation Types dataset (NYTWIT), as presented in [this report](https://www.aclweb.org/anthology/2020.coling-main.572) at COLING 2020.
## Versions
| Version | Date | Diff | Details |
|-------------------------|----------------|-----------|----------------------------------------------|
| [V1.1](nytwit_v1-1.tsv) | April 24, 2020 | 73 labels | Re-annotation of mostly blends and compounds |
| [V1](nytwit_v1.tsv) | March 7, 2020 | N/A | Initial |## Citation
If you use our dataset, please cite the following:
```
@inproceedings{nytwit,
title = "{NYTWIT}: A Dataset of Novel Words in the {N}ew {Y}ork {T}imes",
author = "Pinter, Yuval and
Jacobs, Cassandra L. and
Bittker, Max",
booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
month = dec,
year = "2020",
address = "Barcelona, Spain (Online)",
publisher = "International Committee on Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.coling-main.572",
pages = "6509--6515",
}
```