An open API service indexing awesome lists of open source software.

https://github.com/yuvalpinter/nytwit

New York Times Word Innovation Types dataset
https://github.com/yuvalpinter/nytwit

computational-linguistics corpus dataset news nlp

Last synced: 14 days ago
JSON representation

New York Times Word Innovation Types dataset

Awesome Lists containing this project

README

        

# NYTWIT

This repository hosts the New York Times Word Innovation Types dataset (NYTWIT), as presented in [this report](https://www.aclweb.org/anthology/2020.coling-main.572) at COLING 2020.

## Versions

| Version | Date | Diff | Details |
|-------------------------|----------------|-----------|----------------------------------------------|
| [V1.1](nytwit_v1-1.tsv) | April 24, 2020 | 73 labels | Re-annotation of mostly blends and compounds |
| [V1](nytwit_v1.tsv) | March 7, 2020 | N/A | Initial |

## Citation

If you use our dataset, please cite the following:
```
@inproceedings{nytwit,
title = "{NYTWIT}: A Dataset of Novel Words in the {N}ew {Y}ork {T}imes",
author = "Pinter, Yuval and
Jacobs, Cassandra L. and
Bittker, Max",
booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
month = dec,
year = "2020",
address = "Barcelona, Spain (Online)",
publisher = "International Committee on Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.coling-main.572",
pages = "6509--6515",
}
```