Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/orionw/reddithumordetection

Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"
https://github.com/orionw/reddithumordetection

Last synced: about 1 month ago
JSON representation

Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"

Awesome Lists containing this project

README

        

# Humor Detection
## Code and Datasets for the Paper ["Humor Detection: A Transformer Gets the Last Laugh"](https://arxiv.org/abs/1909.00252) by Orion Weller and Kevin Seppi
The repository contains the following:
- A way to regenerate the results found in the paper, by running `bash run_bert.sh`.
- The full datasets referenced in the paper (short jokes, puns, and the reddit dataset) are located in `full_datasets` whereas the `data` folder contains the split files used for training and testing. The file `create_data.sh` will create the splits (slightly different from the ones used in the paper - see `create_data.sh`).
- pytorch_pretrained_bert contains files used by the model - these files are from the [huggingface repo](https://github.com/huggingface/pytorch-transformers#Training-large-models-introduction,-tools-and-examples) and are NOT up to date with the current `pytorch-transformers` repo.

**This repository is not maintained and will not be updated.**

## Reference:
If you found this repository helpful, please consider citing the following:
```
@ARTICLE{humorDetection2019,
title={Humor Detection: A Transformer gets the Last Laugh},
author={Weller, Orion and Seppi, Kevin},
journal={"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing"},
month=Nov,
year = "2019",
}
```