https://github.com/dinhanhx/paraphrased_text_for_hm
Paraphrased text for hateful memes dataset
https://github.com/dinhanhx/paraphrased_text_for_hm
augmentation cli dataset hateful-memes-challenge natural-language-processing paraphrase-generation python-3 python3
Last synced: 7 months ago
JSON representation
Paraphrased text for hateful memes dataset
- Host: GitHub
- URL: https://github.com/dinhanhx/paraphrased_text_for_hm
- Owner: dinhanhx
- Created: 2021-08-07T13:51:13.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2021-08-07T13:52:03.000Z (about 4 years ago)
- Last Synced: 2025-01-28T23:50:06.534Z (9 months ago)
- Topics: augmentation, cli, dataset, hateful-memes-challenge, natural-language-processing, paraphrase-generation, python-3, python3
- Language: Python
- Homepage:
- Size: 3.74 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Paraphrased text for hateful memes dataset
This project provides paraphrased texts and simple code for [Hateful Memes Challenge](https://hatefulmemeschallenge.com/) dataset.
`data_test.jsonl` consists of **ALL** metadata for memes in the dataset.
- `id`
- `img`
- `label`
- `text`so do all other files starting with `data_test_paraphrased_nlpaug`.
## Installation
This simple project uses simple functions from [nlpaug](https://github.com/makcedward/nlpaug#installation), [pandas](https://pandas.pydata.org/getting_started.html), [tqdm](https://tqdm.github.io/). Therefore, one merely go to these links and install as their instructions. It works with Python 3.7
## Reuse `nlp_aug.py`
Just read the file, it's simple.