An open API service indexing awesome lists of open source software.

https://github.com/dinhanhx/paraphrased_text_for_hm

Paraphrased text for hateful memes dataset
https://github.com/dinhanhx/paraphrased_text_for_hm

augmentation cli dataset hateful-memes-challenge natural-language-processing paraphrase-generation python-3 python3

Last synced: 7 months ago
JSON representation

Paraphrased text for hateful memes dataset

Awesome Lists containing this project

README

          

# Paraphrased text for hateful memes dataset

This project provides paraphrased texts and simple code for [Hateful Memes Challenge](https://hatefulmemeschallenge.com/) dataset.

`data_test.jsonl` consists of **ALL** metadata for memes in the dataset.
- `id`
- `img`
- `label`
- `text`

so do all other files starting with `data_test_paraphrased_nlpaug`.

## Installation

This simple project uses simple functions from [nlpaug](https://github.com/makcedward/nlpaug#installation), [pandas](https://pandas.pydata.org/getting_started.html), [tqdm](https://tqdm.github.io/). Therefore, one merely go to these links and install as their instructions. It works with Python 3.7

## Reuse `nlp_aug.py`

Just read the file, it's simple.