https://github.com/dinhanhx/paraphrased_text_for_hm

Paraphrased text for hateful memes dataset
https://github.com/dinhanhx/paraphrased_text_for_hm

augmentation cli dataset hateful-memes-challenge natural-language-processing paraphrase-generation python-3 python3

Last synced: 7 months ago
JSON representation

Paraphrased text for hateful memes dataset

Host: GitHub
URL: https://github.com/dinhanhx/paraphrased_text_for_hm
Owner: dinhanhx
Created: 2021-08-07T13:51:13.000Z (about 4 years ago)
Default Branch: main
Last Pushed: 2021-08-07T13:52:03.000Z (about 4 years ago)
Last Synced: 2025-01-28T23:50:06.534Z (9 months ago)
Topics: augmentation, cli, dataset, hateful-memes-challenge, natural-language-processing, paraphrase-generation, python-3, python3
Language: Python
Homepage:
Size: 3.74 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Paraphrased text for hateful memes dataset

This project provides paraphrased texts and simple code for [Hateful Memes Challenge](https://hatefulmemeschallenge.com/) dataset.

`data_test.jsonl` consists of **ALL** metadata for memes in the dataset.
- `id`
- `img`
- `label`
- `text`

so do all other files starting with `data_test_paraphrased_nlpaug`.

## Installation

This simple project uses simple functions from [nlpaug](https://github.com/makcedward/nlpaug#installation), [pandas](https://pandas.pydata.org/getting_started.html), [tqdm](https://tqdm.github.io/). Therefore, one merely go to these links and install as their instructions. It works with Python 3.7

## Reuse `nlp_aug.py`

Just read the file, it's simple.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dinhanhx/paraphrased_text_for_hm

Awesome Lists containing this project

README