Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/simonebel/nlp-datasets-benchmark

NLP Datasets Benchmark
https://github.com/simonebel/nlp-datasets-benchmark

dataset nlp

Last synced: 6 days ago
JSON representation

NLP Datasets Benchmark

Awesome Lists containing this project

README

        

# NLP Datasets Benchmark
Personnal repository that centralized various NLP datasets used for research.

## Datasets Description

|Dataset|Description|
|:---------|-------------|
|[GLUE](./GLUE/)|This dataset evaluates various text classification tasks|
|[WMT-fr-en14](./wmt_fr_en14/)|This dataset evaluates machine translation task (from english to french)|

## How use this repository ?

You can clone this repository and add the submodule linked to it by using the following command :

```bash
git clone --recurse-submodules https://github.com/simonebel/nlp-datasets-benchmark.git
```

then you can find in each folder specific instructions to download the datasets.

## TODO
- [Squad](https://rajpurkar.github.io/SQuAD-explorer/)
- [FQuAD](https://fquad.illuin.tech/)
- CNN/Daily
- Xsum