Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/simonebel/nlp-datasets-benchmark
NLP Datasets Benchmark
https://github.com/simonebel/nlp-datasets-benchmark
dataset nlp
Last synced: 6 days ago
JSON representation
NLP Datasets Benchmark
- Host: GitHub
- URL: https://github.com/simonebel/nlp-datasets-benchmark
- Owner: simonebel
- Created: 2021-08-19T13:56:54.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2021-10-12T17:02:38.000Z (about 3 years ago)
- Last Synced: 2024-04-07T23:32:41.273Z (7 months ago)
- Topics: dataset, nlp
- Homepage:
- Size: 21.5 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# NLP Datasets Benchmark
Personnal repository that centralized various NLP datasets used for research.## Datasets Description
|Dataset|Description|
|:---------|-------------|
|[GLUE](./GLUE/)|This dataset evaluates various text classification tasks|
|[WMT-fr-en14](./wmt_fr_en14/)|This dataset evaluates machine translation task (from english to french)|## How use this repository ?
You can clone this repository and add the submodule linked to it by using the following command :
```bash
git clone --recurse-submodules https://github.com/simonebel/nlp-datasets-benchmark.git
```then you can find in each folder specific instructions to download the datasets.
## TODO
- [Squad](https://rajpurkar.github.io/SQuAD-explorer/)
- [FQuAD](https://fquad.illuin.tech/)
- CNN/Daily
- Xsum