https://github.com/google-research-datasets/wiki-atomic-edits
A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.
https://github.com/google-research-datasets/wiki-atomic-edits
deep-learning deep-neural-networks nlp nlp-machine-learning wikipedia
Last synced: 14 days ago
JSON representation
A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.
- Host: GitHub
- URL: https://github.com/google-research-datasets/wiki-atomic-edits
- Owner: google-research-datasets
- Archived: true
- Created: 2018-09-01T00:40:27.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-05-06T03:51:27.000Z (almost 6 years ago)
- Last Synced: 2025-02-21T04:41:57.346Z (2 months ago)
- Topics: deep-learning, deep-neural-networks, nlp, nlp-machine-learning, wikipedia
- Homepage:
- Size: 20.5 KB
- Stars: 106
- Watchers: 7
- Forks: 8
- Open Issues: 2