https://github.com/sandsmark/scp-wiki
Mirror of the scp wiki, approx. 20 million words. If you just want the text for e. g. training some version of GPT download the latest release (half the size without the git history).
https://github.com/sandsmark/scp-wiki
dataset scp scp-foundation text-analysis text-generation text-mining text-processing wikidot
Last synced: 7 months ago
JSON representation
Mirror of the scp wiki, approx. 20 million words. If you just want the text for e. g. training some version of GPT download the latest release (half the size without the git history).
- Host: GitHub
- URL: https://github.com/sandsmark/scp-wiki
- Owner: sandsmark
- Created: 2019-08-04T12:38:28.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2020-08-28T09:51:13.000Z (about 5 years ago)
- Last Synced: 2025-01-25T18:11:41.769Z (9 months ago)
- Topics: dataset, scp, scp-foundation, text-analysis, text-generation, text-mining, text-processing, wikidot
- Homepage: http://www.scpwiki.com/
- Size: 1.46 GB
- Stars: 17
- Watchers: 3
- Forks: 6
- Open Issues: 0