https://github.com/eleutherai/pile-ubuntu-irc
A script for collecting the Ubuntu IRC dataset in a language modelling friendly format.
https://github.com/eleutherai/pile-ubuntu-irc
Last synced: 7 months ago
JSON representation
A script for collecting the Ubuntu IRC dataset in a language modelling friendly format.
- Host: GitHub
- URL: https://github.com/eleutherai/pile-ubuntu-irc
- Owner: EleutherAI
- License: mit
- Created: 2020-09-05T20:28:04.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2020-11-10T22:01:48.000Z (over 5 years ago)
- Last Synced: 2025-02-25T05:30:39.659Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- License: LICENSE