https://github.com/eleutherai/pile-pubmedcentral
A script for collecting the PubMed Central dataset in a language modelling friendly format.
https://github.com/eleutherai/pile-pubmedcentral
Last synced: about 1 year ago
JSON representation
A script for collecting the PubMed Central dataset in a language modelling friendly format.
- Host: GitHub
- URL: https://github.com/eleutherai/pile-pubmedcentral
- Owner: EleutherAI
- License: mit
- Created: 2020-09-17T23:01:44.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2021-02-16T03:22:01.000Z (over 5 years ago)
- Last Synced: 2023-03-04T05:33:15.554Z (over 3 years ago)
- Language: Python
- Homepage:
- Size: 18.7 MB
- Stars: 7
- Watchers: 2
- Forks: 1
- Open Issues: 0