https://github.com/eleutherai/pile-uspto
A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.
https://github.com/eleutherai/pile-uspto
Last synced: about 1 year ago
JSON representation
A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.
- Host: GitHub
- URL: https://github.com/eleutherai/pile-uspto
- Owner: EleutherAI
- License: mit
- Created: 2020-09-11T02:24:29.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2020-10-26T06:32:43.000Z (over 5 years ago)
- Last Synced: 2025-04-24T18:48:47.711Z (about 1 year ago)
- Language: Python
- Size: 8.79 KB
- Stars: 8
- Watchers: 2
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pile_uspto
A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.
To run, do:
```bash
git clone https://github.com/cfoster0/pile_uspto.git
cd pile_uspto
virtualenv env
. env/bin/activate
pip install -r requirements.txt
python main.py
```