https://github.com/atomotic/rclis
https://github.com/atomotic/rclis
Last synced: over 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/atomotic/rclis
- Owner: atomotic
- Created: 2024-09-13T06:03:32.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-09-13T06:05:55.000Z (over 1 year ago)
- Last Synced: 2025-01-05T02:24:41.476Z (over 1 year ago)
- Language: Go
- Size: 15.9 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
harvest oai with [metha](https://github.com/miku/metha)
```
go install -v github.com/miku/metha/cmd/metha-sync@latest
METHA_DIR=./data metha-sync -format didl http://eprints.rclis.org/cgi/oai2
```
extract seeds
```
go build
parallel ./rclis {} ::: data/I2RpZGwjaHR0cDovL2VwcmludHMucmNsaXMub3JnL2NnaS9vYWky/*.xml.gz | sort > seeds.txt
```