https://github.com/dataesr/harvest-sudoc
Harvesting the sudoc
https://github.com/dataesr/harvest-sudoc
Last synced: 5 months ago
JSON representation
Harvesting the sudoc
- Host: GitHub
- URL: https://github.com/dataesr/harvest-sudoc
- Owner: dataesr
- License: mit
- Created: 2022-02-25T09:01:05.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2023-10-20T13:06:15.000Z (over 2 years ago)
- Last Synced: 2025-09-11T10:49:51.788Z (9 months ago)
- Language: Python
- Size: 101 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# harvest-sudoc
Harvesting the sudoc
## How to use it ?
1. Clone the repo
`git clone https://github.com/dataesr/harvest-sudoc.git`
2. Enter the repo
`cd harvest-sudoc`
3. Build the docker image
`make docker-build`
4. Start docker image
`make start`
5. Start the harvesting
`curl -X POST -H 'Content-Type: application/json' -d '{"id_refs": ["02825354X", "242241344", "183975154", "059389451"]}' 'http://localhost:5004/harvest'`
## API
| endpoint | method | args | description |
| -------- | ------ | -------- | ----------- |
| harvest | POST | id_refs [str, list]
force_download [bool] | This endpoint will download in ObjectStorage all the sudoc notices for the given id_refs given.
If `force_download` is set to `True`, the notice will be downloaded even if already in DB. |
| harvest_notices | POST | sudoc_ids [str, list]
force_download [bool] | This endpoint will download in ObjectStorage all the sudoc notices
If `force_download` is set to `True`, the notice will be downloaded even if already in DB. |
## Release
To create a new release:
```shell
make release VERSION=X.X.X
```