Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wragge/trove-journal-issues
https://github.com/wragge/trove-journal-issues
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/wragge/trove-journal-issues
- Owner: wragge
- License: cc0-1.0
- Created: 2023-08-01T02:43:36.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-08-01T02:51:16.000Z (over 1 year ago)
- Last Synced: 2024-10-12T15:55:57.124Z (3 months ago)
- Size: 1.17 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Trove journal issues
This repository contains a series of CSV files that capture information about digitised journals available on Trove. There is one CSV file per journal. Each file contains a list of available issues. The files were created [using this notebook](https://glam-workbench.net/trove-journals/get-ocrd-text-from-all-journals/) in the GLAM Workbench.
Each CSV file includes the following fields:
* `title` – title of the journal
* `id` – Trove's identifier for the journal
* `details` – publication details of this issue, usually volume/issue numbers and dates
* `pages` – number of pages in the issue
* `text_file` – name of the file containing the full text of this issueTo browse the list of journals (with links to CSV files and full text downloads) [go to the GLAM Workbench](https://glam-workbench.net/trove-journals/journals-with-ocr/). A [machine-readable list of journals with OCRd text](https://glam-workbench.net/trove-journals/csv-journals-with-ocr/) is also available.
The full text contents of each journal issue has been downloaded from Trove and saved in a `zip` file. The `zip` files can be downloaded from an s3 bucket, either using [this human-readable list](https://glam-workbench.net/trove-journals/journals-with-ocr/), or by constructing a url [using this CSV file](https://glam-workbench.net/trove-journals/csv-journals-with-ocr/). Just insert the contents of the `directory` field into a url of the form `https://trove-journals.s3.ap-southeast-2.amazonaws.com/[directory value].zip`. For example: