Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/naghim/opacmultifetcher
Bulk download files from Opac libraries.
https://github.com/naghim/opacmultifetcher
bulk-downloader opac pdf-downloader sapientia-university
Last synced: about 2 months ago
JSON representation
Bulk download files from Opac libraries.
- Host: GitHub
- URL: https://github.com/naghim/opacmultifetcher
- Owner: naghim
- Created: 2024-06-19T14:10:32.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-06-19T15:49:42.000Z (7 months ago)
- Last Synced: 2024-06-20T03:15:39.172Z (7 months ago)
- Topics: bulk-downloader, opac, pdf-downloader, sapientia-university
- Language: Python
- Homepage:
- Size: 2.93 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Opac Multi Fetcher
A Python script designed to **bulk download** items from an Opac online library efficiently. This script automates the process of fetching all PDFs from an Opac website, saving you time and effort while being **highy configurable**.
## Prerequisites
- An **active account to access the Opac online library**.
- Python 3.10+## Installation
1. **Clone the repository**:
```bash
git clone https://github.com/naghim/OpacMultiFetcher.git
cd OpacMultiFetcher
```2. **Install the required Python packages**:
```bash
pip install -r requirements.txt
```## Usage
1. **Configure the script and add your credentials** (see Configuration section). Ensure you have filled out the `settings.json` file with the correct details.
2. **Run the script**:
```bash
python download.py
```3. The script will log in using your credentials and start **downloading PDFs** based on your configuration.
### Configuration
Create a file named `settings.json` in the root folder with the following structure:
```json
{
"url": "http://opac3.ms.sapientia.ro",
"tenant": "marosvasarhely",
"lastRecord": 116323,
"cookies": {
"access_token": "...",
"JSESSIONID": "...."
}
}
```- `url`: The base URL of the Opac online library.
- `tenant`: The tenant identifier for the library.
- `lastRecord`: The last record number you want to fetch. Adjust this based on your requirements.
- `cookies`: Your authentication cookies for accessing the library. You need to include your `access_token` and `JSESSIONID` obtained from logging into the library. **Note:** Some websites _are public and work without cookies set_.