https://github.com/wikimedia-pl/mbc-importer
Script importing "Mazowiecka Biblietka Cyfrowa" files into Wikimedia Commons
https://github.com/wikimedia-pl/mbc-importer
dlibra glam wikimedia-commons
Last synced: 2 months ago
JSON representation
Script importing "Mazowiecka Biblietka Cyfrowa" files into Wikimedia Commons
- Host: GitHub
- URL: https://github.com/wikimedia-pl/mbc-importer
- Owner: wikimedia-pl
- Created: 2021-05-10T10:18:52.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2026-03-31T05:22:35.000Z (2 months ago)
- Last Synced: 2026-04-04T04:48:05.240Z (2 months ago)
- Topics: dlibra, glam, wikimedia-commons
- Language: Python
- Homepage: https://commons.wikimedia.org/wiki/Special:Contributions/Mazovian_Digital_Library_Upload
- Size: 120 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# mbc-harvester
A script harvesting [Mazowiecka Biblietka Cyfrowa collection"Warszawa w ilustracji prasowej XIX w."](http://mbc.cyfrowemazowsze.pl/dlibra/collectiondescription?dirids=231). Files are then [uploaded on Wikimedia Commons](https://commons.wikimedia.org/wiki/Special:Contributions/Mazovian_Digital_Library_Upload).
It can crawl any e-library that is powered by [OAI-compatible software](https://www.openarchives.org/), for instance [dLibra](https://dingo.psnc.pl/dlibra/).
[**Importer is executed**](https://github.com/wikimedia-pl/mbc-importer/actions/workflows/do-import.yml) via GitHub Actions cross **twice a week** - at 7:00 AM every Monday and Thursday.
## Install
Set up Python env.
```
virtualenv env -ppython38
. env/bin/activate
pip install -r requirements.txt
```
Set up account that will used for uploads.
```
$ cat user-password.py
('commons', 'commons', 'Mazovian_Digital_Library_Upload', 'XXX')
```
## Run
```
python harvest.py
```
## GitHub Actions
You need to set up the following secrets in order to run the importer as a cron-triggered action:
* `HTTP_PROXY` (e.g. `socks5://example.com:12345`)
* `PYWIKIBOT_USERNAME`
* `PYWIKIBOT_PASSWORD`