{"id":26403386,"url":"https://github.com/balta2ar/manuscript-dl","last_synced_at":"2025-08-22T18:17:12.075Z","repository":{"id":27692532,"uuid":"31179111","full_name":"balta2ar/manuscript-dl","owner":"balta2ar","description":"Collection of scripts to download digitized manuscripts from various online libraries","archived":false,"fork":false,"pushed_at":"2025-01-02T18:41:34.000Z","size":25,"stargazers_count":25,"open_issues_count":3,"forks_count":4,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-04-03T11:44:24.755Z","etag":null,"topics":["calligraphy","download","download-digitized-manuscripts","downloader","elibrary","library","manuscript","manuscript-dl","pdf","python"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/balta2ar.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2015-02-22T20:31:59.000Z","updated_at":"2025-02-04T14:06:34.000Z","dependencies_parsed_at":"2024-03-23T12:01:02.126Z","dependency_job_id":"5ce4f206-1347-4c14-a45e-bc56d0f0312d","html_url":"https://github.com/balta2ar/manuscript-dl","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/balta2ar/manuscript-dl","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/balta2ar%2Fmanuscript-dl","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/balta2ar%2Fmanuscript-dl/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/balta2ar%2Fmanuscript-dl/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/balta2ar%2Fmanuscript-dl/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/balta2ar","download_url":"https://codeload.github.com/balta2ar/manuscript-dl/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/balta2ar%2Fmanuscript-dl/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":271680531,"owners_count":24802074,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-22T02:00:08.480Z","response_time":65,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["calligraphy","download","download-digitized-manuscripts","downloader","elibrary","library","manuscript","manuscript-dl","pdf","python"],"created_at":"2025-03-17T15:33:29.541Z","updated_at":"2025-08-22T18:17:12.040Z","avatar_url":"https://github.com/balta2ar.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# manuscript-dl\nCollection of scripts to download digitized manuscripts from different online libraries.\n\nSome online libraries provide convenient way to download complete manuscript as a PDF file. Some don't. Mad scripting skills to the resque!\n\n### Supported libraries\n\n#### [Nasjonalbiblioteket](https://www.nb.no/)\n\nTo download a book:\n\n1. Find out its ID, e.g.: https://www.nb.no/items/URN:NBN:no-nb_digibok_2008091504048?page=1\n2. (optional) Copy curl command from the browser, so that you preserve cookies, and adjust it.\n3. Run:\n```bash\n$ python ./nb.no.py -H 'cookie: something' URN:NBN:no-nb_digibok_2008091504048\n```\n\n#### [e-codices - Virtual Manuscript Library of Switzerland](http://www.e-codices.unifr.ch/en)\n\nTo download a book:\n\n1. Go to book description page, e.g.: http://www.e-codices.unifr.ch/en/list/one/csg/0369\n2. Right click on the link \"IIIF Manifest URL\" and save it to file, e.g. manifest.json\n3. Run\n\n``` bash\n$ e-codices.sh manifest.json [size]\n```\n\n`size` is an optional argument. Original size of manuscripts on e-codices is usually way too big and needs to be reduced.\n\n#### [British Library Digitised Manuscripts](http://www.bl.uk/manuscripts/)\n\nThis downloader uses `montage` (`imagemagick` suite) program to convert images\nto PDFs and `pdftk` to concatenate PDFs together. You need to have `pdftk` and\n`montage` installed in your system.\n\nUbuntu:\n\n``` bash\nsudo apt-get install pdftk imagemagick\n```\n\nTo download a book you need to find out its short name:\n\n1. Open manuscript description, e.g.: http://www.bl.uk/manuscripts/FullDisplay.aspx?ref=Add_MS_24686\n2. In this case the name is \"add_ms_24686\" (notice lower case). But you can double check if you click any of the pictures below and open a new page: http://www.bl.uk/manuscripts/Viewer.aspx?ref=add_ms_24686_f002r\n3. Here, `add_ms_24686_f002r` is a manuscript name + page name. You only need manuscript name.\n4. Run the `bl.uk.py` with manuscript name:\n\n``` bash\n$ python3 bl.uk.py add_ms_24686 --resolution 12\n```\n\nThis will grab all available pages with resolution 12. If you want specific pages, you can set page range using `--pages A:B` argument.\n\nAt some point the Library started replying with HTTP 429 (Too Many Requests).\nFaking user agent helped. If default user agent is not working for you, you can\nreplace it using `--user-agent` option like this:\n\n``` bash\npython3 bl.uk.py add_ms_24686 --user-agent 'Mozilla/5.0 (X11; OpenBSD i386) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.125 Safari/537.36'\n```\n\n### Author\n\n(c) 2015-2018 Yuri Bochkarev\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbalta2ar%2Fmanuscript-dl","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbalta2ar%2Fmanuscript-dl","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbalta2ar%2Fmanuscript-dl/lists"}