{"id":22700035,"url":"https://github.com/ub-mannheim/hkb-gt","last_synced_at":"2025-03-29T19:10:40.374Z","repository":{"id":183986196,"uuid":"671122190","full_name":"UB-Mannheim/hkb-gt","owner":"UB-Mannheim","description":"Ground truth for a political newspaper of the Mannheim region (1931–1945) ","archived":false,"fork":false,"pushed_at":"2024-10-27T15:29:09.000Z","size":2447,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-02-04T19:46:09.805Z","etag":null,"topics":["ground-truth","newspaper","ocr","ocr-d"],"latest_commit_sha":null,"homepage":"","language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc-by-sa-4.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/UB-Mannheim.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-26T15:31:02.000Z","updated_at":"2024-10-27T15:29:14.000Z","dependencies_parsed_at":"2025-02-04T19:44:13.837Z","dependency_job_id":"fd0c2236-9a73-4fc3-a20d-94a85b040367","html_url":"https://github.com/UB-Mannheim/hkb-gt","commit_stats":null,"previous_names":["jkamlah/hakenkreuzbanner-gt","ub-mannheim/hkb-gt"],"tags_count":0,"template":false,"template_full_name":"OCR-D/gt-repo-template","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/UB-Mannheim%2Fhkb-gt","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/UB-Mannheim%2Fhkb-gt/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/UB-Mannheim%2Fhkb-gt/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/UB-Mannheim%2Fhkb-gt/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/UB-Mannheim","download_url":"https://codeload.github.com/UB-Mannheim/hkb-gt/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246230542,"owners_count":20744349,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ground-truth","newspaper","ocr","ocr-d"],"created_at":"2024-12-10T06:09:36.759Z","updated_at":"2025-03-29T19:10:40.356Z","avatar_url":"https://github.com/UB-Mannheim.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# hkb-gt\nThe Hakenkreuzbanner (hkb) was the party newspaper of the NSDAP for the Mannheim region from 1931 to 1945.\nMore information can be found on the website of [Udo Leuschner](https://www.udo-leuschner.de/zeitungsgeschichte/sonstige/hkb.htm).\n\nThe city archive of Mannheim [MARCHIVUM](https://druckschriften-digital.marchivum.de/zd/periodical/titleinfo/74387) has digitised this newspaper.\nIn order to optimise the automated text recognition, this ground truth dataset was created to train new models.\n\nMannheim University Library makes this ground truth available for the purposes of science, research and teaching.\nThe University Library expressly distances itself from all NS, racist and violence-glorifying content.\n\n### Images:\nImages can be downloaded via script to their own image folder\n\n`./download_images_to_folder.sh `\n\nOr to the existing PAGE XML files\n\n`./download_images_to_page.sh `\n\n### Quantity:\n- 36 single newspaper pages\n\n### Period:\n1931-1945\n\n### Font / Writing class:\nFraktur, Antiqua, Latin\n\n### Languages:\nGerman, English, French\n\n### Transcription guidelines:\nAll transcriptions were created using [Transkribus](https://readcoop.eu/transkribus/?sc=Transkribus). The transcription rules are based on the [OCR-D transcription guidelines Level 2](https://ocr-d.de/en/gt-guidelines/trans/trLevels.html) with some exceptions (see below):\n\n**Special characters**:\n- Long s (ſ)\n- Currency symbols: German Mark (ℳ) and Pfennig (₰), $, £\n- Fractions (¼ ½ ¾ ⅐ ⅑ ⅒ ⅓ ⅔ ⅕ ⅖ ⅗ ⅘ ⅙ ⅚ ⅛ ⅜ ⅝ ⅞)\n- R rotunda (ꝛ)\n- Dagger (†)\n- White square (□)\n\n**Normalizations**:\n- Roman numerals Ⅰ Ⅴ Ⅹ Ⅼ Ⅽ Ⅾ Ⅿ --\u003e I V X L C D M\n- Em dash (—) instead of En dash (–)\n- Asterisk (\\*) used for both standard asterisk (\\*) and tear-drop asterisk (✽)\n\n**Additional characters transcribed true to original** (contrary to OCR-D Level 2):\n- Double oblique hyphen (⸗)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fub-mannheim%2Fhkb-gt","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fub-mannheim%2Fhkb-gt","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fub-mannheim%2Fhkb-gt/lists"}