{"id":21660171,"url":"https://github.com/slub/esmarc","last_synced_at":"2026-05-22T05:02:27.892Z","repository":{"id":41279383,"uuid":"236740250","full_name":"slub/esmarc","owner":"slub","description":"marc21 -\u003e rdf mapping tool","archived":false,"fork":false,"pushed_at":"2023-12-14T15:02:15.000Z","size":327,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-01-25T07:08:54.977Z","etag":null,"topics":["json-ld","json-ld-context","marc21","python3","rdf","rdflib"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/slub.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-01-28T13:22:24.000Z","updated_at":"2024-04-16T20:07:25.000Z","dependencies_parsed_at":"2023-12-13T13:38:46.498Z","dependency_job_id":"1d0a55ae-a283-4224-83e1-397e7b3d1035","html_url":"https://github.com/slub/esmarc","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/slub%2Fesmarc","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/slub%2Fesmarc/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/slub%2Fesmarc/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/slub%2Fesmarc/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/slub","download_url":"https://codeload.github.com/slub/esmarc/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244555937,"owners_count":20471530,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["json-ld","json-ld-context","marc21","python3","rdf","rdflib"],"created_at":"2024-11-25T09:32:24.352Z","updated_at":"2026-05-22T05:02:27.804Z","avatar_url":"https://github.com/slub.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cimg alt=\"EFRE-Lod logo\" src=\"https://raw.githubusercontent.com/slub/data.slub-dresden.de/master/assets/images/EFRE_EU_quer_2015_rgb_engl.svg\" width=\"300\" \u003e\n\n# Installation\n\nrun:\n```\npip3 install . --user\n```\n\n# esmarc.py\n\nesmarc is a python3 tool to read line-delimited MARC21 JSON from an elasticSearch index, perform a mapping and writes the output in a directory with a file for each mapping type.\n\ndependencies:\npython3-elasticsearch\nefre-lod-elasticsearch-tools\n\nrun:\n\n```\n$ esmarc.py \u003cOPTARG\u003e\n\t-h, --help            show this help message and exit\n\t-host HOST            hostname or IP-Address of the ElasticSearch-node to use. If None we try to read ldj from stdin.\n\t-port PORT            Port of the ElasticSearch-node to use, default is 9200.\n\t-type TYPE            ElasticSearch Type to use\n\t-index INDEX          ElasticSearch Index to use\n\t-id ID                map single document, given by id\n\t-help                 print this help\n\t-prefix PREFIX        Prefix to use for output data\n\t-debug                Dump processed Records to stdout (mostly used for debug-purposes)\n\t-server SERVER        use http://host:port/index/type/id?pretty syntax. overwrites host/port/index/id/pretty.\n\t-pretty               output tabbed json\n\t-w W                  how many processes to use\n\t-idfile IDFILE        path to a file with IDs to process\n\t-query QUERY          prefilter the data based on an elasticsearch-query\n\n```\n\n\n\u003ca name=\"entityfacts-bot.py\"/\u003e\n\n# entityfacts-bot.py \n\nentityfacts-bot.py is a Python3 program that enrichs (\"links\") your data with more identifiers from entitiyfacts.  Prerequisits is that you have a field containing your GND-Identifier.\n\n\nIt connects to an elasticsearch node and outputs the enriched data, which can be put back to the index using esbulk.\n\n## Usage\n\n```\n./entityfacts-bot.py\n    -h, --help            show this help message and exit\n    -host HOST            hostname or IP-Address of the ElasticSearch-node to use, default is localhost.\n    -port PORT            Port of the ElasticSearch-node to use, default is 9200.\n    -index INDEX          ElasticSearch Search Index to use\n    -type TYPE            ElasticSearch Search Index Type to use\n    -id ID                retrieve single document (optional)\n    -searchserver SEARCHSERVER use http://host:port/index/type/id?pretty. overwrites host/port/index/id/pretty\n    -stdin                get data from stdin\n    -pipeline             output every record (even if not enriched) to put this script into a pipeline\n\n```\n\n\n## Requirements\n\npython3-elasticsearch\n\ne.g. (ubuntu)\n```\nsudo apt-get install python3-elasticsearch\n```\n\u003ca name=\"entityfacts-bot.py\"/\u003e\n\n# wikidata.py \n\nwikidata.py is a Python3 program that enrichs (\"links\") your data with the wikidata-identifier from wikidata.  Prerequisits is that you have a field containing your GND-Identifier. Other identifiers are planned to be used in future.\n\n\nIt connects to an elasticsearch node and outputs the enriched data, which can be put back to the index using esbulk.\n\n## Usage\n\n```\n./wikidata.py\n    -h, --help      show this help message and exit\n    -host HOST      hostname or IP-Address of the ElasticSearch-node to use, default is localhost.\n    -port PORT      Port of the ElasticSearch-node to use, default is 9200.\n    -index INDEX    ElasticSearch Search Index to use\n    -type TYPE      ElasticSearch Search Index Type to use\n    -id ID          retrieve single document (optional)\n    -stdin          get data from stdin\n    -pipeline       output every record (even if not enriched) to put this script into a pipeline\n    -server SERVER  use http://host:port/index/type/id?pretty. overwrites host/port/index/id/pretty\n```\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fslub%2Fesmarc","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fslub%2Fesmarc","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fslub%2Fesmarc/lists"}