{"id":25381762,"url":"https://github.com/isawnyu/oracc2csv","last_synced_at":"2026-04-30T13:34:55.960Z","repository":{"id":142313593,"uuid":"507534326","full_name":"isawnyu/oracc2csv","owner":"isawnyu","description":"The Open Richly Annotated Cuneiform Corpus (ORACC) publishes JSON data for each of its projects. Sometimes you want the catalog data listing each text to be in CSV format. This package does that.","archived":false,"fork":false,"pushed_at":"2022-06-26T09:47:06.000Z","size":9097,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2026-04-30T13:34:55.346Z","etag":null,"topics":["csv","cuneiform","json","oracc"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/isawnyu.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-06-26T09:46:41.000Z","updated_at":"2023-02-06T16:46:04.000Z","dependencies_parsed_at":null,"dependency_job_id":"3ea94245-9b47-44f8-9e49-df78ced8a8d8","html_url":"https://github.com/isawnyu/oracc2csv","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/isawnyu/oracc2csv","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/isawnyu%2Foracc2csv","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/isawnyu%2Foracc2csv/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/isawnyu%2Foracc2csv/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/isawnyu%2Foracc2csv/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/isawnyu","download_url":"https://codeload.github.com/isawnyu/oracc2csv/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/isawnyu%2Foracc2csv/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32466333,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-30T13:12:12.517Z","status":"ssl_error","status_checked_at":"2026-04-30T13:12:06.837Z","response_time":57,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["csv","cuneiform","json","oracc"],"created_at":"2025-02-15T06:33:20.965Z","updated_at":"2026-04-30T13:34:55.943Z","avatar_url":"https://github.com/isawnyu.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!--\nThis file is part of \"oracc2csv\"\nby Tom Elliott for the Institute for the Study of the Ancient World (NYU)\n(c) Copyright 2022 by New York University\nLicensed under the AGPL-3.0; see LICENSE.txt file.\n--\u003e\n\n# oracc2csv\n\nThe [Open Richly Annotated Cuneiform Corpus (ORACC)](http://oracc.museum.upenn.edu/) publishes JSON data for each of its projects. Sometimes you want the catalog data listing each text to be in CSV format. This package does that.\n\nThis program was written by [Tom Elliott](https://orcid.org/0000-0002-4114-6677) for the [Institute for the Study of the Ancient World (NYU)](https://isaw.nyu.edu) and is Copyright 2022 by New York University. It is licensed under the GNU Affero General Public License (see LICENSE.txt).\n\n## Install\n\nCreate a python 3.10.4+ virtual environment. Download or clone this package from GitHub. Run:\n\n```\npip install -U -r requirements_dev.txt\n```\n\n## Use\n\nDownload the zip file of the ORACC project you're interested in (e.g., http://oracc.org/json/hbtin.zip). Run the oracc2csv `dump` script:\n\n```\n\u003e python scripts/dump.py -v ~/oracc/hbtin ~/scratch\nINFO:root:logging level changed to INFO via command line option; was WARNING\nINFO:oracc2csv:Loaded corpus from /Users/banana/oracc/hbtin:\nHBTIN: Hellenistic Babylonia: Texts, Iconography, Names\nCuneiform texts, iconography and onomastic data from Hellenistic Babylonia, primarily from Uruk. HBTIN texts form the demonstrator corpus of the \u003ca href=\"http://berkeleyprosopography.org/\"\u003eBerkeley Prosopography Service\u003c/a\u003e (BPS).  Directed by Laurie Pearce at UC Berkeley.\n572 entries\nINFO:oracc2csv:Wrote corpus to /Users/banana/scratch\n```\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fisawnyu%2Foracc2csv","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fisawnyu%2Foracc2csv","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fisawnyu%2Foracc2csv/lists"}