{"id":18541847,"url":"https://github.com/cltk/latin_treebank_perseus","last_synced_at":"2025-11-01T05:30:34.783Z","repository":{"id":15930308,"uuid":"18672337","full_name":"cltk/latin_treebank_perseus","owner":"cltk","description":"Latin treebank from the Perseus Digital Library","archived":false,"fork":false,"pushed_at":"2017-06-22T06:24:05.000Z","size":36147,"stargazers_count":4,"open_issues_count":1,"forks_count":1,"subscribers_count":7,"default_branch":"master","last_synced_at":"2024-12-26T05:12:27.300Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cltk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-04-11T11:43:44.000Z","updated_at":"2024-11-22T08:15:00.000Z","dependencies_parsed_at":"2022-09-24T05:41:20.959Z","dependency_job_id":null,"html_url":"https://github.com/cltk/latin_treebank_perseus","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Flatin_treebank_perseus","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Flatin_treebank_perseus/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Flatin_treebank_perseus/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Flatin_treebank_perseus/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cltk","download_url":"https://codeload.github.com/cltk/latin_treebank_perseus/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239262366,"owners_count":19609518,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-06T20:06:30.505Z","updated_at":"2025-11-01T05:30:34.751Z","avatar_url":"https://github.com/cltk.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# About\n\nThis repository contains treebanks for Latin from the [Ancient Latin Dependency Treebank, version 1.7](http://nlp.perseus.tufts.edu/syntax/treebank/). The file `latin_treebank_perseus/ldt-1.5.xml` contains all of the treebank data.\n\n\n# Part of speech\n\nSee `make_pos_models.py` for how models were created for the unigram, bigram, trigram, backoff (1, 2, 3), and crf models were made. They are kept in the `cltk/latin_models_cltk \u003chttps://github.com/cltk/latin_models_cltk\u003e`_ repo.\n\nThe Lapos model was made with the Lapos tagger (`cltk/lapos \u003chttps://github.com/cltk/lapos\u003e`_) and the following command:\n\n``` shell\n$ ./lapos-learn -m ./model latin_training_set.pos\n```\n\n\n# README\n\nThis is a README file for the Latin Dependency Treebank, version 1.5.\n\n\n1. Preamble\n\n\t1.1 Source\n\t\n\t\tThe Latin Dependency Treebank is available at:\n\t\t\n\t\thttp://nlp.perseus.tufts.edu/syntax/treebank/1.5\n\t\t\n\t\t\n\t1.2 License\n\t\n\t\tLDT 1.5 is licensed under a Creative Commons Attribution- \n\t\tNonCommercial-ShareAlike 2.5 License:\n\t\t\n\t\thttp://creativecommons.org/licenses/by-nc-sa/2.5\n\t\t\n\t\t\n2. Documentation\n\n\t2.1 Data Format\n\t\n\t\tThe data given in this treebank is provided as an XML document.  Each \n\t\tword contains six required attributes:\n\t\t\n\t\tid: This is a unique identifier, and corresponds to the word's linear \n\t\tposition in the sentence.  The first word in a sentence is given \n\t\tid 1.\n\t\t\n\t\tform: The token form of the word.\n\t\t\n\t\tlemma: The base lemma from which the word is derived.\n\t\t\n\t\thead: The id of the word's parent.  If a word depends on the sentence \n\t\troot, its head is 0.\n\t\t\n\t\trelation: The syntactic relation between the word and its parent.  A \n\t\tcatalogue of syntactic tags can be found in the syntactic guidelines \n\t\tdescribed below.\n\t\t\n\t\tpostag: The morphological analysis for the word.  This field is 9 \n\t\tcharacters long, and corresponds to the following morphological \n\t\tfeatures:\n\t\t\n\t\t\t1: \tpart of speech\n\t\t\t\n\t\t\t\tn\tnoun\n\t\t\t\tv\tverb\n\t\t\t\tt\tparticiple\n\t\t\t\ta\tadjective\n\t\t\t\td\tadverb\n\t\t\t\tc\tconjunction\n\t\t\t\tr\tpreposition\n\t\t\t\tp\tpronoun\n\t\t\t\tm\tnumeral\n\t\t\t\ti\tinterjection\n\t\t\t\te\texclamation\n\t\t\t\tu\tpunctuation\n\t\t\t\n\t\t\t2: \tperson\n\t\t\t\n\t\t\t\t1\tfirst person\n\t\t\t\t2\tsecond person\n\t\t\t\t3\tthird person\n\t\t\t\n\t\t\t3: \tnumber\n\t\t\t\n\t\t\t\ts\tsingular\n\t\t\t\tp\tplural\n\t\t\t\n\t\t\t4: \ttense\n\t\t\t\n\t\t\t\tp\tpresent\n\t\t\t\ti\timperfect\n\t\t\t\tr\tperfect\n\t\t\t\tl\tpluperfect\n\t\t\t\tt\tfuture perfect\n\t\t\t\tf\tfuture\n\t\t\t\n\t\t\t5: \tmood\n\t\t\t\n\t\t\t\ti\tindicative\n\t\t\t\ts\tsubjunctive\n\t\t\t\tn\tinfinitive\n\t\t\t\tm\timperative\n\t\t\t\tp\tparticiple\n\t\t\t\td\tgerund\n\t\t\t\tg\tgerundive\n\t\t\t\tu\tsupine\n\t\t\t\n\t\t\t6: \tvoice\n\t\t\t\n\t\t\t\ta\tactive\n\t\t\t\tp\tpassive\n\t\t\t\n\t\t\t7:\tgender\n\t\t\t\n\t\t\t\tm\tmasculine\n\t\t\t\tf\tfeminine\n\t\t\t\tn\tneuter\n\t\t\t\n\t\t\t8: \tcase\n\t\t\t\n\t\t\t\tn\tnominative\n\t\t\t\tg\tgenitive\n\t\t\t\td\tdative\n\t\t\t\ta\taccusative\n\t\t\t\tb\tablative\n\t\t\t\tv\tvocative\n\t\t\t\tl\tlocative\n\t\t\t\n\t\t\t9: \tdegree\n\t\t\t\n\t\t\t\tc\tcomparative\n\t\t\t\ts\tsuperlative\n\t\t\t\n\t\t\t---\n\t\t\t\n\t\t\tFor example, the postag for the adjective \"alium\" is \"a-s---ma-\", \n\t\t\twhich corresponds to the following features:\n\t\t\t\n\t\t\t1: a\tadjective\n\t\t\t2: -\n\t\t\t3: s\tsingular\n\t\t\t4: -\n\t\t\t5: -\n\t\t\t6: -\n\t\t\t7: m\tmasculine\n\t\t\t8: a\taccusative\n\t\t\t9: -\n\n\t\t\n\t\n\t2.2 Text\n\t\n\t\tLDT 1.5 is comprised of excerpts from eight texts, in the following \n\t\tdistribution:\n\t\t\n\t\tCaesar:\t1,488 words\n\t\tCicero:\t6,229 words\n\t\tJerome:\t8,382 words\n\t\tOvid: 4,789 words\n\t\tPetronius: 12,474 words\n\t\tPropertius: 4,857 words\n\t\tSallust: 12,311 words\n\t\tVergil:\t2,613 words\n\t\t\n\t\tThe editions of these texts are as follows:\n\t\t\n\t\tCaesar, C. Julius, Commentarii Rerum in Gallia Gestarum VII: A Hirti \n\t\tCommentarius VIII.  T. Rice Holmes (Oxford: Clarendon Press, 1914).\n\t\t\n\t\tCicero, M. Tullius, Orationes.  Recognovit brevique adnotatione critica \n\t\tinstruxit Albertus Curtis Clark (Oxford: Clarendon Press, 1908).\n\t\t\n\t\tJerome, Vulgate Bible.  Bible Foundation and On-Line Book Initiative.  \n\t\tftp.std.com/obi/Religion/Vulgate. \n\t\t\n\t\tOvid, Metamorphoses.  Hugo Magnus (ed.) (Gotha: Friedr. Andr. Perthes, \n\t\t1892).\n\t\t\n\t\tPetronius, Satyricon.  W. H. D. Rouse (ed.) (London: William Heinemann, \n\t\t1913).\n\t\t\n\t\tPropertius, Charm. Vincent Katz (trans.) (Los Angeles: Sun and Moon \n\t\tPress, 1995).\n\t\t\n\t\tVergil, Bucolica, Aeneis, Georgica. The Greater Poems of Virgil. J. B. \n\t\tGreenough (Boston: Ginn \u0026 Co., 1882).\n\t\t\n\t\tC. Sallusti Crispi Catilina, Iugurtha, Orationes et epistulae excerptae\n\t\tde historiis. Axel W. Ahlberg (Leipzig: Teubner, 1919).\n\t\t\n\t\tThe following document_ids in the treebank correspond to the following \n\t\tworks:\n\t\t\n\t\tPerseus:text:1999.02.0002\tCaesar (Commentarii de Bello Gallico)\n\t\tPerseus:text:1999.02.0010\tCicero (In Catilinam)\n\t\tPerseus:text:1999.02.0060\tJerome (Vulgata)\n\t\tPerseus:text:1999.02.0055\tVergil (Aeneid)\n\t\tPerseus:text:1999.02.0029\tOvid (Metamorphoses)\n\t\tPerseus:text:2007.01.0001\tPetronius (Satyricon)\n\t\tPerseus:text:1999.02.0066\tPropertius (Elegies)\n\t\tPerseus:text:2008.01.0002\tSallust (Bellum Catilinae)\n\t\t\n\t2.3 Annotation Standards\n\n\t\tThis release of the treebank has been annotated according to the \n\t\tguidelines specified in version 1.3 of the \"Guidelines for the Syntactic \n\t\tAnnotation of Latin Treebanks,\" found in docs/guidelines.pdf\n\t\t\t  \n\t\t\n\t2.4 Authorship\n\n\t\tEach sentence in the Latin Dependency Treebank is built from the efforts\n\t\tof two independent annotators (marked \"primary\" in the data) reconciled\n\t\tby a third (marked \"secondary\").  We would like to recognize the \n\t\tcontribution of the following individuals toward its creation and thank\n\t\tthem for their commitment to the advancement of Classical scholarship:\n\t\n\t\tJames Artz, Calliopi Dourou, J. F. Gentile, Kenny Hickman, Alex Lessie, \n\t\tViet Luong, Meg Luthin, Molly Miller, Robin Ngo, Skylar Neil and the \n\t\tTufts University LAT-181 class (Spring 2008).\n\n\t\t\n\t\t\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcltk%2Flatin_treebank_perseus","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcltk%2Flatin_treebank_perseus","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcltk%2Flatin_treebank_perseus/lists"}