{"id":19162814,"url":"https://github.com/centre-for-humanities-computing/odycy","last_synced_at":"2025-05-07T11:21:41.324Z","repository":{"id":65530283,"uuid":"586824323","full_name":"centre-for-humanities-computing/odyCy","owner":"centre-for-humanities-computing","description":"A general-purpose NLP pipeline for Ancient Greek","archived":false,"fork":false,"pushed_at":"2024-03-26T14:29:10.000Z","size":45737,"stargazers_count":22,"open_issues_count":5,"forks_count":3,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-04-13T02:40:11.121Z","etag":null,"topics":["ancient-greek","machine-learning","natural-language-processing","nlp","python","spacy"],"latest_commit_sha":null,"homepage":"https://centre-for-humanities-computing.github.io/odyCy/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/centre-for-humanities-computing.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2023-01-09T10:15:23.000Z","updated_at":"2025-03-20T17:38:21.000Z","dependencies_parsed_at":"2024-03-26T15:49:20.024Z","dependency_job_id":null,"html_url":"https://github.com/centre-for-humanities-computing/odyCy","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/centre-for-humanities-computing%2FodyCy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/centre-for-humanities-computing%2FodyCy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/centre-for-humanities-computing%2FodyCy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/centre-for-humanities-computing%2FodyCy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/centre-for-humanities-computing","download_url":"https://codeload.github.com/centre-for-humanities-computing/odyCy/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252866122,"owners_count":21816397,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ancient-greek","machine-learning","natural-language-processing","nlp","python","spacy"],"created_at":"2024-11-09T09:13:14.303Z","updated_at":"2025-05-07T11:21:41.299Z","avatar_url":"https://github.com/centre-for-humanities-computing.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cimg width=\"200\" src=\"docs/_static/logo_with_text_below.svg\"\u003e\n  \u003cdiv align=\"center\" style=\"color: #2c5882; font-weight: bold; font-size: 14px; margin-top: -18px;\"\u003e\n    A general-purpose NLP pipeline for Ancient-Greek.\n  \u003c/div\u003e\n\u003c/p\u003e\n\u003cbr\u003e\n\n## Features :mount_fuji:\n\n - [x] Part of speech tagging\n - [x] Lemmatization\n - [x] Dependency parsing\n - [x] Morphological analysis\n - [ ] Named entity recognition (work in progress :construction:)\n\n## Installation :sunrise:\n\nOdyCy models can be directly installed from huggingface:\n\n```bash\n# To install the transformer-based pipeline\npip install https://huggingface.co/chcaa/grc_odycy_joint_trf/resolve/main/grc_odycy_joint_trf-any-py3-none-any.whl\n# To install the tok2vec-based small pipeline\npip install https://huggingface.co/chcaa/grc_odycy_joint_sm/resolve/main/grc_odycy_joint_sm-any-py3-none-any.whl\n```\n\n## Usage :whale:\n\n[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/centre-for-humanities-computing/odyCy/blob/main/tutorials/01_odycy_getting_started.ipynb#\u0026offline=true\u0026sandboxMode=true)\n\nOdyCy pipelines can be imported with spaCy.\n\n```python\nimport spacy\n\n# For the transformer-based pipeline\nnlp = spacy.load(\"grc_odycy_joint_trf\")\n\n# For a faster and smaller (but less accurate) tok2vec-based pipeline\nnlp = spacy.load(\"grc_odycy_joint_sm\")\n```\n\nPipelines can then be used as any other spaCy pipeline.\n([spaCy Documentation](https://spacy.io/usage))\n\nCheck out our Documentation on [Basic Usage](https://centre-for-humanities-computing.github.io/odyCy/getting_started.html).\n\n## Performance :boat:\n\nodyCy achieves state of the art performance on multiple tasks on unseen test data from the Universal Dependencies Perseus treebank,\nand performs second best on the PROIEL treebank’s test set on even more tasks.\nIn addition performance also seems relatively stable across the two evaluation datasets in comparison with other NLP pipelines.\n\nFor plots and tables on OdyCy's performance, check out the Documentation page on [Performance](https://centre-for-humanities-computing.github.io/odyCy/performance.html)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcentre-for-humanities-computing%2Fodycy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcentre-for-humanities-computing%2Fodycy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcentre-for-humanities-computing%2Fodycy/lists"}