{"id":15898567,"url":"https://github.com/ayushexel/spacy_tagger_parser_ud","last_synced_at":"2025-04-02T19:13:30.845Z","repository":{"id":104386966,"uuid":"347302717","full_name":"AyushExel/spaCy_tagger_parser_ud","owner":"AyushExel","description":"spaCy projects integration test with W\u0026B artifacts","archived":false,"fork":false,"pushed_at":"2021-03-13T08:20:28.000Z","size":5,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-02-08T09:44:54.136Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AyushExel.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-03-13T07:25:01.000Z","updated_at":"2021-07-08T08:06:51.000Z","dependencies_parsed_at":"2023-04-20T20:32:17.803Z","dependency_job_id":null,"html_url":"https://github.com/AyushExel/spaCy_tagger_parser_ud","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AyushExel%2FspaCy_tagger_parser_ud","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AyushExel%2FspaCy_tagger_parser_ud/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AyushExel%2FspaCy_tagger_parser_ud/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AyushExel%2FspaCy_tagger_parser_ud/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AyushExel","download_url":"https://codeload.github.com/AyushExel/spaCy_tagger_parser_ud/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246875909,"owners_count":20848048,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-06T10:06:08.800Z","updated_at":"2025-04-02T19:13:30.821Z","avatar_url":"https://github.com/AyushExel.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!-- SPACY PROJECT: AUTO-GENERATED DOCS START (do not remove) --\u003e\n\n# 🪐 spaCy Project: Part-of-speech Tagging \u0026 Dependency Parsing (Universal Dependencies)\n\nThis project template lets you train a part-of-speech tagger, morphologizer and dependency parser from a [Universal Dependencies](https://universaldependencies.org/) corpus. It takes care of downloading the treebank, converting it to spaCy's format and training and evaluating the model. The template uses the [`UD_English-EWT`](https://github.com/UniversalDependencies/UD_English-EWT) treebank by default, but you can swap it out for any other available treebank. Just make sure to adjust the `lang` and treebank settings in the variables below. Use `xx` for multi-language if no language-specific tokenizer is available in spaCy. Note that multi-word tokens will be merged together when the corpus is converted since spaCy does not support multi-word token expansion.\n\n## 📋 project.yml\n\nThe [`project.yml`](project.yml) defines the data assets required by the\nproject, as well as the available commands and workflows. For details, see the\n[spaCy projects documentation](https://spacy.io/usage/projects).\n\n### ⏯ Commands\n\nThe following commands are defined by the project. They\ncan be executed using [`spacy project run [name]`](https://spacy.io/api/cli#project-run).\nCommands are only re-run if their inputs have changed.\n\n| Command | Description |\n| --- | --- |\n| `preprocess` | Convert the data to spaCy's format |\n| `train` | Train UD_English-EWT |\n| `evaluate` | Evaluate on the test data and save the metrics |\n| `package` | Package the trained model so it can be installed |\n| `clean` | Remove intermediate files |\n\n### ⏭ Workflows\n\nThe following workflows are defined by the project. They\ncan be executed using [`spacy project run [name]`](https://spacy.io/api/cli#project-run)\nand will run the specified commands in order. Commands are only re-run if their\ninputs have changed.\n\n| Workflow | Steps |\n| --- | --- |\n| `all` | `preprocess` \u0026rarr; `train` \u0026rarr; `evaluate` \u0026rarr; `package` |\n\n### 🗂 Assets\n\nThe following assets are defined by the project. They can\nbe fetched by running [`spacy project assets`](https://spacy.io/api/cli#project-assets)\nin the project directory.\n\n| File | Source | Description |\n| --- | --- | --- |\n| `assets/UD_English-EWT` | Git |  |\n\n\u003c!-- SPACY PROJECT: AUTO-GENERATED DOCS END (do not remove) --\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fayushexel%2Fspacy_tagger_parser_ud","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fayushexel%2Fspacy_tagger_parser_ud","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fayushexel%2Fspacy_tagger_parser_ud/lists"}