{"id":13935775,"url":"https://github.com/explosion/projects","last_synced_at":"2025-05-15T10:02:09.622Z","repository":{"id":36324781,"uuid":"223165649","full_name":"explosion/projects","owner":"explosion","description":"🪐 End-to-end NLP workflows from prototype to production","archived":false,"fork":false,"pushed_at":"2024-10-15T12:32:08.000Z","size":19619,"stargazers_count":1371,"open_issues_count":6,"forks_count":469,"subscribers_count":30,"default_branch":"v3","last_synced_at":"2025-04-11T04:11:22.879Z","etag":null,"topics":["annotations","datasets","natural-language-processing","nlp","prodigy","spacy"],"latest_commit_sha":null,"homepage":"https://spacy.io/usage/projects","language":"Python","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/explosion.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"custom":"https://explosion.ai/merch"}},"created_at":"2019-11-21T12:08:52.000Z","updated_at":"2025-04-10T14:38:22.000Z","dependencies_parsed_at":"2023-11-09T15:46:22.784Z","dependency_job_id":"bf16f9b5-e991-4895-afdc-6b511eb885b6","html_url":"https://github.com/explosion/projects","commit_stats":{"total_commits":454,"total_committers":29,"mean_commits":"15.655172413793103","dds":0.7555066079295154,"last_synced_commit":"504e1e428238a08fb038c04af04ea4b3b34756af"},"previous_names":[],"tags_count":6,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fprojects","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fprojects/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fprojects/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fprojects/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/explosion","download_url":"https://codeload.github.com/explosion/projects/tar.gz/refs/heads/v3","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254319717,"owners_count":22051072,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["annotations","datasets","natural-language-processing","nlp","prodigy","spacy"],"created_at":"2024-08-07T23:02:05.088Z","updated_at":"2025-05-15T10:02:08.453Z","avatar_url":"https://github.com/explosion.png","language":"Python","readme":"\u003ca href=\"https://explosion.ai\"\u003e\u003cimg src=\"https://explosion.ai/assets/img/logo.svg\" width=\"125\" height=\"125\" align=\"right\" /\u003e\u003c/a\u003e\n\n# 🪐 Project Templates\n\n[Weasel](https://github.com/explosion/weasel), previously\n[spaCy projects](https://spacy.io/usage/projects), lets you manage and share\n**end-to-end workflows** for different **use cases and domains**, and\norchestrate training, packaging and serving your custom pipelines. You can start\noff by cloning a pre-defined project template, adjust it to fit your needs, load\nin your data, train a pipeline, export it as a Python package, upload your\noutputs to a remote storage and share your results with your team.\n\n\u003e ⚠️ Weasel project templates require\n\u003e [**Weasel**](https://github.com/explosion/weasel), which is also included by\n\u003e default with spaCy v3.7+. You can install it from pip with\n\u003e `pip install weasel` or conda with `conda install weasel -c conda-forge`. Make\n\u003e sure to use a fresh virtual environment.\n\u003e\n\u003e See the [`master` branch](https://github.com/explosion/projects/tree/master)\n\u003e for the previous version of this repo.\n\n[![tests](https://github.com/explosion/projects/actions/workflows/tests.yml/badge.svg)](https://github.com/explosion/projects/actions/workflows/tests.yml)\n[![spaCy](https://img.shields.io/static/v1?label=made%20with%20%E2%9D%A4%20and\u0026message=spaCy\u0026color=09a3d5\u0026style=flat-square)](https://spacy.io)\n\n## 🗃 Categories\n\n| Name                           | Description                                                                                                                                                                             |\n| ------------------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |\n| [`pipelines`](pipelines)       | Templates for training NLP pipelines with different components on different corpora.                                                                                                    |\n| [`tutorials`](tutorials)       | Templates that work through a specific NLP use case end-to-end.                                                                                                                         |\n| [`integrations`](integrations) | Templates showing integrations with third-party libraries and tools for managing your data and experiments, iterating on demos and prototypes and shipping your models into production. |\n| [`benchmarks`](benchmarks)     | Templates to reproduce our benchmarks and produce quantifiable results that are easy to compare against other systems or versions of spaCy.                                             |\n| [`experimental`](experimental) | Experimental workflows and other cutting-edge stuff to use at your own risk.                                                                                                            |\n\n## 🚀 Quickstart\n\nProjects can be used via the\n[`weasel`](https://github.com/explosion/weasel/blob/main/docs/cli.md) CLI, or\nthrough the [`spacy project`](https://spacy.io/api/cli#project) alias. To find\nout more about a command, add `--help`. For detailed instructions, see the\n[Weasel documentation](https://github.com/explosion/weasel/tree/main#-documentation)\nor [spaCy projects usage guide](https://spacy.io/usage/projects).\n\n1. **Clone** the project template you want to use.\n   ```bash\n   python -m weasel clone tutorials/ner_fashion_brands\n   ```\n2. **Install** any project requirements.\n   ```bash\n   cd ner_fashion_brands\n   python -m pip install -r requirements.txt\n   ```\n3. **Fetch assets** (data, weights) defined in the `project.yml`.\n   ```bash\n   python -m weasel assets\n   ```\n4. **Run a command** defined in the `project.yml`.\n   ```bash\n   python -m weasel run preprocess\n   ```\n5. **Run a workflow** of multiple steps in order.\n   ```bash\n   python -m weasel run all\n   ```\n6. **Adjust** the template for **your specific use case**, load in your own\n   data, adjust the settings and model and share the result with your team.\n\n## 👷‍♀️Repository maintanance\n\nTo keep the project templates and their documentation up to date, this repo\ncontains several scripts:\n\n| Script                                                         | Description                                                                                                                                                                                                               |\n| -------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |\n| [`update_docs.py`](.github/update_docs.py)                     | Update all auto-generated docs in the given root. Calls into [`spacy project document`](https://spacy.io/api/cli#project-document) and only replaces the auto-generated sections, not any custom content before or after. |\n| [`update_category_docs.py`](.github/update_category_docs.py)   | Update the auto-generated `README.md` in the category directories listing the available project templates.                                                                                                                |\n| [`update_configs.py`](.github/update_configs.py)               | Update and auto-fill all `config.cfg` files included in the repo, similar to [`spacy init fill-config`](https://spacy.io/api/cli#init-fill-config). Can be used to keep the configs up to date with changes in spaCy.     |\n| [`update_projects_jsonl.py`](.github/update_projects_jsonl.py) | Update `projects.jsonl` file in the given root. Should be used at the root level of the repo.                                                                                                                             |\n","funding_links":["https://explosion.ai/merch"],"categories":["Python"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexplosion%2Fprojects","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fexplosion%2Fprojects","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexplosion%2Fprojects/lists"}