Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/veldhub/veld_chain__train_spacy_apis_ner
Chain velds encapsulating a spacy NER training setup on APIS data.
https://github.com/veldhub/veld_chain__train_spacy_apis_ner
named-entity-recognition nlp spacy spacy-nlp spacy-nlp-ner
Last synced: 6 days ago
JSON representation
Chain velds encapsulating a spacy NER training setup on APIS data.
- Host: GitHub
- URL: https://github.com/veldhub/veld_chain__train_spacy_apis_ner
- Owner: veldhub
- License: mit
- Created: 2024-12-04T13:37:56.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2025-01-20T16:47:26.000Z (7 days ago)
- Last Synced: 2025-01-20T17:44:07.222Z (7 days ago)
- Topics: named-entity-recognition, nlp, spacy, spacy-nlp, spacy-nlp-ner
- Language: Jupyter Notebook
- Homepage:
- Size: 11.6 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ![veld chain](https://raw.githubusercontent.com/veldhub/.github/refs/heads/main/images/symbol_V_letter.png) veld_chain__train_spacy_apis_ner
This repo contains [chain velds](https://zenodo.org/records/13322913) encapsulating a spacy NER
training setup on APIS data.## requirements
- git
- docker compose (note: older docker compose versions require running `docker-compose` instead of
`docker compose`)Clone this repo with all its submodules
```
git clone --recurse-submodules https://github.com/veldhub/veld_chain__train_spacy_apis_ner.git
```## how to reproduce
The following chain velds were used. Open the respective veld yaml file for more information.
**[./veld_convert.yaml](./veld_convert.yaml)**
Cleaning and converting json into spaCy docbin
```
docker compose -f veld_convert.yaml up
```**[./veld_create_config.yaml](./veld_create_config.yaml)**
Creates a spacy training config according to passed arguments. See
https://spacy.io/usage/training/#config for the target outcome.```
docker compose -f veld_create_config.yaml up
```**[./veld_train.yaml](./veld_train.yaml)**
A NER trainig setup, utilizing spaCy 3's config system.
```
docker compose -f veld_train.yaml up
```**[./veld_analysis.yaml](./veld_analysis.yaml)**
Analyses out-of vocabulary occurrences of training data.
```
docker compose -f veld_analysis.yaml up
```**[./veld_publish_to_hf.yaml](./veld_publish_to_hf.yaml)**
Pushing spacy model to huggingface.
```
docker compose -f veld_publish_to_hf.yaml up
```