Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/explosion/spacy-ray

☄️ Parallel and distributed training with spaCy and Ray
https://github.com/explosion/spacy-ray

distributed-computing machine-learning natural-language-processing parallel-training ray spacy training

Last synced: 4 months ago
JSON representation

☄️ Parallel and distributed training with spaCy and Ray

Awesome Lists containing this project

README

        

# spacy-ray: Parallel and distributed training with spaCy and Ray

> ⚠️ This repo is still a work in progress and requires the new **spaCy v3.0**.

[Ray](https://ray.io/) is a fast and simple framework for building and running
**distributed applications**. This very lightweight extension package lets you
use Ray for parallel and distributed training with [spaCy](https://spacy.io). If
`spacy-ray` is installed in the same environment as spaCy, it will automatically
add `spacy ray` commands to your spaCy CLI.

The main command is `spacy ray train` for parallel and distributed training, but
we expect to add `spacy ray pretrain` and `spacy ray parse` as well.

[![tests](https://github.com/explosion/spacy-ray/actions/workflows/tests.yml/badge.svg)](https://github.com/explosion/spacy-ray/actions/workflows/tests.yml)
[![Current Release Version](https://img.shields.io/github/v/release/explosion/spacy-ray.svg?include_prereleases&sort=semver&style=flat-square&logo=github)](https://github.com/explosion/spacy-ray/releases)
[![PyPi Version](https://img.shields.io/pypi/v/spacy-ray.svg?include_prereleases&sort=semver&style=flat-square&logo=pypi&logoColor=white)](https://pypi.python.org/pypi/spacy-ray)

## 🚀 Quickstart

You can install `spacy-ray` from pip:

```bash
pip install spacy-ray
```

To check if the command has been registered successfully:

```bash
python -m spacy ray --help
```

Train a model using the same API as `spacy train`:

```bash
python -m spacy ray train config.cfg --n-workers 2
```