Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/CLARIAH/DANE
Utils for working with the Distributed Annotation 'n' Enrichment (DANE) system
https://github.com/CLARIAH/DANE
Last synced: 2 months ago
JSON representation
Utils for working with the Distributed Annotation 'n' Enrichment (DANE) system
- Host: GitHub
- URL: https://github.com/CLARIAH/DANE
- Owner: CLARIAH
- License: apache-2.0
- Created: 2019-11-25T14:28:24.000Z (about 5 years ago)
- Default Branch: main
- Last Pushed: 2024-08-02T06:46:26.000Z (6 months ago)
- Last Synced: 2024-11-05T14:20:40.749Z (2 months ago)
- Language: Python
- Homepage: https://dane.readthedocs.io/en/latest/
- Size: 362 KB
- Stars: 4
- Watchers: 12
- Forks: 1
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-ai4lam - Distributed Annotation 'n' Enrichment (DANE) - clariah), Norway) (Tools and Frameworks<a title="Suggest an addition to the list!" href="https://forms.gle/aPA41GT5AmbxrTwq5"><img alt="Click button to suggest an addition" align="right" src="https://raw.githubusercontent.com/AI4LAM/awesome-ai4lam/main/.graphics/suggest-addition-small.svg"></a> / Document analysis, transcription, and labeling)
README
# DANE
The Distributed Annotation 'n' Enrichment (DANE) system handles compute task assignment and file storage for the automatic annotation of content.This repository contains contains the building blocks for with DANE, such as creating custom analysis workers or submitting new task.
## Installation
This package can be installed through pip:
pip install dane
### Configuration
DANE components are configured through the dane.config module, which is described here: https://dane.readthedocs.io/en/latest/intro.html#configuration
It is however noteable that, because all DANE components are expected to rely on it, some of the DANE-server, ElasticSearch and RabbitMQ configuration
are included in the default config. As such it is recommended that you create a `$HOME/.dane/config.yml` or `$DANE_HOME/config.yml` which contain machine-wide settings for how to connect to these services, which involves specifying the following settings:```
DANE:
API_URL: 'http://localhost:5500/DANE/'
MANAGE_URL: 'http://localhost:5500/manage/'
RABBITMQ:
HOST: 'localhost'
PORT: 5672
EXCHANGE: 'DANE-exchange'
RESPONSE_QUEUE: 'DANE-response-queue'
USER: 'guest'
PASSWORD: 'guest'
ELASTICSEARCH:
HOST: ['localhost']
PORT: 9200
USER: 'elastic'
PASSWORD: 'changeme'
SCHEME: 'http'
INDEX: 'your_dane_index'
```The values given here are the default values.
### Usage
Examples of how to use DANE can be found in the `examples/` directory.
## Local Development
We moved from `setup.py` & `requirements.txt` to a single `pyproject.toml`. For local builds and publishing we use [poetry](https://python-poetry.org/).
For local installation:
```bash
poetry install
poetry shell
```After installation the following unit test should succeed:
```bash
python -m test.test_dane
```To build a wheel + source package (will end up in `dist` directory):
```bash
poetry build
```The wheel can be conveniently tested in e.g. your own DANE worker by installing it e.g. using `pip`:
```bash
pip install path_to_dane_wheel_file
```or with poetry
```bash
poetry add path_to_dane_wheel_file
```### Breaking changes after 0.3.1
Since version 0.3.1 DANE must be imported in lowercase letters:
```python
import dane
```Before version 0.3.1 you should import using uppercase letters:
```python
import DANE
```