https://github.com/jakelever/kindred

A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).
https://github.com/jakelever/kindred

bionlp python relation-extraction text-mining

Last synced: 6 months ago
JSON representation

A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).

Host: GitHub
URL: https://github.com/jakelever/kindred
Owner: jakelever
License: mit
Created: 2017-04-14T19:14:44.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2023-03-12T00:12:46.000Z (over 3 years ago)
Last Synced: 2025-10-28T11:25:26.672Z (9 months ago)
Topics: bionlp, python, relation-extraction, text-mining
Language: Python
Homepage:
Size: 2.38 MB
Stars: 158
Watchers: 5
Forks: 31
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Kindred





   





   





   





   





   





Kindred is a Python3 package for relation extraction in biomedical texts. Given some training data, it can build a model to identify relations between entities (e.g. drugs, genes, etc) in a sentence.

## Installation

You can install "kindred" via [pip](https://pypi.python.org/pypi/pip/) from [PyPI](https://pypi.org/project/kindred/)

```bash

pip install kindred

```

Kindred relies on the [Spacy](https://spacy.io) toolkit for parsing. After installing kindred (which also installs spacy), you will need to install a Spacy language model. For instance, the command below installs the English language model::

```bash

python -m spacy download en_core_web_sm

```

## Usage

Check out the [tutorial](https://github.com/jakelever/kindred/tree/master/tutorial) that goes through a simple use case of extracting capital cities from text. More details and the full documentation can be found at [readthedocs](http://kindred.readthedocs.io/).

### BioNLP Shared Task Example

```python

import kindred

# Load the SeeDev corpus

trainCorpus = kindred.bionlpst.load('2016-SeeDev-binary-train')

devCorpus = kindred.bionlpst.load('2016-SeeDev-binary-dev')

# Create a copy of the dev corpus to make predictions on

predictionCorpus = devCorpus.clone()

predictionCorpus.removeRelations()

# Create a relation classifier, train it and make predictions

classifier = kindred.RelationClassifier()

classifier.train(trainCorpus)

classifier.predict(predictionCorpus)

# Get the F1 score of the predicted relations

f1score = kindred.evaluate(devCorpus, predictionCorpus, metric='f1score')

```

### PubAnnotation Example

```python

corpus = kindred.pubannotation.load('bionlp-st-gro-2013-development')

```

### PubTator Example

```python

corpus = kindred.pubtator.load([19894120,19894121])

```

### Input Formats

Kindred can load several formats, including BioNLP Shared Task, JSON, BioC XML and a simple tag format. Check out the [file format documentation](https://kindred.readthedocs.io/en/stable/fileformats.html) for example data and code.

### Citing

It would be wonderful if you could cite the [associated paper](http://aclweb.org/anthology/W17-2322) for this package if used in any academic research.

```bibtex

@article{lever2017painless,

   title={Painless {R}elation {E}xtraction with {K}indred},

   author={Lever, Jake and Jones, Steven},

   journal={BioNLP 2017},

   pages={176--183},

   year={2017}

}

```

## Contributing

Contributions are very welcome.

## License

Distributed under the terms of the [MIT](http://opensource.org/licenses/MIT) license, "kindred" is free and open source software

## Issues

If you encounter any problems, please [file an issue](https://github.com/jakelever/kindred/issues) along with a detailed description.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jakelever/kindred

Awesome Lists containing this project

README