Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/midas-research/dlkp
A deep learning library for identifying keyphrases from text
https://github.com/midas-research/dlkp
dataset deep-learning information-extraction information-retrieval keyphrase-extraction keyphrase-generation machine-learning nlp
Last synced: 3 days ago
JSON representation
A deep learning library for identifying keyphrases from text
- Host: GitHub
- URL: https://github.com/midas-research/dlkp
- Owner: midas-research
- License: mit
- Created: 2021-12-20T19:40:07.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-08-01T19:40:11.000Z (over 2 years ago)
- Last Synced: 2024-08-08T20:55:57.502Z (3 months ago)
- Topics: dataset, deep-learning, information-extraction, information-retrieval, keyphrase-extraction, keyphrase-generation, machine-learning, nlp
- Language: Python
- Homepage:
- Size: 266 KB
- Stars: 25
- Watchers: 11
- Forks: 3
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# dlkp [WIP]
![wip](https://i.gifer.com/origin/82/82a1ed531e333926a8ca2a00c277e0d1.gif)
A transformers based deep learning library for keyphrase identification from text documents.
dlkp is:
* **A deep learning keyphrase extraction and generation library.** dlkp allows you to train and apply state-of-the-art
deep learning models for keyphrase extraction and generation from text documents.* **Transformer based framework.** dlkp framework builds directly on [transformers](https://github.com/huggingface/transformers),
making it easy to train and evaluate your own transformer based keyphrase extraction and generation models and experiment with
new approaches using different contextual embeddings.* **A dataset library for keyphrase extraction and generation.** dlkp has simple interfaces that allow you
to download several benchmark datasets in the domain of keyphrase extraction and generation from
[Huggingface Datasets](https://huggingface.co/docs/datasets/index) and readily use them in your training your models
with the transformer library. It provides easy access to BIO tagged data for several datasets such as Inspec, NUS,
WWW, KDD, KP20K, LDKP and many more suitable for training your keyphrase extraction model as a sequence tagger.* **An evaluation library for keyphrase extraction and generation.** dlkp implements several evaluation metrics for
evaluating keyphrase extraction and generation models and helps to generate evaluation reports of your models.
## Quick Start### Requirements and Installation
The project is based on transformers>=4.6.0 and Python 3.6+. If you do not have Python 3.6, install it first.
[Here is how for Ubuntu 16.04](https://vsupalov.com/developing-with-python3-6-on-ubuntu-16-04/).
Then, in your favorite virtual environment, simply do:```
git clone https://github.com/midas-research/dlkp.git
cd dlkp
pip install -e .
```### Example Usage
#### Keyphrase Extraction
#### Keyphrase Generation
## Tutorials
* [Loading Datasets](resources/docs/TUTORIAL_LOADING_DATASETS.md)
* [Training Models](resources/docs/TUTORIAL_TRAINING_MODELS.md)
* [Evaluating Models](resources/docs1/TUTORIAL_EVALUATING_MODELS.md)
## Citing dlkp
## Contact
Please email your questions or comments to [Amardeep Kumar](https://ad6398.github.io) or [Debanjan Mahata](https://sites.google.com/a/ualr.edu/debanjan-mahata/)
## Contributing
Thanks for your interest in contributing! There are many ways to get involved;
start with our [contributor guidelines](CONTRIBUTING.md) and then
check these [open issues](https://github.com/midas-research/dlkp/issues) for specific tasks.## [License](/LICENSE)
The MIT License (MIT)