https://github.com/f1uctus/ttc
✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].
https://github.com/f1uctus/ttc
conversation nlp nlp-apis nlp-library spacy spacy-extension spacy-nlp spacy-pipeline speaker-identification
Last synced: 6 months ago
JSON representation
✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].
- Host: GitHub
- URL: https://github.com/f1uctus/ttc
- Owner: F1uctus
- License: gpl-3.0
- Created: 2021-01-15T19:30:27.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2024-09-16T17:52:00.000Z (about 1 year ago)
- Last Synced: 2024-09-16T22:00:09.926Z (about 1 year ago)
- Topics: conversation, nlp, nlp-apis, nlp-library, spacy, spacy-extension, spacy-nlp, spacy-pipeline, speaker-identification
- Language: Python
- Homepage:
- Size: 2.19 MB
- Stars: 4
- Watchers: 4
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Text-To-Conversation toolkit (TTC)
This NLP library can help you with:
- Extraction of characters' replicas from literary texts;
- Identification of the actors owning these replicas.### Demo (CLI)

### Progress
We aim to achieve the following goals:
- Better accuracy on the actor classification task
(it is near 80% or worse for now);
- Support for more languages
(only Russian is supported at the moment).### Installation
Install with pip is just a usual `pip install .` from the project dir.
### Usage
**As a library**
You can find an example of using the library in the [`cli.py`](ttc/cli.py) file.
**As a tool for the CLI**
Test output on a text file:
```console
ttc print-play path-to-the-text-file text-language
```**Notes**
- Text must be encoded in UTF-8;
- Text must be sanitized (see https://github.com/F1uctus/ttc/issues/23);
- It is usually better to test on some middle-sized text (e.g a book chapter);
- Supported `text-language`s are:
+ ru (russian)### Development
Please install [Poetry](https://python-poetry.org).
Spawn a new virtual environment for the project:
```console
poetry shell
```Install project dependencies:
```console
poetry install [--with dev,large_models_ru]
```Contributions are very welcome!
#### Implementation notes
[russian/*/actor_classifier.py](ttc/language/russian/pipelines/actor_classifier.py):
