Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vngrs-ai/vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
https://github.com/vngrs-ai/vnlp

deasciifier deep-learning dependency-parsing fasttext morphological-analysis morphological-disambiguation named-entity-recognition nlp normalization number-to-words part-of-speech-tagging sentence-splitting sentence-tokenizer sentiment-analysis spelling-correction stemming stopword-removal turkish-nlp word-embeddings word2vec

Last synced: 2 days ago
JSON representation

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

Awesome Lists containing this project

README

        

## VNLP: Turkish NLP Tools
State-of-the-art, lightweight NLP tools for Turkish language.

Developed by VNGRS.

https://vngrs.com/

[![PyPI version](https://badge.fury.io/py/vngrs-nlp.svg)](https://badge.fury.io/py/vngrs-nlp)
[![PyPi downloads](https://static.pepy.tech/personalized-badge/vngrs-nlp?period=total&units=international_system&left_color=grey&right_color=orange&left_text=pip%20downloads)](https://pypi.org/project/vngrs-nlp/)
[![Docs]()](https://vnlp.readthedocs.io/)
[![License]()](https://github.com/vngrs-ai/vnlp/blob/main/LICENSE)
[![Python check](https://github.com/vngrs-ai/vnlp/actions/workflows/test.yml/badge.svg)](https://github.com/vngrs-ai/vnlp/actions/workflows/test.yml)

### Functionality:
- Sentence Splitter
- Normalizer
- Spelling/Typo correction
- Convert numbers to word form
- Deasciification
- Stopword Remover:
- Static
- Dynamic
- Stemmer: Morphological Analyzer & Disambiguator
- Named Entity Recognizer (NER)
- Dependency Parser
- Part of Speech (PoS) Tagger
- Sentiment Analyzer
- Turkish Word Embeddings
- FastText
- Word2Vec
- SentencePiece Unigram Tokenizer
- News Summarization
- News Paraphrasing

- Summarization and Paraphrasing models are available in the demo. Contact us at [email protected] for API.

### Demo:
- Try the [Demo](https://demo.vnlp.io).

### Installation
```
pip install vngrs-nlp
```

### Documentation:
- See the [Documentation](https://vnlp.readthedocs.io) for the details about usage, classes, functions, datasets and evaluation metrics.

### Metrics:

### Usage Example:
**Dependency Parser**
```
from vnlp import DependencyParser
dep_parser = DependencyParser()

dep_parser.predict("Oğuz'un kırmızı bir Astra'sı vardı.")
[("Oğuz'un", 'PROPN'),
('kırmızı', 'ADJ'),
('bir', 'DET'),
("Astra'sı", 'PROPN'),
('vardı', 'VERB'),
('.', 'PUNCT')]

# Spacy's submodule Displacy can be used to visualize DependencyParser result.
import spacy
from vnlp import DependencyParser
dependency_parser = DependencyParser()
result = dependency_parser.predict("Oğuz'un kırmızı bir Astra'sı vardı.", displacy_format = True)
spacy.displacy.render(result, style="dep", manual = True)
```

## Citation

```bibtex
@article{turker2024vnlp,
title={VNLP: Turkish NLP Package},
author={Turker, Meliksah and Ari, Erdi and Han, Aydin},
journal={arXiv preprint arXiv:2403.01309},
year={2024}
}
```