Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
https://github.com/vngrs-ai/vnlp
deasciifier deep-learning dependency-parsing fasttext morphological-analysis morphological-disambiguation named-entity-recognition nlp normalization number-to-words part-of-speech-tagging sentence-splitting sentence-tokenizer sentiment-analysis spelling-correction stemming stopword-removal turkish-nlp word-embeddings word2vec
Last synced: 2 days ago
JSON representation
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
- Host: GitHub
- URL: https://github.com/vngrs-ai/vnlp
- Owner: vngrs-ai
- License: agpl-3.0
- Created: 2021-07-26T14:39:51.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-11-27T12:48:00.000Z (3 months ago)
- Last Synced: 2024-11-27T13:32:25.430Z (3 months ago)
- Topics: deasciifier, deep-learning, dependency-parsing, fasttext, morphological-analysis, morphological-disambiguation, named-entity-recognition, nlp, normalization, number-to-words, part-of-speech-tagging, sentence-splitting, sentence-tokenizer, sentiment-analysis, spelling-correction, stemming, stopword-removal, turkish-nlp, word-embeddings, word2vec
- Language: Python
- Homepage: https://vngrs.com
- Size: 392 MB
- Stars: 257
- Watchers: 9
- Forks: 17
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- turkish-nlp-resources - VNLP
README
## VNLP: Turkish NLP Tools
State-of-the-art, lightweight NLP tools for Turkish language.Developed by VNGRS.
https://vngrs.com/
[![PyPI version](https://badge.fury.io/py/vngrs-nlp.svg)](https://badge.fury.io/py/vngrs-nlp)
[![PyPi downloads](https://static.pepy.tech/personalized-badge/vngrs-nlp?period=total&units=international_system&left_color=grey&right_color=orange&left_text=pip%20downloads)](https://pypi.org/project/vngrs-nlp/)
[![Docs]()](https://vnlp.readthedocs.io/)
[![License]()](https://github.com/vngrs-ai/vnlp/blob/main/LICENSE)
[![Python check](https://github.com/vngrs-ai/vnlp/actions/workflows/test.yml/badge.svg)](https://github.com/vngrs-ai/vnlp/actions/workflows/test.yml)### Functionality:
- Sentence Splitter
- Normalizer
- Spelling/Typo correction
- Convert numbers to word form
- Deasciification
- Stopword Remover:
- Static
- Dynamic
- Stemmer: Morphological Analyzer & Disambiguator
- Named Entity Recognizer (NER)
- Dependency Parser
- Part of Speech (PoS) Tagger
- Sentiment Analyzer
- Turkish Word Embeddings
- FastText
- Word2Vec
- SentencePiece Unigram Tokenizer
- News Summarization
- News Paraphrasing- Summarization and Paraphrasing models are available in the demo. Contact us at [email protected] for API.
### Demo:
- Try the [Demo](https://demo.vnlp.io).### Installation
```
pip install vngrs-nlp
```### Documentation:
- See the [Documentation](https://vnlp.readthedocs.io) for the details about usage, classes, functions, datasets and evaluation metrics.### Metrics:
### Usage Example:
**Dependency Parser**
```
from vnlp import DependencyParser
dep_parser = DependencyParser()dep_parser.predict("Oğuz'un kırmızı bir Astra'sı vardı.")
[("Oğuz'un", 'PROPN'),
('kırmızı', 'ADJ'),
('bir', 'DET'),
("Astra'sı", 'PROPN'),
('vardı', 'VERB'),
('.', 'PUNCT')]# Spacy's submodule Displacy can be used to visualize DependencyParser result.
import spacy
from vnlp import DependencyParser
dependency_parser = DependencyParser()
result = dependency_parser.predict("Oğuz'un kırmızı bir Astra'sı vardı.", displacy_format = True)
spacy.displacy.render(result, style="dep", manual = True)
```
## Citation
```bibtex
@article{turker2024vnlp,
title={VNLP: Turkish NLP Package},
author={Turker, Meliksah and Ari, Erdi and Han, Aydin},
journal={arXiv preprint arXiv:2403.01309},
year={2024}
}
```