An open API service indexing awesome lists of open source software.

https://github.com/hiejulia/spark-nlp

Spark NLP
https://github.com/hiejulia/spark-nlp

Last synced: 6 months ago
JSON representation

Spark NLP

Awesome Lists containing this project

README

          

# spark-nlp
Spark NLP

## Dataset
- https://archive.ics.uci.edu/ml/datasets/Twenty+Newsgroups

## Install
- Anaconda
- Spark
-

## NLP related terms

- anotators
DocumentAssembler
A Transformer that creates a column that contains documents.
Sentence Segmenter
An annotator that produces the sentences of the document.
Tokenizer
An annotator that produces the tokens of the sentences.
SpellChecker
An annotator that produces the spelling-corrected tokens.
Stemmer
An annotator that produces the stems of the tokens.
Lemmatizer
An annotator that produces the lemmas of the tokens.
POS Tagger
An annotator that produces the parts of speech of the associated tokens.