Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/euskadi31/go-tokenizer
A Text Tokenizer library for Golang
https://github.com/euskadi31/go-tokenizer
go golang golang-library machine-learning nlp text tokenizer
Last synced: 23 days ago
JSON representation
A Text Tokenizer library for Golang
- Host: GitHub
- URL: https://github.com/euskadi31/go-tokenizer
- Owner: euskadi31
- License: mit
- Created: 2018-03-18T14:22:53.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2024-10-07T13:28:46.000Z (about 1 month ago)
- Last Synced: 2024-10-13T12:23:24.247Z (about 1 month ago)
- Topics: go, golang, golang-library, machine-learning, nlp, text, tokenizer
- Language: Go
- Size: 31.3 KB
- Stars: 10
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Text Tokenizer for Golang ![Last release](https://img.shields.io/github/release/euskadi31/go-tokenizer.svg)
[![Go Report Card](https://goreportcard.com/badge/github.com/euskadi31/go-tokenizer)](https://goreportcard.com/report/github.com/euskadi31/go-tokenizer)
| Branch | Status | Coverage |
| ------ | --------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------- |
| master | [![Go](https://github.com/euskadi31/go-tokenizer/actions/workflows/go.yml/badge.svg)](https://github.com/euskadi31/go-tokenizer/actions/workflows/go.yml) | [![Coveralls](https://img.shields.io/coveralls/euskadi31/go-tokenizer/master.svg)](https://coveralls.io/github/euskadi31/go-tokenizer?branch=master) |```go
import (
"fmt""github.com/euskadi31/go-tokenizer"
)func main() {
t := tokenizer.New()tokens := t.Tokenize("I believe life is an intelligent thing: that things aren't random.")
fmt.Print(tokens) // []string{"I", "believe", "life", "is", "an", "intelligent", "thing", "that", "things", "aren't", "random"}
}```
## License
go-tokenizer is licensed under [the MIT license](LICENSE.md).