https://github.com/trappmartin/eurotokenizer.jl
Tokenizer for (DE, EN, ES, FR, IT, and PT) based on the Europarl preprocessing tools
https://github.com/trappmartin/eurotokenizer.jl
julia natural-language-processing tokenizer
Last synced: 11 months ago
JSON representation
Tokenizer for (DE, EN, ES, FR, IT, and PT) based on the Europarl preprocessing tools
- Host: GitHub
- URL: https://github.com/trappmartin/eurotokenizer.jl
- Owner: trappmartin
- License: other
- Created: 2018-06-27T10:06:29.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-06-27T10:07:11.000Z (almost 8 years ago)
- Last Synced: 2025-02-26T07:34:19.588Z (over 1 year ago)
- Topics: julia, natural-language-processing, tokenizer
- Language: Julia
- Size: 2.93 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# EuroTokenizer