An open API service indexing awesome lists of open source software.

https://github.com/wassemgtk/supertokenizer

A high-performance tokenizer built to rival GPT-4, trained on the C4 dataset.
https://github.com/wassemgtk/supertokenizer

tokenizer tokenizer-framework tokenizers

Last synced: about 2 months ago
JSON representation

A high-performance tokenizer built to rival GPT-4, trained on the C4 dataset.

Awesome Lists containing this project