https://github.com/topce/yutf-8
What is YUSCII to ASCII , that is YUTF-8 to UTF-8
https://github.com/topce/yutf-8
Last synced: 3 months ago
JSON representation
What is YUSCII to ASCII , that is YUTF-8 to UTF-8
- Host: GitHub
- URL: https://github.com/topce/yutf-8
- Owner: topce
- License: mit
- Created: 2025-01-05T09:37:38.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2025-01-05T10:46:17.000Z (5 months ago)
- Last Synced: 2025-01-05T11:23:02.064Z (5 months ago)
- Homepage:
- Size: 3.91 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# YUTF-8
What is YUSCII to ASCII , that is YUTF-8 to UTF-8 ;-)YUTF-8
Swap YUSCII chars in UTF-8
https://en.wikipedia.org/wiki/YUSCIISo as UTF-8 is compatible to ASCII
YUTF-8 is compatible with YUSCIIYUTF-8 is my character encoding standard that similar like UTF-8
that swap YUSCII charachters.It could be used to in 3 variation to encode (use BOM for this?)
Serbian Latin (Slovenian and Croatian as well),
Serbian Cyrillic,
Macedonin Cyrillic.Similar could be done for KOI
https://en.wikipedia.org/wiki/KOI-7
Russian and Ukrainian
but it is out of scope of this project.# Why to use YUTF-8
to have less network traffic for Serbian texts without compression,
also interesting for LLM maybe it will use less tokens for prompting in for example Serbian Launguage.
it loook like all mainstream LLM are large english language models LELM ;-)# TODO
implment rust linrary (WASM) UTF-8 <-> YUTF-8 libs similar like https://github.com/topce/cirilica
to make easy to use and experiment with YUTF-8 new encoding standard.