https://github.com/below/utfunzombie
A library and command line tool to convert decomposed unicode into its precomposed form
https://github.com/below/utfunzombie
Last synced: about 1 year ago
JSON representation
A library and command line tool to convert decomposed unicode into its precomposed form
- Host: GitHub
- URL: https://github.com/below/utfunzombie
- Owner: below
- License: mit
- Created: 2019-10-28T11:55:56.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-11-12T21:46:07.000Z (over 6 years ago)
- Last Synced: 2025-01-26T02:44:08.136Z (about 1 year ago)
- Language: Swift
- Size: 6.84 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# utfunzombie
A tool to convert decomposed characters into precomposed characters
## Background
Historically, character sets had a single value for a character. For example, in ISO-8859-1 (aka "Latin 1"), the value `0xE4` represents the character "ä". To maintain compatiblity with such older standards, Unicode also has some single codepoints, like `U+00E4 LATIN SMALL LETTER A WITH DIAERESIS`. This is called the _precomposed_ form.
However, for greater flexibility, Unicode can also represent these characters as a combination, in this case of `LATIN SMALL LETTER A (U+0061)` and `COMBINING DIAERESIS (U+0308)`. This is called the _decomposed_ form.
## The Problem
Some fonts do not render the decomposed form of some characters, even some professionally designed ones. If they reach a decomposed character, most systems will default to a font that can display them, creating a weird looking character.
## utfunzombie
This utility takes a path to a file, and outputs the precomposed form of that file