Projects in Awesome Lists tagged with utf-16
A curated list of projects in awesome lists tagged with utf-16 .
https://github.com/nemtrif/utfcpp
UTF-8 with C++ in a Portable Way
cpp cpp98 header-only internationalization portable text unicode utf-16 utf-8
Last synced: 13 May 2025
https://github.com/janlelis/unibits
Visualize different Unicode encodings in the terminal
ascii cli-command codepoints debugging-tool hacktoberfest ruby-cli terminal unicode utf-16 utf-32 utf-8
Last synced: 10 Oct 2025
https://github.com/bobsteagall/utf_utils
My work on high-speed conversion of UTF-8 to UTF-32/UTF-16
cplusplus-11 cplusplus-14 cplusplus-17 utf-16 utf-32 utf-8
Last synced: 30 Apr 2025
https://github.com/bodigrim/text-rope
A wrapper around Text for fast line/column navigation and logarithmic concatenation
concatenation haskell lines position rope slicing text unicode utf-16
Last synced: 15 Mar 2025
https://github.com/acceis/unisec
Unicode Security Toolkit
cldr confusables homoglyphs icu security surrogates unicode unicode-tools utf-16 utf-32 utf-8
Last synced: 31 Oct 2025
https://github.com/danielkrupinski/stringpool
A performant and memory efficient storage for immutable strings with C++17. Supports all standard char types: char, wchar_t, char16_t, char32_t and C++20's char8_t.
char char16 char32 char8 cpp cpp17 cpp20 memory-optimization memory-pool single-header string string-optimization string-pool string-view strings utf-16 utf-32 utf-8 wchar
Last synced: 09 Jul 2025
https://github.com/contrebande-labs/charred
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell
bert canine character-aware controlnet diffusion diffusion-models fonts stable-diffusion tokenization-free transformers transformers-models typography unicode utf-16 utf-8
Last synced: 27 Jun 2025
https://github.com/katahiromz/mcpp
UTF-16 readable C preprocessor (A fork of mcpp 2.7.2)
bom byte-order-mark c-language c-preprocessor preprocessor utf-16 utf-32 utf-8
Last synced: 05 Feb 2026
https://github.com/ehmicky/string-byte-length
Get the UTF-8 byte length of a string.
binary bytes emoji encoding javascript json length library nodejs parsing serialization size string string-manipulation stringify typescript ucs-2 unicode utf-16 utf-8
Last synced: 28 Jul 2025
https://github.com/getsentry/utf16string
UTF-16 string types
hacktoberfest rust tag-production utf-16 wstring
Last synced: 23 Mar 2025
https://github.com/ehmicky/string-byte-slice
Like `string.slice()` but bytewise.
binary bytes emoji encoding javascript json length library nodejs serialization size slice string string-manipulation truncate typescript ucs-2 unicode utf-16 utf-8
Last synced: 13 Jul 2025
https://github.com/danielaparker/unicode_traits
The C++ unicode_traits class template makes using unicode easier
cpp11 unicode unicode-traits utf-16 utf-32 utf-8
Last synced: 22 Apr 2025
https://github.com/katahiromz/utf
C/C++ UTF-8, UTF-16, UTF-32 conversion library
c-language c-plus-plus conversion converter cxx header-only string utf-16 utf-32 utf-8
Last synced: 20 Jun 2025
https://github.com/fornwall/utfdecode
Unicode decoder, encoder and debugger command-line tool.
command-line emoji encoding normalization unicode utf-16 utf-8
Last synced: 11 Apr 2025
https://github.com/efmsoft/utf8
This library contains a set of classes for working with strings in utf8 format, as well as functions for converting strings in utf8, ANSI, utf16, utf32 formats. The most commonly used format conversion operations are converting from ANSI encoding (on Windows), as well as from a Unicode string
ansi conversion utf-16 utf-32 utf-8 utf8
Last synced: 18 Jan 2026
https://github.com/stdlib-js/string-next-grapheme-cluster-break
Return the next extended grapheme cluster break in a string after a specified position.
astral code grapheme javascript node node-js nodejs point pt stdlib str string surrogate unicode utf-16 utf16 util utilities utility utils
Last synced: 11 Apr 2025
https://github.com/fab2s/bom
A purely static zero dependencies Helper to handle unicode BOMs
big-endian bom byte-order-mark little-endian php utf-16 utf-32 utf-8
Last synced: 10 Oct 2025
https://github.com/maxdz-gmbh/mdz_unicode
Very lightweight and portable ANSI C 89/90 compliant library for handling Unicode strings.
android c freebsd library linux macos portable strings unicode-characters unicode-support unicode-symbols utf-16 utf-32 utf-8 utf16 utf32 utf8 wchar-support windows
Last synced: 13 Feb 2026
https://github.com/stdlib-js/string-prev-grapheme-cluster-break
Return the previous extended grapheme cluster break in a string before a specified position.
astral code grapheme javascript node node-js nodejs point pt stdlib str string surrogate unicode utf-16 utf16 util utilities utility utils
Last synced: 15 Oct 2025
https://github.com/monomonedula/temeco
Telegram Bot API message entities to HTML translator
code-units telegram telegram-bot telegram-bot-api unicode utf-16
Last synced: 21 Jun 2025
https://github.com/visualdoj/dunicode
Unicode UTF-8 and UTF-16 units for Free Pascal
fpc freepascal pascal unicode utf-16 utf-8 utf16 utf8
Last synced: 14 Apr 2026
https://github.com/leoovs/unipp
Iterator-based Unicode conversion
cpp header-only unipp utf-16 utf-8
Last synced: 14 Oct 2025
https://github.com/jurgen-kluft/cstring
string library (WIP = work in progress)
cpp cross-platform string utf-16 utf-32 utf-8 view
Last synced: 27 Jan 2026
https://github.com/darkmoon9322/utfcpp
UTF-8 with C++ in a Portable Way
cpp cpp98 header-only internationalization portable text unicode utf-16 utf-8
Last synced: 06 Sep 2025
https://github.com/kmx-systems/conversion-to-utf8
Converter from UTF-16 and UTF-32 to UTF-8 (header only)
conversion cpp cpp17 cpp17-library library utf utf-16 utf-32 utf-8
Last synced: 12 Apr 2025
https://github.com/federicobruzzone/ende
A rust library for encoding/decoding unicode/utf-8/utf-16(ucs-2) code points. 🦀
codepoints rust rust-lang rust-library ucs2 unicode utf-16 utf-8
Last synced: 26 Jan 2026
https://github.com/b4n/wtf8tools
WTF-8 conversion tools
encoding-convertors unicode utf-16 utf-32 utf-8 wtf-8
Last synced: 02 Aug 2025
https://github.com/pranosa/learnutf8sam
Backend For Unicode Calculator
unicode unicode-character-database unicode-converter unicode-data utf-16 utf-32 utf-8
Last synced: 07 Oct 2025
https://github.com/stdbug/unicpp
Just another C++ Unicode library
c-plus-plus cpp cpp17 decoding encoding unicode utf-16 utf-8
Last synced: 27 Jul 2025
https://github.com/pranosa/learnutf8web
Frontend For Unicode Calculator Application
unicode unicode-characters unicode-converter unicode-emoji unicode-symbols utf-16 utf-32 utf-8-encoding
Last synced: 10 May 2026
https://github.com/peelonet/peelo-unicode
Simple Unicode utilities for C++
cpp-library header-only unicode unicode-support utf-16 utf-32 utf-8
Last synced: 07 Apr 2025
https://github.com/rampaa/unicoderangetoutf16compliantregex
Non-Basic Multilingual Plane Regex Ranges to UTF-16 Compliant Regex
regex regex-pattern regexp supplementary-plane surrogate-pairs unicode utf-16
Last synced: 22 Mar 2025
https://github.com/vinser/u8xml
u8xml NewDecoder func can be used to parse XML files with IANA character encodings instead of Go Standard Library xml package Decoder
charset converter decoder iana iso-8859-1 utf-16 windows-1251 xml
Last synced: 26 May 2026
https://github.com/borewit/read-next-line
Utility for reading lines from a `ReadableStream` in JavaScript
bom esm line promise read stream stream-api streamreadable text utf-16
Last synced: 13 Feb 2026
https://github.com/maxdz-gmbh/mdz_string
Very lightweight and portable ANSI C 89/90 compliant library for handling single-byte (ASCII/ANSI) strings and Unicode strings.
android ansible ascii c freebsd library linux macos portable strings unicode utf-16 utf-32 utf-8 utf16 utf32 utf8 wchar windows
Last synced: 12 Apr 2026