Know your Unicode ✀
Describes Unicode characters with their name and shows compositions. UNICODE 15.1*
Uses a similar color coding like its lower-level companion tool unibits.
Make sure you have Ruby installed and installing gems works properly. Then do:
$ gem install uniscribe
Pass the string to debug to uniscribe:
$ uniscribe "test strı̈ng"
require "uniscribe/kernel_method"
uniscribe "test strı̈ng"
0074 ├─ t ├─ LATIN SMALL LETTER T
0065 ├─ e ├─ LATIN SMALL LETTER E
0073 ├─ s ├─ LATIN SMALL LETTER S
0074 ├─ t ├─ LATIN SMALL LETTER T
0020 ├─ ] [ ├─ SPACE
0073 ├─ s ├─ LATIN SMALL LETTER S
0074 ├─ t ├─ LATIN SMALL LETTER T
0072 ├─ r ├─ LATIN SMALL LETTER R
---- ├┬ ı̈ ├┬ Composition
0131 │├─ ı │├─ LATIN SMALL LETTER DOTLESS I
0308 │└─ ◌̈ │└─ COMBINING DIAERESIS
006E ├─ n ├─ LATIN SMALL LETTER N
0067 ├─ g ├─ LATIN SMALL LETTER G
>> uniscribe "நகரத்தில்"
>> uniscribe "ม้าลายหกตัว"
>> uniscribe "辻󠄀㚑󠄁"
(the variation is not visible in the screenshot, because my system does not render it correctly)
>> uniscribe "3️⃣🤸♀"
>> uniscribe "̶̧̨̱̹̭̯ͧ̾ͬC̷̙̲̝͖ͭ̏ͥͮ͟Oͮ͏̮̪̝͍"
>> uniscribe "\0A\u{E01D7}\x7F\r\n\u{D0000}\u{81}\u{FFF9}B\u{FFFB}🏴\u{E0061}\u{E007F}\u{10FFFF}"
>> uniscribe "ᅠ "
Although the gem is generally up to date with Unicode 15.0, the proper detection of compositions / graphemes / combined characters depends on your Ruby version:
You can run uniscribe -v
to check for the Unicode level of your uniscribe version.
Also see
Copyright (C) 2017-2023 Jan Lelis https://janlelis.com. Released under the MIT license.