Tutorial by Examples

A letter with a diacritic may be represented with the letter, and a combining modifier letter. You normally think of é as one character, but it's really 2 code points: U+0065 — LATIN SMALL LETTER E U+0301 — COMBINING ACUTE ACCENT Similarly ç = c + ¸, and å = a + ˚ combined forms To compl...
There is this thing called Zalgo Text which pushes this to the extreme. Here is the first grapheme cluster of the example. It consists of 15 code points: the Latin letter H and 14 combining marks.   H̡̫̤̤̣͉̤ͭ̓̓̇͗̎̀   Although this doesn't show up in normal text, it shows that a “character” r...
A lot of emoji consist of more than one code point. 🇯🇵: A flag is defined as a pair of "regional symbol indicator letters" (🇯 + 🇵) 🙋🏿: Some emoji may be followed by a skin tone modifier: 🙋 + 🏿 😀︎ or 😀️: Windows 10 allows you to specify if an emoji is colored or black/white b...

Page 1 of 1