I am guessing there is no silver bullet here.
In Unicode, often more than one code point combines to make one visible character or glyph. For example, ❤️ can be repre