I have been looking online for the UTF8 character table. And all I could find for subscripts were numbers 1 to 9 and some of latin letters.
I need to find S and B a
I know its a old post, following is the list of characters having unicode equivalent. For other alphabets we dont have. hope it helps
http://www.unicode.org/charts/PDF/U2070.pdf
I created a small library to convert sub/super script to its equivalent UTF character.
https://github.com/DilipRajkumar/Select-With-Sub-Super-tags/tree/master
hope it helps.