I have some text data from twitter and some documents contain codes such as â_x0081_ and âžx009d.
I assume these represent symbols/emojis or other special characters.