Converting special charactes such as ü and à back to their original, latin alphbet counterparts in C#

后端 未结 5 895
感情败类
感情败类 2020-12-30 02:02

I have been given an export from a MySQL database that seems to have had it\'s encoding muddled somewhat over time and contains a mix of HTML char codes such as

5条回答
  •  时光说笑
    2020-12-30 02:29

    It's probably windows-1252 encoded string which you read as UTF-8.

    As Guffa mentioned data has been corrupted.

    Lets take a look on bytes:
    ö -> C3B6 in UTF8

    in windows-1252 C3 ->Ã B6 ->¶

    so ö ->ö

    what about all these "ƒÂ":

    ƒ ->83 Â ->C2

    Honesty i don't know why they appear, but you can try erase them and do some conversions as Guffa mentioned. Good luck

提交回复
热议问题