How can I determine which encoding the file uses before I read the file?

前端 未结 3 779
余生分开走
余生分开走 2021-01-21 09:07

I\'m facing a problem.

A file can be written in some encoding such as UTF-8, UTF-16, UTF-32, etc.

When I read a UTF-

3条回答
  •  广开言路
    2021-01-21 09:38

    There is no good way to do that. The question you're asking is like determining the radix of a number by looking at it. For example, what is the radix of 101?

    Best solution would be to read the data into a byte array. Then you can use String(byte[] bytes, Charset charset) to test it with multiple encodings, most likely to least likely.

提交回复
热议问题