How do I find out how many bytes a character has?

前端 未结 2 1242
孤城傲影
孤城傲影 2021-02-13 15:46

I was wondering how do I find out how many bytes does a character have?

相关标签:
2条回答
  • 2021-02-13 16:06
    • ASCII is 7 bits.
    • Most other languages use 8 bits (1 byte).
    • Many eastern languages (Chinese, Japanese) use 16 bits (2 bytes).
    • Unicode is usually 32 bits (4 bytes).

    How a character is stored and represented depends on the programming language and the platform you are using.

    0 讨论(0)
  • 2021-02-13 16:11

    If you want to find out how many UTF-8 bytes a letter in a PHP string has then:

    print strlen(mb_substr($string, 0, 1, "utf-8"));
    

    strlen() returns the raw byte length, while mb_substr() returns a "character" according to the charset/encoding. In this example from position 0.

    0 讨论(0)
提交回复
热议问题