Are 6 octet UTF-8 sequences valid?

后端 未结 3 705
不思量自难忘°
不思量自难忘° 2021-01-05 00:46

Can UTF-8 encode 5 or 6 byte sequences, allowing all Unicode characters to be encoded? I\'m getting conflicting standards. I need to be able to support every Unico

3条回答
  •  孤城傲影
    2021-01-05 00:59

    They are no Unicode characters beyond 10FFFF, the BMP covers 0000 through FFFF.

    UTF-8 is well-defined for 0-10FFFF.

提交回复
热议问题