Which collation to use so that `ş` and `s` are treated as unique values?

问题

The issue is that ş and s are interpreted by MySQL as identical values.

I'm new to MySQL, so I have no idea which collations would view them as unique.

The collations that I've tried using which don't work are:

utf8_general_ci
utf8_unicode_520_ci
utf8mb4_unicode_ci
utf8mb4_unicode_520_ci

Does anybody know which collation to use?

P.S. I also really need the collation to interpret emojis and other non-Latin characters, and, to my knowledge of MySQL and collations, the only collation able to do this is unicode?

回答1:

utf8_turkish_ci and utf8_romanian_ci -- as shown in http://mysql.rjweb.org/utf8_collations.html

(Plus, of course, utf8_bin.)

For your added question: You are looking for a "character set" (not a "collation") that can represent Emoji and other non-Latin characters -- UTF-8 is the one to use. In MySQL, it is utf8mb4. The "collations" that are associated with that are named utf8mb4_.... Collations control ordering and equality, as indicated in the first part of your question about s and ş.

MySQL's CHARACTER SET utf8 is a subset of utf8mb4. Either can handle all the "letters" in the world. But only utf8mb4 can handle Emoji and some Chinese characters.

来源：https://stackoverflow.com/questions/53217875/which-collation-to-use-so-that-%c5%9f-and-s-are-treated-as-unique-values

标签

mysql

collation

mysql-5.7

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!