SOLR and accented characters

后端 未结 3 2034
生来不讨喜
生来不讨喜 2021-01-27 07:12

I have an index for occupations (identifier + occupation):




        
3条回答
  •  离开以前
    2021-01-27 07:34

    Ok, I have discovered the source problem. I have opened my SQL load script with VI, in hex mode.

    This is the hex content for 'Agrónomo' in an INSERT statement: 41 67 72 6f cc 81 6e 6f 6d 6f.

    6f cc 81!!!! This is "o COMBINING ACUTE ACCENT" UTF code!!!!
    

    So that's the problem... It must be "c3 b3"... I get the literals copy/pasting from a web page, so the source characters on the origin was the problem.

    Thanks to both of you, because I have learning more about SOLR's soul.

    Regards.

提交回复
热议问题