wordbreaker

Is there such a thing as third party SQL Server word breaker for Hungarian language?

可紊 提交于 2019-12-10 11:25:44
问题 I want to use CONTAINS on a fulltext index and use FORMSOF(...) on Hungarian data. Is it possible? I KNOW it is not supported by default in SQL Server. 回答1: SQL Server can load custom word breakers and stemmers, see Word Breakers and Stemmers. If you cannot find a Hungarian stemmer there is always the possibility of creating one your own: Word Breaker and Stemmer Sample, see also Getting the LRSAMPLE custom word-breaker to work on 64-bit SQL Server 2008. You don't have to implement the

Word break in languages without spaces between words (e.g., Asian)?

假如想象 提交于 2019-12-03 03:53:11
问题 I'd like to make MySQL full text search work with Japanese and Chinese text, as well as any other language. The problem is that these languages and probably others do not normally have white space between words. Search is not useful when you must type the same sentence as is in the text. I can not just put a space between every character because English must work too. I would like to solve this problem with PHP or MySQL. Can I configure MySQL to recognize characters which should be their own

Word break in languages without spaces between words (e.g., Asian)?

时光毁灭记忆、已成空白 提交于 2019-12-02 18:11:09
I'd like to make MySQL full text search work with Japanese and Chinese text, as well as any other language. The problem is that these languages and probably others do not normally have white space between words. Search is not useful when you must type the same sentence as is in the text. I can not just put a space between every character because English must work too. I would like to solve this problem with PHP or MySQL. Can I configure MySQL to recognize characters which should be their own indexing units? Is there a PHP module that can recognize these characters so I could just throw spaces