Analyzer for Russian language in Lucene and Lucene.Net

前端 未结 5 1934
囚心锁ツ
囚心锁ツ 2020-12-25 15:14

Lucene has quite poor support for Russian language.

RussianAnalyzer (part of lucene-contrib) is of very low quality.

RussianStemmer module for Snowball is ev

相关标签:
5条回答
  • 2020-12-25 15:52

    I used http://code.google.com/p/russianmorphology/

    0 讨论(0)
  • 2020-12-25 15:54

    My answer is probably too late, but for the record, I've found analyzers from AOT project much better then those shipped with Lucene.

    0 讨论(0)
  • 2020-12-25 15:54

    That's the beauty of open source. You have the source code, so if the current implementations don't work for you, you can always create your own or even better, extend the existing ones. A good start would be the "Lucene in Action" book.

    0 讨论(0)
  • 2020-12-25 16:00

    If all else fails, use Sphinx

    0 讨论(0)
  • 2020-12-25 16:02

    Project http://code.google.com/p/russianmorphology/ moved to https://github.com/AKuznetsov/russianmorphology. Please take into account the new hosting of this project.

    0 讨论(0)
提交回复
热议问题