Lucene Porter Stemmer not public

匿名 (未验证) 提交于 2019-12-03 01:06:02

问题:

How to use the Porter Stemmber class in Lucene 3.6.2? Here is what I have:

import org.apache.lucene.analysis.PorterStemmer; ... PorterStemmer stemmer = new PorterStemmer(); term = stemmer.stem(term);

I am being told: PorterStemmer is not public in org.apache.lucene.analysis; cannot be accessed from outside package.

Edit: I also read extensively about using Snowball, but it isn't encouraged. What is the right way to stem using Lucene in Java??

回答1:

1) If you want to use PorterStemmer as part of Lucene token analysis process, use PorterStemFilter

Sample code

 class MyAnalyzer extends Analyzer {   public final TokenStream tokenStream(String fieldName, Reader reader) {     return new PorterStemFilter(new LowerCaseTokenizer(reader));   }  }

2) If you want to use PorterStemmer just for any other application, here is the sourcecode by author himself: PorterStemmer in Java



回答2:

In Lucene later version, PorterStemmer no longer public. So

 class MyAnalyzer extends Analyzer {    public final TokenStream tokenStream(String fieldName, Reader reader) {     return new PorterStemFilter(new LowerCaseTokenizer(reader));    }    }

Or you can use SnowballAnalyzer Stemmer.link (SnowballAnalyzer is deprecated)

import org.tartarus.snowball.ext.PorterStemmer; . . public static  String applyPorterStemmer(String input) throws IOException {          PorterStemmer stemmer = new PorterStemmer();         stemmer.setCurrent(input);         stemmer.stem();         return stemmer.getCurrent();     }


转载请标明出处:Lucene Porter Stemmer not public
标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!