Nutch message “No IndexWriters activated” while loading to solr

后端 未结 4 1977
星月不相逢
星月不相逢 2021-01-18 11:26

I have run nutch crawler as per nutch tutorial http://wiki.apache.org/nutch/NutchTutorial but when i started loading it to solr i am getting this message i.e. \"No I

相关标签:
4条回答
  • 2021-01-18 12:02

    @Tryskele + @Scott101 worked for me:

    add plugin.includes property to both /conf/nutch-site.xml and runtime/local/conf/nutch-site.xml files:

    <property>
      <name>plugin.includes</name>
      <value>protocol-httpclient|urlfilter-regex|index-(basic|more)|query-(basic|site|url|lang)|indexer-solr|nutch-extensionpoints|protocol-httpclient|urlfilter-regex|parse-(text|html|msexcel|msword|mspowerpoint|pdf)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|more|metadata)</value>
    </property>
    
    0 讨论(0)
  • 2021-01-18 12:15

    Make sure that the plugin indexer-solr is included. Go to the file: conf/nutch-site.xml and in the property plugin.includes add the plugin, for instance:

    protocol-http|urlfilter-regex|parse-(html|tika)|index-(basic|anchor)|indexer-solr|scoring-opic|urlnormalizer-(pass|regex|basic)

    After adding the plugin the No IndexWriters activated - check your configuration warning disappeared in my case.

    Check this thread: http://lucene.472066.n3.nabble.com/a-plugin-extending-IndexWriter-td4074353.html

    0 讨论(0)
  • 2021-01-18 12:15

    Don't know if this is still an issue, but I was having this problem and then realized that my src/plugin/build.xml was missing the indexer-solr plugin. Adding the following and then recompiling nutch fixed it for me:

    <ant dir="indexer-solr" target="deploy"/>

    0 讨论(0)
  • 2021-01-18 12:18

    Add the below property in conf/nutch-site.xml for plugin

    <property>
    <name>plugin.includes</name>
    <value>protocol-httpclient|urlfilter-regex|index-(basic|more)|query-(basic|site|url|lang)|indexer-solr|nutch-extensionpoints|protocol-httpclient|urlfilter-regex|parse-(text|html|msexcel|msword|mspowerpoint|pdf)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|more|metadata)</value>
    </property>
    

    Let me know if it solves your problem.

    0 讨论(0)
提交回复
热议问题