How do we create a simple search engine using Lucene, Solr or Nutch?

前端 未结 10 2386
孤城傲影
孤城傲影 2021-02-15 11:49

Our company has thousands of PDF documents. How do we create a simple search engine using Lucene, Solr or Nutch? We\'ll provide a basic Java/JSP web page were people can type

10条回答
  •  旧巷少年郎
    2021-02-15 12:28

    Nutch + Lucene + Pdf plugin enabled in Nutch is your solution. Nutch allows you to parse pdfs by enabling the pdf plugin.

    Lucene will allow you to index the crawled and parsed data and Nutch has servelet which gives you a search interface.

    We use the same for our internal lans.

提交回复
热议问题