xapian

Document search on partial words

感情迁移 提交于 2019-12-18 15:16:24
问题 I am looking for a document search engine (like Xapian, Whoosh, Lucene, Solr, Sphinx or others) which is capable of searching partial terms. For example when searching for the term "brit" the search engine should return documents containing either "britney" or "britain" or in general any document containing a word matching r *brit* Tangentially, I noticed most engines use TF-IDF (Term frequency-Inverse document frequency) or its derivatives which are based on full terms and not partial terms.

ElasticSearch, Sphinx, Lucene, Solr, Xapian. Which fits for which usage? [closed]

你离开我真会死。 提交于 2019-12-17 02:01:18
问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 7 years ago . I'm currently looking at other search methods rather than having a huge SQL query. I saw elasticsearch recently and played with whoosh

Py_InitModule4 with Djapian/Xapian

纵然是瞬间 提交于 2019-12-12 21:03:35
问题 I am trying to install Djapian on RedHat5 / Python2.6. I have already installed it successfully on my OSX 10.6 machine. I have built and compiled Xapian and Djapian without issue for Py2.6. I then install the Python Bindings for Xapian and it works fine, however, if open the Python interpreter and type 'import xapian, or try including djapian in my Django app, I get the following error: /usr/lib64/python2.6/site-packages/_xapian.so: undefined symbol: Py_InitModule4 In searching, I have seen

Django-haystack with xapian engine: can't execute update_index if model has ManyToManyField

 ̄綄美尐妖づ 提交于 2019-12-12 10:49:23
问题 After upgrading django to 1.7 I can't execute management command update_index. Traceback (most recent call last): File "/opt/pycharm-3.4.1/helpers/pydev/pydevd.py", line 1733, in <module> debugger.run(setup['file'], None, None) File "/opt/pycharm-3.4.1/helpers/pydev/pydevd.py", line 1226, in run pydev_imports.execfile(file, globals, locals) # execute the script File "/home/tochium/projects/povary/manage.py", line 10, in <module> execute_from_command_line(sys.argv) File "/home/tochium/projects

solr vs xapian: which one gives you the most meaningful results?

▼魔方 西西 提交于 2019-12-09 17:15:35
问题 I am currently using whoosh to dev a website, and I'll need to choose something more powerful once the website will be in production. If anyone of you used both of these engines, which one gave you the most meaningful results one the long road? 回答1: Solr is the best option. Its well documented and the community is huge. Almost a year ago I benchmarked Xapian vs Solr: My dataset had +8000 emails: Solr index time: 3s index size: 5.2mb Xapian index time: 30s index size: 154mb Another great

Count total search objects count in template using django-haystack

 ̄綄美尐妖づ 提交于 2019-12-09 05:39:30
问题 I am using django haystack with xapian as the backend search engine. I am using FacetedSearchView and FacetedSearchForm for faceting over the search. I have passed searchqueryset to the FacetSearchView in my urls.py file. But the problem is I cannot access that searchqueryset in template. All I want to do is count the number of objects in searchqueryset found. In shell I could achieve it using S earchQuerySet().filter(content="foo").count() , how can I do that similarly in the template?

Django + Haystack how to do this search

China☆狼群 提交于 2019-12-08 07:42:28
问题 I'm new to Haystack and to the search world so I didn't know how to ask this question. What I want to achieve is the following. Having a search query like: one two I would like to get returned any content like: This one one two two one something one here Is this possible with Haystack + solr/xapian? Is also possible to have a relevance on the result? In the case where both words are hit, that would give more relevance to me. I'm currently using SearchQuerySet in my view but can't achieve that

Django + Haystack how to do this search

丶灬走出姿态 提交于 2019-12-07 09:14:30
I'm new to Haystack and to the search world so I didn't know how to ask this question. What I want to achieve is the following. Having a search query like: one two I would like to get returned any content like: This one one two two one something one here Is this possible with Haystack + solr/xapian? Is also possible to have a relevance on the result? In the case where both words are hit, that would give more relevance to me. I'm currently using SearchQuerySet in my view but can't achieve that. Cheers So you're basically looking for an OR type query right? By default haystack uses an AND

solr vs xapian: which one gives you the most meaningful results?

房东的猫 提交于 2019-12-04 04:23:57
I am currently using whoosh to dev a website, and I'll need to choose something more powerful once the website will be in production. If anyone of you used both of these engines, which one gave you the most meaningful results one the long road? Rui Carneiro Solr is the best option. Its well documented and the community is huge. Almost a year ago I benchmarked Xapian vs Solr: My dataset had +8000 emails: Solr index time: 3s index size: 5.2mb Xapian index time: 30s index size: 154mb Another great reading about benchmarks between Xapian and Solr is this document: Cross-instance Search System -

Count total search objects count in template using django-haystack

廉价感情. 提交于 2019-12-03 06:54:18
I am using django haystack with xapian as the backend search engine. I am using FacetedSearchView and FacetedSearchForm for faceting over the search. I have passed searchqueryset to the FacetSearchView in my urls.py file. But the problem is I cannot access that searchqueryset in template. All I want to do is count the number of objects in searchqueryset found. In shell I could achieve it using S earchQuerySet().filter(content="foo").count() , how can I do that similarly in the template? Please guide. I want the total number of objects matching the search. Francis Yaconiello Haystack uses the