Elasticsearch - previous/next functionality

后端 未结 2 958
无人及你
无人及你 2021-01-05 16:37

I created a search engine to search all documents in my elasticsearch index. When a user hits a document on the searchengine resultpage he leaves the current page and opens

相关标签:
2条回答
  • 2021-01-05 17:12

    If your documents are using sequential _id then you can just do current document _id+1 and query it again.

    0 讨论(0)
  • 2021-01-05 17:22

    The following works beautifully for me. Ensure that you are using a regularly formatted list of sort definitions like this:

    function getSortDefinitions() {
        return [
            'newest' => [
                [ 'created_at' => 'desc' ],
                [ 'id' => 'desc' ],
            ],
            'oldest' => [
                [ 'created_at' => 'asc' ],
                [ 'id' => 'asc' ],
            ]
            'highest' => [
                [ 'price' => 'desc' ],
                [ 'created_at' => 'desc' ],
                [ 'id' => 'desc' ],
            ],
            'lowest' => [
                [ 'price' => 'asc' ],
                [ 'created_at' => 'asc' ],
                [ 'id' => 'asc' ],
            ],
        ];
    }
    

    An aside: Adding id makes the resultset have predictable ordering for records with the same timestamp. This happens often with testing fixtures where the records are all saved at the same time.

    Now whenever someone searches, they have usually selected a few filters, perhaps a query and definitely a sort order. Create a table that stores this so you can generate a search context to work with:

    create table search_contexts (
        id int primary,
        hash varchar(255) not null,
        query varchar(255) not null,
        filters json not null,
        sort varchar(255) not null,
    
        unique search_contexts_hash_uk (hash)
    );
    

    Use something like the following in your language of choice to insert and get a reference to the search context:

    function saveSearchContext($query, $filters, $sort)
    {
        // Assuming some magic re: JSON encoding of $filters
        $hash = md5(json_encode(compact('query', 'filters', 'sort')));
        return SearchContext::firstOrCreate(compact('hash', 'query', 'filters', 'sort'));
    }
    

    Notice that we only insert a search context if there isn't one already there with the same parameters. So we end up with one unique row per search. You may choose to be overwhelmed by the volume and save one per search. If you choose to do that, use uniqid instead of md5 and just create the record.

    On the results index page, whenever you generate a link to the detail page, use the hash as a query parameter like this:

    http://example.com/details/2456?search=7ddf32e17a6ac5ce04a8ecbf782ca509
    

    In your detail page code, do something like this:

    function getAdjacentDocument($search, $documentId, $next = true) {
        $sortDefinitions = getSortDefinitions();
    
        if (!$next) {
            // Reverse the sort definitions by looping through $sortDefinitions
            // and swapping asc and desc around
            $sortDefinitions = array_map($sortDefinitions, function ($defn) {
                return array_map($defn, function ($array) {
                    $field = head(array_keys($array));
                    $direction = $array[$field];
    
                    $direction = $direction == 'asc' ? 'desc' : 'asc';
    
                    return [ $field => $direction ];
                });
            });
        }
    
        // Add a must_not filter which will ensure that the
        // current page's document ID is *not* in the results.
        $filters['blacklist'] = $documentId;
    
        $params = [
            'body' => [
                'query' => generateQuery($search->query, $filters),
                'sort' => $sortDefinitions[$sort],
    
                // We are only interested in 1 document adjacent
                // to this one, limit results
                'size' => 1
            ]
        ];
    
        $response = Elasticsearch::search($params);
    
        if ($response['found']) {
            return $response['hits']['hits'][0];
        }
    }
    
    function getNextDocument($search, $documentId) {
        return getAdjacentDocument($search, $documentId, true);
    }
    
    function getPreviousDocument($search, $documentId) {
        return getAdjacentDocument($search, $documentId, false);
    }
    
    // Retrieve the search context given it's hash as query parameter
    $searchContext = SearchContext::whereHash(Input::query('search'))->first();
    
    // From the route segment
    $documentId = Input::route('id');
    
    $currentDocument = Elasticsearch::get([
        'id' => $documentId,
        'index' => 'documents'
    ]);
    
    $previousDocument = getPreviousDocument($searchContext, $documentId);
    $nextDocument = getNextDocument($searchContext, $documentId);
    

    The key to this technique is that you are generating two searches in addition to the get for the detail record.

    One search goes forwards from that record, the other goes backwards from that record, given the same search context in both cases so they work in line with eachother.

    In both cases, you take the first record that is not our current record, and it should be correct.

    0 讨论(0)
提交回复
热议问题