Multiple tags search query

后端 未结 4 790
深忆病人
深忆病人 2021-01-02 12:24

I working on a tag based search. I have three tables tag(id,name), tagXmedia(id,tag_id,media_id), and media(id,...). tagXmedia is the mapping table between the tag and media

相关标签:
4条回答
  • 2021-01-02 12:33

    The following should work.

    SELECT media_id
    FROM tagXmedia
    WHERE tag_id IN (SELECT id FROM tag WHERE name IN ('home','hawaii'))
    GROUP BY media_id
    HAVING COUNT(tag_id) = 2;
    

    If you wish to have it match more than just two tags, you can easily add them. Just remember to change the 2 in the HAVING clause.

    I assumed all the rows in tagXmedia are unique. In case they aren't, you will have to add DISTINCT to the COUNT part.

    0 讨论(0)
  • 2021-01-02 12:33
    SELECT
        media1.media_id
    FROM
        (
            SELECT media_id FROM tagXmedia A INNER JOIN
            (SELECT id tag_id FROM tag WHERE name='home') B
            USING (tag_id)
        ) media1
        INNER JOIN
        (
            SELECT media_id FROM tagXmedia C INNER JOIN
            (SELECT id tag_id FROM tag WHERE name='hawaii') D
            USING (tag_id)
        ) media2
        USING (media_id)
    ;
    

    Make sure you have this index in tagXmedia:

    ALTER TABLE tagXmedia ADD UNIQUE INDEX (tag_id,media_id);
    

    Here is a test case:

    drop database if exists tagmediatest;
    create database tagmediatest;
    use tagmediatest
    CREATE TABLE media
    (
       id int not null auto_increment,
       stuff varchar(20),
       PRIMARY KEY (id)
    );
    INSERT INTO media (stuff) VALUES
    ('magazine'),('television'),('iphone'),
    ('ipad'),('IE9 Browser'),('radio');
    CREATE TABLE tag
    (
       id int not null auto_increment,
       name varchar(20),
       PRIMARY KEY (id),
       UNIQUE KEY (name)
    );
    INSERT INTO tag (name) VALUES
    ('away'),('home'),('jersery city'),('hawaii'),('nyc');
    CREATE TABLE tagXmedia
    (
       id int not null auto_increment,
       tag_id INT NOT NULL,
       media_id INT NOT NULL,
       PRIMARY KEY (id),
       UNIQUE KEY (tag_id,media_id)
    );
    INSERT INTO tagXmedia (tag_id,media_id) VALUES
    (1,1),(1,2),(1,3),(1,6),
    (2,1),(2,2),(2,4),(2,5),
    (3,5),(3,4),(3,3),(3,1),
    (4,2),(4,3),(4,5),(4,6),
    (5,2),(5,3),(5,5),(5,4);
    SELECT 
        media1.media_id 
    FROM 
        ( 
            SELECT media_id FROM tagXmedia A INNER JOIN 
            (SELECT id tag_id FROM tag WHERE name='home') B 
            USING (tag_id) 
        ) media1 
        INNER JOIN 
        ( 
            SELECT media_id FROM tagXmedia C INNER JOIN 
            (SELECT id tag_id FROM tag WHERE name='hawaii') D 
            USING (tag_id) 
        ) media2
    USING (media_id)
    ; 
    

    Here is the result:

    mysql> drop database if exists tagmediatest;
    Query OK, 3 rows affected (0.09 sec)
    
    mysql> create database tagmediatest;
    Query OK, 1 row affected (0.00 sec)
    
    mysql> use tagmediatest
    Database changed
    mysql> CREATE TABLE media
        -> (
        ->    id int not null auto_increment,
        ->    stuff varchar(20),
        ->    PRIMARY KEY (id)
        -> );
    Query OK, 0 rows affected (0.05 sec)
    
    mysql> INSERT INTO media (stuff) VALUES
        -> ('magazine'),('television'),('iphone'),
        -> ('ipad'),('IE9 Browser'),('radio');
    Query OK, 6 rows affected (0.05 sec)
    Records: 6  Duplicates: 0  Warnings: 0
    
    mysql> CREATE TABLE tag
        -> (
        ->    id int not null auto_increment,
        ->    name varchar(20),
        ->    PRIMARY KEY (id),
        ->    UNIQUE KEY (name)
        -> );
    Query OK, 0 rows affected (0.08 sec)
    
    mysql> INSERT INTO tag (name) VALUES
        -> ('away'),('home'),('jersery city'),('hawaii'),('nyc');
    Query OK, 5 rows affected (0.06 sec)
    Records: 5  Duplicates: 0  Warnings: 0
    
    mysql> CREATE TABLE tagXmedia
        -> (
        ->    id int not null auto_increment,
        ->    tag_id INT NOT NULL,
        ->    media_id INT NOT NULL,
        ->    PRIMARY KEY (id),
        ->    UNIQUE KEY (tag_id,media_id)
        -> );
    Query OK, 0 rows affected (0.06 sec)
    
    mysql> INSERT INTO tagXmedia (tag_id,media_id) VALUES
        -> (1,1),(1,2),(1,3),(1,6),
        -> (2,1),(2,2),(2,4),(2,5),
        -> (3,5),(3,4),(3,3),(3,1),
        -> (4,2),(4,3),(4,5),(4,6),
        -> (5,2),(5,3),(5,5),(5,4);
    Query OK, 20 rows affected (0.05 sec)
    Records: 20  Duplicates: 0  Warnings: 0
    
    mysql> SELECT
        ->     media1.media_id
        -> FROM
        ->     (
        ->         SELECT media_id FROM tagXmedia A INNER JOIN
        ->         (SELECT id tag_id FROM tag WHERE name='home') B
        ->         USING (tag_id)
        ->     ) media1
        ->     INNER JOIN
        ->     (
        ->         SELECT media_id FROM tagXmedia C INNER JOIN
        ->         (SELECT id tag_id FROM tag WHERE name='hawaii') D
        ->         USING (tag_id)
        ->     ) media2
        -> USING (media_id)
        -> ;
    +----------+
    | media_id |
    +----------+
    |        2 |
    |        5 |
    +----------+
    2 rows in set (0.00 sec)
    
    mysql>
    

    Note that tag_id 2 and 4 reside in media_id 2 and 5. This is why the query works.

    0 讨论(0)
  • 2021-01-02 12:50

    Try this query:

    SELECT
        T1.media_id
    FROM
        tagXmedia as T1
    INNER JOIN media as T2 
    ON T1.media_id =T2.id
    INNER JOIN tag as T3 
    ON T1.id = T3.tag_id AND T3.name IN ('home','hawaii')
    GROUP BY T1.media_id
    
    0 讨论(0)
  • 2021-01-02 12:52

    @kba's answer is correct but you can also do this with a JOIN which is probably more efficient.

    SELECT media_id
      FROM tagXmedia
      LEFT JOIN tag ON tag_id = tag.id
      WHERE tag.name IN ('home', 'hawaii')
      GROUP BY media_id
      HAVING COUNT(tag_id) = 2;
    

    I had a similar problem where I wanted to get not just the media_id but the actual object and wanted to pass in arbitrary comma separated lists of tags. Here's my complete solution using a stored procedure:

    CREATE PROCEDURE FindByTag(IN _tags VARCHAR(256))
    BEGIN
      DECLARE _length INT;
    
      -- Get the length of the list
      SET _tags =  TRIM(BOTH ',' FROM _tags);
      SET _length = LENGTH(_tags) - LENGTH(REPLACE(_tags, ',', '')) + 1;
    
      -- Find media
      SELECT * FROM media
        WHERE id IN (
          SELECT media_id FROM tagXmedia
            LEFT JOIN tag ON tag_id = tag.id
            WHERE FIND_IN_SET(tag.name, _tags)
            GROUP BY media_id
            HAVING COUNT(tag_id) = _length
        )
    END
    
    0 讨论(0)
提交回复
热议问题