Select rows which are not present in other table

后端 未结 4 2099
抹茶落季
抹茶落季 2020-11-21 07:47

I\'ve got two postgresql tables:

table name     column names
-----------    ------------------------
login_log             


        
4条回答
  •  Happy的楠姐
    2020-11-21 07:51

    There are basically 4 techniques for this task, all of them standard SQL.

    NOT EXISTS

    Often fastest in Postgres.

    SELECT ip 
    FROM   login_log l 
    WHERE  NOT EXISTS (
       SELECT  -- SELECT list mostly irrelevant; can just be empty in Postgres
       FROM   ip_location
       WHERE  ip = l.ip
       );
    

    Also consider:

    • What is easier to read in EXISTS subqueries?

    LEFT JOIN / IS NULL

    Sometimes this is fastest. Often shortest. Often results in the same query plan as NOT EXISTS.

    SELECT l.ip 
    FROM   login_log l 
    LEFT   JOIN ip_location i USING (ip)  -- short for: ON i.ip = l.ip
    WHERE  i.ip IS NULL;
    

    EXCEPT

    Short. Not as easily integrated in more complex queries.

    SELECT ip 
    FROM   login_log
    
    EXCEPT ALL  -- "ALL" keeps duplicates and makes it faster
    SELECT ip
    FROM   ip_location;
    

    Note that (per documentation):

    duplicates are eliminated unless EXCEPT ALL is used.

    Typically, you'll want the ALL keyword. If you don't care, still use it because it makes the query faster.

    NOT IN

    Only good without NULL values or if you know to handle NULL properly. I would not use it for this purpose. Also, performance can deteriorate with bigger tables.

    SELECT ip 
    FROM   login_log
    WHERE  ip NOT IN (
       SELECT DISTINCT ip  -- DISTINCT is optional
       FROM   ip_location
       );
    

    NOT IN carries a "trap" for NULL values on either side:

    • Find records where join doesn't exist

    Similar question on dba.SE targeted at MySQL:

    • Select rows where value of second column is not present in first column

提交回复
热议问题