I\'m running a MYSQL query in two steps. First, I get a list of ids with one query, and then I retrieve the data for those ids using a second query along the lines of
Starting from a certain number of records, the IN
predicate over a SELECT
becomes faster than that over a list of constants.
See this article in my blog for performance comparison:
If the column used in the query in the IN
clause is indexed, like this:
SELECT *
FROM table1
WHERE unindexed_column IN
(
SELECT indexed_column
FROM table2
)
, then this query is just optimized to an EXISTS
(which uses but a one entry for each record from table1
)
Unfortunately, MySQL
is not capable of doing HASH SEMI JOIN
or MERGE SEMI JOIN
which are yet more efficient (especially if both columns are indexed).
Why do you extract the ids first? You should probably just join the tables. If you use the ids for something else, you can insert them in a temp table before and use this table for the join.