Having the following table (conversations
):
id | record_id | is_response | text |
---+------------+---------------+------------
Here's my take:
SELECT
record_id,
string_agg(text, ' ' ORDER BY id) AS context
FROM (
SELECT
*,
coalesce(sum(incl::integer) OVER (ORDER BY id ROWS BETWEEN UNBOUNDED PRECEDING AND 1 PRECEDING),0) AS grp
FROM (
SELECT *, is_response AND text IN (SELECT text FROM responses) as incl
FROM conversations
) c
) c1
GROUP BY record_id, grp
HAVING bool_or(incl)
ORDER BY max(id);
This will scan the table conversations
once, but I am not sure if it will perform better than your solution. The basic idea is to use a window function to count how maybe preceding rows within the same record, end the conversation. Then we can group by with that number and the record_id
and discard incomplete conversations.