TimescaleDB query to select rows where column value changed from previous row

只愿长相守 提交于 2021-01-27 11:43:23

问题


Just recently started using TimescaleDB with postgrest to handle most requests for data.

However I'm running into an issue where I have a horribly inefficient request for time series of data.

It's a data series that can be any length of time, with specific Integer values.

Most of the time the value will be the same unless there's an anomaly. So rather than fetching +10,000 rows of data. I would like to aggregate this into "time blocks".

Let's say there 97 items in a row where the value is 100 (new item for every 5 minutes) #98 the value is 48 for 5 items in a row and then it goes back up to 100 for another 2,900 rows.

I don't want to fetch 3002 items to display this data. I should only need to fetch 3 items.

  • 1 item that says the value is 100 from a startDate
  • 1 item that says the value is 48 from a startDate after #1
  • 1 item that says the value is 100 again from a startDate after #2

But I'm having some trouble figuring out how I can do this with timescaledb.

basically, if the value is the same as the last value, aggregate it. That's all I need it to do.

Does anyone know how to construct a VIEW for this kind of situation in timescaleDB using continuous aggregation (or if there's a faster way) to fetch this?


回答1:


You can achieve the desired result with window functions and a subselect:

SELECT time, value FROM (
  SELECT 
    time,
    value,
    value - LAG(value) OVER (ORDER BY time) as diff
  FROM hypertable) ht 
WHERE diff IS NULL OR diff != 0;

You use a window function to calculate the diff to the previous row and then filter all the rows where the diff is 0 in the outer query.



来源:https://stackoverflow.com/questions/56331247/timescaledb-query-to-select-rows-where-column-value-changed-from-previous-row

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!