Aggregate values over a range of hours, every hour

前端 未结 2 1361
你的背包
你的背包 2021-02-07 18:02

I have a PostgreSQL 9.1 database with a table containing a timestamp and a measuring value

\'2012-10-25 01:00\'   2
\'2012-10-2         


        
相关标签:
2条回答
  • 2021-02-07 18:26

    A window function with a custom frame makes this amazingly simple:

    SELECT ts
          ,avg(val) OVER (ORDER BY ts
                          ROWS BETWEEN CURRENT ROW AND 7 FOLLOWING) AS avg_8h
    FROM tbl;
    

    Live demo on sqlfiddle.

    The frame for each average is the current row plus the following 7. This assumes you have exactly one row for every hour. Your sample data seems to imply that, but you did not specify.

    The way it is, avg_8h for the final (according to ts) 7 rows of the set is computed with fewer rows, until the value of the last row equals its own average. You did not specify how to deal with the special case.

    0 讨论(0)
  • 2021-02-07 18:35

    The key is to make a virtual table against which to join your results sets. The generate_series function can help do that, in the following manner:

    SELECT
        start
        , start + interval '8 hours' as end
    FROM (
        SELECT generate_series(
            date'2012-01-01'
            , date'2012-02-02'
            , '1 hour'
        ) AS start
    ) x;
    

    This produces output something like this:

             start          |          end           
    ------------------------+------------------------
     2012-01-01 00:00:00+00 | 2012-01-01 08:00:00+00
     2012-01-01 01:00:00+00 | 2012-01-01 09:00:00+00
     2012-01-01 02:00:00+00 | 2012-01-01 10:00:00+00
     2012-01-01 03:00:00+00 | 2012-01-01 11:00:00+00
    

    This gives you something to join your data to. In this way, the following query:

    SELECT
        y.start
        , round(avg(ts_val.v))
    FROM
        ts_val,
        (
            SELECT
                start
                , start + interval '8 hours' as end
            FROM (
                SELECT generate_series(
                    date'2012-01-01'
                    , date'2012-02-02'
                    , '1 hour'
                ) AS start
            ) x
        ) y
    WHERE
        ts BETWEEN y.start AND y.end
    GROUP BY
        y.start
    ORDER BY
        y.start
    ;
    

    For the following data

             ts          | v 
    ---------------------+---
     2012-01-01 01:00:00 | 2
     2012-01-01 09:00:00 | 2
     2012-01-01 10:00:00 | 5
    (3 rows)
    

    Will produce the following results:

             start          | round 
    ------------------------+-------
     2012-01-01 00:00:00+00 |   2.0
     2012-01-01 01:00:00+00 |   2.0
     2012-01-01 02:00:00+00 |   3.5
     2012-01-01 03:00:00+00 |   3.5
     2012-01-01 04:00:00+00 |   3.5
     2012-01-01 05:00:00+00 |   3.5
     2012-01-01 06:00:00+00 |   3.5
     2012-01-01 07:00:00+00 |   3.5
     2012-01-01 08:00:00+00 |   3.5
     2012-01-01 09:00:00+00 |   3.5
     2012-01-01 10:00:00+00 |   5.0
    (11 rows)
    
    0 讨论(0)
提交回复
热议问题