Best of breed indexing data structures for Extremely Large time-series

后端 未结 3 1275
别那么骄傲
别那么骄傲 2021-01-30 14:46

I\'d like to ask fellow SO\'ers for their opinions regarding best of breed data structures to be used for indexing time-series (aka column-wise data, aka flat linear).

T

3条回答
  •  执笔经年
    2021-01-30 15:03

    General ideas:

    Problem 1 is fairly common: Create an index that fits into your RAM and has links to the data on the secondary storage (datastructure: B-Tree family). Problem 2 / 3 are quite complicated since your data is so large. You could partition your data into time ranges and calculate the min / max for that time range. Using that information, you can filter out time ranges (e.g. max value for a range is 50 and you search for v0>60 then the interval is out). The rest needs to be searched by going through the data. The effectiveness greatly depends on how fast the data is changing.

    You can also do multiple indices by combining the time ranges of lower levels to do the filtering faster.

提交回复
热议问题