How to create history fact table?

前端 未结 1 450
庸人自扰
庸人自扰 2021-02-07 20:59

I have some entities in my Data Warehouse:

  1. Person - with attributes personId, dateFrom, dateTo, and others those can be changed, e.g. last nam

相关标签:
1条回答
  • 2021-02-07 21:38

    Interesting question @Argnist!

    So to create some common language for my example, you want a

    • DimPerson (PK=kcPerson, suggorate key for unique Persons=kPerson, type 2 dim)
    • DimDocument (PK=kcDocument, suggorate key for unique Documents=kDocument, type 2 dim)
    • DimAddress (PK=kcAddress, suggorate key for unique Addresses=kAddress, type 2 dim)

    A colleague has written a short blog on the usage of two surrogate keys to explain the above dims 'Using Two Surrogate Keys on Dimensions'.

    I would always add DimDate with PK in the form yyyymmdd to any data warehouse with extra attribute columns.

    Then you would have your fact table as

    • FactHistory (FKs=kcPerson, kPerson, kcDocument, kDocument, kcPerson, kPerson, kDate) plus any aditional measures.

    Then joining on the "kc"s you can show the current Person/Document/Address dimension information. If you joined on the "k"s you can show the historic Person/Document/Address dimension information.

    The downside of this is that this fact table needs one row for each person/document/address/date combination. But it really is a very narrow table, since the table just has a number of foreign keys.

    The advantage of this is it is very easy to query for the sorts of questions you were asking.

    Alternatively, you could have your fact table as

    • FactHistory (FKs=kcPerson, kPerson, kcDocument, kDocument, kcPerson, kPerson, kDateFrom, kDateTo) plus any aditional measures.

    This is obviously much more compact, but the querying becomes more complex. You could also put a view over the Fact table to make it easier to query!

    The choice of solution depends on the frequency of change of the data. I suspect that it will not be changing that quickly, so teh alternate design of the fact table may be better.

    Hope that helps.

    0 讨论(0)
提交回复
热议问题