Foreign key vs check constraint for integrity

后端 未结 2 2049
死守一世寂寞
死守一世寂寞 2020-12-09 18:50

I am building a system that is a central repository for storing data from a number of other systems. A sync process is required to update the central repository when the oth

相关标签:
2条回答
  • 2020-12-09 19:31

    I think you're confusing the difference between a foreign key constraint and a check constraint.

    A foreign key constraint is there to enforce referential integrity and a check constraint constrains a column to containing only valid data. In your case this may seem like a minor difference but if we abstract it slightly I hope to make it clearer.

    If we consider a table users with the columns user_id, user_name, address_id, join_date, active, last_active_month; I recognise that this is not necessarily the best way of doing things but it'll serve for the point I'm trying to make.

    In this case it's patently ridiculous to have address_id as a constraint. This column could have any number of values. However, active, assuming we want a boolean y/n can only have two possible values and last_active_month can only have 12 possible values. In both these cases it's completely ridiculous to have a foreign key. There are only a certain number of values and by the definition of the data you are including these values cannot change.

    In your case, while you could go for a check constraint, unless you can be absolutely certain that the number of actions will never change a foreign key is the correct way to go.


    On a slightly separate matter, and as @pst mentioned, I see you've been eaten by the surrogate key monster. While this can result in performance improvements, in a table of the size you're envisaging ( 3 values, insert / update / delete ) or even a larger one all it serves to do is obscure what you're trying to achieve.

    It's not easy to look at

    ID  Action  System
     1     1       1
     2     2       1 
    

    and see what's going on, but:

    ID  Action  System
     1  insert     1
     2  update     1
    

    is far easier to read; you may also want to consider doing the same for the system column - I probably would, though the number of possible values jumps slightly in this. Just my personal thoughts on the matter...

    0 讨论(0)
  • 2020-12-09 19:56

    The commentators seems to umanimously agree:

    It's generally better to have a FOREIGN KEY constraint to a (more or less static) reference table. Reasons:

    • The constraint is easily "extendable". To add or remove an option, you only have to add or remove a row from the refernce table. You don't have to drop the constraint and recreate it. Even more, if you have same constraint in similar columns in other tables, too.

    • You can have extra information attached (more columns), that can be read by the applications if needed.

    • ORMs can deal better with (Read: be aware of) these constraints. They just have to read a table, not the meta-data.

    • If you want to change the Action codes, the cascading effects will take care of the changes in other (possibly many) tables. No need to write UPDATE queries.

    • One particular DBMS has not yet implemented CHECK constraints (shame), although it does have FK ones.

    As @pst mentioned (and I prefer this approach very much), you can use a sensible code instead of a surrogate integer ID. So, your table could be:

    Table: System

    SystemID Description
     1        Slave System 1
     2        Slave System 2
    

    Table: Action

    ActionCode Description
     I          Insert
     U          Update
     D          Delete
    

    Table: SyncAction

    ID  ActionCode  SystemID
     1     I          1
     2     U          1
    
    0 讨论(0)
提交回复
热议问题