Fixed Effects Regression with Interaction Term Causes Error

后端 未结 3 1180
孤独总比滥情好
孤独总比滥情好 2021-01-18 03:08

I am trying to estimate a panel dataset with an interaction term for geographical areas (LoadArea, DischargeArea) which signifies a route. Using the fixed effects specificat

相关标签:
3条回答
  • 2021-01-18 03:40

    Please note that plm() is playing fine all along, its the summary.plm() function that's breaking bad! Delving deeper into the function reveals the trouble in the part where it calculates R^2.

    Read more here on the same problem at stackexchange

    Quick and not so elegant workarounds include:

    (1) Replacing LoadArea:DischargeArea with LoadArea*DischargeArea

    (2) Manually create separate interaction variable

    LoadxDischarge <- LoadArea*DischargeArea 
    
    0 讨论(0)
  • 2021-01-18 03:42

    A way to get at least the standard errors etc. is to use

    library("sandwich")
    library("lmtest")
    coeftest(mult_fe)
    
    0 讨论(0)
  • 2021-01-18 03:51

    This is a problem of collinearity among your variables.

    The lm command automatically places NAs in the beta vector for variables that were not estimated due to colinearity, but PLM does not.

    When you have LoadArea*DischargeArea PLM will three variables to your model:

    LoadArea + DischargeArea + LoadArea:DischargeArea
    

    After that PLM will demean them.

    In this case, and without further information on your data my guess is that one of these variables is perfectly collinear with one of the factors levels in:

    as.factor(Laycan.Day.Diff)
    

    In your case I would try to estimate the model without the factor. If it works you know the factors are causing the problem. If it comes to that you can then convert each factor to a explicit 0/1 dummy and add them one by one until you understand where the problem is coming from.

    To determine which variables are collinear you could try something like:

    require(data.table)
    tmp      <- data.table(var1=1:10,var2=55:64,userid=rep(c(1,2),5))
    cols     <- c('var1','var2')
    newnames <- c('demeaned_var1','demeaned_var2')
    tmp[,(newnames):=.SD-lapply(.SD,mean),.SDcols=cols,by=userid]
    cor(tmp[,newnames,with=F])
    

    Line 5 is the demeaning. This other stack overflow post describes the operations of the data.table that i used above in detail.

    The output of the code above will be:

    > 
                  demeaned_var1 demeaned_var2
    demeaned_var1             1             1
    demeaned_var2             1             1
    

    This will tell you which demeaned vars are perfectly collinear.

    0 讨论(0)
提交回复
热议问题