Creating a loop through a list of variables for an LM model in R

泪湿孤枕 提交于 2020-07-30 09:06:01

问题


I am trying to create multiple linear regression models from a list of variable combinations (I also have them separately as a data-frame if that is more useful!)

The list of variables looks like this:

Vars
x1+x2+x3
x1+x2+x4
x1+x2+x5
x1+x2+x6
x1+x2+x7

The loop I'm using looks like this:

for (i in 1:length(var_list)){
  lm(independent_variable ~ var_list[i],data = training_data)
  i+1
}

However it is not recognizing the string of var_list[i] which gives x1+x2+x3 etc. as a model input.

Does any-one know how to fix it?

Thanks for your help.


回答1:


You don't even have to use loops. Apply should work nicely.

training_data <- as.data.frame(matrix(sample(1:64), nrow = 8))
colnames(training_data) <- c("independent_variable", paste0("x", 1:7))

Vars <- as.list(c("x1+x2+x3",
                "x1+x2+x4",
                "x1+x2+x5",
                "x1+x2+x6",
                "x1+x2+x7"))

allModelsList <- lapply(paste("independent_variable ~", Vars), as.formula)
allModelsResults <- lapply(allModelsList, function(x) lm(x, data = training_data))  

If you need models summaries you can add :

allModelsSummaries = lapply(allModelsResults, summary) 

For example you can access the coefficient R² of the model lm(independent_variable ~ x1+x2+x3) by doing this:

allModelsSummaries[[1]]$r.squared

I hope it helps.




回答2:


We can create the formula with paste

out <- vector('list', length(var_list))

for (i in seq_along(var_list)){
  out[[i]] <- lm(paste('independent_variable',  '~', var_list[i]),
               data = training_data)
 }

Or otherwise, it can be done with reformulate

lm(reformulate(var_list[i], 'independent_variable'), data = training_data)


来源:https://stackoverflow.com/questions/59161144/creating-a-loop-through-a-list-of-variables-for-an-lm-model-in-r

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!