create a sparse matrix; given the indices of non-zero elements for creation of dummy variables of a categorical column of a large dataset

前端 未结 2 692
后悔当初
后悔当初 2021-01-24 13:48

I\'m trying to use a sparse matrix to generate dummy variables for a set of data with 5.8 million rows and two categorical columns.

The structure of the data is:

2条回答
  •  伪装坚强ぢ
    2021-01-24 14:36

    Try this:

    spmat<-Matrix(0,nrow = 210000 ,ncol = 500,sparse = T)
    locs<-Matrix(data=c(mydata$Var_1,mydata$Var_2),byrow=F,ncol=2)
    spmat[locs]=1
    

提交回复
热议问题