create a sparse matrix; given the indices of non-zero elements for creation of dummy variables of a categorical column of a large dataset

前端 未结 2 695
后悔当初
后悔当初 2021-01-24 13:48

I\'m trying to use a sparse matrix to generate dummy variables for a set of data with 5.8 million rows and two categorical columns.

The structure of the data is:

2条回答
  •  孤街浪徒
    2021-01-24 14:34

    Why do you want a sparse matrix? For a dummy matrix you can also just use:

    model.matrix(~ . + 0, data = df)
    

    The 0 indicates no intercept and the . indicates that all categorical variables will be transformed. Be sure to set these variables as factors using as.factor() beforehand.

提交回复
热议问题