As the input into the logistic regression model in sklearn, I was trying to make a list of lists, where each sublist is a word embedding of a token (size~100). But I\'m not