I have X as a csr_matrix that I obtained using scikit\'s tfidf vectorizer, and y which is an array
My plan is to create features using LDA, however, I failed to find
Gensim has a semi-well-hidden function that can kind of do this for you:
http://radimrehurek.com/gensim/matutils.html#gensim.matutils.Sparse2Corpus
"class gensim.matutils.Sparse2Corpus(sparse, documents_columns=True) Convert a matrix in scipy.sparse format into a streaming gensim corpus."
I've had some success with it using a corpus extracted with CountVectorizer, then loaded into gensim.