dimension mismatch error in CountVectorizer MultinomialNB

对着背影说爱祢 提交于 2019-12-01 22:53:46

Your CountVectorizer has already been fitted with the training data. So for your test data, you just want to call transform(), not fit_transform().

Otherwise, if you use fit_transform() again on your test data, you get different columns based on the unique vocabulary of the test data. So just fit once for training.

X_testcv = cv.transform(X_test)