I am using TfidfVectorizer and SGDClassifier to classify the text into categories (Sport, Business etc) like below:
text_clf = Pipeline([ (\'tfidf\', Tf