Determining topics by using Lateral Dirichlet Allocation
tokenized = tweets[\'initial\'] dictionary = corpora.Dictionary(tokenized) dictionary.filter_extremes