Extracting Key-Phrases from text based on the Topic with Python

后端 未结 3 407
被撕碎了的回忆
被撕碎了的回忆 2021-01-03 03:36

I have a large dataset with 3 columns, columns are text, phrase and topic. I want to find a way to extract key-phrases (phrases column) based on the topic. Key-Phrase can b

3条回答
  •  一生所求
    2021-01-03 04:13

    I think what your looking for is called "Topic modeling" in NLP. you should try using LDA for topic modeling. It's one of easiest methods to apply. also as @Mike mentioned, converting word to vector has many approaches. You should first try simple approaches like count vectorizer and then gradually move to something like word-2-vect or glove.

    I am attaching some links for applying LDA to the corpus. 1. https://towardsdatascience.com/nlp-extracting-the-main-topics-from-your-dataset-using-lda-in-minutes-21486f5aa925 2. https://www.machinelearningplus.com/nlp/topic-modeling-visualization-how-to-present-results-lda-models/

提交回复
热议问题