tidytext R in spanish - any alternative?

后端 未结 3 433
[愿得一人]
[愿得一人] 2021-01-19 08:24

I\'m doing sentiment analysis from twitter but my tweets are on Spanish so I can\'t use tidytext to classify the words. Does anyone know if there is a similar package for Sp

相关标签:
3条回答
  • I run into the same issue with Non-English textmining. I found udpipe which is an r package developed by Bnosac. It is a Natural Language Processing toolkit that provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization', 'morphological feature tagging' and 'dependency parsing' of raw text. Beware that there are no sentiment tags in the package. Those you will need to find elsewhere.

    It supports a diverse range of non-English languages.

    You can find out more on their blog, on the webpage of udpipe or on github

    P.S. I have no affiliation with them.

    0 讨论(0)
  • 2021-01-19 08:47

    The Stanford Core NLP package is on cran and provides also the sentiment for spanish with the get_sentiment function

    0 讨论(0)
  • 2021-01-19 08:49

    There are not a lot of good open source options for sentiment lexicons in non-English languages right now, unfortunately. You can request the NRC lexicon in other languages from the authors; it is translated by Google Translate (which of course adds uncertainty but has shown to be mostly OK overall) and the authors say they give it away for research purposes but will charge for commercial use.

    0 讨论(0)
提交回复
热议问题