How to remove non UTF-8 characters from text
问题 I need help removing non UTF-8 character from my word cloud. So far this is my code. I've tried gsub and removeWords and they are still there in my word cloud and I do not know what to do to get rid of them. Any help would be appreciated. Thank you for your time. txt <- readLines("11-0.txt") corpus = VCorpus(VectorSource(txt)) gsub("’","‘","",txt) corpus = tm_map(corpus, content_transformer(tolower)) corpus = tm_map(corpus, removeWords, stopwords("english")) corpus = tm_map(corpus,