Removing words featured in character vector from string

前端 未结 2 975
傲寒
傲寒 2020-12-31 09:28

I have a character vector of stopwords in R:

stopwords = c(\"a\" ,
            \"able\" ,
            \"about\" ,
            \"above\" ,
            \"abst         


        
2条回答
  •  说谎
    说谎 (楼主)
    2020-12-31 10:19

    Try this:

    str <- c("I have zero a accordance")
    
    stopwords = c("a", "able", "about", "above", "abst", "accordance", "yourself",
    "yourselves", "you've", "z", "zero")
    
    x <- unlist(strsplit(str, " "))
    
    x <- x[!x %in% stopwords]
    
    paste(x, collapse = " ")
    
    # [1] "I have"
    

    Addition: Writing a "removeWords" function is simple so it is not necessary to load an external package for this purpose:

    removeWords <- function(str, stopwords) {
      x <- unlist(strsplit(str, " "))
      paste(x[!x %in% stopwords], collapse = " ")
    }
    
    removeWords(str, stopwords)
    # [1] "I have"
    

提交回复
热议问题