i have the following data frame
|tokenCnt|filtered | |5 |[java,scala, list, java, linkedlist]| |3 |[also, genseq, parseq]