I am trying to do tokenize my text. The type of my data is pyspark.sql.dataframe.DataFrame I read it by the following code:
pyspark.sql.dataframe.DataFrame
df1=spark.read.json(&qu