I am new to Apache Spark and faced the following problem: there is a dataset:
| label | words | | -------- | ------------------- | | 0 | w