I have data consisting of a comment-text and a sentiment-label. I already pre-processed the data using Spacy for an earlier project, so I my text isalready tokenized and without