I\'m trying to run a supervised text classification task with two different classes, and I heard that BERT can work great for those kinds of tasks. So far I have worked only wit