I want to build a sentence classifier that takes the sentence as a sequence of token embeddings. I\'m specifically interested in the methodology for learning the sentence embedd