I have a dataset in which I have often repeating large text features, imagine the following:
user book_name book_contents score
Since many u