I wanna try this model doc_to_vec as my experiment
I want to convert my dataset to the corpus as a t