I am using AllenNLP to train a hierarchical attention network model. My training dataset consists of a list of JSON objects (e