I have a corpora of text files. 9650 in total. Only a handful example:
[\'datasets/Autobiography.txt\', \'datasets/CoralReefs.txt\', \'datasets/DescentofMan