I have over 100 heavy gz tables from Google Ngram data that I need to concatenate and create one data set. I am going to use this data for word embedding modeling. After doing s