Reading a portion of a large xlsx file with python

后端 未结 2 1251
野性不改
野性不改 2020-12-06 21:40

I have a large .xlsx file with 1 million rows. I don\'t want to open the whole file in one go. I was wondering if I can read a chunk of the file, process it and then read th

2条回答
  •  囚心锁ツ
    2020-12-06 22:21

    Yes. Pandas supports chunked reading. You would go about reading an excel file like so.

    import pandas as pd
    xl = pd.ExcelFile("myfile.xlsx")
    for sheet_name in xl.sheet_names:
      reader = xl.parse(sheet_name, chunksize=1000):
      for chunk in reader:
        #parse chunk here
    

提交回复
热议问题