Reading Excel file without hidden columns in Python using Pandas or other modules

前端未结

关注

 1  1484

失恋的感觉

Can anyone tell me how to read the Excel file without hidden columns in Python with Pandas or any other modules?

When I try to read excel file using Pandas, for exam

相关标签:

1条回答

眼角桃花

2021-01-05 09:34

I don't think pandas does it out of the box.

Input

You will have to unfortunately do some redundant reading (twice). openpyxl does what you want -

import openpyxl
import pandas as pd

loc = 'sample.xlsx'
wb = openpyxl.load_workbook(loc)
ws = wb.get_sheet_by_name('Sheet1')

hidden_cols = []
for colLetter,colDimension in ws.column_dimensions.items():
    if colDimension.hidden == True:
        hidden_cols.append(colLetter)

df = pd.read_excel(loc)
unhidden = list( set(df.columns) - set(hidden_cols) )
df = df[unhidden]
print(df)

Output

Explanation

Reading the file first using openpyxl -

loc = 'C:/Users/FGB3140/Desktop/sample.xlsx'
wb = openpyxl.load_workbook(loc)
ws = wb.get_sheet_by_name('Sheet1')

Searching for hidden property in cells (this is where the hidden columns are captured)

hidden_cols = []
for colLetter,colDimension in ws.column_dimensions.items():
    if colDimension.hidden == True:
        hidden_cols.append(colLetter)

Read the same file using pandas - df = pd.read_excel(loc)

Find the unhidden columns by subtracting the hidden ones from the rest -

unhidden = list( set(df.columns) - set(hidden_cols) )

Finally, filter out the unhidden columns -

df = df[unhidden]

P.S

I know I could have done colDimension.hidden == False or simple if not colDimension.hidden - The goal here is to capture the hidden columns and then do the filtering accordingly. Hope this helps!

0 讨论(0)