How to skip header and footer data in pandas dataframe?

前端 未结 4 933
孤街浪徒
孤街浪徒 2021-01-04 10:44

I have first 15 rows of a excel file as \"Header data\". and after 235 rows, \"Footer data\". I need to read data in between these header and footer da

相关标签:
4条回答
  • 2021-01-04 11:01

    You are interested in data from row 15 to row 235.

    You can try this:

    import pandas as pd
    
    df = pd.read_excel(somefile.xls)
    
    df = df[15:236] #we have to include row 235
    
    0 讨论(0)
  • 2021-01-04 11:05

    So to summarize. Header location is 15 from the top and Footer location is Y from the bottom. Here's how you import the correct values:

    import pandas as pd
    df=pd.read_excel("File.xls",header=15,skipfooter=_Y_)
    

    Do ensure that your columnar data isn't being excluded!

    0 讨论(0)
  • 2021-01-04 11:10

    You can also do this after loading the file:

    df=df[(df.index>15)&(df.index<236)]
    df.index-=16
    
    0 讨论(0)
  • 2021-01-04 11:15

    Demo:

    xl = pd.ExcelFile(filepath)
    
    # parsing first (index: 0) sheet
    total_rows = xl.book.sheet_by_index(0).nrows
    
    skiprows = 15
    nrows = 235 - 15
    
    # calc number of footer rows
    # (-1) - for the header row
    skipfooter = total_rows - nrows - skiprows - 1
    
    df = xl.parse(0, skiprows=skiprows, skipfooter=skipfooter)
    
    0 讨论(0)
提交回复
热议问题