pd.read_html() imports a list rather than a dataframe

后端 未结 2 970
面向向阳花
面向向阳花 2020-12-01 16:54

I used pd.read_html() to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the da

相关标签:
2条回答
  • 2020-12-01 17:09

    .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:

    dfs = pd.read_html(url)
    df = dfs[0]
    print(df)
    

    Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.

    0 讨论(0)
  • 2020-12-01 17:09
    import pandas as pd
    import html5lib
    url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'
    dfs = pd.read_html(url)
    df = pd.concat(dfs)
    df
    
    0 讨论(0)
提交回复
热议问题