Using BeautifulSoup to extract specific dl and dd list elements

前端 未结 2 1281
别跟我提以往
别跟我提以往 2021-01-20 01:56

My first time posting. I am using BeautifulSoup 4 and python 2.7 (pycharm). I have a webpage containing elements and I need to extract specific elements where the tags

2条回答
  •  心在旅途
    2021-01-20 02:28

    If order is not important just make some changes:

    ...
    dl_data = soup.find_all("dd")
    for dlitem in dl_data:
        print dlitem.string
    

    Result:

    13 September 2015
    Starting at £40,130 per annum.
    15 December 2015
    Starting at £22,460 per annum.
    10 January 2014
    Starting at £18,160 per annum.
    

    For your latest request:

    for item in list(zip(soup.find_all("dd")[0::3],soup.find_all("dd")[2::3])):
        date, salary = item
        print ', '.join([date.string, salary.string])
    

    Output:

    13 September 2015, 100
    14 September 2015, 200
    

提交回复
热议问题