Identifying large bodies of text via BeautifulSoup or other python based extractors

后端 未结 2 371
独厮守ぢ
独厮守ぢ 2021-01-31 06:25

Given some random news article, I want to write a web crawler to find the largest body of text present, and extract it. The intention is to extract the physical news article on

2条回答
  •  闹比i
    闹比i (楼主)
    2021-01-31 07:25

    You might look at the python-readability package which does exactly this for you.

提交回复
热议问题