发表新帖

发表新帖

Strip HTML from strings in Python

前端未结

关注

 26  2316

难免孤独 2020-11-22 02:50

from mechanize import Browser
br = Browser()
br.open(\'http://somewebpage\')
html = br.response().readlines()
for line in html:
  print line

When p

26条回答

粉色の甜心 (楼主)

2020-11-22 03:16

You can use either a different HTML parser (like lxml, or Beautiful Soup) -- one that offers functions to extract just text. Or, you can run a regex on your line string that strips out the tags. See Python docs for more.

0 讨论(0)

查看其它26个回答
发布评论:

提交评论
- 加载中...

热议问题