Searching through webpage

前端未结

关注

 2  833

Hey I\'m working on a Python project that requires I look through a webpage. I want to look through to find a specific text and if it finds the text, then it prints something ou

相关标签:

2条回答

有刺的猬

2021-02-06 18:14

You could do something simple like:


import urllib2
import re

html_content = urllib2.urlopen('http://www.domain.com').read()

matches = re.findall('regex of string to find', html_content);

if len(matches) == 0: 
   print 'I did not find anything'
else:
   print 'My string is in the html'

0 讨论(0)

梦谈多话

2021-02-06 18:28

lxml is awesome: http://lxml.de/parsing.html

I use it regularly with xpath for extracting data from the html.

The other option is http://www.crummy.com/software/BeautifulSoup/ which is great as well.

0 讨论(0)
发布评论:

提交评论
- 加载中...