Searching through webpage

前端 未结 2 818
[愿得一人]
[愿得一人] 2021-02-06 18:10

Hey I\'m working on a Python project that requires I look through a webpage. I want to look through to find a specific text and if it finds the text, then it prints something ou

相关标签:
2条回答
  • 2021-02-06 18:14

    You could do something simple like:

    
    import urllib2
    import re
    
    html_content = urllib2.urlopen('http://www.domain.com').read()
    
    matches = re.findall('regex of string to find', html_content);
    
    if len(matches) == 0: 
       print 'I did not find anything'
    else:
       print 'My string is in the html'
    
    0 讨论(0)
  • 2021-02-06 18:28

    lxml is awesome: http://lxml.de/parsing.html

    I use it regularly with xpath for extracting data from the html.

    The other option is http://www.crummy.com/software/BeautifulSoup/ which is great as well.

    0 讨论(0)
提交回复
热议问题