Python Scraping JavaScript using Selenium and Beautiful Soup

后端 未结 1 414
花落未央
花落未央 2020-12-09 20:00

I\'m trying to scrape a JavaScript enables page using BS and Selenium. I have the following code so far. It still doesn\'t somehow detect the JavaScript (and returns a null

相关标签:
1条回答
  • 2020-12-09 20:25

    There are some mistakes in your code that are fixed below. However, the class "postText" must exist elsewhere, since it is not defined in the original source code. My revised version of your code was tested and is working on multiple websites.

    from selenium import webdriver  
    from selenium.common.exceptions import NoSuchElementException  
    from selenium.webdriver.common.keys import Keys  
    from bs4 import BeautifulSoup
    
    browser = webdriver.Firefox()  
    browser.get('http://techcrunch.com/2012/05/15/facebook-lightbox/')  
    html_source = browser.page_source  
    browser.quit()
    
    soup = BeautifulSoup(html_source,'html.parser')  
    #class "postText" is not defined in the source code
    comments = soup.findAll('div',{'class':'postText'})  
    print comments
    
    0 讨论(0)
提交回复
热议问题