Why Scrapy returns an Iframe?

后端 未结 1 445
无人及你
无人及你 2021-01-07 10:39

i want to crawl this site by Python-Scrapy

i try this

class Parik(scrapy.Spider):
    name = \"ooshop\"
    allowed_domains = [\"http://www.ooshop.co         


        
相关标签:
1条回答
  • 2021-01-07 11:22

    The website is protected by Incapsula, a website security service. It's providing your "browser" with a challenge that it must perform before being given a special cookie that gives you access to the website itself.

    Fortunately, it's not that hard to bypass. Install incapsula-cracker and install its downloader middleware:

    DOWNLOADER_MIDDLEWARES = {
        'incapsula.IncapsulaMiddleware': 900
    }
    
    0 讨论(0)
提交回复
热议问题