I know that you can scrapy shell -s USER_AGENT='custom user agent' 'http://www.example.com'
to change the USER_AGENT
, but how do you add request headers?
there is no current way to add headers directly on cli, but you could do something like:
$ scrapy shell
...
...
>>> from scrapy import Request
>>> req = Request('yoururl.com', headers={"header1":"value1"})
>>> fetch(req)
This will update the current shell information with that new request.
来源:https://stackoverflow.com/questions/37010524/set-headers-for-scrapy-shell-request