“500 Internal Server Error” when combining Scrapy over Splash with an HTTP proxy

后端 未结 1 1352
广开言路
广开言路 2021-01-07 10:06

I\'m trying to crawl a Scrapy spider in a Docker container using both Splash (to render JavaScript) and Tor through Privoxy (to provide anonymity). Here is the docker-

相关标签:
1条回答
  • 2021-01-07 10:48

    Following the structure of the Aquarium project as suggested by paul trmbrth, I found that it is essential to name the .ini file default.ini, not proxy.ini (otherwise it doesn't get 'picked up' automatically). I managed to get the scraper to work in this way (cf. my self-answer to How to use Scrapy with both Splash and Tor over Privoxy in Docker Compose).

    0 讨论(0)
提交回复
热议问题