How to prevent unauthorized spidering

后端 未结 6 1141
刺人心
刺人心 2021-02-06 04:59

I want to prevent automated html scraping from one of our sites while not affecting legitimate spidering (googlebot, etc.). Is there something that already exists to accomplish

6条回答
  •  遥遥无期
    2021-02-06 05:34

    robots.txt only works if the spider honors it. You can create a HttpModule to filter out spiders that you don't want crawling your site.

提交回复
热议问题