How to protect/monitor your site from crawling by malicious user

后端 未结 9 708

Situation:

  • Site with content protected by username/password (not all controlled since they can be trial/test users)
  • a normal search engine can\'t get at i
9条回答
  •  囚心锁ツ
    2021-02-06 19:21

    Added comments:

    • I know you can't completely protect something that a normal user should be able to see. I've been on both sides of the problem :)
    • From a developer side what do you think is best ratio of time spent versus protected cases? I'd guess some simple user-agent checks would remove half or more of the potential crawlers, and I know you can spend months developing to protect from the last 1%

    Again, from a service provider point of view I'm also interested that one user (crawler) doesn't consume cpu/bandwidth for others so any good bandwidth/request limiters you can point out?

    response to comment: Platform specifications: Application based on JBoss Seam running on JBoss AS. However there is an apache2 in front of it. (running on linux)

提交回复
热议问题