Prevent site data from being crawled and ripped

前端 未结 12 835
终归单人心
终归单人心 2020-12-15 06:32

I\'m looking into building a content site with possibly thousands of different entries, accessible by index and by search.

What are the measures I can take to preven

12条回答
  •  囚心锁ツ
    2020-12-15 06:51

    I used to have a system that would block or allow based on the User-Agent header. It relies on the crawler setting their User-Agent but it seems most of them do.

    It won't work if they use a fake header to emulate a popular browser of course.

提交回复
热议问题