order of directives in robots.txt, do they overwrite each other or complement each other?

后端 未结 1 1420
北海茫月
北海茫月 2020-12-18 13:08
User-agent: Googlebot
Disallow: /privatedir/

User-agent: *
Disallow: /

Now, what are disallowed for Googlebot: /privatedir/, or the whole website

相关标签:
1条回答
  • 2020-12-18 13:58

    According to the original robots.txt specification:

    1. A bot must follow the first record that matches its user-agent name.

    2. If such a record doesn’t exist, it must follow the record with User-agent: * (this line may not appear in more than one record).

    3. If such a record doesn’t exist, it doesn’t have to follow any record.

    So a bot never follows more than one record.


    For your example this means:

    • A bot that matches the name "Googlebot" is not allowed to crawl URLs with a path that starts with /privatedir/.
    • A bot that doesn’t match the name "Googlebot" is not allowed to crawl any URL.
    0 讨论(0)
提交回复
热议问题