How do I disallow specific page from robots.txt

前端 未结 4 1355
孤独总比滥情好
孤独总比滥情好 2020-12-06 09:03

I am creating two pages on my site that are very similar but serve different purposes. One is to thank users for leaving a comment and the other is to encourage users to sub

相关标签:
4条回答
  • 2020-12-06 09:48
    Disallow: /thank-you-for-commenting
    

    in robots.txt

    Take a look at last.fm robots.txt file for inspiration.

    0 讨论(0)
  • 2020-12-06 10:01

    robots.txt files use regular expressions to match pages, so to avoid targeting more pages than you intend, you may need to add a $ to the end of the page name:

    Disallow: /thank-you-for-commenting$
    

    If you don't you'll also disallow page /thank-you-for-commenting-on-this-too

    0 讨论(0)
  • 2020-12-06 10:04

    This is very simple, any page that you want to disallow, just give root url of this file or folder. Just put this into your robots.txt file.

    Disallow: /thank-you-for-commenting
    
    0 讨论(0)
  • 2020-12-06 10:06

    You can also add a specific page with extension in robots.txt file. In case of testing, you can specify the test page path to disallow robots from crawling.

    For examples:

     Disallow: /index_test.php
     Disallow: /products/test_product.html
     Disallow: /products/     
    

    The first one Disallow: /index_test.php will disallow bots from crawling the test page in root folder.

    Second Disallow: /products/test_product.html will disallow test_product.html under the folder 'products'.

    Finally the last example Disallow: /products/ will disallow the whole folder from crawling.

    0 讨论(0)
提交回复
热议问题