Download all files of a particular type from a website using wget stops in the starting url

前端 未结 3 1020
心在旅途
心在旅途 2021-02-09 02:41

The following did not work.

wget -r -A .pdf home_page_url

It stop with the following message:

....
Removing site.com         


        
3条回答
  •  栀梦
    栀梦 (楼主)
    2021-02-09 03:16

    This is certainly because of the links in the HTML don't end up with /.

    Wget will not follow this has it think it's a file (but doesn't match your filter):

    page
    

    But will follow this:

    page
    

    You can use the --debug option to see if it's the actual problem.

    I don't know any good solution for this. In my opinion this is a bug.

提交回复
热议问题