Download all files of a particular type from a website using wget stops in the starting url

前端 未结 3 1017
心在旅途
心在旅途 2021-02-09 02:41

The following did not work.

wget -r -A .pdf home_page_url

It stop with the following message:

....
Removing site.com         


        
3条回答
  •  独厮守ぢ
    2021-02-09 02:58

    It may be based on a robots.txt. Try adding -e robots=off.

    Other possible problems are cookie based authentication or agent rejection for wget. See these examples.

    EDIT: The dot in ".pdf" is wrong according to sunsite.univie.ac.at

提交回复
热议问题