Downloading a web page and all of its resource files in Python

后端 未结 2 1064
盖世英雄少女心
盖世英雄少女心 2020-12-30 11:14

I want to be able to download a page and all of its associated resources (images, style sheets, script files, etc) using Python. I am (somewhat) familiar with urllib2 and k

相关标签:
2条回答
  • 2020-12-30 11:23

    websucker.py doesn't import css links. HTTrack.com is not python, it's C/C++, but it's a good, maintained, utility for downloading a website for offline browsing.

    http://www.mail-archive.com/python-bugs-list@python.org/msg13523.html [issue1124] Webchecker not parsing css "@import url"

    Guido> This is essentially unsupported and unmaintaned example code. Feel free to submit a patch though!

    0 讨论(0)
  • 2020-12-30 11:42

    Websucker? See http://effbot.org/zone/websucker.htm

    0 讨论(0)
提交回复
热议问题