# -*- coding: utf-8 -*-
# Python3
import urllib
import urllib.request as url_req
opener = url_req.build_opener()
url=\'http://zh.wikipedia.org/wiki/\'+\"毛泽东\"
opene
You can not use arbitrary unicode strings as part of an URL. The URL must be properly encoded. See here for details:
http://www.w3schools.com/tags/ref_urlencode.asp
In particular you want to use the urllib.quote() or urllib.quote_plus() method of the Python API for quoting your URL properly.
http://docs.python.org/library/urllib.html
You could use urllib.parse.quote() to encode the path section of URL.
#!/usr/bin/env python3
from urllib.parse import quote
from urllib.request import urlopen
url = 'http://zh.wikipedia.org/wiki/' + quote("毛泽东")
content = urlopen(url).read()
The fantastic requests library does this for you out of the box:
>>> url='http://zh.wikipedia.org/wiki/'+"毛泽东"
>>> import requests
>>> r = requests.get(url)
>>> len(r.content)
818747