urllib2 Error 403: Forbidden

后端 未结 2 619
生来不讨喜
生来不讨喜 2021-01-28 06:10

I have posted to this site and received really helpful guidance, i return with another question.

Where have i gone wrong here, I was prettty sure this is what is require

相关标签:
2条回答
  • 2021-01-28 06:29

    Actually the problem is that they block everyone who doesn't have a user-agent

    import urllib2
    
    request = urllib2.Request("http://www.cmegroup.com/trading/energy/natural-gas/natural-gas.html", None, {'User-Agent': 'Mozilla/5.0'})
    content = urllib2.urlopen(request).read()
    splitted_page = content.split("<span class=\"cmeSubHeading\">", 1);
    splitted_page = splitted_page[1].split("</span>", 1)
    print splitted_page[0]
    
    0 讨论(0)
  • 2021-01-28 06:51

    If you have to do GET requests I reccomend you the Requests python package. You can read its advantages in this post.

    However, if you're getting a 403 message maybe you're trying to access some restricted data (Wikipedia link).

    0 讨论(0)
提交回复
热议问题