How can I create a GzipFile instance from the “file-like object” that urllib.urlopen() returns?

后端 未结 3 2032
悲&欢浪女
悲&欢浪女 2020-12-31 06:10

I’m playing around with the Stack Overflow API using Python. I’m trying to decode the gzipped responses that the API gives.

import urllib, gzip

url = urllib         


        
相关标签:
3条回答
  • 2020-12-31 06:58

    Here is a new update for @stefanw's answer, to whom that might think it too expensive to use that much memory.

    Thanks to this article(https://www.enricozini.org/blog/2011/cazzeggio/python-gzip/, it explains why gzip doesn't work), the solution is to use Python3.

    import urllib.request
    import gzip
    
    response = urllib.request.urlopen('http://api.stackoverflow.com/1.0/badges/name')
    with gzip.GzipFile(fileobj=response) as f:
        for line in f:
            print(line)
    
    0 讨论(0)
  • 2020-12-31 06:59
    import urllib2
    import json
    import gzip
    import io
    
    url='http://api.stackoverflow.com/1.0/badges/name'
    page=urllib2.urlopen(url)
    gzip_filehandle=gzip.GzipFile(fileobj=io.BytesIO(page.read()))
    json_data=json.loads(gzip_filehandle.read())
    print(json_data)
    

    io.BytesIO is for Python2.6+. For older versions of Python, you could use cStringIO.StringIO.

    0 讨论(0)
  • 2020-12-31 07:01

    The urlopen docs list the supported methods of the object that is returned. I recommend wrapping the object in another class that supports the methods that gzip expects.

    Other option: call the read method of the response object and put the result in a StringIO object (which should support all methods that gzip expects). This maybe a little more expensive though.

    E.g.

    import gzip
    import json
    import StringIO
    import urllib
    
    url = urllib.urlopen('http://api.stackoverflow.com/1.0/badges/name')
    url_f = StringIO.StringIO(url.read())
    g = gzip.GzipFile(fileobj=url_f)
    j = json.load(g)
    
    0 讨论(0)
提交回复
热议问题