Generator expression never raises StopIteration

后端 未结 2 1398
礼貌的吻别
礼貌的吻别 2021-01-04 06:20

Inspired by my own answer, I didn\'t even understand how it worked myself, consider the following:

def has22(nums):
    it = iter(nums)
    return any(x == 2         


        
相关标签:
2条回答
  • 2021-01-04 06:51

    Interesting behaviour, but absolutely understandable.

    If you transform your generator expression to a generator:

    def _has22_iter(it):
        for x in it:
            yield x == 2 and x == next(it)
    
    def has22(nums):
        it = iter(nums)
        return any(_has22_iter(it))
    

    your generator raises StopIteration in the following conditions:

    • the generator function reaches its end
    • there is a return statement somewhere
    • there is a raise StopIteration somewhere

    Here, you have the third condition, so the generator is terminated.

    Compare with the following:

    def testgen(x):
        if x == 0:
            next(iter([])) # implicitly raise
        if x == 1:
            raise StopIteration
        if x == 2:
            return
    

    and do

    list(testgen(0)) # --> []
    list(testgen(1)) # --> []
    list(testgen(2)) # --> []
    list(testgen(3)) # --> []
    

    you get the same behaviour in all cases.

    0 讨论(0)
  • 2021-01-04 07:00

    The devs have decided that allowing this was a mistake because it can mask obscure bugs. Because of that, the acceptance of PEP 479 means this is going away.

    In Python 3.5 if you do from __future__ import generator_stop, and in Python 3.7 by default, the example in the question will fail with a RuntimeError. You could still achieve the same effect (allowing nums to not be precomputed) with some itertools magic:

    from itertools import tee, islice
    
    def has22(nums):
        its = tee(nums, 2)
        return any(x == y == 2 for x, y in 
                   zip(its[0], islice(its[1], 1, None)))
    

    The reason it ever worked in the first place has to do with how generators work. You can think of this for loop:

    for a in b:
        # do stuff
    

    As being (roughly) equivalent to this:

    b = iter(b) 
    while True:
        try:
            a = next(b)
        except StopIteration:
            break
        else:
            # do stuff
    

    Now, all the examples have two for loops nested together (one in the generator expression, one in the function consuming it), so that the inner loop iterates once when the outer loop performs its next call. What happens when the '# do stuff' in the inner loop is raise StopIteration?

    >>> def foo(): raise StopIteration
    >>> list(foo() for x in range(10))
    []
    

    The exception propagates out of the inner loop, since it isn't in its guard, and gets caught by the outer loop. Under the new behavior, Python will intercept a StopIteration that is about to propagate out of a generator and replace it with a RuntimeError, which won't be caught by the containing for loop.

    This also has the implication that code like this:

    def a_generator():
         yield 5
         raise StopIteration
    

    will also fail, and the mailing list thread gives the impression that this was considered bad form anyway. The proper way to do this is:

    def a_generator():
        yield 5
        return
    

    As you pointed out, list comprehensions already behave differently:

    >>> [foo() for x in range(10)]
    Traceback (most recent call last):
      File "<stdin>", line 1, in <module>
      File "<stdin>", line 1, in <listcomp>
      File "<stdin>", line 1, in foo
    StopIteration
    

    This is somewhat an implementation detail leaking - list comprehensions don't get transformed into a call to list with an equivalent generator expression, and apparently doing so would cause large performance penalties that the powers that be consider prohibitive.

    0 讨论(0)
提交回复
热议问题