Multiprocessing works in Ubuntu, doesn't in Windows

后端 未结 1 483
盖世英雄少女心
盖世英雄少女心 2021-01-13 16:47

I am trying to use this example as a template for a queuing system on my cherrypy app.

I was able to convert it from python 2 to python 3 (change from Queue im

相关标签:
1条回答
  • 2021-01-13 16:56

    The problem is that parts of the MyBus object are not picklable, and you're saving an instance of MyBus to your Broker instance. Because Windows lacks fork() support, when you call broker.start(), the entire state of broker must be pickled and recreated in the child process that multiprocessing spawns to execute broker.run. It works on Linux because Linux supports fork; it doesn't need to pickle anything in this case - the child process contains the complete state of the parent as soon as it is forked.

    There are two ways to sole this problem. The first, and more difficult, way, is to make your broker instance picklable. To do that, you need to make MyBus picklable. The error you're getting right now refers to the logger attribute on MyBus, which is not picklable. That one is easy to fix; just add __getstate__/__setstate__ methods to MyBus, which are used to control how the object is pickled/unpickled. If we remove the logger when we pickle, and recreate it when we unpickle, we'll work around the issue:

    class MyBus(wspbus.Bus):
        ... 
        def __getstate__(self):
            self_dict = self.__dict__
            del self_dict['logger']
            return self_dict
    
        def __setstate__(self, d):
            self.__dict__.update(d)
            self.open_logger()
    

    This works, but then we hit another pickling error:

    Traceback (most recent call last):
      File "async2.py", line 121, in <module>
        broker.start()
      File "C:\python34\lib\multiprocessing\process.py", line 105, in start
        self._popen = self._Popen(self)
      File "C:\python34\lib\multiprocessing\context.py", line 212, in _Popen
        return _default_context.get_context().Process._Popen(process_obj)
      File "C:\python34\lib\multiprocessing\context.py", line 313, in _Popen
        return Popen(process_obj)
      File "C:\python34\lib\multiprocessing\popen_spawn_win32.py", line 66, in __init__
        reduction.dump(process_obj, to_child)
      File "C:\python34\lib\multiprocessing\reduction.py", line 60, in dump
        ForkingPickler(file, protocol).dump(obj)
    _pickle.PicklingError: Can't pickle <class 'cherrypy.process.wspbus._StateEnum.State'>: attribute lookup State on cherrypy.process.wspbus failed
    

    Turns outcherrypy.process.wspbus._StateEnum.State, which is an attribute on the wspbus.Bus class inherited by MyBus, is a nested class, and nested classes can't be pickled:

    class _StateEnum(object):
        class State(object):
            name = None
            def __repr__(self):
                return "states.%s" % self.name
    

    The State object (surprise) is used to track the Bus instance's state. Since we're doing the pickling before we start up the bus, we could just remove the state attribute from the object when we pickle, and set it to States.STOPPED when we unpickle.

    class MyBus(wspbus.Bus):
        def __init__(self, name=""):
            wspbus.Bus.__init__(self)
            self.open_logger(name)
            self.subscribe("log", self._log)
    
        def __getstate__(self):
            self_dict = self.__dict__
            del self_dict['logger']
            del self_dict['state']
            return self_dict
    
        def __setstate__(self, d):
            self.__dict__.update(d)
            self.open_logger()
            self.state = wspbus.states.STOPPED  # Initialize to STOPPED
    

    With these changes, the code works as expected! The only limitation is that it's only safe to pickle MyBus if the bus hasn't started yet, which is fine for your usecase.

    Again, this is the hard way. The easy way is to just remove the need to pickle the MyBus instance altogether. You can just create the MyBus instance in the child process, rather than the parent:

    class Broker(Process):
        def __init__(self, queue):
            Process.__init__(self)
            self.queue = queue
    
    ...
        def run(self):
            self.bus = MyBus(Broker.__name__)  # Create the instance here, in the child
            self.bus.subscribe("main", self.check)
            self.bus.start()
            self.bus.block(interval=0.01)
    

    As long as you don't need to access broker.bus in the parent, this is the simpler option.

    0 讨论(0)
提交回复
热议问题