Python how to parallelize loops

后端 未结 2 922
有刺的猬
有刺的猬 2021-01-27 10:11

I am very new to multi-threading and multi-processing and trying to make for loop parallel. I searched similar questions, and created code based on multiprocessing module.

相关标签:
2条回答
  • 2021-01-27 10:52

    As @Alex Hall mentioned, remove iteration from fun1. Also, wait till all pool's workers are finished.

    PEP8 note: import timeit, multiprocessing is bad practice, split it to two lines.

    import multiprocessing
    import timeit
    
    
    start_time = timeit.default_timer()
    
    d1 = dict( (i,tuple([i*0.1,i*0.2,i*0.3])) for i in range(500000) )
    d2 = {}
    
    def fun1(gn):
        x,y,z = d1[gn]
        d2.update({gn: ((x+y+z)/3)})
    
    
    if __name__ == '__main__':
        gen1 = [x for x in d1.keys()]
    
        # serial processing
        for gn in gen1:
            fun1(gn)
    
        # paralel processing
        p = multiprocessing.Pool(3)
        p.map(fun1, gen1)
        p.close()
        p.join()
    
        print('Script finished')
        stop_time = timeit.default_timer()
        print(stop_time - start_time)
    
    0 讨论(0)
  • 2021-01-27 11:00

    p.map does the looping for you, so remove the for i in gn:.

    That is, p.map applies fun1 to each element of gen1, so gn is one of those elements.

    0 讨论(0)
提交回复
热议问题