可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I have a function, which I am trying to speed up using the @jit decorator from Numba module. For me it is essential to speed this up as much as possible, because my main code calls upon this function for millions of times. Here is my function:

from numba import jit, types import Sweep    #My own module, works fine   @jit(types.Tuple((types.complex128[:], types.float64[:]))(types.complex128[:], types.complex128[:], types.float64[:], types.float64[:], types.float64)) def MultiModeSL(Ef, Ef2, Nf, u, tijd ):   dEdt= np.zeros(nrModes, dtype=np.complex128)   dNdt0= np.zeros(nrMoments, dtype=np.complex128)   Efcon = np.conjugate(Ef)    for j in range(nrModes):     for n in range(nrMoments):       dEdt +=  0.5 * CMx[:,j,n,0] * dg * (1+ A*1j) * Nf[n] * Ef[j] * np.exp( 1j* (Sweep.omega[j]-Sweep.omega) *tijd)       for k in range(nrModes):         if n==0:           dNdt0 += g* CMx[j, k, 0,:] * Efcon[j] * Ef[k] * np.exp( 1j* (Sweep.omega[k]-Sweep.omega[j]) *tijd)          dNdt0 += dg*(1+A*1j) * CMx[j,k,n,:] * Nf[n] * Efcon[j] * Ef[k] * np.exp( 1j* (Sweep.omega[k]-Sweep.omega[j]) *tijd)     dEdt += - 0.5*(pd-g)*Ef +     fbr*Ef2  + Kinj*EAinj*(1 + np.exp(1j*(u+Vmzm)) )    dNdt = Sweep.Jn - Nf*ed - dNdt0.real   return dEdt, dNdt

The function works perfectly well, without the Jit decorator. However, when I run it with the @jit, I get this error:

numba.errors.LoweringError: Failed at object (object mode frontend) Failed at object (object mode backend) dEdt.1 File "Functions.py", line 82 [1] During: lowering "$237 = call $236(Ef, Ef2, Efcon, Nf, dEdt.1, dNdt0, tijd, u)" at /home/humblebee/MEGA/GUI RC/General_Formula/Functions.py (82)

Line 82 corresponds to the For loop with j as iterator.

Can you help me out?

EDIT: Based on Peter's suggestion and combining it with Einsum, I was able to remove the loops. This made my function 3 times faster. Here is the new code:

def MultiModeSL(Ef, Ef2, Nf, u, tijd ):   dEdt= np.zeros(nrModes, dtype=np.complex128)   dNdt0= np.zeros(nrMoments, dtype=np.complex128)   Efcon = np.conjugate(Ef)   dEdt = 0.5*  np.einsum("k, jkm, mk, kj -> j",  dg*(1+A*1j), CMx[:, :, :, 0],  (Ef[:] * Nf[:, None] ),  np.exp( 1j* (OMEGA[:, None]-OMEGA) *tijd))   dEdt += - 0.5*(pd-g)*Ef + fbr*Ef2  + Kinj*EAinj*(1 + np.exp(1j*(u+Vmzm)) )    dNdt = - np.einsum("j, jkm, jk, kj ", g, CMx[:,:,:,0], (Ef*Efcon[:,None]),  np.exp( 1j* (OMEGA[:, None]-OMEGA) *tijd))   dNdt += -np.einsum("j, j, jknm, kjm, kj",dg, (1+A*1j), CMx, (Nf[:]*Efcon[:,None]*Ef[:,None,None]), np.exp( 1j* (OMEGA[:, None]-OMEGA) *tijd)  )   dNdt += JN - Nf*ed   return dNdt

Can you suggest more techniques to speed this up?

回答1:

I can't see from your code why this isn't vectorizable. Vectorizing can speed up this kind of Python code by around 100x. Not sure of how it does relative to jit.

It looks like you could, for instance, take your dEdt out of the loop, and compute it in one step with something like :

dEdt = 0.5 * (Cmx[:, :, :, 0] * dg * (1+A*1j) * Nf[:] * Ef[:, None] * np.exp( 1j* (Sweep.omega[None, :, None, None]-Sweep.omega) *tijd)).sum(axis=2).sum(axis=1) - 0.5*(pd-g)*Ef + fbr*Ef2  + Kinj*EAinj*(1 + np.exp(1j*(u+Vmzm)) )

(Though I don't really know what the dimensionality of your Sweet.omega is).

回答2:

There may be other issues, but one is that referencing an array in a module namespace seems to currently be unsupported (simple repro below). Try importing omega as a name.

In [14]: %%file Sweep.py     ...: import numpy as np     ...: constant_val = 0.5     ...: constant_arr = np.array([0, 1.5, 2.]) Overwriting Sweep.py  In [15]: Sweep.constant_val Out[15]: 0.5  In [16]: Sweep.constant_arr Out[16]: array([ 0. ,  1.5,  2. ])  In [17]: @njit     ...: def f(value):     ...:     return value + Sweep.constant_val     ...:   In [18]: f(100) Out[18]: 100.5  In [19]: @njit     ...: def f(value):     ...:     return value + Sweep.constant_arr[0]  In [20]: f(100) LoweringError: Failed at nopython (nopython mode backend) 'NoneType' object has no attribute 'module' File "<ipython-input-19-0a259ade6b9e>", line 3 [1] During: lowering "$0.3 = getattr(value=$0.2, attr=constant_arr)" at <ipython-input-19-0a259ade6b9e> (3)

文章来源: How to Solve Numba Lowering error?

标签

Sweep