I have the following for loops. Can I make it fast by using parallel operation, using mutiprocessing and functions from joblib?
def DFT_matrix(N):