What is the rationale for the SAXPY routine of the BLAS specification (http://www.netlib.org/blas/#_level_1) to perform the update (1): Y = A * X + Y, rather than the new assign