I\'m trying to create efficient SIMD version of dot product to implement 2D convolution for i16 type for FIR filter.
#[cf