I am working on a C++ library that needs to perform many Matrix operations (dot product, transpose, matrix additions), Arithmetic operations (element-wise add, sub on matric