I am trying to replace masking layer (mask for time steps) by simply adding the sample weights as 0 or 1. Tensor flow doc for losses mentions that it will simply scale the losse