Assuming we got tensors: img and bounding boxes: bbox of shape:
img
bbox
[B, C, H, W], [B, xmin, ymin, xmax, ymax], respectively