I have the following code where the user can press p to pause the video, draw a bounding box around the object to be tracked, and then press Enter (carriage return)
p