The whole model has a hierarchical structure (e.g. 2 levels). That is, the model on the first stage takes the input and gives an output, then based on this output to choose