I have implemented an ensemble consisting of 3-layer MLPs with the following architecture:
super(MLP, self).__init__() self.linear1 = torch.nn.Linear(