_channels, height, width)`. Hidden-states of the model at the output of each layer plus the optional initial embedding outputs. Nr