t takes the same input as the transform, and returns the labels. By default, this will try to find a "labels" key in the input, if the input is a dict or it is a tuple whose second element is a dict. This heuristic should work well with a lot of datasets, including the built-in torchvision datasets. ç