instead of a plain tuple. training (`bool`, *optional*, defaults to `False`): Whether or not to use the model in training mode (some modules like dropout modules have different behaviors between training and evaluation). NzTYou cannot specify both decoder_input_ids and decoder_inputs_embeds at the same timer