ith_blocks" parameter represent the number of GRU's level. args: base_encoder (nn.Module): The base encoder part that can have a shared weight with feature_encoder's base_encoder because they have same architecture. out_with_blocks (List[bool]): The length represent the number of GRU's level (length of output), and if the element is True then the output layer on that position will have additional block output_dim (int): The dimension of output on each level (default: 256) block (Callable[..., nn.Module]): The type of basic block used for downsampling and output layer (default: ResidualBlock) r&