that requires CUDA on a single XPU. These tests are skipped when there are no XPU available or number of xPUs is more than one. r<