drm/amdgpu: move GTT to shmem after eviction for hibernation
authorSamuel Zhang <guoqing.zhang@amd.com>
Thu, 10 Jul 2025 06:23:10 +0000 (14:23 +0800)
committerMario Limonciello <mario.limonciello@amd.com>
Thu, 10 Jul 2025 15:49:45 +0000 (10:49 -0500)
When hibernate with data center dGPUs, huge number of VRAM BOs evicted
to GTT and takes too much system memory. This will cause hibernation
fail due to insufficient memory for creating the hibernation image.

Move GTT BOs to shmem in KMD, then shmem to swap disk in kernel
hibernation code to make room for hibernation image.

Signed-off-by: Samuel Zhang <guoqing.zhang@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com>
Link: https://lore.kernel.org/r/20250710062313.3226149-3-guoqing.zhang@amd.com
Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c

index d282c07..723ab95 100644 (file)
@@ -5029,8 +5029,16 @@ static int amdgpu_device_evict_resources(struct amdgpu_device *adev)
                return 0;
 
        ret = amdgpu_ttm_evict_resources(adev, TTM_PL_VRAM);
-       if (ret)
+       if (ret) {
                dev_warn(adev->dev, "evicting device resources failed\n");
+               return ret;
+       }
+
+       if (adev->in_s4) {
+               ret = ttm_device_prepare_hibernation(&adev->mman.bdev);
+               if (ret)
+                       dev_err(adev->dev, "prepare hibernation failed, %d\n", ret);
+       }
        return ret;
 }