drm/amdgpu: Clear reset flags from ras context
authorLijo Lazar <lijo.lazar@amd.com>
Wed, 4 Jun 2025 04:06:09 +0000 (09:36 +0530)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 18 Jun 2025 16:19:18 +0000 (12:19 -0400)
Once RAS errors are cleared with appropriate recovery mechanism, clear
reset flags also from RAS context. Otherwise, stale flag values could
affect the subsequent RAS reset handling on the device.

Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

index de09449..e91d4d3 100644 (file)
@@ -4414,8 +4414,10 @@ void amdgpu_ras_clear_err_state(struct amdgpu_device *adev)
        struct amdgpu_ras *ras;
 
        ras = amdgpu_ras_get_context(adev);
-       if (ras)
+       if (ras) {
                ras->ras_err_state = 0;
+               ras->gpu_reset_flags = 0;
+       }
 }
 
 void amdgpu_ras_set_err_poison(struct amdgpu_device *adev,