drm/amdgpu: Implement concurrent asic reset for XGMI.
authorAndrey Grodzovsky <andrey.grodzovsky@amd.com>
Thu, 29 Nov 2018 20:14:27 +0000 (15:14 -0500)
committerAlex Deucher <alexander.deucher@amd.com>
Mon, 3 Dec 2018 16:15:14 +0000 (11:15 -0500)
commitd4535e2c018bba71b49edeb5e396183920f5d341
tree894857dc0eff45db769fcaa8b0dbcd0d793d072f
parenta82400b57abb6aff068bb3b21d1cccd63acbb863
drm/amdgpu: Implement concurrent asic reset for XGMI.

Use per hive wq to concurrently send reset commands to all nodes
in the hive.

v2:
Switch to system_highpri_wq after dropping dedicated queue.
Fix non XGMI code path KASAN error.
Stop  the hive reset for each node loop if there
is a reset failure on any of the nodes.

Signed-off-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
Acked-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu.h
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c