drm/xe: Take job list lock in xe_sched_add_pending_job
authorMatthew Brost <matthew.brost@intel.com>
Thu, 3 Oct 2024 00:16:56 +0000 (17:16 -0700)
committerLucas De Marchi <lucas.demarchi@intel.com>
Wed, 16 Oct 2024 14:00:22 +0000 (09:00 -0500)
A fragile micro optimization in xe_sched_add_pending_job relied on both
the GPU scheduler being stopped and fence signaling stopped to safely
add a job to the pending list without the job list lock in
xe_sched_add_pending_job. Remove this optimization and just take the job
list lock.

Fixes: 7ddb9403dd74 ("drm/xe: Sample ctx timestamp to determine if jobs have timed out")
Signed-off-by: Matthew Brost <matthew.brost@intel.com>
Reviewed-by: Matthew Auld <matthew.auld@intel.com>
Link: https://patchwork.freedesktop.org/patch/msgid/20241003001657.3517883-2-matthew.brost@intel.com
(cherry picked from commit 90521df5fc43980e4575bd8c5b1cb62afe1a9f5f)
Signed-off-by: Lucas De Marchi <lucas.demarchi@intel.com>
drivers/gpu/drm/xe/xe_gpu_scheduler.h

index 5ad5629..64b2ae6 100644 (file)
@@ -63,7 +63,9 @@ xe_sched_invalidate_job(struct xe_sched_job *job, int threshold)
 static inline void xe_sched_add_pending_job(struct xe_gpu_scheduler *sched,
                                            struct xe_sched_job *job)
 {
+       spin_lock(&sched->base.job_list_lock);
        list_add(&job->drm.list, &sched->base.pending_list);
+       spin_unlock(&sched->base.job_list_lock);
 }
 
 static inline