How should a highly multithreaded architecture, like a GPU, pick which threads to issue? Cache-Conscious Wavefront Scheduling Use feedback from the memory system
Thread Scheduler Cache System Better hit rate than optimal replacement with other schedulers Fix Your Replacement Policy! Feedback! Access Thread 0 Access Thread 1 Thread 2 Thread 3 Access Thread 0 Access Thread 0 63% performance improvement!
Timothy G. Rogers 1, Mike O’Connor 2, Tor M. Aamodt 1 1 The University of British Columbia 2 AMD Research Cache-Conscious Wavefront Scheduling Today 3:30pm Right Here in the Columbia Ballroom