Low-power Task Scheduling for GPU Energy Reduction Li Tang, Yiji Zhang
Introduction DVFS (dynamic voltage and frequency scaling) implementation Building GPU linear regression power model
DVFS implementation Dynamic Voltage and Frequency Scaling a method to provide variable amount of energy for a task by scaling the operating voltage/frequency. Power & Energy consumption
GPU architecture and linear regression power model On-chip Device Memory GPU linear power model: Total power Maximum power of the i-th component Usage rate of the i-th components Intercept power
Energy measurement NI USB-6216 DAQ+ two FLUKE 80i-110s current clamps Sampling rate: ▫10 readings per millisecond
Preliminary results WAXPY function: ▫W[i]=alpha*X[i]+beta*Y[i] (i: thread number) Kernel launch: ▫WAXPY >> Vector size and type: ▫1,000,000 float Thread*Block1*11*41*161*644*6416*6464*16 WAXPY Time =0.071 WAXPY GPU Power > WAXPY GPU Energy