Presentation is loading. Please wait.

Presentation is loading. Please wait.

Version-6: Software Pipelining

Similar presentations


Presentation on theme: "Version-6: Software Pipelining"— Presentation transcript:

1 Version-6: Software Pipelining
V-2: 4-times unrolling 1 L.D F0,0(R1) 2 ADD.D F4,F0,F2 3 S.D 0(R1),F4 4 L.D F0,-8(R1) 5 ADD.D F4,F0,F2 6 S.D -8(R1),F4 7 L.D F0,-16(R1) 8 ADD.D F4,F0,F2 9 S.D -16(R1),F4 10 L.D F0,-24(R1) 11 ADD.D F4,F0,F2 12 S.D -24(R1),F4 13 SUBI R1,R1,#32 14 BNEZ R1,LOOP L.D F0,0(R1) ADD.D F4,F0,F2 L.D F0,-8(R1) SUBI R1,R1,#16 Only -8 per iteration!  need an adjustment in the next offests to reach - 24 S.D 16(R1),F4 ; Store X[i] 3 S.D 0(R1),F4 ; Store X[i] 13 SUBI R1,R1,#8 5 ADD.D F4,F0,F2 ; Add X[i-1] BNEZ R1,LOOP 7 L.D F0,-16(R1); Load X[i-2] L.D F0,8(R1); Load X[i-2] S.D -8(R1),F4 S.D 16(R1),F4 ADD.D F4,F0,F2 S.D -16(R1),F4 ADD.D F4,F0,F2 S.D 8(R1),F4 The unrolled version is consuming several (all) registers when instructions are reorderedlet’s see software-pipelining (having 3 instructions per iteration I can pipeline 3 iterations) 5 cycles per iteration and per element Roberto Giorgi, Universita' degli Studi di Siena, C219LEZ08-SL di 29


Download ppt "Version-6: Software Pipelining"

Similar presentations


Ads by Google