Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 © 2012 Elsevier, Inc. All rights reserved. Chapter 16.

Similar presentations


Presentation on theme: "1 © 2012 Elsevier, Inc. All rights reserved. Chapter 16."— Presentation transcript:

1 1 © 2012 Elsevier, Inc. All rights reserved. Chapter 16

2 2 © 2012 Elsevier, Inc. All rights reserved. Figure 16.1 A complete Thrust program that sorts data on a GPU.

3 3 © 2012 Elsevier, Inc. All rights reserved. Figure 16.2 Thrust is an abstraction layer on top of CUDA C/C++.

4 4 © 2012 Elsevier, Inc. All rights reserved. Figure 16.3 Thrust interoperates smoothly with CUDA C/C++: (a) interfacing Thrust to CUDA, and (b) interfacing CUDA to Thrust.

5 5 © 2012 Elsevier, Inc. All rights reserved. Figure 16.4 SAXPY implementations in (a) CUDA C and (b) Thrust.

6 6 © 2012 Elsevier, Inc. All rights reserved. Figure 16.5 Sorting integers on the GeForce GTX480: Thrust’s dynamic sorting optimizations improve performance by a considerable margin in common use cases where keys are less than 32 bits.

7 7 © 2012 Elsevier, Inc. All rights reserved. Figure 16.6 snrm2 has low arithmetic intensity and therefore benefits greatly from fusion.

8 8 © 2012 Elsevier, Inc. All rights reserved. Figure 16.7 Data layouts for 3D float vectors: (a) AoS and (b) SoA.

9 9 © 2012 Elsevier, Inc. All rights reserved. Figure 16.8 The make_zip_iterator facilitates processing of data in structure of arrays format.

10 10 © 2012 Elsevier, Inc. All rights reserved. Figure 16.9 Implicit ranges improve performance by conserving memory bandwidth.


Download ppt "1 © 2012 Elsevier, Inc. All rights reserved. Chapter 16."

Similar presentations


Ads by Google