Key to Scalable Parallelism - Regularity and Locality GPU Computing Forum
Eight Algorithm Optimizations Techniques (so far) Scatter to Gather transformation Privatization Work granularity coarsening Data tiling/reuse Data layout and traversal ordering Input data binning Input compaction Input extraction and regularization http://courses.engr.illinois.edu/ece598/hk/ Currently a graduate-level practical algorithm course GPU Computing Forum
“Orthogonal” to Traditional Parallel Algorithms for Teaching Tiling Privatization Regularization Compaction Binning Data Layout Granularity Coarsening Scatter to Gather MRI- Gridding ✓ CutCP Histo Stencil LBM BFS DMM MRI-Q SpMV SAD Tpacf FFT GPU Computing Forum