GPU-based Hierarchical Computations for View Independent Visibility Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Sixth Indian Conference on Computer.

Slides:



Advertisements
Similar presentations
1GR2-00 GR2 Advanced Computer Graphics AGR Lecture 18 Image-based Rendering Final Review of Rendering What We Did Not Cover Learning More...
Advertisements

Computer graphics & visualization. Image Synthesis – WS 07/08 Dr. Jens Krüger – Computer Graphics and Visualization Group Our GPU Photon Tracing Pipeline.
Computer Graphics An Introduction. What’s this course all about? 05/10/2014 Lecture 1 2 We will cover… Graphics programming and algorithms Graphics data.
Week 10 - Monday.  What did we talk about last time?  Global illumination  Shadows  Projection shadows  Soft shadows.
Presented by Konstantinos Georgiadis. Abstract This method extends the Hierarchical Radiosity approach for environments whose geometry and surface attributes.
Y A S O O B A L I b o r n o n 1 9 t h F e b r u a r y i n K a n p u r d i s t r i c t o f U t t a r P r a d e s h. H e s t a r t e d s i n g i.
Visibility Culling. Back face culling View-frustrum culling Detail culling Occlusion culling.
Computational Geometry The systematic study of algorithms and data structures for geometric objects, with a focus on exact algorithms that are asymptotically.
Modeling the Interaction of Light Between Diffuse Surfaces Cindy M. Goral, Keenth E. Torrance, Donald P. Greenberg and Bennett Battaile Presented by: Chris.
Scalability with many lights II (row-column sampling, visibity clustering) Miloš Hašan.
Rapid Visualization of Large Point-Based Surfaces Tamy Boubekeur Florent Duguet Christophe Schlick Presented by Xavier Granier.
Albert Mas Ignacio Martín Gustavo Patow Fast Inverse Reflector Design FIRD Graphics Group of Girona Institut d’Informàtica i Aplicacions Universitat de.
Paper Presentation - An Efficient GPU-based Approach for Interactive Global Illumination- Rui Wang, Rui Wang, Kun Zhou, Minghao Pan, Hujun Bao Presenter.
18.1 Si31_2001 SI31 Advanced Computer Graphics AGR Lecture 18 Image-based Rendering Light Maps What We Did Not Cover Learning More...
3D Graphics Processor Architecture Victor Moya. PhD Project Research on architecture improvements for future Graphic Processor Units (GPUs). Research.
Matrix Row-Column Sampling for the Many-Light Problem Miloš Hašan (Cornell University) Fabio Pellacini (Dartmouth College) Kavita Bala (Cornell University)
M. Lastra, R. García, Dpt. Lenguajes y Sistemas Informáticos E.T.S.I. Informática - University of Granada [jrevelle, mlastral, ruben, ugr.es.
COMP236: Computer Graphics Spring’2000 Dinesh Manocha.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
An Introduction to Ray Tracing CS /27/1998 Vic Baker.
Finish Adaptive Space Carving Anselmo A. Montenegro †, Marcelo Gattass ‡, Paulo Carvalho † and Luiz Velho † †
Visibility Queries Using Graphics Hardware Presented by Jinzhu Gao.
Computational Geometry The systematic study of algorithms and data structures for geometric objects, with a focus on exact algorithms that are asymptotically.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Advanced Computer Graphics March 06, Grading Programming assignments Paper study and reports (flipped classroom) Final project No written exams.
The Visibility Problem In many environments, most of the primitives (triangles) are not visible most of the time –Architectural walkthroughs, Urban environments.
Accidental pinhole and pinspeck cameras: revealing the scene outside the picture A. Torralba and W. T. Freeman Proceedings of 25 IEEE Conference on Computer.
David Luebke10/9/2015 CS 551 / 645: Introductory Computer Graphics David Luebke
In this video, you will learn how to view one or more parts of an expression as a single entity, by using the associative property to analyze the formula.
Interactively Modeling with Photogrammetry Pierre Poulin Mathieu Ouimet Marie-Claude Frasson Dép. Informatique et recherche opérationnelle Université de.
Real-Time Rendering SPEEDING UP RENDERING Lecture 04 Marina Gavrilova.
1 Shadows (2) ©Anthony Steed Overview n Shadows – Umbra Recap n Penumbra Analytical v. Sampling n Analytical Aspect graphs Discontinuity meshing.
1 Advanced Scene Management. 2 This is a game-type-oriented issue Bounding Volume Hierarchies (BVHs) Binary space partition trees (BSP Trees) “Quake”
November 13, 2014Computer Vision Lecture 17: Object Recognition I 1 Today we will move on to… Object Recognition.
Collaborative Visual Computing Lab Department of Computer Science University of Cape Town Graphics Topics in VR By Shaun Nirenstein.
1 Real-time visualization of large detailed volumes on GPU Cyril Crassin, Fabrice Neyret, Sylvain Lefebvre INRIA Rhône-Alpes / Grenoble Universities Interactive.
Real-time Graphics for VR Chapter 23. What is it about? In this part of the course we will look at how to render images given the constrains of VR: –we.
고급 컴퓨터 그래픽스 중앙대학교 컴퓨터공학부 손 봉 수. Course Overview Level : CSE graduate course No required text. We will use lecture notes and on-line materials This course.
1 Implicit Visibility and Antiradiance for Interactive Global Illumination Carsten Dachsbacher 1, Marc Stamminger 2, George Drettakis 1, Frédo Durand 3.
Introduction to Radiosity Geometry Group Discussion Session Jiajian (John) Chen 9/10/2007.
Efficient Streaming of 3D Scenes with Complex Geometry and Complex Lighting Romain Pacanowski and M. Raynaud X. Granier P. Reuter C. Schlick P. Poulin.
고급 컴퓨터 그래픽스 중앙대학교 컴퓨터공학부 손 봉 수. Course Overview Level : CSE graduate course No required text. We will use lecture notes and on-line materials This course.
Subject Name: Computer Graphics Subject Code: Textbook: “Computer Graphics”, C Version By Hearn and Baker Credits: 6 1.
Ray Tracing Fall, Introduction Simple idea  Forward Mapping  Natural phenomenon infinite number of rays from light source to object to viewer.
Honours Graphics 2008 Session 5. Today’s focus Rasterization Visibility determination Coarse / fine visibility determination.
Overview About the company What is it? What it does? How does it affect us.
Fast Global Illumination Including Specular Effects Xavier Granier 1 George Drettakis 1 Bruce J. Walter 2 1 iMAGIS -GRAVIR/IMAG-INRIA iMAGIS is a joint.
Controlling Memory Consumption of Hierarchical Radiosity with Clustering iMAGIS -GRAVIR/IMAG-INRIA iMAGIS is a joint project of CNRS/INRIA/UJF/INPG Xavier.
Global Illumination (2) Radiosity (2). The Radiosity Equation The "radiosity equation" describes the amount of energy which can be emitted from a surface,
David Luebke3/12/2016 Advanced Computer Graphics Lecture 3: More Ray Tracing David Luebke
1 Advanced Scene Management. 2 This is a game-type-oriented issue Bounding Volume Hierarchies (BVHs) Binary space partition trees (BSP Trees) “Quake”
3D Object Representations 2009, Fall. Introduction What is CG?  Imaging : Representing 2D images  Modeling : Representing 3D objects  Rendering : Constructing.
Quirin Meyer 1, Christian Eisenacher 1, Marc Stamminger 1, Carsten Dachsbacher 2 1 Erlangen-Nuremberg University, Department Informatik, Computer Graphics.
3D Max Reality 3D Max Reality provides a single click, fast, photo- realistic, cloud rendering. Now you can test it before buying. Register an account.
Chapter 1 An overview on Computer Graphics
고급 컴퓨터 그래픽스 (Advanced Computer Graphics)
CSC418 Computer Graphics Back Faces Visibility Algorithms.
Real-Time Ray Tracing Stefan Popov.
3D Object Representations
Interactive Computer Graphics
Introduction to Computer Graphics
Computational Geometry
CSCE 441: Computer Graphics Ray Tracing
Overview of Modeling 김성남.
Radiosity Sung-Eui Yoon (윤성의) CS580: Course URL:
Introduction to spagetti and meatballs
Hierarchical clustering analysis of 7785 genes (genes with a log-ratio variation in the 25th centile and >5% missing data were excluded) (A) A heat map.
Introduction to Computer Graphics
Real-time Global Illumination with precomputed probe
Presentation transcript:

GPU-based Hierarchical Computations for View Independent Visibility Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Sixth Indian Conference on Computer Vision, Graphics & Image Processing, prekshu, sharat}

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Parallel Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Result Video ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran VIDEO !

Challenges ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran O(N 3 ) Time Complexity N 2 point-pairs N occluders considered for every pair No surface Information Polygonal ModelPoint Model AABB [GKSD 07], Visibility Map for Global Illumination in Point Clouds, GRAPHITE 2007

Building the Octree Hierarchy ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Building the Octree Hierarchy

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Building the Octree Hierarchy

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Building the Octree Hierarchy

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Building the Octree Hierarchy

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Visibility between Point Clusters

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Visibility between Point Clusters

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Visibility between Point Clusters

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Discussion: Visibility between Point Clusters O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough !

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Discussion: Visibility between Point Clusters O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough !

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Contributions O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough ! Parallel, Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU Problem Statement & Contributions: Achieved upto 19x speed-up

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough ! Achieved upto 19x speed-up Visibility Problem is highly parallel Problem Statement & Contributions: Contributions Parallel, Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough ! Achieved upto 19x speed-up No compromise on quality Visibility problem is highly parallel Problem Statement & Contributions: Contributions Parallel, Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough ! Achieved upto 19x speed-up Visibility problem is highly parallel No Dynamic Memory Allocation and Recursion No compromise on quality Problem Statement & Contributions: Contributions Parallel, Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran O(M 2 log M) Time Complexity (for Octree with M leaves) M 2 log M << N 3 M << N … but still not fast enough ! Achieved upto 19x speed-up Visibility problem is highly parallel No compromise on quality No Dynamic Memory Allocation and Recursion Problem Statement & Contributions: Contributions Parallel, Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Application to Global Illumination ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Parallel, Fast computa t ion of view-independent visibility in a complex scene represented as a point model, using a GPU

Application to Global Illumination ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Point modeled Cornell Room and the Stanford Bunny Parallel, Fast computa t ion of view-independent visibility in a complex scene represented as a point model, using a GPU

Application to Global Illumination ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Can be extended to the Digital Heritage Project

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran V-map for a tree is a collection of visibility links for every node in the tree

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran V-map for a tree is a collection of visibility links for every node in the tree The visibility link for any node N is a set L of nodes Node N

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran V-map for a tree is a collection of visibility links for every node in the tree The visibility link for any node N is a set L of nodes Every point in any node L is guaranteed to be visible from every point in N Node N V1

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran V-map for a tree is a collection of visibility links for every node in the tree The visibility link for any node N is a set L of nodes Every point in any node L is guaranteed to be visible from every point in N Node N V1 V2

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran V-map for a tree is a collection of visibility links for every node in the tree The visibility link for any node N is a set L of nodes Every point in any node L is guaranteed to be visible from every point in N Node N V1 V2 V3

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level 0

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran With respect to at any level, Level 0 Level 1

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran With respect to at any level, Level 0 Level 1 Level 2

What is a V-map ? ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran With respect to at any level, Level 0 Level 1 Level 2 Level 3

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU based V-map Construction ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Strategy 1: Multiple Threads per Node ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran A N I0I0 I1I1 I2I2 I3I3 Interaction List

Computation by a Single Thread ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 1: Multiple Threads per Node ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran A N I0I0 I1I1 I2I2 I3I3 Interaction List Degree of parallelism limited by the size of the nodes interaction list Serious Limitation: No support for Recursion and Dynamic memory allocation Discussion:

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Strategy 2: One Thread per Node ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Degree of parallelism limited by number of nodes per level Serious Limitation: No support Recursion and Dynamic memory allocation Discussion: A I0I0 I1I1 I2I2 I3I3 N I0I0 I1I1 Level L Level L Interaction List Performance increases as we go down the octree

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Level L-2 Level L-1 Level L Level L-3 Level 0

CPU v/s GPU ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran CPU AlgorithmGPU Algorithm

CPU v/s GPU ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran CPU AlgorithmGPU Algorithm

Strategy 3: Multiple Threads per Node-Pair ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Degree of parallelism is high as hundreds/thousands of threads run concurrently most of the time Recursion and Dynamic memory allocation happens on the CPU Performance is best at octree levels near to root Achieves upto 19x speed-up Discussion: No change in Quality

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Results: Visibility Validation ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Results: Visibility Validation ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Results: Timings ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Bunny in the Cornell Room (1,35,000 points) Max Speedup : 19X

Results: Timings ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Buddha in the Cornell Room (2,30,000 points) Max Speedup : 14X

Results: Timings ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Ganesha & Satyawati in the Cornell Room (3,50,000 points) Max Speedup : 17X

Results: Application to Global Illumination ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Results: Application to Global Illumination ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Results: Quality Comparison ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Results: Quality Comparison ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran

Discussion: Implementation Details ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Optimal Thread and Block Size Each block (16 x 16) contained 256 threads Every thread-block Grid contained no less than 64 blocks Asynchronous Computations Loop Unrolling

Plan ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Hierarchical Visibility Map (V-map) Construction on CPU Parallel V-map Construction on GPU Strategy 1 – Multiple threads per node Strategy 2 – One thread per node Strategy 3 – Multiple threads per node-pair Results Conclusion

Final Note ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran P arallel implementation of the view-independent mutual point-pair visibility problem on the modern day GPU (G80/G92 architecture) was presented Several strategies to achieve the desired parallelism were introduced and the most suitable one chosen Speed-ups upto 19X were reported while maintaining the quality of results Blocks of 256 threads achieved optimal GPU performance Good octree heights essential for high speed-ups and accurate visibility results By viewing this V-map construction as a pre-processing step, photo-realistic global illumination rendering of complex point models have been shown Parallel V-map construction solution can be thought as an intuition to solve other problems involving computing relationships amongst data at various levels in a hierarchy

Thank You ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Fractal: Mandel Zoom - Satellite Antenna, Mandelbrot set

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU

Problem Statement ICVGIP, 2008Rhushabh Goradia, Prekshu Ajmera, Sharat Chandran Fast computation of view-independent visibility in a complex scene represented as a point model, using a GPU