Tone Mapping on GPUs Cliff Woolley University of Virginia Slides courtesy Nolan Goodnight.

Slides:

Advertisements

Similar presentations

Accelerating Real-Time Shading with Reverse Reprojection Caching Diego Nehab 1 Pedro V. Sander 2 Jason Lawrence 3 Natalya Tatarchuk 4 John R. Isidoro 4.

Advertisements

COMPUTER GRAPHICS SOFTWARE.

COMPUTER GRAPHICS CS 482 – FALL 2014 NOVEMBER 10, 2014 GRAPHICS HARDWARE GRAPHICS PROCESSING UNITS PARALLELISM.

Normal Map Compression with ATI 3Dc™ Jonathan Zarge ATI Research Inc.

Understanding the graphics pipeline Lecture 2 Original Slides by: Suresh Venkatasubramanian Updates by Joseph Kider.

Status – Week 257 Victor Moya. Summary GPU interface. GPU interface. GPU state. GPU state. API/Driver State. API/Driver State. Driver/CPU Proxy. Driver/CPU.

RealityEngine Graphics Kurt Akeley Silicon Graphics Computer Systems.

Visualization and graphics research group CIPIC May 25, 2004Realistic Image Synthesis1 Tone Mapping Presented by Lok Hwa.

WSCG 2007 Hardware Independent Clipmapping A. Seoane, J. Taibo, L. Hernández, R. López, A. Jaspe VideaLAB – University of A Coruña (Spain)

Fast GPU Histogram Analysis for Scene Post- Processing Andy Luedke Halo Development Team Microsoft Game Studios.

Hardware-Accelerated Adaptive EWA Volume Splatting Wei Chen ZJU Liu Ren CMU Matthias Zwicker MIT Hanspeter Pfister MERL.

Real-time Dynamic HDR Based Lighting in a Static Environment Marcus Hennix Daniel Johansson Gunnar Johansson Martin Wassborn.

2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.

Computer Graphics Hardware Acceleration for Embedded Level Systems Brian Murray

GPU-Based Frequency Domain Volume Rendering Ivan Viola, Armin Kanitsar, and Meister Eduard Gröller Institute of Computer Graphics and Algorithms Vienna.

Photographic Tone Reproduction for Digital Images Erik Reinhard Utah Mike Stark Peter Shirley Jim Ferwerda Cornell.

IN4151 Introduction 3D graphics 1 Introduction to 3D computer graphics part 2 Viewing pipeline Multi-processor implementation GPU architecture GPU algorithms.

GPU Simulator Victor Moya. Summary Rendering pipeline for 3D graphics. Rendering pipeline for 3D graphics. Graphic Processors. Graphic Processors. GPU.

High Dynamic Range Emeka Ezekwe M11 Christopher Thayer M12 Shabnam Aggarwal M13 Charles Fan M14 Manager: Matthew Russo 6/26/

Hardware-Based Nonlinear Filtering and Segmentation using High-Level Shading Languages I. Viola, A. Kanitsar, M. E. Gröller Institute of Computer Graphics.

1 Photographic Tone Reproduction for Digital Images Brandon Lloyd COMP238 October 2002.

Evolution of the Programmable Graphics Pipeline Patrick Cozzi University of Pennsylvania CIS Spring 2011.

The programmable pipeline Lecture 10 Slide Courtesy to Dr. Suresh Venkatasubramanian.

Status – Week 260 Victor Moya. Summary shSim. shSim. GPU design. GPU design. Future Work. Future Work. Rumors and News. Rumors and News. Imagine. Imagine.

GPU Graphics Processing Unit. Graphics Pipeline Scene Transformations Lighting & Shading ViewingTransformations Rasterization GPUs evolved as hardware.

High Performance in Broad Reach Games Chas. Boyd

Ray Tracing and Photon Mapping on GPUs Tim PurcellStanford / NVIDIA.

A Multigrid Solver for Boundary Value Problems Using Programmable Graphics Hardware Nolan Goodnight Cliff Woolley Gregory Lewin David Luebke Greg Humphreys.

A Multigrid Solver for Boundary Value Problems Using Programmable Graphics Hardware Nolan Goodnight Cliff Woolley Gregory Lewin David Luebke Greg Humphreys.

1 Introduction to Computer Graphics with WebGL Ed Angel Professor Emeritus of Computer Science Founding Director, Arts, Research, Technology and Science.

GPU Programming Robert Hero Quick Overview (The Old Way) Graphics cards process Triangles Graphics cards process Triangles Quads.

Enhancing GPU for Scientific Computing Some thoughts.

Spatial Tone Mapping in High Dynamic Range Imaging Zhaoshi Zheng.

Interactive Time-Dependent Tone Mapping Using Programmable Graphics Hardware Nolan GoodnightGreg HumphreysCliff WoolleyRui Wang University of Virginia.

Cg Programming Mapping Computational Concepts to GPUs.

NVIDIA Fermi Architecture Patrick Cozzi University of Pennsylvania CIS Spring 2011.

Stream Processing Main References: “Comparing Reyes and OpenGL on a Stream Architecture”, 2002 “Polygon Rendering on a Stream Architecture”, 2000 Department.

Shadow Mapping Chun-Fa Chang National Taiwan Normal University.

A Closer Look At GPUs By Kayvon Fatahalian and Mike Houston Presented by Richard Stocker.

MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES

Hardware-accelerated Rendering of Antialiased Shadows With Shadow Maps Stefan Brabec and Hans-Peter Seidel Max-Planck-Institut für Informatik Saarbrücken,

Accelerated Stereoscopic Rendering using GPU François de Sorbier - Université Paris-Est France February 2008 WSCG'2008.

M. Jędrzejewski, K.Marasek, Warsaw ICCVG, Multimedia Chair Computation of room acoustics using programable video hardware Marcin Jędrzejewski.

- Laboratoire d'InfoRmatique en Image et Systèmes d'information

A SEMINAR ON 1 CONTENT 2  The Stream Programming Model  The Stream Programming Model-II  Advantage of Stream Processor  Imagine’s.

Havok FX Physics on NVIDIA GPUs. Copyright © NVIDIA Corporation 2004 What is Effects Physics? Physics-based effects on a massive scale 10,000s of objects.

Computer Graphics 3 Lecture 6: Other Hardware-Based Extensions Benjamin Mora 1 University of Wales Swansea Dr. Benjamin Mora.

Maths & Technologies for Games Advanced Graphics: Scene Post-Processing CO3303 Week

CSE 690: GPGPU Lecture 8: Image Processing PDE Solvers Klaus Mueller Computer Science, Stony Brook University.

CSCI 440.  So far we have learned how to  build shapes  create movement  change views  add simple lights  But, our objects still look very cartoonish.

From Turing Machine to Global Illumination Chun-Fa Chang National Taiwan Normal University.

COMPUTER GRAPHICS CS 482 – FALL 2015 SEPTEMBER 29, 2015 RENDERING RASTERIZATION RAY CASTING PROGRAMMABLE SHADERS.

Ray Tracing using Programmable Graphics Hardware

Mapping Computational Concepts to GPUs Mark Harris NVIDIA.

Shadows David Luebke University of Virginia. Shadows An important visual cue, traditionally hard to do in real-time rendering Outline: –Notation –Planar.

GPGPU: Parallel Reduction and Scan Joseph Kider University of Pennsylvania CIS Fall 2011 Credit: Patrick Cozzi, Mark Harris Suresh Venkatensuramenan.

Dynamic Geometry Displacement Jens Krüger Technische Universität München.

Radiance Cache Splatting: A GPU-Friendly Global Illumination Algorithm P. Gautron J. Křivánek K. Bouatouch S. Pattanaik.

3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.

Contextual Snapshots: Enriched Visualization with Interactive Spatial Annotations Peter Mindek 1, Stefan Bruckner 2,1 and M. Eduard Gröller 1 1 Institute.

COMP 175 | COMPUTER GRAPHICS Remco Chang1/XX13 – GLSL Lecture 13: OpenGL Shading Language (GLSL) COMP 175: Computer Graphics April 12, 2016.

CPSC 6040 Computer Graphics Images

Graphics Processing Unit

From Turing Machine to Global Illumination

The Graphics Rendering Pipeline

Chapters VIII Image Texturing

Static Image Filtering on Commodity Graphics Processors

GPGPU: Parallel Reduction and Scan

RADEON™ 9700 Architecture and 3D Performance

University of Virginia

Presentation transcript:

Tone Mapping on GPUs Cliff Woolley University of Virginia Slides courtesy Nolan Goodnight

HDR and Tone Mapping Clamped to [0,1]Compressed

Advances in graphics hardware –Physically-based rendering on the GPU (Purcell et al, 2003) –High dynamic range texture mapping (Debevec et al, 2001)

System Overview Interactive tone mapping system for an OpenGL application tone mapping system application LDR image Frame buffer Display callback HDR image

Interface to the application –tmInitialize(); // Initialize the system –tmEnable(); // Retarget GL calls Draw geometry tmCompress(); // Compress output –tmDisable(); // Restore app context tone mapping system application

Choosing a tone mapping operator Photographic Tone Reproduction for High Contrast Images (Reinhard et al, 2002) –Global operator is a simple transfer function scaled luminance 0 1

Choosing a tone mapping operator Local operator –Digital analog to ‘burning’ and ‘dodging’ local area luminance Center-surround

Global operator is simple and fast to compute Only one global computation We can dynamically choose the number of zones Why use this tone mapping operator?

Variable number of zones: 3

Variable number of zones: 4

Variable number of zones: 5

Variable number of zones: 6

Variable number of zones: 7

Variable number of zones: 8

System block diagram

Implementation Target architecture –ATI Radeon 9800 (R350) Data storage –Floating-point off-screen buffers (pbuffers) –Multiple rendering surfaces (GL_AUXi)

Implementation Algorithms –ARB fragment and vertex assembly –Generate fragments with image-sized quads Data representation –Vector vs. scalar organization

Global operator block diagram

Implementation: global operator –Simple luminance transform –Store luminance and log luminance in separate channels HDR image Luminance Log luminance Mipmap reduction LDR image Single pbuffer luminance log luminance

Implementation: global operator Single rendering surface log luminance channel log average luminance HDR image Luminance Log luminance Mipmap reduction LDR image Single pbuffer

Implementation: global operator operator shader texture 0 texture 1 texture 2 HDR image Luminance Log luminance Mipmap reduction LDR image

Local operator block diagram

Implementation: GPU-based convolutions Transform n-vector product into multiple 4-vector products filter luminance + + …………

Vectorizing the luminance –Output 4 pixels at the same time –Useful for expensive algorithms –Requires a conversion back to scalar form. Stacked domain

A simple method for luminance vectorization: Vectorizing the luminance R G B A luminance

A simple method for luminance vectorization: Vectorizing the luminance R G B A luminance

A simple method for luminance vectorization: Vectorizing the luminance R G B A luminance

A simple method for luminance vectorization: Vectorizing the luminance R G B A luminance

A simple method for luminance vectorization: Preserves spatial locality Vectorizing the luminance R G B A luminance

filter image Example:1 x n inner product stacked image GPU-based convolutions

filter image stacked image GPU-based convolutions Pass 1

filter image stacked image GPU-based convolutions Pass 1Pass 2 +

filter image stacked image GPU-based convolutions Pass 1Pass 2Pass 3 ++

GPU-based convolutions Compute multiple 4-vector products per pass –Less shader and texture switching stacked image ++ Single render pass

GPU-based convolutions Compute multiple 4-vector products per pass –Less shader and texture switching stacked image ++ Single render pass

GPU-based convolutions Compute multiple 4-vector products per pass –Less shader and texture switching stacked image ++ Single render pass

GPU-based convolutions Compute multiple 4-vector products per pass –Less shader and texture switching stacked image ++ Single render pass

GPU-based convolutions Compute multiple 4-vector products per pass –Less shader and texture switching stacked image ++ Single render pass

GPU-based convolutions Advantages : –Handles large kernels –Efficient memory access –No transform back to scalar values 21 x 21 kernel ~ 10 ms 41 x 41 kernel ~ 16 ms 11 x 11 kernel ~ 6 ms 512 X 512 image:

System block diagram

Calculating adaptation zones luminance 0 Buffer 0Buffer 1 FRONT BACK 1 filtered

Calculating adaptation zones luminance 2 Buffer 0Buffer 1 FRONT BACK 1 filtered

Calculating adaptation zones luminance 2 Buffer 0 FRONT BACK 3 Buffer 1 filtered

Calculating adaptation zones luminance 4 FRONT BACK 3 Buffer 0Buffer 1 filtered

Image size Frames per second 16 bit floats 32 bit floats Performance: global operator

Performance: local operator Number of zones 16 bit floats 32 bit floats Frames per second

Performance comparison: CPU vs. GPU

Results: Accuracy Comparison with CPU: 512 x 512 image ImageRMS % error Scaled luminance0.022 % Convolution (5 x 5)0.026 % Convolution (49 x 49)0.032 % Final image1.051 %

False-color zone images CPUGPU

Compressed: 2 zonesClamped [0,1] Images generated at ~30Hz

Compressed: 2 zonesClamped [0,1]

Compressed: 2 zonesClamped [0,1] Images generated at ~30Hz

Compressed: 2 zonesClamped [0,1] Images generated at ~30Hz

Compressed: 2 zonesClamped [0,1] Images generated at ~30Hz

Compressed: 2 zonesClamped [0,1] Images generated at ~30Hz

Conclusion and Future Work Summary –System for interactively compressing HDR output from an OpenGL application –Complex tone mapping operator on the GPU Future Work –Other tone mapping operators –Further optimizations –Non-invasive implementation