Stereoscopic Video Overlay with Deformable Registration Balazs Vagvolgyi Prof. Gregory Hager CISST ERC Dr. David Yuh, M.D. Department of Surgery Johns.

Slides:

Advertisements

Similar presentations

Results/Conclusions: In computer graphics, AR is achieved by the alignment of the virtual camera with the actual camera and the virtual object with the.

Advertisements

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.

Structured Light principles Figure from M. Levoy, Stanford Computer Graphics Lab.

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.

MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.

Kernel-based tracking and video patch replacement Igor Guskov

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Last Time Pinhole camera model, projection

Copyright  Philipp Slusallek Cs fall IBR: Model-based Methods Philipp Slusallek.

Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.

Stereoscopic Light Stripe Scanning: Interference Rejection, Error Minimization and Calibration By: Geoffrey Taylor Lindsay Kleeman Presented by: Ali Agha.

Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.

Multi-view stereo Many slides adapted from S. Seitz.

High-Quality Video View Interpolation

The plan for today Camera matrix

Lecture 10: Stereo and Graph Cuts

Stereo Computation using Iterative Graph-Cuts

Lecture 11: Stereo and optical flow CS6670: Computer Vision Noah Snavely.

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

CSE473/573 – Stereo Correspondence

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Introduction 3D scene flow is the 3D motion field of points in the world. Structure is the depth of the scene. Motivation of our work: Numerous applications.

3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.

Research & Innovation 1 An Industry Perspective on VVG Research Oliver Grau BBC Research & Innovation VVG SUMMER SCHOOL '07.

IGT Meeting – CADDLab – November, 2005 Image-Guided Surgery Applications Julien Jomier The University of North Carolina at Chapel Hill.

CSC 589 Lecture 22 Image Alignment and least square methods Bei Xiao American University April 13.

A Brief Overview of Computer Vision Jinxiang Chai.

Geometric and Radiometric Camera Calibration Shape From Stereo requires geometric knowledge of: –Cameras’ extrinsic parameters, i.e. the geometric relationship.

Real-Time High Resolution Photogrammetry John Morris, Georgy Gimel’farb and Patrice Delmas CITR, Tamaki Campus, University of Auckland.

Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.

KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.

Video Overlay Advanced Computer Integrated Surgery ( ) Jeff Hsin, Cyrus Moon, Anand Viswanathan.

Visual Perception PhD Program in Information Technologies Description: Obtention of 3D Information. Study of the problem of triangulation, camera calibration.

ECE532 Final Project Demo Disparity Map Generation on a FPGA Using Stereoscopic Cameras ECE532 Final Project Demo Team 3 – Alim, Muhammad, Yu Ting.

Introduction EE 520: Image Analysis & Computer Vision.

CVPR Workshop on RTV4HCI 7/2/2004, Washington D.C. Gesture Recognition Using 3D Appearance and Motion Features Guangqi Ye, Jason J. Corso, Gregory D. Hager.

Stereo Many slides adapted from Steve Seitz.

CS 4487/6587 Algorithms for Image Analysis

Cmput412 3D vision and sensing 3D modeling from images can be complex 90 horizon 3D measurements from images can be wrong.

A General-Purpose Platform for 3-D Reconstruction from Sequence of Images Ahmed Eid, Sherif Rashad, and Aly Farag Computer Vision and Image Processing.

A Fast and Accurate Tracking Algorithm of the Left Ventricle in 3D Echocardiography A Fast and Accurate Tracking Algorithm of the Left Ventricle in 3D.

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.

Spatio-Temporal Free-Form Registration of Cardiac MR Image Sequences Antonios Perperidis s /02/2006.

CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.

Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

Digital Image Processing

Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.

Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.

Visual Odometry David Nister, CVPR 2004

Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.

1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.

Image-Based 3-D Spinal Navigation Using Intra-Operative Fluoroscopic Registration R. Grzeszczuk, S. Chin, M. Murphy, R. Fahrig, H. Abbasi, D. Kim, J.R.

Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.

John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.

Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.

John Morris These slides were adapted from a set of lectures written by Mircea Nicolescu, University of Nevada at Reno Stereo Vision Iolanthe in the Bay.

Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.

MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.

1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.

A 2D/3D correspondence building method for reconstruction of a 3D bone surface model Longwei Fang

Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:

Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

Data Integration during Robotic Ultrasound-guided Surgery

Advanced Computer Graphics

Processing visual information for Computer Vision

Multimodal Registration Using Stereo Imaging and Contact Sensing

Optical Coherence Tomography

Stereo vision Many slides adapted from Steve Seitz.

Presentation transcript:

Stereoscopic Video Overlay with Deformable Registration Balazs Vagvolgyi Prof. Gregory Hager CISST ERC Dr. David Yuh, M.D. Department of Surgery Johns Hopkins University

The CASA Project Today’s Surgical Assistant: A Simple Information Channel

The CASA Project Stereo surface tracking Stereo tool tracking Virtual fixtures with da Vinci Robot Task graph execution system HMM-based Intent Recognition Information Fusion with da Vinci Display Ultrasound Capabilities of a Context-Aware Surgical Assistant (CASA) Tissue Classification Preoperative Imagery

The CASA Project Stereo surface tracking Stereo tool tracking Information Fusion with da Vinci Display Developing a Context-Aware Surgical Assistant (CASA) Preoperative Imagery

Information Overlay Problem setting: –Given pre-operative scan data from a suitable imaging modality –Video sequence from a stereo endoscope Add value –Overlay underlying anatomy on the stereo video stream (x-ray vision) –Include annotations or other information tied to imagery [[ add kidney picture ]] Key Problem: Nonrigid registration of organ surface to data

Inputs: What Do We Know? 1.Pre-operative 3D model - most probably volumetric - only a portion of it will be visible on the endoscope - anatomy will be deformed during the surgical procedure 2.Camera system properties can be measured - optical & stereo calibration - local brightness/contrast/color response 3.Stereo image stream - 3D surface can be reconstructed - texture information 4.A guesstimate of model–endoscope 3D relationship - We can guess where to start searching [i.e. patient position]

Outputs: What Do We Generate? 1.Position of 3D model registered to stereo image 2.Model deformed to the current shape of anatomy 3.Rendering a synthetic 3D view on the stereo stream 4.Everything done real-time Original ImageStereo DataDeformed Mesh

2D3D All this in a flow chart Stereo image pre-processing Building and optimizing disparity map Deformable Registration to 3D surface 3D texture tracking Recognizing deformations optical parameters stereo video stream Image overlay disparity 3D data image data parameters 3D model

Classical Stereo Vision: The Problem Blocks of each image are compared using SAD Optimization for each block independently on entire depth range +Very fast implementation (GPU) ¬Lousy results Small Vision System from Videre Design (w/o structured light):

Input images downsized to several scale levels (½, ¼, …) Each scale processed with the same algorithm –Propagate coarse search results to the finer scale +Quality of disparity map is better +Even faster than single scale computation ¬Requires structured light Solution #1: Lighting and Multi-Scale SVL implementation (using structured light):

Solve a (spatially) global optimization with regularization –O(D) = min SAD(D) + Smooth(D) GLOBAL optimum found in polynomial time Solution #2: Dynamic Programming

1.Defining the recursive cost function 2.Memoization 3.Finding lowest cost path, which is the disparity map (D M in red) SmoothnessError Solution #2: Dynamic Programming

Dynamic Programming on Images Minor issue: previous approach applies to scanline Approximate DP applied to entire image - 3D disparity space (D): - Cost function (C): - Memoization (P):

Dynamic Programming: Results

Dynamic Programming: In Vivo Results Stereo recordings from the da Vinci robot Focal length of ~ 700 pixels ~5mm baseline Distance to surface of 55mm to 154mm. Raw Disparity Map Textured 3D Model

Surface to 3D Model Registration Inputs: –point cloud from the stereo surface modeler –point cloud generated from a model or volume image Outputs: - transformation to register the 3D model to the 3D surface

Results: Rigid Registration Complete system (stereo plus registration) operates at 5 frames/second Current algorithm uses IPC with modifications to account for occlusions due to viewpoint (z-buffer)

From Rigid to Deformable Calculate residual errors in z direction Define a spring-mass system Perform local gradient descent

Deformable Registration Results Final registration error of < 1mm except for the area where the tool enters the image

Coming in CASA The Language of Surgery Tool Tracking Tissue Surface Classification

Thank you!

Telemanipulation with Integrated Laparoscopic Ultrasound for Hepatic Surgery Ultrasound probe examining artificial lesion in porcine liver with registered 2D ultrasound overlay Registered 3D ultrasound volume swept w/autonomous robot motion Needle insertion demonstrates alignment Collaboration between JHU and Intuitive Surgical, Inc.