David Nguyen Professor John Canny MultiView A Spatially Faithful Video-Conferencing System
Talk Outline Motivation Prior Work Our Approach Experiment Discussion Conclusions
MultiView is the first practical video conferencing system that preserves many non-verbal cues in group-to-group meetings by improving spatial faithfulness
Spatial Faithfulness describes how attention cues (e.g., gaze and gesture) are preserved or distorted across a virtual space.
Motivation You’re Fired!
Spatial Distortions Collapsed Viewer Effect With only one camera, remote participants take on a shared and incorrect perspective warping nonverbal cues
Mona Lisa Effect
Spatial Faithfulness Different levels of Spatial Faithfulness –Mutual – Correctly perceive that attention is or is not being directed at you when it actually is or is not –Partial – Correctly perceive the general direction of attention when it is away from you –Full – Correctly perceive the specific object of attention Donald Dave increasing
Talk Outline Motivation Prior Work Our Approach Experiment Discussion Conclusions
HYDRA (Sellen et al., 1992) Group Sites Mutual Sp.Fa. Full Sp.Fa. Partial Sp.Fa.
GAZE-2 (Vertegaal et al., 2003) Group Sites Mutual Sp.Fa. Full Sp.Fa. Partial Sp.Fa.
MAJIC (Okada et al., 1994) Group Sites Mutual Sp.Fa. Full Sp.Fa. Partial Sp.Fa.
Talk Outline Motivation Prior Work Our Approach Experiment Discussion Conclusions
Unique and Correct Views Multiple Viewpoint Display “Virtually Here” Preserving a geometric relationships between virtual and actual positions provides full spatial faithfulness
Gaze Parallax
Camera Placement vs. Gaze Parallax Ideally, cameras should be placed at the position of the eyes –Occlusion (either the camera or the image) Cameras can be up to 5 o before gaze parallax is perceived (Chen, 2002) [5°-7°]
MultiView Directional Display Big, Bright, High Resolution Display Each view is provided by a projector The projected image is reflected directly back in the direction of the projector The image can be seen at varying heights only behind the projector
Construction Retroreflective Layer Reflects image back in direction of source Vertical Diffuser Diffuses image vertically to accommodate varying viewing heights Antireflective Sheet Reduces distracting glares due to glossy surface and front projection setup
Brightness vs. Viewing Angle 0O0O α -10 O
MultiView Cameras MultiView Screen Projectors
Cost Small fixed cost Variable costs increase linearly Alternatives increase quadratically Fixed Costs (MultiView Screen) ItemCost Retroreflective Sheet$50 Lenticular Lens$50 Anti-Glare Layer$600 Total Fixed Cost$700 Variable Costs ItemCost/Unit Camera$100 Projectors$1000
Easy to Join
Talk Outline Motivation Prior Work Our Approach Experiment Discussion Conclusions
Experimental Setup 3O3O
23 participants: UC Berkeley graduate and undergraduate students Each Paid $10 Experiments –Exp 1: 10 Trials (230 Total) –Exp 2: 10 Trials (230 Total) –Exp 3: 30 Trials (690 Total)
Experiment 1 & 2: Mona Lisa Effect Each researcher was asked to look or point at one of the five targets Each participant was asked to circle which target each researcher appeared to be looking/pointing at Repeated for 10 trials
Experiment 1: Accuracy of Gaze Perception Confusion Matrix –Each column represents actual target stimulus –Each Row represents perceived target –Ideally, this would be a diagonal matrix. Participants were accurate in determining target of gaze –90% were at most one target off
Experiment 2: Accuracy of Gesture Perception Confusion Matrix –Each column represents actual target stimulus –Each Row represents perceived target –Ideally, this would be a diagonal matrix. Participants were accurate in determining target of gaze –94% were at most one target off
Experiment 3: Eye Contact Researchers and Participants were paired off. Researchers were asked to look at the eyes, at the cameras, below the eyes, to the left of the eyes or to the right of the eyes. Each participant was asked if they felt like the partner was looking at them in the eyes.
Mutual Gaze No significant difference in positive responses between different gaze positions “At Cam” yielded <100% positive response Participants noted a strong sense of eye contact in the context of conversation Yes/No “Are they looking at me?”
Virtual Viewing Distance Viewers sat 12’ from screen –projector throw distance Image needed to be scaled down to 66% –Scene:Screen size Resulting in a virtual viewing distance of 18’
Future Work New MultiView design iteration –8’ Screen –Short throw projectors for closer virtual distance (from 18’ to 8’) –True Life Size Images
Future Work: Shared Workspace Object Virtual Object Projector Glass Table
Future Work Using a new digital pipeline, we can control for aspects such as video quality and latency for low level experiments New series of higher level experiments using new screen design –Trust formation –Turn Taking –Speech Patterns
Conclusion Our perceptual experiments show that attention target cues can be accurately determined using our system ? Standard video conference systems have spatial distortions which can adversely effect communication Our solution, MultiView, is the first practical video conferencing system to support spatially faithful group-to-group meetings
Additional Slides
David Nguyen Professor John Canny MultiView A Spatially Faithful Video-Conferencing System
Position Dependant Views Participants are facing the right camera View From Left View From Center View From Right
Current Technologies
Functions of Gaze (Kendon, 1967) Monitoring –“Is my partner listening to me?” –“Does my partner want to take over?” –Planning, Current Control, Checking Regulating –Floor Control (Turn Taking) –Forestall/Demand a Response Expressing – feelings or attitudes –Power, interest, point-granting, attention, agreement
First Impressions: Rule (Mehrabian, 1971) In forming first impressions… –7% is based on what you actually say –38% is based on the way you say it –55% is based on your appearance Suggests that a lot is at risk in forming first impressions if visual nonverbal cues are not appropriately preserved
Pop Quiz: First Impressions Rule (Mehrabian, 1971) In forming first impressions… – is based on content – is based on voice – is based on appearance Fill in the blank with [7%, 38%, or 55%] 7% 38% 55%
The Video Tunnel Variants Include… Gaze-2 (Vertegaal et al., 1999, 2003) GA Display (Monk and Gale, 2002) The Teleprompter (Kahn, 1994) Clearboard (Ishii and Kobayashi, 1992) Reciprocal Video Tunnel (Buxton and Moran, 1990)
Computer Vision/Graphics Approach (Gemmel et al., 2000)
Human Accuity (Chen, 2002)
Next Design Iteration 8ft screen for life size images Short throw projector for closer viewing distances 9’6”) essentially halving virtual distance Ghosting –Multiple elements and uncontrollable diffusion causes ghosting effects
Error by Viewing Position GAZE Positionμσ All GESTURE PositionΜσ All In standard video conferencing systems, Mona Lisa Effect would cause greater error as viewing position deviates from “virtual camera position” Finding: Viewing position had no significant effect on error MultiView does not exhibit the Mona Lisa Effect
3 Site Setup
Vision: More Motion Freedom Three evolving technologies –Synthesize viewpoints from multiple cameras –Shrinking Projector Size and Power Consumption –Position Tracking Virtual Camera
Prior Work Hydra (Sellen et al., 1992) MAJIC (Okada et al., 1994) GAZE-2 (Vertigaal et al., 2003)
Feature Comparison
MultiView (Nguyen and Canny, 2005) A Spatially Faithful Video Conferencing System
Trust Formation (Bos et al., 2002) Trust development was delayed in audio/video Defections were more likely with video/audio than FTF communication. Little difference between video and audio
Errors By Stimulus Direction Edge Effects Extension of Chen’s Snap To Contact Theory –Includes other sources attention (not just gaze) –Includes third party viewers viewing two others in discussion Mean Error By Gesture Direction Direction Mean Error By Gaze Direction Direction
Acknowledgments