MACHINE VISION GROUP Head-tracking virtual 3-D display for mobile devices Miguel Bordallo López*, Jari Hannuksela*, Olli Silvén* and Lixin Fan**, * University.

Slides:



Advertisements
Similar presentations
COMPUTER GRAPHICS CS 482 – FALL 2014 NOVEMBER 10, 2014 GRAPHICS HARDWARE GRAPHICS PROCESSING UNITS PARALLELISM.
Advertisements

Hand Gesture for Taking Self Portrait Shaowei Chu and Jiro Tanaka University of Tsukuba Japan 12th July 15 minutes talk.
By : Adham Suwan Mohammed Zaza Ahmed Mafarjeh. Achieving Security through Kinect using Skeleton Analysis (ASKSA)
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
ENERGY-PROPORTIONAL IMAGE SENSING FOR Robert LiKamWa Bodhi Priyantha Matthai Philipose Victor Bahl Lin Zhong CONTINUOUS MOBILE VISION
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Microsense Webcast Streaming Solutions
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Video Object Tracking and Replacement for Post TV Production LYU0303 Final Year Project Spring 2004.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Real-time and Retrospective Analysis of Video Streams and Still Image Collections using MPEG-7 Ganesh Gopalan, College of Oceanic and Atmospheric Sciences,
CS335 Principles of Multimedia Systems Multimedia and Human Computer Interfaces Hao Jiang Computer Science Department Boston College Nov. 20, 2007.
Robust Real-Time Object Detection Paul Viola & Michael Jones.
Viola and Jones Object Detector Ruxandra Paun EE/CS/CNS Presentation
UNIVERSITY OF MURCIA (SPAIN) ARTIFICIAL PERCEPTION AND PATTERN RECOGNITION GROUP REFINING FACE TRACKING WITH INTEGRAL PROJECTIONS Ginés García Mateos Dept.
Ch 1 Intro to Graphics page 1CS 367 First Day Agenda Best course you have ever had (survey) Info Cards Name, , Nickname C / C++ experience, EOS experience.
Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.
MACHINE VISION GROUP Multimodal sensing-based camera applications Miguel Bordallo 1, Jari Hannuksela 1, Olli Silvén 1 and Markku Vehviläinen 2 1 University.
Nvidia Tegra 2 The world's first mobile super chip.
Human tracking and counting using the KINECT range sensor based on Adaboost and Kalman Filter ISVC 2013.
3D Stereo Reconstruction using iPhone Devices Final Presentation 24/12/ Performed By: Ron Slossberg Omer Shaked Supervised By: Aaron Wetzler.
MACHINE VISION GROUP Graphics hardware accelerated panorama builder for mobile phones Miguel Bordallo López*, Jari Hannuksela*, Olli Silvén* and Markku.
MACHINE VISION GROUP GPGPU-based surface inspection from structured white light Miguel Bordallo 1, Karri Niemelä 2, Olli Silvén 1 1 Center for Machine.
Knowledge Systems Lab JN 9/10/2002 Computer Vision: Gesture Recognition from Images Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Shinta Kisriani.  INTRODUCTION  THEORY LITERATURE  METHOD DESIGN  ANALYSIS & RESULT  CONCLUSION  FUTURE WORK.
Computer Graphics Graphics Hardware
PortableVision-based HCI A Hand Mouse System on Portable Devices 連矩鋒 (Burt C.F. Lien) Computer Science and Information Engineering Department National.
Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.
GENERAL PRESENTATION SUBMITTED BY:- Neeraj Dhiman.
ST01 - Introduction 1 Introduction Lecturer: Smilen Dimitrov Sensors Technology – MED4.
Lectio Praecursoria Miguel Bordallo López
© 2009 Nokia V1-OpenCLEnbeddedProfilePresentation.ppt / / JyrkiLeskelä 1 OpenCL Embedded Profile Presentation for Multicore Expo 16 March 2009.
Interactive Time-Dependent Tone Mapping Using Programmable Graphics Hardware Nolan GoodnightGreg HumphreysCliff WoolleyRui Wang University of Virginia.
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
Portable Vision-Based HCI A Real-Time Hand Mouse System on Portable Devices 連矩鋒 (Burt C.F. Lien) Department of Computer Science and Information Engineering.
WSCG2008, Plzen, 04-07, Febrary 2008 Comparative Evaluation of Random Forest and Fern classifiers for Real-Time Feature Matching I. Barandiaran 1, C.Cottez.
Real-Time Cyber Physical Systems Application on MobilityFirst Winlab Summer Internship 2015 Karthikeyan Ganesan, Wuyang Zhang, Zihong Zheng.
Accelerating image recognition on mobile devices using GPGPU
Mobile Middleware for Energy-Awareness Wei Li
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Tone Mapping on GPUs Cliff Woolley University of Virginia Slides courtesy Nolan Goodnight.
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Spectrograms Revisited Feature Extraction Filter Bank Analysis EEG.
Designing for energy-efficient vision-based interactivity on mobile devices Miguel Bordallo Center for Machine Vision Research.
Tiled Forward Shading Johan Medeström. Project Goals Render a scene with lots of lights Learn more OpenGL and shading techniques Learn more about OpenCL/Compute.
Real-Time Cyber Physical Systems Application on MobilityFirst Winlab Summer Internship 2015 Karthikeyan Ganesan, Wuyang Zhang, Zihong Zheng Shantanu Ghosh,
User Attention Tracking in Large Display Face Tracking and Pose Estimation Yuxiao Hu Media Computing Group Microsoft Research, Asia.
Immersive Rendering. General Idea ► Head pose determines eye position  Why not track the eyes? ► Eye position determines perspective point ► Eye properties.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Visual Odometry David Nister, CVPR 2004
MACHINE VISION GROUP MOBILE FEATURE-CLOUD PANORAMA CONSTRUCTION FOR IMAGE RECOGNITION APPLICATIONS Miguel Bordallo, Jari Hannuksela, Olli silvén Machine.
Augmented Reality and 3D modelling Done by Stafford Joemat Supervised by Mr James Connan.
MULTI TOUCH. Introduction Multi-touch is a human-computer interaction technique. Consists of a touch screen as well as software that recognizes multiple.
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
Dynamic Framerate and Resolution Scaling on Mobile Devices Kent W. Nixon, Xiang Chen, Yiran Chen University of Pittsburgh January 29, 2016.
Hand Gestures Based Applications
- Introduction - Graphics Pipeline
Real-Time Soft Shadows with Adaptive Light Source Sampling
Computer Graphics.
Video-based human motion recognition using 3D mocap data
Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†
Chapter I Introduction
Computer Graphics Graphics Hardware
WELCOME.
Research Institute for Future Media Computing
Midway Design Review Team 1: MirrAR
LiGaze Ultra-Low Power Gaze Tracking for Virtual Reality
PRELIMINARY DESIGN REVIEW
OpenGL-Rendering Pipeline
Presentation transcript:

MACHINE VISION GROUP Head-tracking virtual 3-D display for mobile devices Miguel Bordallo López*, Jari Hannuksela*, Olli Silvén* and Lixin Fan**, * University of Oulu, Finland ** Nokia Research Center, Tampere, Finland

MACHINE VISION GROUP Contents Introduction Head-tracking 3D virtual display Interaction design Face-tracking for mobile devices Mobile device’s constrains Field of view Energy efficiency Implementation Latency considerations Performance Summary

MACHINE VISION GROUP Introduction 3D virtual displays Calculate the relative position of the user respect to the screen Calculate the angle of the user’s point of view Render an image according to the point of view Result is a Virtual Window: - Shows realistic 3D objects - Based on parallax effect * Video from Johnny Lee (Wiimote head tracking project) The position information is used to render the 3D UI/content as if the user watched it from different angles. The technology enable users to watch the content from different angles and become more immersed.

MACHINE VISION GROUP Introduction Mobile 3D virtual displays Mobile head-coupled display can take advantage of the small size Movement of either user or device Mobile Devices have cameras and sensors integrated No need for external periferics Can increase UI functionalities New applications and concepts Realistic 3D objects can be rendered and perceived New interaction methods can be developed We know what the user looks at and we can use that information

MACHINE VISION GROUP Demo

MACHINE VISION GROUP Head-tracking mobile virtual 3D display A simple use case

MACHINE VISION GROUP Interaction design

MACHINE VISION GROUP Head-coupled displays require robust and fast face-tracking Based on multiscale LBP, Cascade classifier and AdaBoost Excellent results in face recognition and authentication, face detection, facial expression recognition, gender classification Introduction Mobile face-tracking

MACHINE VISION GROUP Introduction Evaluating the distance to the screen Essential to compute de relative angle Ground truth determined With Kinect Two methods evaluated: Face size obtained with face tracking Flickering between frames No extra computations needed Good accuracy Motion estimation library: Harris corners + BLUE Computes changes of scale between frames Presents about 10% more accuracy Less flickering between frames Needs extra computations: Introduces latency, decreases framerate Worse input sequence for tracking More differences between frames

MACHINE VISION GROUP Mobile constrains Field of view Front Camera is on the device’s corner and not pointing to the user: Reduced field of view (<45dg) Assymmetric FoV Even more reduced effective FoV Considerable minimum distance to the screen User often outside of the point of view Tracking sometimes lost Need to show viewfinder on the screen

MACHINE VISION GROUP Mobile constrains Field of view Implemented solution: Wide angle lens Dramatically increases the effective field of view (<160dg) Requires calibrated lens Requires de-warping routine Implemented with lookup tables Problems when several faces are on the field of view

MACHINE VISION GROUP Mobile constrains Energy efficiency Practical challenge of camera-based UI is to have an always active camera Lower framerate -> High UI starting latencies Higher framerate -> Small energy-efficiency Application processor (even in mobile) is power hungry Specific processors closer to the sensors are needed Current devices include HW-codecs and GPUs: Better energy efficiency due to small EPI Mobile GPU already programable: OpenGL ES OpenCL Embeded Porfile

MACHINE VISION GROUP Energy efficiency GPU-accelerated face-tracking Computational and energy costs per VGA frame of feature extraction GPU can be treated as an independent entity Can be use concurrently with CPU Use of GPU for feature extraction (format conversion + multiscaling + LBP) Mobile GPUs still not very efficient for certain tasks

MACHINE VISION GROUP Implementation Demo platform: N900 (Qt + Gstreamer + openGL ES) Based on face-tracking external library Implementation details: Input image resolution : 320x240 Frame rate: fps. Base latency: ms. Accepted field of view: < 45dg hori. & < 35dg vert. User’s distance range: cm.

MACHINE VISION GROUP Implementation Simple block diagram

MACHINE VISION GROUP Implementation Task distribution

MACHINE VISION GROUP Implementation Task distribution Camera moduleApplication Processor CPU Graphics Processor GPU Touchscreen Display

MACHINE VISION GROUP Implementation Task distribution Camera moduleApplication ProcessorGraphics ProcessorTouchscreen

MACHINE VISION GROUP Mobile constrains Latency User interface latency is a critical issue Latency > 100ms. Very disturbing Realistic 3D rendering even more sensitive Not realistic if it happened a while ago !!!

MACHINE VISION GROUP Mobile constrains Latency hiding A possible solution: Latency hiding Requires good knowledge of the system’s timing Predict the current position based on motion vector

MACHINE VISION GROUP Performance Demo platform: Nokia N900 ARM cortex A8, 600 MHz + PowerVR535 GPU Comparison platform: Nokia N9 ARM cortex A8, 1 GHz + PowerVR535 GPU

MACHINE VISION GROUP Remaining problems Face-tracking based 3D User Interfaces provide support for new concepts Face tracking can be offered as a platform level Current mobile platforms still present several shortcomings Energy efficiency compromises battery life Camera not designed for UI purposes Single camera implies difficult 3D context recognition

MACHINE VISION GROUP Thank you Any question?

MACHINE VISION GROUP LBP fragment shader implementation Access the image via texture lookup Fetch the selected picture pixel Fetch the neighbours values Compute binary vector Multiply by weighting factor Uses OpenGL ES interface Two versions: –Version 1: calculates LBP map in one grayscale channel –Version 2: calculates 4 LBP maps in RGBA channels

MACHINE VISION GROUP Preprocessing Create quad Divide texture & Convert to grayscale Render each piece in one channel

MACHINE VISION GROUP GPU assisted face analysis process