3D Computer Vision and Applications

Slides:



Advertisements
Similar presentations
SEMINAR ON VIRTUAL REALITY 25-Mar-17
Advertisements

Pose Estimation Using Four Corresponding Points M.L. Liu and K.H. Wong, "Pose Estimation using Four Corresponding Points", Pattern Recognition Letters,
Sukhum Sattaratnamai Advisor: Dr.Nattee Niparnan
Page 1 SIXTH SENSE TECHNOLOGY Presented by: KIRTI AGGARWAL 2K7-MRCE-CS-035.
Joint Eye Tracking and Head Pose Estimation for Gaze Estimation
3D M otion D etermination U sing µ IMU A nd V isual T racking 14 May 2010 Centre for Micro and Nano Systems The Chinese University of Hong Kong Supervised.
Mean transform, a tutorial KH Wong mean transform v.5a1.
Department of Electrical and Computer Engineering He Zhou Hui Zheng William Mai Xiang Guo Advisor: Professor Patrick Kelly ASLLENGE.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
FYP khw09xx1 Final Year Project titles by Prof. K.H. Wong
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
Image Stitching and Panoramas
Overview and Mathematics Bjoern Griesbach
 Introduction  Devices  Technology – Hardware & Software  Architecture  Applications.
A Brief Overview of Computer Vision Jinxiang Chai.
Simple Calibration of Non-overlapping Cameras with a Mirror
FYPs supervised by KH Wong1 FYP projects Supervisor : K.H. Wong See Projects under my webpage or
INTRODUCTION TO COMPUTING
(CMSC5720-1) MSC projects by Prof K.H. Wong (21 July 2015) (shb907) MSC projects supervised by Prof.
SFM Kalman V5b21 3D computer vision Structure from motion using Kalman filter.
Brief Introduction to Geometry and Vision
Zhengyou Zhang Microsoft Research Digital Object Identifier: /MMUL Publication Year: 2012, Page(s): Professor: Yih-Ran Sheu Student.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Presentation by: K.G.P.Srikanth. CONTENTS  Introduction  Components  Working  Applications.
Flow Separation for Fast and Robust Stereo Odometry [ICRA 2009]
Submitted by:- Vinay kr. Gupta Computer Sci. & Engg. 4 th year.
Realtime 3D model construction with Microsoft Kinect and an NVIDIA Kepler laptop GPU Paul Caheny MSc in HPC 2011/2012 Project Preparation Presentation.
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Virtual Reality Lecture2. Some VR Systems & Applications 고려대학교 그래픽스 연구실.
GENESIS OF VIRTUAL REALITY  The term ‘Virtual reality’ (VR) was initially coined by Jaron Lanier, founder of VPL Research (1989)..
Spatiotemporal Information Processing No.8 Information-oriented real space ~ Telexistence The role of Spatiotemporal Information Processing Kazuhiko HAMAMOTO.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
Fundamentals of Information Systems, Sixth Edition1 Natural Language Processing and Voice Recognition Processing that allows the computer to understand.
I Robot.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Augmented Reality Authorized By: Miss.Trupti Pardeshi. NDMVP, Comp Dept. Augmented Reality 1/ 23.
CONTENT FOCUS FOCUS INTRODUCTION INTRODUCTION COMPONENTS COMPONENTS TYPES OF GESTURES TYPES OF GESTURES ADVANTAGES ADVANTAGES CHALLENGES CHALLENGES REFERENCE.
Visual Odometry David Nister, CVPR 2004
Visual Odometry for Ground Vehicle Applications David Nistér, Oleg Naroditsky, and James Bergen Sarnoff Corporation CN5300 Princeton, New Jersey
Chapter 1: Image processing and computer vision Introduction
  Computer vision is a field that includes methods for acquiring,prcessing, analyzing, and understanding images and, in general, high-dimensional data.
What is Multimedia Anyway? David Millard and Paul Lewis.
FYP titles By Prof. KH Wong FYP v6.31.
Virtual Reality - by Arpit Gupta. Introduction  Virtual reality is an artificial environment that is created with software and presented to the user.
3D computer vision techniques
렌즈왜곡 관련 논문 - 기반 논문: R.Y. Tsai, An Efficient and Accurate Camera Calibration Technique for 3D Machine Vision. Proceedings of IEEE Conference on Computer.
Introducing virtual REALITY
SIXTH SENSE TECHNOLOGY
IMPART: The Intelligent Mobility Partnership
Introduction to Virtual Environments & Virtual Reality
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Jun Shimamura, Naokazu Yokoya, Haruo Takemura and Kazumasa Yamazawa
The design of smart glasses for VR applications The CU-GLASSES
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
MSC projects for for CMSC5720(term1), CMSC5721(term2)
Calibration of Multiple Kinect Depth Sensors for Full Surface Model Reconstruction 2016 the first International Workshop on Pattern Recognition (IWPR 2016)
RGBD Camera Integration into CamC Computer Integrated Surgery II Spring, 2015 Han Xiao, under the auspices of Professor Nassir Navab, Bernhard Fuerst and.
Development of VR Glasses
A language assistant system for smart glasses
Atom-Based Embedded System Design at CUHK
To be supervised by Prof. KH Wong
Depth Perception in Medical Augmented Reality
Mean transform , a tutorial
Chapter 1: Image processing and computer vision Introduction
Multiple View Geometry for Robotics
MSC projects for for CMSC5720(term1), CMSC5721(term2)
Image processing and computer vision
Ubiquitous Computing By: Patrick Yienger.
Presentation transcript:

3D Computer Vision and Applications by Prof. K.H. Wong, Computer Science and Engineering Dept. CUHK http://www.cse.cuhk.edu.hk/~khwong/ http://www.cse.cuhk.edu.hk/~khwong/papers.html khwong@cse.cuhk.edu.hk Invited talk at : The Second International Workshop on Pattern Recognition (IWPR2017), Nanyang Executive Centre, Nanyang Technological University, Singapore, May 1-3, 2017 3D Comp. Vision and Applications v7a

Faculty of Engineering, The Chinese University of Hong Kong (CUHK) Electronic Engineering (since 1970) Computer Science & Engineering (since 1973) Information Engineering (since 1989) Systems Engineering & Engineering Management (since 1991) Mechanical and Automation Engineering (since 1994) 110 faculty members 2,200 undergraduates (15% non-local) 800 postgraduates 3D Comp. Vision and Applications v7a

The Chinese University of Hong Kong Department of Computer Science and Engineering 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Abstract In this seminar, I will talk about various 3-D pose estimation and structure from motion techniques in engineering applications. First, I will discuss the general approaches of feature based pose estimation and structure from motion. Then I will introduce the techniques of Kalman filtering and trifocal tensor for real time pose tracking.  Issues of 3-D vision approaches for virtual reality, projector-camera systems and automatically driving will be elaborated. During the talk, I will also give video demonstrations of some interesting vision based systems we developed in recently years. Finally the opportunities and challenges of 3-D computer vison in the modern mobile era will be discussed. 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Overview Motivation What is 3-D vision? How it is used? Some previous approaches Recent projects 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Motivation 3D vision problems Obtain 3D information from 2D images Various applications Virtual reality Augmented reality Automatic driving Education and entertainment 3D Comp. Vision and Applications v7a

Motion of camera from world to camera coordinates Camera motion (rotation=Rc, translation=Tc) will cause change of pixel position (x,y), See p156[1] Yc Camera center Rc,Tc Xc Yw Zw Zc an_y an_z Xw World center Cameras v.3d 3D Comp. Vision and Applications v7a an_x

3D Comp. Vision and Applications v7a 3D to 2D projection Perspective model u=F*X/z v=F*Y/z Virtual Screen or CCD sensor World center Y v v F Z F Real Screen Or CCD sensor Thin lens or a pin hole 3D Comp. Vision and Applications v7a

3D computer vision main tasks: SFM structure from motion Input: only image feature points xt=1, xt=2, xt=3, Output : 3-D structure Model M, Motion (Rotation Rt, Translation Tt of the object ) 3D Model=Xj : where j=feature index =1,2…n features Time (i)  1 2 3 … R1,T1 R2,T2 http://www.youtube.com/watch?v=2KLFRILlOjc http://www.youtube.com/watch?v=RXpX9TJlpd0 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a 3D models from images Camera projection systems Some recent work Wearable devices Virtual tourism 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demo1: 3D reconstruction (see also http://www.cse.cuhk.edu.hk/khwong/demo/index.html) Grand Canyon Demo Flask Robot http://www.youtube.com/watch?v=2KLFRILlOjc 2-pass bundle adjustment Algorithm Loop until converge { Find pose based on a guessed model Find model based on a guessed pose } Michael Ming Yuen Chang and Kin Hong Wong, "Model reconstruction and pose acquisition using extended Lowe's method", IEEE Transactions on Multimedia, Volume: 7,   Issue: 2,  April 2005. 3D Comp. Vision and Applications v7a

Demo2: augmented reality (Click picture to see movie) Augmented reality demo http://www.youtube.com/watch?v=gnnQ_OEtj-Y http://www.youtube.com/watch?v=zPbgw-ydB9Y 3D Comp. Vision and Applications v7a

Demo3 Projector camera system (PROCAM) Click pictures to see movies CVPR A Projector-based Movable Hand-held Display System A Hand-held 3D Display System that facilities direct manipulation of 3D virtual objects https://www.youtube.com/watch?v=vVW9QXuKfoQ 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demo 4 Flexible projected surface http://www.youtube.com/watch?v=isqg8O9a4LE 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demo 5 3-D display without the use of spectacles. http://www.youtube.com/watch?v=oyxR_RT4NNc 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demo 6 Spherical projected surface for 3D viewing without spectacles. http://www.youtube.com/watch?v=yVDFcZZ8gDo 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demo 7 A KEYSTONE-FREE HAND-HELD MOBILE PROJECTION http://www.youtube.com/watch?v=mbl-BpTnbeA 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Some theoretic work Kalman filter based SFM structure from motion Single camera Ying Kin YU, Kin Hong Wong and Michael Ming Yuen Chang, "Recursive 3D Model Reconstruction Based on Kalman Filtering", IEEE Transactions on Systems, Man and Cybernetics B, Vol.35, No.3, June 2005 Stereo camera Mohammad Ehab Ragab, Kin Hong Wong, Jun Zhou Chen, Michael Ming-Yuen Chang, "EKF Pose estimation: how many filters and cameras to use?", IEEE International Conference on Image Processing (ICIP08), San Diego, California, U.S.A, October 12–15, 2008 Ying Kin Yu, Kin Hong Wong, Michael Ming Yuen Chang and Siu Hang Or, "Recursive Camera Motion Estimation with Trifocal Tensor", IEEE Transactions on Systems, Man and Cybernetics B, Volume 36,  Issue 5,  Oct. 2006 Page(s):1081 - 1090. Trifocal tensor based, used in SLAM/Tracking tool box For feature points Ying Kin Yu, Kin Hong Wong, Siu Hang Or and Junzhou Chen, "Controlling Virtual Cameras Based on a Robust Model-free Pose Acquisition Technique", IEEE Transactions on Multimedia, No. 1, January 2009, pp. 184-190. For feature lines LEE, Kai Ki; YU Ying Kin; WONG Kin Hong and CHANG Ming Yuen Michael "Tracking 3-D motion from straight lines with trifocal tensors." Multimedia Systems 22.2 (2016): 181-195. For feature and point combined Kai Ki LEE, Ying Kin YU, Kin Hong WONG, “ Recovering camera motion from points and lines in stereo images: A recursive model-less approach using trifocal tensors",17th IEEE/ACIS (SNPD 2017), Japan, 26-26 June 2017 3D Comp. Vision and Applications v7a

Recent work: Trifocal tensor Kalman point/line approach Kai Ki LEE, Ying Kin YU, Kin Hong WONG, “ Recovering camera motion from points and lines in stereo images: A recursive model-less approach using trifocal tensors",17th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, (SNPD 2017), Kanazawa Kinrosha Plaza, Kanazawa, Ishikawa, Japan, 26-26 June 2017. 3D Comp. Vision and Applications v7a

Trifocal tensor vision is used in Automatic driving Y. K. Yu, K. H. Wong, M. M. Y. Chang, and S. H. Or, "Recursive camera-motion estimation with the trifocal tensor." IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 36.5 (2006): 1081- 1090. Cited by the author of the Kitti dataset and used in the toolbox http://www.cvlibs.net/datasets/kitti/ Kitt, Bernd, Andreas Geiger, and Henning Lategahn. "Visual odometry based on stereo image sequences with ransac-based outlier rejection scheme." Intelligent Vehicles Symposium (IV), 2010 IEEE. IEEE, 2010. 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work Wearable glasses and beyond Fro optical character recognition OCR Virtual tourism Calibration of Multiple Kinect Depth Sensors for Full Surface Model Reconstruction Robot Avatar: a virtual tourism robot for people with disabilities An efficient 3-D environment scanning method 3D Comp. Vision and Applications v7a

Recent work 1 Wearable glasses Can see through the display Overlay text/images with the surroundings 3D Comp. Vision and Applications v7a

The design of smart glasses for VR applications The CU-GLASSES KH Wong 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Introduction Glass frame(no lens) Be able to overlay images or text to our normal view Simple, low cost and easy to build Can duplicate for the 2nd eye See through glass Close up Lens CCD display For text or graphics The CU-GLASSES 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a The idea Top down view Eye See through glass CCD display Close up Lens 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Tests Video link https://youtu.be/i2lpo0DHaWA 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Translation Real time translate what you see into other languages Education Overlay more information to students when reading books 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Virtual tourism idea If you are wearing a head mount device such as the  OCULUS (https://www.oculus.com/en-us/), you can feel you are in anywhere you like in the virtual world. To extend it to the real world, we propose to build a robot that carries a set of cameras to capture the images in 3-D at a remote place (e.g. at Tokyo). The 3-D data will be  sent back to the user and displayed through the OCULUS  (e.g. at Hong Kong). There may be extended features, for example, the system can provide tourist or shopping information to users. This system enables users to visit other countries with little cost. Even disabled people can travel far away without leaving their homes. 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work 2 Calibration of Multiple Kinect Depth Sensors for Full Surface Model Reconstruction 2016 the first International Workshop on Pattern Recognition (IWPR 2016) Tokyo, Japan during May 11-13, 2016 Kwan Pang Tsuia, Kin Hong Wongb*, Changling Wanga, Ho Chuen Kamb, Hing Tuen Yaub, and Ying Kin Yu aDepartment of Mechanical Engineering, The Chinese University of Hong Kong, Hong Kong bDepartment of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong 3D Comp. Vision and Applications v7a

3D Scanning Sensor Types Laser Very Accurate Expensive ( up to US$50,000) Bulky LED Accurate Mid-range price ( > US$ 2000) Infra-red Microsoft Kinect, Structure Sensor Cheap ( US$100~200) Less accurate 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Theory for method 2 Calibrate the pose between Kinect J and K first. Then K and L etc. Kinect M Kinect L Result Move the checker board and take samples Kinect J Kinect K 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work 3 Robot Avatar: a virtual tourism robot for people with disabilities Chong Wing Cheung, Tai Ip Tsang, Kin Hong Wong* The 2nd International Conference on Virtual Reality ICVR16, May 20-22, 2016, Chengdu, China. Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong *corresponding author (khwong@cse.cuhk.edu.hk) Download link https://appsrv.cse.cuhk.edu.hk/~khwong/www2/conference/2016/ICVR2016/ICVR20 16.html 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Introduction To create a robot agent aimed to help the disabled person experiencing the physical environment Generic Robot that used in Virtual Tourism or Bomb detonation, etc. Using Cutting Edge Virtual Reality (VR) Technology Chose Oculus Rift owing to the features, cost and programmability Story-telling +1 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a System overview Story-telling +1 3D Comp. Vision and Applications v7a

The robot with a stereo camera pair Head Mount Device HMD Hand gesture recognition 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Demos Utube : https://youtu.be/CfkP2Coajpk icvr16_avatar_video.MP4 3D Comp. Vision and Applications v7a

Face tracking/following using a wearable vision systems Eye ball tracking Face folllowing robot Warble extended eye

Recent work 4 An efficient 3-D environment scanning method 24th International Conference in Central Europe on  Computer Graphics, Visualization and Computer Vision 2016 (WSCG 2016) Venue : May 30 - June 3, 2016 Kin Hong Wonga, Ho Chuen Kamaa, Ying Kin Yua, Sheung Lai Loa, Kwan Pang Tsuib, and Hing Tuen Yaua aDepartment of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong bDepartment of Mechanical Engineering, The Chinese University of Hong Kong, Hong Kong 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a The idea 3D Comp. Vision and Applications v7a

3D reconstruction for each Kinect position (the standard method) 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a 3D environment captured using two rotating back-to-back Kinects and Lidar 3D Comp. Vision and Applications v7a

Back to Back Kinect calibration – Overview Methods Linear method Non-linear method: rotation averaging 3D Comp. Vision and Applications v7a

The mirror technique to solve the problem Pose computation between the dual-face checkerboard and the Kinect Need to find out the relative pose (rotation=Rb, translation=tb) between the dual-face checkerboard and the Kinect because they cannot be aligned perfectly Step 1 : Pose estimation through a mirror Places the mirror at n different positions Take picture of checkerboard via mirror using Kinect RGB camera Rotations Ri=1,2,..n obtained via the mirrored checkerboard needed to be converted to the corresponding improper rotations Ȓ i=1,2,..n using the formulas (equation 5) found in [LKL+15] 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work 5 Ho Yin Fung, Kin Hong Wong, Ying Kin Yu, Kwan Pang Tsui, and Ho Chuen Kam,"Face pose tracking using the four-point pose algorithm", The Second International Workshop on Pattern Recognition (IWPR2017), Nanyang Executive Centre, Nanyang Technological University, Singapore, May 1-3, 2017. 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work 6 Kin Hong Wong, Ying Kin Yu, Pang Kwan Tsui, Yin Fung Ho, and Ho Chuen Kam,"Robust and efficient pose tracking using perspective-four-point algorithm and Kalman filter", 2017 International Conference on Mechanical, System and Control Engineering (ICMSC 2017), St. Petersburg, Russia, during May 19-21, 2017 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a Recent work 8 Zhe Zhang, Kin Hong Wong, Zhiliang Zeng, Lei Zhu,"A neural network approach to visual tracking", The 15th IAPR Conference on Machine Vision Applications (MVA 2017),  Nagoya University (Toyoda Auditorium), Nagoya, Japan, 8-12, May 2017, FCN tracker 3D Comp. Vision and Applications v7a

3D Comp. Vision and Applications v7a End Q&A 3D Comp. Vision and Applications v7a