Presentation is loading. Please wait.

Presentation is loading. Please wait.

Po-Hsiang Chen Advisor: Sheng-Jyh Wang 2/13/2012.

Similar presentations


Presentation on theme: "Po-Hsiang Chen Advisor: Sheng-Jyh Wang 2/13/2012."— Presentation transcript:

1 Po-Hsiang Chen Advisor: Sheng-Jyh Wang 2/13/2012

2 Shotton, J., A. Fitzgibbon, et al. (2011). "Real-Time Human Pose Recognition in Parts from Single Depth Images." Microsoft Research Cambridge & Xbox Incubation CVPR 2011 Best Paper Freedman, B., A. Shpunt, et al. (2008). Depth mapping using projected patterns, US 2010/0118123A1 PrimeSense Patent 2/13/20122

3 3 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

4 2/13/20124 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

5 2/13/20125 Motion sensing input device by Microsoft Depth camera tech. developed by PrimeSense Invented in 2005 Software tech. developed by Rare First announced at E3 2009 as “Project Natal” Windows SDK Releases http://www.microsoft.com /en-us/kinectforwindows/ discover/features.aspx

6 2/13/20126

7 7 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

8 2/13/20128 Depth Image Body Parts Joint Position IR Structured Light Random Decision Forest Mean Shift

9 2/13/20129 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

10 2/13/201210

11 2/13/201211 Main Problem To recover shape from multiple views, need CORRESPONDENCES between the images Matching/Correspondence problem is hard Occlusions, Texture, Colors.. Etc. Solution: Structured light Idea: Simplify matching Strategy: Use illumination to create your own correspondences

12 2/13/201212 Basic Principle Use a projector to create unambiguous correspondences Light projection If we project a single point, matching is unique

13 2/13/201213 Line projection ( Line Scan ) For calibrated cameras, the epipolar geometry is known Project a line instead of a single point

14 2/13/201214 Project Multiple Stripes or Grids Which stripe matches which? Correspondence Again

15 2/13/201215 Answer 1: Assume Surface Continuity Ordering Constraint

16 2/13/201216 Answer 2: Coloured stripes (De Bruijn) Difficult to use for coloured surfaces

17 2/13/201217 Answer 2: Coloured dots (M-array) Difficult to use for coloured surfaces

18 2/13/201218 Answer 3: Pattern dots (M-array) Difficult for industrial manufacturing

19 2/13/201219 Answer 4: Time-coded light patterns (Time multiplexing) Use a sequence of binary patterns → (log N) images Each stripe has a unique binary illumination code

20 2/13/201220 All of the above are categorized as Discrete Methods There are a lot more Continuous Structured Light Methods such as Phase shifting and etc. Salvi, J., S. Fernandez, et al. (2010). "A state of the art in structured light patterns for surface profilometry." Pattern Recognition 43(8): 2666-2680

21 2/13/201221 All of the above are human designed patterns. Random Speckle Structured light using randomly generated patterns May obtain denser depth information by solving correspondence problem

22 2/13/201222 A Projector is just an inverse of a camera One projector and one camera is enough for triangulation Need Calibration

23 2/13/201223 US 2010/0118123 Projector-Camera system Already calibrated structure δZ results in δX in 32

24 2/13/201224 US 2010/0118123 Structured Light-1 Pseudo-random distribution Local: Random Global: Gray level decreases Can make a rough estimate in a low resolution image

25 2/13/201225 US 2010/0118123 Structured Light-2 Quasi-periodic pattern Five-fold symmetry Results in distinct peaks in freq. domain Contain no unit cell repeats over spatial domain Use to reduce noise and ambient light in environment

26 2/13/201226

27 2/13/201227 US 2010/0290698

28 2/13/201228 US 2010/0290698 Uses a special (“astigmatic”) lens with different focal length in x- and y- directions Orientation of the circle indicates depth

29 2/13/201229 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

30 2/13/201230 Shotton, J., A. Fitzgibbon, et al. (2011). "Real-Time Human Pose Recognition in Parts from Single Depth Images." Microsoft Research Cambridge & Xbox Incubation Treat body segmentation as a per-pixel classification task ( No pairwise term or CRF is used ) Algorithms runs 5ms per frame on Xbox GPU Novelty: Intermediate body parts representation

31 2/13/201231 Body part labeling 31 body parts Distinct parts for left and right allow classifier to disambiguate the left and right sides of the body

32 2/13/201232 Depth image features dI(x) is the depth at pixel x in image I θ=(u,v) describe offsets u and v Each feature need only read at most 3 image pixels and perform at most 5 arithmetic operations

33 2/13/201233 Fast and effective multi-class classifier Each split node consists of a feature fθ and a threshold τ At the leaf node in tree t, given a learned Final classification

34 2/13/201234 Multiple classifiers work together Committees E.g. Averaging the predictions of a set of individual models E.g. Majority votes Boosting Classifiers trained in sequence E.g. AdaBoost Decision Tree Binary selection corresponding to the traversal of a tree

35 2/13/201235 Three major aspect A splitting criterion A stop-splitting rule A rule to assign each leaf to a specific class Decision Forests A Decision Tree Committee

36 2/13/201236 Fast and effective multi-class classifier Each split node consists of a feature fθ and a threshold τ At the leaf node in tree t, given a learned Final classification How to train?

37 2/13/201237 Training Each tree train on different images Each image pick 2000 example pixels Algorithm

38 2/13/201238 Algorithm(cont.) Shannon entropy given Z on Y

39 2/13/201239 Algorithm(cont.) Training takes a lot of efforts 3 trees with depth 20 from 1 million images takes about a day on a 1000 core cluster Where are those training data?

40 2/13/201240 Depth imaging Simplify the task of background subtraction Most important: easy to synthesize!!! Take Real Images Learning Synthesize Parameters Generate Lots of training data

41 2/13/201241 Depth Image Body Parts Joint Position IR Structured Light Random Decision Forest Mean Shift

42 2/13/201242 From the previous section, Use Mean Shift with a weighted Gaussian kernel

43 2/13/201243 Kernel density estimator Discrete points -> Continuous function Calculate the gradient at initial point and shift Iterate till stop

44 2/13/201244 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

45 2/13/201245 Synthetic Real

46 2/13/201246 Failure

47 2/13/201247 Training parameters vs. classification accuracy

48 2/13/201248 Comparisons

49 2/13/201249 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

50 2/13/201250 Depth images may contain enough information to solve human pose problems Depth images are color and texture invariant, which simplifies a lot of the corresponding problem A deep combining model with sufficient training data can become a good classifier even with simple features Buy a Kinect for LAB

51 2/13/201251 What is Kinect? Kinect Architecture From IR to depth image History of Structured Light PrimeSense Invented Structured Light From depth image to joint positions Body Part Interference Joint Proposals Experiments and Results Conclusion References

52 Shotton, J., A. Fitzgibbon, et al. (2011). "Real-Time Human Pose Recognition in Parts from Single Depth Images." Microsoft Research Cambridge & Xbox Incubation Freedman, B., A. Shpunt, et al. (2008). Depth mapping using projected patterns, US 2010/0118123A1 Freedman, B., A. Shpunt, et al. (2008). Distance-Varying Illumination and Imaging Techniques for Depth Mapping, US 2010/0290698A1 2/13/201252

53 2/13/201253 Salvi, J., S. Fernandez, et al. (2010). "A state of the art in structured light patterns for surface profilometry." Pattern Recognition 43(8): 2666-2680. Albitar, I., P. Graebling, et al. (2007). “Robust structured light coding for 3D reconstruction,” IEEE. Scharstein, D. and R. Szeliski (2003). “High-accuracy stereo depth maps using structured light,” IEEE. Breiman, L. (2001). "Random forests." Machine learning 45(1): 5-32. Amit, Y. and D. Geman (1997). "Shape quantization and recognition with randomized trees." Neural computation 9(7): 1545-1588.

54 2/13/201254 John MacCormick, “How does the Kinect work? ” users.dickinson.edu/~jmac/selected-talks/kinect.pdf “Structured Light”, www.igp.ethz.ch/photogrammetry/.../MV-SS2011- structured.pdf www.igp.ethz.ch/photogrammetry/.../MV-SS2011- structured.pdf http://en.wikipedia.org/wiki/Kinect http://en.wikipedia.org/wiki/Structured-light_3D_scanner http://en.wikipedia.org/wiki/Triangulation http://dms.irb.hr/tutorial/tut_dtrees.php http://www.anandtech.com/show/4057/microsoft-kinect- the-anandtech-review/2 http://www.anandtech.com/show/4057/microsoft-kinect- the-anandtech-review/2 Chen, Y. S. and B. T. Chen (2003). "Measuring of a three- dimensional surface by use of a spatial distance computation." Applied optics 42(11): 1958-1972.


Download ppt "Po-Hsiang Chen Advisor: Sheng-Jyh Wang 2/13/2012."

Similar presentations


Ads by Google