Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University.

Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University of British Columbia, CANADA  We apply Semi-supervised and active learning algorithms (Zhu et al) for interactive object recognition in visually guided robots.  These algorithms are O(M 3 ), but we will show that the cost can be reduced to O(M). We will also reduce storage from O(M 2 ) to O(M).

2 Object recognition with semi-supervised data and simple color features

3 Aibo is able to identify objects in different settings. Aibo can learn and classify several objects at the same time.

4 Semi-supervised Learning xixi xjxj We have: Input data x, two labels y l We want: A full labeling of the data w ij

5 Semi-supervised Learning Leads to a Linear System of Equations Differentiating the Error function and equating it to zero, gives the solution in terms of a linear system of equations (Zhu et al, 2003): Where W is the adjacency matrix. 0 0

6 The big computational bottleneck is M 3 What is large M?? 1955: M=20 1965: M=200 1980: M=2000 1995: M=20000 2005: M=200000 Solving the linear system of equations costs O(M 3 ), where M is a large number of unlabeled features. So over the course of 50 years M has increased by a factor of 10 4. However, the speed of computers has increased by a factor of 10 12. From this, the problematic O(M 3 ) bottleneck is evident.

7 Using iterative methods (of which Krylov are well known to work best), the cost can be reduced to O(M 2 ) times the number of iterations. The expensive step in each iteration is the following matrix-vector multiplication: This matrix vector multiplication can be written as two O(M 2 ) Gaussian kernel estimates: These kernel estimates can be solved in O(M) operations using the Fast Gauss Transform. From O(M 3 ) to O(M 2 ): Krylov Iterative Methods (MINRES)

8 From O(M 2 ) to O(M): The Fast Gauss Transform Intuition: *L Greengard and V Rokhlin,1987 Storage requirement is also reduced from O(M 2 ) to O(M) !!

9 Training in Real Time

10 Predicting Pixel Labels Once we have labels for M points in our training data, we use a classical kernel discriminant for N test pixels The cost is O(NM)! By applying the Fast Gauss Transform the cost can be reduced to O(N+M).

11 Predicting Pixel Labels in Real Time

12 Active Learning Labeling data is an expensive process. We use active learning to choose what pixels should be labeled automatically. Active learning calls the semi-supervised learning subroutine at each iteration.

13 Active learning: asking the right questions Aibo recognizes the ball without a problem. Since the orange ring is close to the ball in colour space, Aibo gets confused and decides to prompt the user for labels. We want the robot to ask the right questions. The robot prompts the user for the labels that will improve its performance the most.

14 We managed to reduce computational cost from O(M 3 ) to O(M) and storage requirement from O(M 2 ) to O(M). Currently we are using more sophisticated features (SIFT) and dual KD-tree recursions methods to deal with high dimensions. These methods can be applied to other problems such as SLAM, segmentation, ranking and Gaussian Processes. Thank You! Questions?

15 One solution: Power Method O(N 2 ) per iteration But it might take TOO MANY iterations to converge

16 The Fast Gauss Transform- Reduction of Complexity Straightforward (nested loop)

17 The Fast Gauss Transform- Reduction of Complexity p<<N

19 Training Set Time Comparison M Computational time (seconds) Naïve MINRES-FGT 600.5217030.1195140.312126 1204.237320.2500500.518589 24078.78640.7294640.791181 480501.462.562461.165930 960 --------63.94872.02537 1920 --------497.593.97674

20 Test Set Time Comparison N Computational time (seconds) NaiveFGT 2600.00360830.035944 5200.1285070.113086 10400.4584460.178275 20801.693060.321210 41606.627280.682747 832020.569530.858313

21 Krylov Subspace Methods: MINRES Algorithm The cost can be reduced to O(M 2 ) times number of iterations.

Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University.

Similar presentations

Presentation on theme: "Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University.

Similar presentations

Presentation on theme: "Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University."— Presentation transcript:

Similar presentations

About project

Feedback