Urban Scene Analysis James Elder & Patrick Denis York University.

Slides:



Advertisements
Similar presentations
Practical Camera Auto-Calibration Based on Object Appearance and Motion for Traffic Scene Visual Surveillance Zhaoxiang Zhang, Min Li, Kaiqi Huang and.
Advertisements

Vanishing Point Detection and Tracking
Image Registration  Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.
Marked Point Processes for Crowd Counting
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Computer vision: models, learning and inference
Project 35 Visual Surveillance of Urban Scenes. PROJECT 35: VISUAL SURVEILLANCE OF URBAN SCENES Principal Investigators David Clausi, Waterloo Geoffrey.
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
A Bayesian Formulation For 3d Articulated Upper Body Segmentation And Tracking From Dense Disparity Maps Navin Goel Dr Ara V Nefian Dr George Bebis.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
A Brief Overview of Computer Vision Jinxiang Chai.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Localisation & Navigation
SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.
Brief Introduction to Geometry and Vision
A General Framework for Tracking Multiple People from a Moving Camera
3D SLAM for Omni-directional Camera
International Conference on Computer Vision and Graphics, ICCVG ‘2002 Algorithm for Fusion of 3D Scene by Subgraph Isomorphism with Procrustes Analysis.
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Fields of Experts: A Framework for Learning Image Priors (Mon) Young Ki Baik, Computer Vision Lab.
Vehicle Segmentation and Tracking From a Low-Angle Off-Axis Camera Neeraj K. Kanhere Committee members Dr. Stanley Birchfield Dr. Robert Schalkoff Dr.
Expectation-Maximization (EM) Case Studies
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
A Dynamic Conditional Random Field Model for Object Segmentation in Image Sequences Duke University Machine Learning Group Presented by Qiuhua Liu March.
Determining 3D Structure and Motion of Man-made Objects from Corners.
Learning video saliency from human gaze using candidate selection CVPR2013 Poster.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
PROBABILISTIC DETECTION AND GROUPING OF HIGHWAY LANE MARKS James H. Elder York University Eduardo Corral York University.
DISCRIMINATIVELY TRAINED DENSE SURFACE NORMAL ESTIMATION ANDREW SHARP.
Announcements Final is Thursday, March 18, 10:30-12:20 –MGH 287 Sample final out today.
Automatic 3D modelling of Architecture Anthony Dick 1 Phil Torr 2 Roberto Cipolla 1 1 Department of Engineering 2 Microsoft Research, University of Cambridge.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Motion and optical flow
Processing visual information for Computer Vision
Computer vision: models, learning and inference
3D Single Image Scene Reconstruction For Video Surveillance Systems
Marked Point Processes for Crowd Counting
Attentive People Finding
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Adversarially Tuned Scene Generation
Vehicle Segmentation and Tracking in the Presence of Occlusions
Context-Aware Modeling and Recognition of Activities in Video
Eric Grimson, Chris Stauffer,
Marked Point Processes for Crowd Counting
The Brightness Constraint
Markov Random Fields for Edge Classification
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Filtering Things to take away from this lecture An image as a function
Video Compass Jana Kosecka and Wei Zhang George Mason University
Mixture Models with Adaptive Spatial Priors
Filtering An image as a function Digital vs. continuous images
Presentation transcript:

Urban Scene Analysis James Elder & Patrick Denis York University

2 Phase IV Objectives Single-View 3D Reconstruction Scene Dynamics Scene Segmentation and Labelling

Single-View Reconstruction

4

5 Ultimate Goal Our ultimate goal is to automate this process!

6 Immediate Goal Automatic estimation of the three vanishing points corresponding to the “Manhattan directions”.

7 Manhattan Frame Geometry An edge is aligned to a vanishing point if the interpretation plane normal is orthogonal to the vanishing point vector in the Gaussian Sphere (i.e. dot product is 0)

8 Mixture Model Each edge E ij in the image is generated by one of four possible kinds of scene structure: –m 1-3 : a line in one of the three Manhattan directions –m 4 : non-Manhattan structure The observable properties of each edge E ij are: –position –angle The likelihoods of these observations are co-determined by: –The causal process (m 1-4 ) –The rotation Ψ of the Manhattan frame relative to the camera mimi mimi mimi mimi E 11 E 12 E 22 E 21 Ψ Image

9 Mixture Model Our goal is to estimate the Manhattan frame Ψ from the observable data E ij. mimi mimi mimi mimi E 11 E 12 E 22 E 21 Ψ Image

10 E-M Algorithm M Step –Given estimates of the mixture probabilities for each edge, update our estimate of the Manhattan coordinate frame

11

12 Design Criteria Accuracy Speed

13 Design Decisions Features –Dense gradient map –Sparse sub-pixel localized edges Measurement Space –Image –Gauss Sphere Search Method –Coarse-to-Fine (Coughlan & Yuille 2001) –Quasi-Newton –EM –Quasi-EM

14

15

16

17

18

19 Speed Method Time (sec) MW Edge-Based Coarse-to-Fine MW Params Edge-Based Newton MW Params Edge-Based Newton Edge-Based EM Edge-Based Quasi-EM Edge-Based Quasi-EM GS

20

21 Single-View Reconstruction Potential Research Objectives for Phase IV –Recover connected Manhattan cuboids Connected, labelled line segments Connected, labelled rectangular facets –Estimate scale factor From pedestrian, vehicle traffic From building features whose size is approximately known (e.g., doors) –Integrate with other data sources Existing 3D models on coarser scale 3D models from cameras with overlapping fields of view

22 Projects: Pre-Attentive and Attentive Sensing FOVEAL IMAGE WIDE-FIELD IMAGE PAN TILT

23 Statistical Integration of Weak Cues Motion Region Log Likelihood Ratio Foreground Region Log Likelihood Ratio Skin Region Log Likelihood Ratio Joint Region Log Likelihood Ratio

24 Attentive Feedback Loop random sampler posterior prior non-max suppression gaze command likelihood gaze control Attentive sensor high-resolution face detection confirmed face location motion kernel motion kernel mean body indicator spatial prior

25 Wide-Field Person Detection

26 Attentive High-Res Video Surveillance

27 Attentive Snapshots

28 Automatically Confirmed High-Resolution Faces

29 Pose-Invariant Face Recognition (with Simon Prince, UCL)

30 Projects: 3D Facial Estimation and Modelling

31 Scene Dynamics Potential Research Objectives for Phase IV –Person re-identification –Individuation (counting) in crowds

32 Using Prior Knowledge: Example

33 Experimental Results

34 Experimental Results SS vs. MSRC vs. MSEJ vs. MS Mean relative errors Relative Error

35 Scene Segmentation Potential Research Objectives for Phase IV –Application to urban scenes Scene layout –Ground plane –Buildings –Vegetation –Sky Material recognition Integrated text recognition