Video Surveillance E6998 -007 Senior/Feris/Tian 1 Behavior Analysis Rogerio Feris IBM TJ Watson Research Center

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

AKC Rally Signs These are copies of the 2008 AKC Rally signs, as re-drawn by Chuck Shultz. Use them to print your own signs. Be prepared to use a LOT of.
Simplifications of Context-Free Grammars
Variations of the Turing Machine
Zhongxing Telecom Pakistan (Pvt.) Ltd
AP STUDY SESSION 2.
1
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Kapitel 14 Recognition – p. 1 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image.
Video Surveillance E Senior/Feris/Tian 1 Emerging Topics in Video Surveillance Rogerio Feris IBM TJ Watson Research Center
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
Myra Shields Training Manager Introduction to OvidSP.
STATISTICS Joint and Conditional Distributions
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Objectives: Generate and describe sequences. Vocabulary:
UNITED NATIONS Shipment Details Report – January 2006.
1 Hyades Command Routing Message flow and data translation.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
Chapter 6 File Systems 6.1 Files 6.2 Directories
1 Hierarchical Part-Based Human Body Pose Estimation * Ramanan Navaratnam * Arasanathan Thayananthan Prof. Phil Torr * Prof. Roberto Cipolla * University.
Programming Language Concepts
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Chapter 7: Steady-State Errors 1 ©2000, John Wiley & Sons, Inc. Nise/Control Systems Engineering, 3/e Chapter 7 Steady-State Errors.
Break Time Remaining 10:00.
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Thomas Jellema & Wouter Van Gool 1 Question. 2Answer.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Adding Up In Chunks.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
DTU Informatics Introduction to Medical Image Analysis Rasmus R. Paulsen DTU Informatics TexPoint fonts.
. Lecture #8: - Parameter Estimation for HMM with Hidden States: the Baum Welch Training - Viterbi Training - Extensions of HMM Background Readings: Chapters.
1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Converting a Fraction to %
ANSC644 Bioinformatics-Database Mining 1 ANSC644 Bioinformatics §Carl J. Schmidt §051 Townsend Hall §
Clock will move after 1 minute
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
Actions in video Monday, April 25 Kristen Grauman UT-Austin.
ICCV 2003UC Berkeley Computer Vision Group Recognizing Action at a Distance A.A. Efros, A.C. Berg, G. Mori, J. Malik UC Berkeley.
Recognizing Action at a Distance A.A. Efros, A.C. Berg, G. Mori, J. Malik UC Berkeley.
Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.
Presentation transcript:

Video Surveillance E Senior/Feris/Tian 1 Behavior Analysis Rogerio Feris IBM TJ Watson Research Center

Video Surveillance E Senior/Feris/Tian 2 Outline Motivation Action Recognition Template-Based Approaches State-Space Approaches Detecting Suspicious Behavior

Video Surveillance E Senior/Feris/Tian 3 Motivation Action Recognition in Surveillance Video Detecting people fighting Falling person detection

Video Surveillance E Senior/Feris/Tian 4 Motivation Detecting suspicious behavior [Boiman and Irani, 2005] Fence Climbing

Video Surveillance E Senior/Feris/Tian 5 Find all locations where objects enter or exit (green) Find all normal routes between these locations- average path and observed deviations. Motivation

Video Surveillance E Senior/Feris/Tian 6 Tracks anomalies (not matching trained routes) Motivation

Video Surveillance E Senior/Feris/Tian 7 Motivation Long-term reasoning / object interaction [Ivanov and Bobick, 2000] Car/person interactions (e.g., car picking up a person)

Video Surveillance E Senior/Feris/Tian 8 Challenges Strong appearance variation in semantically similar events (e.g., people performing actions with different clothing Viewpoint Variation Duration of the action / frame rate Action segmentation – determining beginning and end of the action

Video Surveillance E Senior/Feris/Tian 9 Outline Motivation Action Recognition Template-Based Approaches State-Space Approaches Detecting Suspicious Behavior

Video Surveillance E Senior/Feris/Tian 10 Action Recognition – Template-Based Motion History Image (MHI): Scalar-valued image where brighter pixels correspond to more recently moving pixels Temporal Templates [Bobick and Davis, 1996] Binary image indicating regions of motion

Video Surveillance E Senior/Feris/Tian 11 Action Recognition – Template-Based Motion History Image (MHI): Scalar-valued image where brighter pixels correspond to more recently moving pixels Temporal Templates [Bobick and Davis, 1996]

Video Surveillance E Senior/Feris/Tian 12 Action Recognition – Template-Based At the current frame, statistical descriptors based on moments (translation and scale invariant) are extracted from the current MHI and matched against stored exemplars for classification Three actions: sitting, arm waving, and crouching. View-based approach to handle camera view changes. Problems with ambiguities, occlusions, poor motion segmentation Temporal Templates [Bobick and Davis, 1996]

Video Surveillance E Senior/Feris/Tian 13 Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03] 3-pixel man Blob tracking vast surveillance literature 300-pixel man Limb tracking e.g. Yacoob & Black, Rao & Shah, etc.

Video Surveillance E Senior/Feris/Tian 14 The 30-Pixel Man Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03]

Video Surveillance E Senior/Feris/Tian 15 Action Recognition – Template-Based Appearance versus Motion Recognizing Action at a Distance [Efros et al, ICCV03]

Video Surveillance E Senior/Feris/Tian 16 Tracking Simple correlation-based tracker User-initialized Figure-centric Representation

Video Surveillance E Senior/Feris/Tian 17 input sequence Explain novel motion sequence by matching to previously seen video clips For each frame, match based on some temporal extent Challenge: how to compare motions? motion analysis run walk left swing walk right jog database Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03]

Video Surveillance E Senior/Feris/Tian 18 Spatial Motion Descriptor Image frame Optical flow blurred

Video Surveillance E Senior/Feris/Tian 19 t … … … … Sequence A Sequence B Temporal extent E B frame-to-frame similarity matrix A motion-to-motion similarity matrix A B I matrix E E blurry I E E Two person running sequences - periodic behavior

Video Surveillance E Senior/Feris/Tian 20 Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03] Classification is done for each frame. The spatial-temporal descriptor centered at the current frame is matched against the database of actions (previously stored spatial-temporal descriptors). For each frame of the probe sequence, the maximum score in the corresponding row of the motion-to-motion similarity matrix (between probe and one sequence of the database) will indicate the best match to the spatial-temporal descriptor centered at this frame. K-nearest neighbors is used to determine the action. Good results were demonstrated in sequences related to tennis, soccer, and dancing.

Video Surveillance E Senior/Feris/Tian 21 2D Skeleton Transfer The database is annotated with 2D joint positions After matching, data is transfered to novel sequence Input sequence: Transferred 2D skeletons: Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03]

Video Surveillance E Senior/Feris/Tian 22 Actor Replacement Show Video GregWordCup.avi Action Recognition – Template-Based Recognizing Action at a Distance [Efros et al, ICCV03]

Video Surveillance E Senior/Feris/Tian 23 Action Recognition – Template-Based Proposed for image similarity. Action detection is a particular application Local Self-Similarities [Shechtman and Irani, CVPR07] How to measure similarity in these images?

Video Surveillance E Senior/Feris/Tian 24 Action Recognition – Template-Based Local Self-Similarities [Shechtman and Irani, CVPR07]

Video Surveillance E Senior/Feris/Tian 25 Action Recognition – Template-Based Local Self-Similarities [Shechtman and Irani, CVPR07] The descriptor implicitly handles the similarity between people wearing different clothes. Also, the spatial-temporal log-polar binning allows for better matching under different action durations / frame rate.

Video Surveillance E Senior/Feris/Tian 26 Action Recognition – Template-Based Complex actions performed by different people wearing different clothes with different backgrounds, are detected with no prior learning, based on a single example clip. Local Self-Similarities [Shechtman and Irani, CVPR07]

Video Surveillance E Senior/Feris/Tian 27 Action Recognition – Template-Based Spatial-Temporal Bag of Words [Niebles et al, CVPR06]

Video Surveillance E Senior/Feris/Tian 28 Outline Motivation Action Recognition Template-Based Approaches State-Space Approaches Detecting Suspicious Behavior

Video Surveillance E Senior/Feris/Tian 29 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989]

Video Surveillance E Senior/Feris/Tian 30 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989]

Video Surveillance E Senior/Feris/Tian 31 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989] Three Basic Problems: Forward-Backward Algorithm

Video Surveillance E Senior/Feris/Tian 32 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989] Three Basic Problems: Viterbi Algorithm

Video Surveillance E Senior/Feris/Tian 33 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989] Three Basic Problems: Baum-Welch Algorithm

Video Surveillance E Senior/Feris/Tian 34 Action Recognition – State-Space Hidden Markov Models [Rabiner, 1989] Action Recognizer: Learn an HMM model for each action in the database (e.g., HMM for running, HMM for fighting, etc.) – Baum-Welch algorithm Given an action sequence, compare it with all HMMs in the database and select the one which best explains the probe sequence – Forward-Backward algorithm

Video Surveillance E Senior/Feris/Tian 35 Action Recognition – State-Space [Yamato et al, 1992] - First application of HMMs for gesture recognition (for recognizing tennis strokes) From there on HMMs have been extensively applied in many gesture recognition problems (Sign Language Recognition, Head Gesture, etc.) Many variations have been proposed (see e.g., coupled HMMs). More recently, Conditional Random Fields (CRFs) have proven to be very successful to model human motion [ Sminchisescu et al, ICCV 2005]

Video Surveillance E Senior/Feris/Tian 36 Action Recognition – State-Space Modeling Interactions with Stochastic Grammars [Ivanov and Bobick, 2000] Recognize actions with larger temporal range Two-Stage Approach: Detection of low-level discrete events (e.g., using HMMs or tracking) Action Recognition using Stochastic Grammars

Video Surveillance E Senior/Feris/Tian 37 Action Recognition – State-Space Modeling Interactions with Stochastic Grammars [Ivanov and Bobick, 2000] Background: Earley Parsing for Context-free Grammars See description in wikipedia Three main steps: Prediction, Scanning, Completion

Video Surveillance E Senior/Feris/Tian 38 Earley Parsing Example

Video Surveillance E Senior/Feris/Tian 39 Action Recognition – State-Space Modeling Interactions with Stochastic Grammars [Ivanov and Bobick, 2000] Probabilistic Earley Parsing Production rules are augmented with probabilities Parse tree with highest probability is generated [Stolcke, Bayesian Learning of Probabilistic Language Models,1994]

Video Surveillance E Senior/Feris/Tian 40 Action Recognition – State-Space Modeling Interactions with Stochastic Grammars [Ivanov and Bobick, 2000] Car/Person Interaction Low-level discrete event detection Track moving blobs Generate events: {person,car}+{enter,found,exit,lost,stopped}

Video Surveillance E Senior/Feris/Tian 41 Modeling Interactions with Stochastic Grammars [Ivanov and Bobick, 2000]

Video Surveillance E Senior/Feris/Tian 42 Outline Motivation Action Recognition Template-Based Approaches State-Space Approaches Detecting Suspicious Behavior

Video Surveillance E Senior/Feris/Tian 43 Suspicious Behavior Problem: given a few regular examples, compute the likelihood of a new observation Detecting Irregularities [Boiman and Irani, ICCV 2005] DatabaseQuery Construct the likelihood using chuncks of data from the examples. Large matching chunks imply large likelihood.

Video Surveillance E Senior/Feris/Tian 44 Suspicious Behavior Problem: given a few regular examples, compute the likelihood of a new observation Detecting Irregularities [Boiman and Irani, ICCV 2005] Database Construct the likelihood using chuncks of data from the examples. Large matching chunks imply large likelihood. Query

Video Surveillance E Senior/Feris/Tian 45 Suspicious Behavior Detecting Irregularities [Boiman and Irani, ICCV 2005]

Video Surveillance E Senior/Feris/Tian 46 Suspicious Behavior [Zhong et al, Detecting Unusual Activity in Video, CVPR04] See Also: [Stauffer and Grimson, Learning patterns of activity using real-time tracking, 2000] [Lei Chen et al, Robust and fast similarity search for moving object trajectories, 2005] Motion Trajectory Behavior: