Presentation is loading. Please wait.

Presentation is loading. Please wait.

Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics.

Similar presentations


Presentation on theme: "Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics."— Presentation transcript:

1 Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics

2 2 Intelligent Reusable Systems for Cognitive Vision Intelligent: explicit knowledge, reasoning and learning capabilities Reusable Systems: different levels of reuse Cognitive Vision: image understanding beyond structural vision (semantics) ORION team: Research directions

3 08/07/2003 Inria/STMicroelectronics 3 Reusable Systems: Program supervision approach Software platform for KBS design Cognitive Vision: image understanding Object categorization Video understanding  Multidisciplinary team: artificial intelligence, software engineering, computer vision ORION team: Research directions

4 08/07/2003 Inria/STMicroelectronics 4 a model of the complete process of video understanding (object detection, tracking, scenario recognition) a formalism for expressing all high level concepts (persons, events, scenarios) based on constraints a formalism for describing semantics and 3D geometry of the observed empty scene a reusable real time platform for video understanding Video Understanding: Main achievements

5 08/07/2003 Inria/STMicroelectronics 5 Inferring symbolic concepts from numeric percepts uncertainty management of noisy data formalization of the expertise Independence of the models/methods versus: sensors and low level preprocessing dedicated applications several spatio-temporal scales Video Understanding: Main issues

6 08/07/2003 Inria/STMicroelectronics 6 Strong impact in video understanding for visual surveillance (metro station, bank agencies) 4 European projects (PASSWORDS, AVS-PV, AVS-RTPW, ADVISOR) 2 contracts with French companies (metro, bank) 1 transfer activity with Bull (Dyade action Telescope) Video understanding for video communication (Mediaspace) Video Understanding: Applications

7 08/07/2003 Inria/STMicroelectronics 7 Objective: to automate the recognition of specific human behaviors from video sequences ---> Interpretation of the videos from pixels to alarms Video understanding for metro surveillance « Blocking an exit » «Fighting » « Fraud » « Overcrowding» ALARM Interface for alarm management Behavior recognition People detection and tracking Cameras

8 08/07/2003 Inria/STMicroelectronics 8 Video Understanding for metro surveillance Context: European project ADVISOR: Annotated Digital Video for Intelligent Surveillance and Optimised Retrieval Intelligent system of video surveillance in metros Problem : 1000 cameras but few human operators Automatic selection in real time of the cameras viewing abnormal behaviours Automatic annotation of recognised behaviors in a video data base using XML

9 08/07/2003 Inria/STMicroelectronics 9 Scene Models (3D) - Scene objects - zones - calibration matrices - Scene objects - zones - calibration matrices Alarms Multi-cameras Combination Behaviors Recognition - States - Events - Scenarios Individual Tracking Group Tracking Crowd Tracking - Motion Detector - F2F Tracker - Motion Detector - F2F Tracker - Motion Detector - F2F Tracker Mobile objects Annotation Scenario Models Video Understanding Platform

10 08/07/2003 Inria/STMicroelectronics 10 Definition : a priori knowledge of the observed empty scene Cameras: 3D position of the sensor, calibration matrix field of view,... 3D Geometry of physical objects (bench, trash, door, walls) and interesting zones (entrance zone) with position, shape and volume n Semantic information : type (object, zone), characteristics (yellow, fragile) and its function (seat) Role: to keep the interpretation independent from the sensors and the sites : many sensors, one 3D referential to provide additional knowledge for behavior recognition Video Understanding: 3D Scene Model

11 08/07/2003 Inria/STMicroelectronics 11 Barcelona Metro Station Sagrada Famiglia mezzanine (cameras C10, C11 and C12) Video Understanding: Scene Model

12 08/07/2003 Inria/STMicroelectronics 12 States, Events and Scenarios : State: a spatio-temporal property involving one or several actors on a time interval Ex : « close», « walking», « seated» Event: a significant change of states Ex : « enters», « stands up», « leaves » Scenario: a long term symbolic application dependent activity Ex : « fighting», « vandalism» Video Understanding

13 08/07/2003 Inria/STMicroelectronics 13 Several types of States : posture  {lying, crouching, standing} direction  {towards the right, towards the left, leaving, arriving} speed  {stopped,walking, running} distance/object  {close, far} distance/person  {close, far} posture/object  {seated, any} Several types of Events : 1 person : falls down, crouches down, stands up, goes right side, goes left side, goes away, arrives, stops, starts running 1 person & 1 zone : leaves, enters 1 person & 1 equipment : moves close to,sits on, moves away from 2 persons : moves close to, moves away from Video Understanding

14 08/07/2003 Inria/STMicroelectronics 14 We use several formalisms to recognise states, events and scenarios : specific routines classification finite state automaton propagation of temporal constraints Scenario Recognition

15 Exit zone Mobile objects Detection Group Tracking Recognition of the behaviour « A Group of people blocks an Exit» Blocking The operator of the scenario “A Group of people blocks an Exit” is based on a Finite state automaton Scenario Recognition : Automaton INIT Grp x is tracked Grp x is inside a ZOI Grp X is stopped in the ZOI > 30 sec Enter_ZOI Exit_ZOI « Blocking » Stops Start_walking Start_running 15 26/02/2003 IDSS

16 08/07/2003 Inria/STMicroelectronics 16 A temporal scenario is constituted by three parts : Characters : people, physical objects and interesting zones relative to the scenario. Constraints : a set of constraints on the characteristics of the actors and on the states, events and sub-scenarios involving the actors. Production : generation of a scenario instance which can be part of more complex scenarios. Use of symbolic, logical, spatial and temporal constraints including Allen’s interval algebra operators. Scenario Recognition : Temporal constraints

17 08/07/2003 Inria/STMicroelectronics 17 Vandalism scenario description : Scenario(vandalism_against_ticket_machine, Characters((p : Person), (eq : Equipment, Name = “Ticket_Machine”) ) Constraints((exist ((event s1: p move_close_to eq) (state s2: p stay_at eq) (event s3: p move_away_from eq) (event s4: p move_close_to eq) (state s5: p stay_at eq) ) ((s1 != s4) (s2 != s5) (s1 before s2) (s2 before s3) (s3 before s4) (s4 before s5) ) ) ) Production( (sc : Scenario) ( (Name of sc := "vandalism_against_ticket_machine") (StartTime of sc := StartTime of s1) (EndTime of sc := EndTime of s5) ) ) ) Scenario Recognition : Temporal constraints

18 08/07/2003 Inria/STMicroelectronics 18 Results Vandalism in metro (Nuremberg)

19 Examples : Brussels and Barcelona Metros Exit zone Jumping over barrier Blocking Overcrowding Fighting Group behavior Crowd behavior Individual behavior Group behavior Results 19 26/02/2003 IDSS

20 08/07/2003 Inria/STMicroelectronics 20 Recognition of five behaviors: “Blocking”, “Fighting”, “Jumping over barrier”, “Vandalism” and “Overcrowding” Tested on 50 metro sequences (10 hours) True positive per sequence: 70% (“Fighting”) to 95% (“Blocking”) False positive per sequence: 5% (“Fighting”, “Jumping over barrier”) to 0% (others) Results

21 08/07/2003 Inria/STMicroelectronics 21 Suivi multi-cameras Video Understanding for bank surveillance

22 08/07/2003 Inria/STMicroelectronics 22 Conclusion Hypotheses: fixed cameras 3D model of the empty scene predefined behavior models Results: Behavior understanding for Individuals, Groups of people or Crowd an operational language for video understanding (live evaluation at Barcelona Metro) a real-time platform (5 to 25 frames/s)

23 08/07/2003 Inria/STMicroelectronics 23 Future work: Learning techniques to compute optimal set of parameters and to dynamically configure the platform New application domains : Domotic, Health, Teaching,… Recognition of scenarios based on the recognition of postures Conclusion


Download ppt "Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics."

Similar presentations


Ads by Google