Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics.

Slides:

Advertisements

Similar presentations

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.

Advertisements

Computing and Communications Technologies: Quo Vadis ? Athens, 20 May 2001 Horst Forster European Commission ATH0.

HealthCare Monitoring: GERHOME Project Monique Thonnat, Francois Bremond & Nadia Zouba PULSAR, INRIA Date.

1 Early Pest Detection in Greenhouses Vincent Martin, Sabine Moisan INRIA Sophia Antipolis Méditerranée, Pulsar project-team, France.

ETISEO Project Corpus data - Video sequences contents - Silogic provider.

By: Ryan Wendel.  It is an ongoing analysis in which videos are analyzed frame by frame  Most of the video recognition is pulled from 3-D graphic engines.

Automated Shot Boundary Detection in VIRS DJ Park Computer Science Department The University of Iowa.

Towards a Video Camera Network for Early Pest Detection in Greenhouses

New Kind of Logic The first step to approch this questions consists of a new definition of logic operators able to explain the richness of the events happened.

Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.

PULSAR Perception Understanding Learning Systems for Activity Recognition Theme: Cognitive Systems Cog C Multimedia data: interpretation and man-machine.

Object-Oriented Analysis and Design

O NTOLOGICAL R EPRESENTATION OF C ONTEXT K NOWLEDGE FOR V ISUAL D ATA F USION Juan Gómez Romero Miguel A. Patricio Jesús García José M. Molina Applied.

KAIST CS780 Topics in Interactive Computer Graphics : Crowd Simulation A Task Definition Language for Virtual Agents WSCG’03 Spyros Vosinakis, Themis Panayiotopoulos.

Nice, 17/18 December 2001 Autonomous mapping of natural fields using Random Closed Set Models Stefan Rolfes, Maria Joao Rendas

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

SeCoGIS - Bercellona, October 20, 2008 An ontology-based approach for the semantic modeling and reasoning on trajectories Miriam Baglioni Jose de Macedo.

Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.

ORION Project-team Monique THONNAT INRIA Sophia Antipolis Creation: July 1995 Multidisciplinary team: artificial intelligence, software engineering, computer.

Vigilant Real-time storage and intelligent retrieval of visual surveillance data Dr Graeme A. Jones.

Trip Report for The IASTED International Conference on Internet and Multimedia Systems and Applications (EuroIMSA 2006) February 13-15, 2006 Innsbruck,

Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.

Microsoft Research Innovation at the Frontiers of Computing Andrew Herbert Chairman, Microsoft Research, EMEA 31st March 2011.

An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.

Knowledge representation

Trends in Computer Vision Automatic Video Surveillance.

Probabilistic Context Free Grammars for Representing Action Song Mao November 14, 2000.

© 2007 Tom Beckman Features:  Are autonomous software entities that act as a user’s assistant to perform discrete tasks, simplifying or completely automating.

Situated Design of Virtual Worlds Using Rational Agents Mary Lou Maher and Ning Gu Key Centre of Design Computing and Cognition University of Sydney.

Orion Image Understanding for Object Recognition Monique Thonnat INRIA Sophia Antipolis.

Benjamin Gamble. What is Time?  Can mean many different things to a computer Dynamic Equation Variable System State 2.

CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch

Towards Cognitive Vision: Knowledge and Reasoning for Image Analysis and Understanding Monique THONNAT Orion team INRIA Sophia Antipolis FRANCE.

Towards real-time camera based logos detection Mathieu Delalandre Laboratory of Computer Science, RFAI group, Tours city, France Osaka Prefecture Partnership.

ETISEO Benoît GEORIS, François BREMOND and Monique THONNAT ORION Team, INRIA Sophia Antipolis, France Nice, May th 2005.

1 Scene Understanding perception, multi-sensor fusion, spatio-temporal reasoning and activity recognition. Francois BREMOND PULSAR project-team, INRIA.

Project on Visual Monitoring of Human Behavior and Recognition of Human Behavior in Metro Stations with Video Understanding M. Thonnat Projet ORION INRIA.

A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt (ICT), Thomas Russ (ISI),

 Designed to monitor the movement of people in given area.  Used video cameras to transmit a signal to a specific place on a limited set of monitors.

Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.

Workshop on Semantic Knowledge in Computer Vision, ICCV 2005 Symbol Grounding for Semantic Image Interpretation: From Image Data to Semantics Céline Hudelot,

Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3

1 1. Representing and Parameterizing Agent Behaviors Jan Allbeck and Norm Badler 연세대학교 컴퓨터과학과 로봇 공학 특강 학기 유 지 오.

A Cognitive Vision Platform for Semantic Image Understanding Monique THONNAT and Celine HUDELOT Orion team INRIA Sophia Antipolis FRANCE.

AVITRACK Project FP INRIA WP1 - Apron Activity Model WP3 - Scene Tracking WP4 - Scene Understanding Brussels, January 17th 2006.

Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.

Chapter 4 Decision Support System & Artificial Intelligence.

Activity Monitoring October 19-20, 1999 DARPADARPA Bob Bolles, Brian Burns, Marty Fischler, Ravi Gopalan, Marsha Jo Hannah, Dave Scott SRI International.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

VISION for Security Monique THONNAT ORION INRIA Sophia Antipolis.

1 Introduction to Cognitive systems Perception, indexing and communication for images and video Marc Berthod.

Tracking Groups of People for Video Surveillance Xinzhen(Elaine) Wang Advisor: Dr.Longin Latecki.

Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.

Object Oriented Analysis Unified Modeling Language By Mary Biddle.

  Computer vision is a field that includes methods for acquiring,prcessing, analyzing, and understanding images and, in general, high-dimensional data.

By Akhilesh K. Sinha Nishant Singh Supervised by Prof. Amitabha Mukerjee Video Surveillance of Basketball Matches and Goal Detection Indian Institute of.

GraphiCon 2008 | 1 Trajectory classification based on Hidden Markov Models Jozef Mlích and Petr Chmelař Brno University of Technology, Faculty of Information.

Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.

Research and Future Perspectives on Intelligent Video Surveillance Systems Monique THONNAT Senior Scientist Head of Orion research team INRIA Sophia Antipolis.

REAL-TIME DETECTOR FOR UNUSUAL BEHAVIOR

Processing visual information for Computer Vision

Role of Data Quality in GIS Decision Support Tools

Learning Fast and Slow John E. Laird

A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology

Scene Understanding Francois BREMOND

Video-based human motion recognition using 3D mocap data

Course Instructor: knza ch

Eric Grimson, Chris Stauffer,

Knowledge-based event recognition from salient regions of activity

Presentation transcript:

Recognition of Human Behaviors with Video Understanding M. Thonnat, F. Bremond and B. Boulay Projet ORION INRIA Sophia Antipolis, France 08/07/2003 Inria/STMicroelectronics

2 Intelligent Reusable Systems for Cognitive Vision Intelligent: explicit knowledge, reasoning and learning capabilities Reusable Systems: different levels of reuse Cognitive Vision: image understanding beyond structural vision (semantics) ORION team: Research directions

08/07/2003 Inria/STMicroelectronics 3 Reusable Systems: Program supervision approach Software platform for KBS design Cognitive Vision: image understanding Object categorization Video understanding  Multidisciplinary team: artificial intelligence, software engineering, computer vision ORION team: Research directions

08/07/2003 Inria/STMicroelectronics 4 a model of the complete process of video understanding (object detection, tracking, scenario recognition) a formalism for expressing all high level concepts (persons, events, scenarios) based on constraints a formalism for describing semantics and 3D geometry of the observed empty scene a reusable real time platform for video understanding Video Understanding: Main achievements

08/07/2003 Inria/STMicroelectronics 5 Inferring symbolic concepts from numeric percepts uncertainty management of noisy data formalization of the expertise Independence of the models/methods versus: sensors and low level preprocessing dedicated applications several spatio-temporal scales Video Understanding: Main issues

08/07/2003 Inria/STMicroelectronics 6 Strong impact in video understanding for visual surveillance (metro station, bank agencies) 4 European projects (PASSWORDS, AVS-PV, AVS-RTPW, ADVISOR) 2 contracts with French companies (metro, bank) 1 transfer activity with Bull (Dyade action Telescope) Video understanding for video communication (Mediaspace) Video Understanding: Applications

08/07/2003 Inria/STMicroelectronics 7 Objective: to automate the recognition of specific human behaviors from video sequences ---> Interpretation of the videos from pixels to alarms Video understanding for metro surveillance « Blocking an exit » «Fighting » « Fraud » « Overcrowding» ALARM Interface for alarm management Behavior recognition People detection and tracking Cameras

08/07/2003 Inria/STMicroelectronics 8 Video Understanding for metro surveillance Context: European project ADVISOR: Annotated Digital Video for Intelligent Surveillance and Optimised Retrieval Intelligent system of video surveillance in metros Problem : 1000 cameras but few human operators Automatic selection in real time of the cameras viewing abnormal behaviours Automatic annotation of recognised behaviors in a video data base using XML

08/07/2003 Inria/STMicroelectronics 9 Scene Models (3D) - Scene objects - zones - calibration matrices - Scene objects - zones - calibration matrices Alarms Multi-cameras Combination Behaviors Recognition - States - Events - Scenarios Individual Tracking Group Tracking Crowd Tracking - Motion Detector - F2F Tracker - Motion Detector - F2F Tracker - Motion Detector - F2F Tracker Mobile objects Annotation Scenario Models Video Understanding Platform

08/07/2003 Inria/STMicroelectronics 10 Definition : a priori knowledge of the observed empty scene Cameras: 3D position of the sensor, calibration matrix field of view,... 3D Geometry of physical objects (bench, trash, door, walls) and interesting zones (entrance zone) with position, shape and volume n Semantic information : type (object, zone), characteristics (yellow, fragile) and its function (seat) Role: to keep the interpretation independent from the sensors and the sites : many sensors, one 3D referential to provide additional knowledge for behavior recognition Video Understanding: 3D Scene Model

08/07/2003 Inria/STMicroelectronics 11 Barcelona Metro Station Sagrada Famiglia mezzanine (cameras C10, C11 and C12) Video Understanding: Scene Model

08/07/2003 Inria/STMicroelectronics 12 States, Events and Scenarios : State: a spatio-temporal property involving one or several actors on a time interval Ex : « close», « walking», « seated» Event: a significant change of states Ex : « enters», « stands up», « leaves » Scenario: a long term symbolic application dependent activity Ex : « fighting», « vandalism» Video Understanding

08/07/2003 Inria/STMicroelectronics 13 Several types of States : posture  {lying, crouching, standing} direction  {towards the right, towards the left, leaving, arriving} speed  {stopped,walking, running} distance/object  {close, far} distance/person  {close, far} posture/object  {seated, any} Several types of Events : 1 person : falls down, crouches down, stands up, goes right side, goes left side, goes away, arrives, stops, starts running 1 person & 1 zone : leaves, enters 1 person & 1 equipment : moves close to,sits on, moves away from 2 persons : moves close to, moves away from Video Understanding

08/07/2003 Inria/STMicroelectronics 14 We use several formalisms to recognise states, events and scenarios : specific routines classification finite state automaton propagation of temporal constraints Scenario Recognition

Exit zone Mobile objects Detection Group Tracking Recognition of the behaviour « A Group of people blocks an Exit» Blocking The operator of the scenario “A Group of people blocks an Exit” is based on a Finite state automaton Scenario Recognition : Automaton INIT Grp x is tracked Grp x is inside a ZOI Grp X is stopped in the ZOI > 30 sec Enter_ZOI Exit_ZOI « Blocking » Stops Start_walking Start_running 15 26/02/2003 IDSS

08/07/2003 Inria/STMicroelectronics 16 A temporal scenario is constituted by three parts : Characters : people, physical objects and interesting zones relative to the scenario. Constraints : a set of constraints on the characteristics of the actors and on the states, events and sub-scenarios involving the actors. Production : generation of a scenario instance which can be part of more complex scenarios. Use of symbolic, logical, spatial and temporal constraints including Allen’s interval algebra operators. Scenario Recognition : Temporal constraints

08/07/2003 Inria/STMicroelectronics 17 Vandalism scenario description : Scenario(vandalism_against_ticket_machine, Characters((p : Person), (eq : Equipment, Name = “Ticket_Machine”) ) Constraints((exist ((event s1: p move_close_to eq) (state s2: p stay_at eq) (event s3: p move_away_from eq) (event s4: p move_close_to eq) (state s5: p stay_at eq) ) ((s1 != s4) (s2 != s5) (s1 before s2) (s2 before s3) (s3 before s4) (s4 before s5) ) ) ) Production( (sc : Scenario) ( (Name of sc := "vandalism_against_ticket_machine") (StartTime of sc := StartTime of s1) (EndTime of sc := EndTime of s5) ) ) ) Scenario Recognition : Temporal constraints

08/07/2003 Inria/STMicroelectronics 18 Results Vandalism in metro (Nuremberg)

Examples : Brussels and Barcelona Metros Exit zone Jumping over barrier Blocking Overcrowding Fighting Group behavior Crowd behavior Individual behavior Group behavior Results 19 26/02/2003 IDSS

08/07/2003 Inria/STMicroelectronics 20 Recognition of five behaviors: “Blocking”, “Fighting”, “Jumping over barrier”, “Vandalism” and “Overcrowding” Tested on 50 metro sequences (10 hours) True positive per sequence: 70% (“Fighting”) to 95% (“Blocking”) False positive per sequence: 5% (“Fighting”, “Jumping over barrier”) to 0% (others) Results

08/07/2003 Inria/STMicroelectronics 21 Suivi multi-cameras Video Understanding for bank surveillance

08/07/2003 Inria/STMicroelectronics 22 Conclusion Hypotheses: fixed cameras 3D model of the empty scene predefined behavior models Results: Behavior understanding for Individuals, Groups of people or Crowd an operational language for video understanding (live evaluation at Barcelona Metro) a real-time platform (5 to 25 frames/s)

08/07/2003 Inria/STMicroelectronics 23 Future work: Learning techniques to compute optimal set of parameters and to dynamically configure the platform New application domains : Domotic, Health, Teaching,… Recognition of scenarios based on the recognition of postures Conclusion