CS 664, Session 19 1 General architecture. CS 664, Session 19 2 Minimal Subscene Working definition: The smallest set of objects, actors and actions in.

Slides:



Advertisements
Similar presentations
ATTENTION: SELECTIVITY AND RESOURCE ALLOCATION SELECTIVITY –what events “capture” attention? –how complete is selectivity? AROUSAL AND ALERTNESS –does.
Advertisements

Detail to attention: Exploiting Visual Tasks for Selective Rendering Kirsten Cater 1, Alan Chalmers 1 and Greg Ward 2 1 University of Bristol, UK 2 Anyhere.
My Group’s Current Research on Image Understanding.
黃文中 Preview 2 3 The Saliency Map is a topographically arranged map that represents visual saliency of a corresponding visual scene. 4.
Last week... why object recognition is difficult, the template model the feature recognition model, word recognition as a case study Today... Recognition.
Object Recognition Using Attention Tammy Avraham and Assoc.Prof Michael Lindenbaum Computer Science Dept., Technion Haifa, Israel Projects for implementing.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.
Visual Attention More information in visual field than we can process at a given moment Solutions Shifts of Visual Attention related to eye movements Some.
Intelligence without Reason
1 The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC. Lecture 12: Visual Attention 1 Computational Architectures in Biological Vision,
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
ICCS-NTUA Contributions to E-teams of MUSCLE WP6 and WP10 Prof. Petros Maragos National Technical University of Athens School of Electrical and Computer.
A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.
CS 664, Session 20 1 Minimal Subscene Working definition: The smallest set of objects, actors and actions in a dynamic visual scene that are relevant to.
Michigan State University 1 “Saliency-Based Visual Attention” “Computational Modeling of Visual Attention”, Itti, Koch, (Nature Reviews – Neuroscience.
From attention to goal-oriented scene understanding Laurent Itti – University of Southern California.
Michael Arbib & Laurent Itti: CS664 – USC, spring Lecture 6: Object Recognition 1 CS664, USC, Spring 2002 Lecture 6. Object Recognition Reading Assignments:
Laurent Itti: CS564 – Brain theory and artificial intelligence. Scene Perception 1 Brain theory and artificial intelligence Lecture 23. Scene Perception.
December 2, 2014Computer Vision Lecture 21: Image Understanding 1 Today’s topic is.. Image Understanding.
A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)
Major Functional Areas Primary motor: voluntary movement Primary somatosensory: tactile, pain, pressure, position, temp., mvt. Motor association: coordination.
Michael Arbib & Laurent Itti: CS664 – Spring Lecture 5: Visual Attention (bottom-up) 1 CS 664, USC Spring 2002 Lecture 5. Visual Attention (bottom-up)
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC. Lecture 13: Scene Perception 1 Computational Architectures in Biological Vision,
Overview of upcoming lectures 1/15: General introduction to vision, brain and the “minimal subscene” 1/17: Overview and discussion of projects 1/22: Visual.
Intelligent Agents: an Overview. 2 Definitions Rational behavior: to achieve a goal minimizing the cost and maximizing the satisfaction. Rational agent:
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 13: Scene Perception 1 Computational Architectures in Biological.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Computer Science Department, Duke UniversityPhD Defense TalkMay 4, 2005 Fast Extraction of Feature Salience Maps for Rapid Video Data Analysis Nikos P.
Human-centered design Human factors & interactive digital media design.
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.
The free-energy principle: a rough guide to the brain? K Friston Summarized by Joon Shik Kim (Thu) Computational Models of Intelligence.
Manipulating Attention in Computer Games Matthias Bernhard, Le Zhang, Michael Wimmer Institute of Computer Graphics and Algorithms Vienna University of.
AN IMPLEMENTATION OF A REGULAR EXPRESSION PARSER
Visual Attention Derek Hoiem March 14, 2007 Misc Reading Group.
Computer Vision Lab Seoul National University Keyframe-Based Real-Time Camera Tracking Young Ki BAIK Vision seminar : Mar Computer Vision Lab.
Lecture 3 - Race against Time 1 Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing.
Natural Tasking of Robots Based on Human Interaction Cues Brian Scassellati, Bryan Adams, Aaron Edsinger, Matthew Marjanovic MIT Artificial Intelligence.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,
The design of EFL multi-media materials in assisting listening By Guan Lin.
Human Activity Recognition at Mid and Near Range Ram Nevatia University of Southern California Based on work of several collaborators: F. Lv, P. Natarajan,
U SER I NTERFACE L ABORATORY Situation Awareness a state of knowledge, from the processes used to achieve that state (situation assessment) not encompass.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 12: Visual Attention 1 Computational Architectures in Biological.
3:01 PM Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing the contents of SM.
Spatio-temporal saliency model to predict eye movements in video free viewing Gipsa-lab, Grenoble Département Images et Signal CNRS, UMR 5216 S. Marat,
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis
How conscious experience and working memory interact Bernard J. Baars and Stan Franklin Soft Computing Laboratory 김 희 택 TRENDS in Cognitive Sciences vol.
Stochastic Grammars: Overview Representation: Stochastic grammar Representation: Stochastic grammar Terminals: object interactions Terminals: object interactions.
Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham.
Ahissar, Hochstein (1997) Nature Task difficulty and specificity of perceptual learning 1 st third 2nd third Final session Task difficulty Stimulus-to-mask.
Biologically Inspired Vision-based Indoor Localization Zhihao Li, Ming Yang
A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition Gustavo Deco Edmund T. Rolls Vision Research, 2004.
Data Mining for Surveillance Applications Suspicious Event Detection
Convolutional Neural Network
Visual Information Retrieval
Today: Classic & AI Control Wednesday: Image Processing/Vision
الذاكرة طويلة المدى Long-Term Memory
Text Detection in Images and Video
Data Mining for Surveillance Applications Suspicious Event Detection
The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
Knowledge-based event recognition from salient regions of activity
The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
Data Mining for Surveillance Applications Suspicious Event Detection
The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
Remembering and Learning
Human-object interaction
The Kadir Operator Saliency, Scale and Image Description Timor Kadir and Michael Brady University of Oxford.
Presentation transcript:

CS 664, Session 19 1 General architecture

CS 664, Session 19 2 Minimal Subscene Working definition: The smallest set of objects, actors and actions in a dynamic visual scene that are relevant to present behavior For now we will assume: Bottom-up: objects/actors/actions must be visible Top-down: relevance to present behavior explicitly specified, e.g., by specifying a question or task Knowledge base: the system may supplement explicit knowledge with long-term acquired knowledge

CS 664, Session 19 3 Motivation: Humans 1) Free examination 2) estimate material circumstances of family 3) give ages of the people 4) surmise what family has been doing before arrival of “unexpected visitor” 5) remember clothes worn by the people 6) remember position of people and objects 7) estimate how long the “unexpected visitor” has been away from family Yarbus, 1967

CS 664, Session 19 4 “Beobot”

CS 664, Session 19 5 Visual Attention see

CS 664, Session 19 6 Object Recognition Riesenhuber & Poggio, Nat Neurosci, 1999 (MIT)

CS 664, Session 19 7 Action Recognition Oztop & Arbib, 2001

CS 664, Session 19 8 Start: -Issue question -Parse question -Extract keywords -Expand to related concepts, using ontology/KB -Fill initial “task list”

CS 664, Session 19 9 Task list Working list of currently relevant objects/actors/actions -Initially empty -Question/task specification provides initial filling-in -As the scene is scanned and objects/actors/actions are recognized, contents of task list are updated

CS 664, Session “Where:” attention, saliency map and task map Input: video stream Low-level vision: massively parallel extraction of simple visual features from video input Saliency map: localizes conspicuous (potentially interesting) objects irrespectively of why they are salient Task map: acts as spatial filter to saliency map; only locations in the current minimal subscene can easily pass through. Other locations need to be exceptionally salient to pass through.

CS 664, Session “What” memory Relates concepts to visual properties Bridge between visual and semantic knowledge

CS 664, Session General architecture

CS 664, Session Examples / experiments Examine video clips For each scene, please write down: Most salient object Most salient action Minimal subscene Who is doing what to whom

CS 664, Session Scene 001

CS 664, Session Scene 001 – Attentional Trajectory

CS 664, Session Scene 002

CS 664, Session Scene 002 – Attentional Trajectory

CS 664, Session Scene 003

CS 664, Session Scene 003 – Attentional Trajectory

CS 664, Session Scene 004

CS 664, Session Scene 004 – Attentional Trajectory

CS 664, Session Scene 005

CS 664, Session Scene 005 – Attentional Trajectory

CS 664, Session Scene 006

CS 664, Session Scene 006 – Attentional Trajectory

CS 664, Session Scene 007

CS 664, Session Scene 007 – Attentional Trajectory

CS 664, Session Scene 008

CS 664, Session Scene 008 – Attentional Trajectory

CS 664, Session Scene 009

CS 664, Session Scene 009 – Attentional Trajectory

CS 664, Session Scene 010

CS 664, Session Scene 010 – Attentional Trajectory

CS 664, Session Scene 011

CS 664, Session Scene 011 – Attentional Trajectory

CS 664, Session Scene 012

CS 664, Session Scene 012 – Attentional Trajectory

CS 664, Session Scene 013

CS 664, Session Scene 013 – Attentional Trajectory

CS 664, Session Scene 014

CS 664, Session Scene 014 – Attentional Trajectory

CS 664, Session Scene 015

CS 664, Session Scene 015 – Attentional Trajectory

CS 664, Session Scene 016

CS 664, Session Scene 016 – Attentional Trajectory

CS 664, Session Scene 017

CS 664, Session Scene 017 – Attentional Trajectory