Towards a stochastic theory of saccadic eye movement GURU Oct 1, 2001 Jonathan Nelson.

Slides:



Advertisements
Similar presentations
Introduction to Eye Tracking
Advertisements

Interpretation of Geometric Shapes Miquel Prats Steve Garner The Design Group The Open University UK ETRA – Eye Tracking Research & Applications
Lecture 18: Temporal-Difference Learning
Observational Gait Analysis Nerrolyn Ford PhD. The observational gait analysis process Reliability/Validity What is done in practice? Visual search strategies.
Electrical & Computer Engineering Dept. University of Patras, Patras, Greece Evangelos Skodras Nikolaos Fakotakis.
Chapter 6: Visual Attention. Scanning a Scene Visual scanning – looking from place to place –Fixation –Saccadic eye movement Overt attention involves.
 INTRODUCTION  STEPS OF GESTURE RECOGNITION  TRACKING TECHNOLOGIES  SPEECH WITH GESTURE  APPLICATIONS.
Cognitive Computer Vision
Hidden Markov Models Ellen Walker Bioinformatics Hiram College, 2008.
Hidden Markov Models in NLP
The Use of Eye Tracking Technology in the Evaluation of e-Learning: A Feasibility Study Dr Peter Eachus University of Salford.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
I. Face Perception II. Visual Imagery. Is Face Recognition Special? Arguments have been made for both functional and neuroanatomical specialization for.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Bayesian Reinforcement Learning with Gaussian Processes Huanren Zhang Electrical and Computer Engineering Purdue University.
Machine Vision and Dig. Image Analysis 1 Prof. Heikki Kälviäinen C50A6100 Lectures 12: Object Recognition Professor Heikki Kälviäinen Machine Vision and.
Computer Vision for Interactive Computer Graphics Mrudang Rawal.
1 Hidden Markov Model Instructor : Saeed Shiry  CHAPTER 13 ETHEM ALPAYDIN © The MIT Press, 2004.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Who am I and what am I doing here? Allan Tucker A brief introduction to my research
Baysian Approaches Kun Guo, PhD Reader in Cognitive Neuroscience School of Psychology University of Lincoln Quantitative Methods 2011.
Visual scan path analysis using eye tracking data
Rapid Serial Visual Presentation (RSVP) Task (abbreviated sequence) Simulates saccadic vision Used to gauge speed of visual object recognition Thorpe et.
Cognitive Computer Vision 3R400 Kingsley Sage Room 5C16, Pevensey III
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Computer Vision Systems for the Blind and Visually Disabled. STATS 19 SEM Talk 3. Alan Yuille. UCLA. Dept. Statistics and Psychology.
Introduction To know how perceptual and attentional processes and properties of words guide the eyes through a sentence, the following issues are particularly.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston
© H. Hajimirsadeghi, School of ECE, University of Tehran Conceptual Imitation Learning Based on Functional Effects of Action Hossein Hajimirsadeghi School.
Graphical models for part of speech tagging
Movement studies 2011 Slides adapted from 2010 produced by SP University of Hertfordshire MS /12.
A Human Eye Retinal Cone Synthesizer Michael F. Deering.
Project title : Automated Detection of Sign Language Patterns Faculty: Sudeep Sarkar, Barbara Loeding, Students: Sunita Nayak, Alan Yang Department of.
K. J. O’Hara AMRS: Behavior Recognition and Opponent Modeling Oct Behavior Recognition and Opponent Modeling in Autonomous Multi-Robot Systems.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Hierarchical Temporal Memory as a Means for Image Recognition by Wesley Bruning CHEM/CSE 597D Final Project Presentation December 10, 2008.
Encoding Specificity Memory is improved when information available at encoding is also available at retrieval.
A Viable Implementation of a Comparison Algorithm for Regions of Interest John P. Heminghous Computer Science Clemson University
ALIP: Automatic Linguistic Indexing of Pictures Jia Li The Pennsylvania State University.
Studying Memory Encoding with fMRI Event-related vs. Blocked Designs Aneta Kielar.
Eye-Based Interaction in Graphical Systems: Theory & Practice Part III Potential Gaze-Contingent Applications.
Hidden Markov Models in Keystroke Dynamics Md Liakat Ali, John V. Monaco, and Charles C. Tappert Seidenberg School of CSIS, Pace University, White Plains,
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Analysis of Eye Movement Data Frank M. Marchak, Ph.D. Veridical Research and Design Corporation
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Cognitive Modeling / University of Groningen / / Artificial Intelligence |RENSSELAER| Cognitive Science CogWorks Laboratories › Christian P. Janssen ›
Curiosity-Driven Exploration with Planning Trajectories Tyler Streeter PhD Student, Human Computer Interaction Iowa State University
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
CS Statistical Machine learning Lecture 24
The geometry of the system consisting of the hyperbolic mirror and the CCD camera is shown to the right. The points on the mirror surface can be expressed.
Visual Search Deficits in Williams Buren Syndrome Montfoort, I., Frens, M.A., Lagers- Van Haselen, G.C., & van der Geest, J.N.
Inferring High-Level Behavior from Low-Level Sensors Donald J. Patterson, Lin Liao, Dieter Fox, and Henry Kautz.
From Genomics to Geology: Hidden Markov Models for Seismic Data Analysis Samuel Brown February 5, 2009.
Computer Science Readings: Reinforcement Learning Presentation by: Arif OZGELEN.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Probabilistic Automaton Ashish Srivastava Harshil Pathak.
Eye Movements and Working Memory Marc Pomplun Department of Computer Science University of Massachusetts at Boston Homepage:
Towards an express-diagnostics for level of processing and hazard perception Boris M. Velichkovsky et al. Transportation Research Part F 5 (2002)
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
The effect of inter-letter spacing on reading Yu-Chi Tai, John Hayes, James Sheedy College of Optometry Pacific University, Forest Grove, Oregon ABSTRACT.
Feedforward Eye-Tracking for Training Histological Visual Searches Andrew T. Duchowski COMPUTER SCIENCE, CLEMSON UNIVERSITY Abstract.
San Diego May 22, 2013 Giovanni Saponaro Giampiero Salvi
The Effect of Inter-letter Spacing on Reading Yu-Chi Tai, PhD James E
PETRA 2014 An Interactive Learning and Adaptation Framework for Socially Assistive Robotics: An Interactive Reinforcement Learning Approach Konstantinos.
Enhancing User identification during Reading by Applying Content-Based Text Analysis to Eye- Movement Patterns Akram Bayat Amir Hossein Bayat Marc.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Presentation transcript:

Towards a stochastic theory of saccadic eye movement GURU Oct 1, 2001 Jonathan Nelson

Thanks especially to Wendy Ark Gary Cottrel Carrie Joyce Javier Movellan Marty Sereno Ruth Williams

Outline Function of eye movement in vision Traditional analyses of eye movement Three papers using stochastic processes to model eye movement Research towards a better generative model

Why model eye movement? for cognitive science and economics: how do a person’s knowledge and goals influence their information-gathering in perception? (applications also to entertainment and interface design) for medical diagnosis: how does a physician/technician read an x-ray? Can we automate or assist the physician? for understanding and treating pathological conditions: autism, williams’ syndrome. Can training appropriate eye movement help autistics in social situations? for artificial perception: are the visual data that humans acquire also informative for active, sequential artificial vision systems?

Saccadic eye movement Eccentricity and image resolution Saccades and fixations Relationship to task, applications

Traditional analyses Large number of papers Inspired by behavioral statistics: lose dynamic character, or dichotomize/polychotomize time Yet unexplained qualitative results: “highly individual scanpaths” “high individual consistency from trial to trial”

Towards a stochastic theory Regions of interest in an image Timescale

Regions of interest:

Timescale Discrete or continuous? Every point sampled by the eye tracker? Every nth millisecond? Every fixation?

Timescale Nearby fixations grouped together Saccade points rejected

What kind of SP? Discrete state, usually with regions of interest that contain current point of gaze as state Discrete time, with points of meaningful fixations forming the time step Time-homogenous Markov chain

White et al.: mammograms How do experts read mammograms? Can experts’ patterns in reading mammograms suggest means of computer aided diagnosis?

from

White et al.’s modeling One model per subject, per image A sequential clustering algorithm to determine regions of interest Exclude transitions to same state, and fixations outside of regions of interest “Not enough data”

Joyce: face perception Goal: augment or corroborate eyewitness testimony, using eye movement EEG GSR Task: view a face recognize it as not novel, after a delay

Task: “Study this face”

Task: “Have you seen this face before?”

Joyce’s regions of interest 10 ROI: hair right eye nose etc. Normalized faces Nearest neighbor

Joyce’s model Markov States: each of 10 regions of interest Time: from entering region till leaving region is one discrete step

Joyce’s results More entropy in saccade sequences when viewing a novel face Matches qualitative findings

Eye-typing: Salvucci Goal: have users type a pre-selected word, by looking at an on-screen keyboard Models used: saccade or fixation HMM letter HMM grammar

Saccade-fixation HMM If previous time is fixation, current time is most likely fixation If low velocity, current time most likely fixation “Standard HMM parameter estimation” (Rabiner, 1989) Viterbi to optimize

Fixations and centroids

“Centroid submodel HMM” States: peaked bivariate distribution for (x, y) diffuse bivariate distribution for (x, y)

Grammar for words Which word gives the highest likelihood of the centroid data?

Results 92% accuracy with 1000 word vocabulary too computationally intensive for realistic-size (e.g. 50,000 words) vocabulary, for next few years … a nice proof of concept (previous systems required >750ms between fixations, and 4 degrees between targets) (for simpler multiple-letter models, use a string-edit distance algorithm to find the nearest vocabulary word)

Future work More temporal dependence: try 3rd+ order model Can a sophisticated string-edit distance algorithm correct for bias in limited sampling?

Future work Systematic exploration of time index Improvement of saccade-fixation HMM Distribution of fixations within regions What is the nature of individual differences? Relationship of knowledge and goals to task

Sources Cited White, KP; Hutson, TL; Hutchinson, TE (1997). Modeling human eye behavior during mammographic scanning: preliminary results. IEEE Transactions on Systems, Man and Cybernetics, A, 27, Salvucci, DD (1999). Inferring intent in eye-based interfaces: tracing eye movements with process models. Proceedings of the 1999 Computer Human Interaction Conference. Joyce, CA (2000). Saving Faces: Using Eye Movement, ERP, and SCR Measures of Face Processing and Recognition to Investigate Eyewitness Identification. Ph.D. dissertation, University of California, San Diego.