Miguel P. Eckstein, Kathryn Koehler, Lauren E. Welbourne, Emre Akbas 

Slides:



Advertisements
Similar presentations
From: Early interference of context congruence on object processing in rapid visual categorization of natural scenes Journal of Vision. 2008;8(13):11.
Advertisements

Backward Masking and Unmasking Across Saccadic Eye Movements
A Source for Feature-Based Attention in the Prefrontal Cortex
Visual Development: Learning Not to See
Ji Dai, Daniel I. Brooks, David L. Sheinberg  Current Biology 
Avi J.H. Chanales, Ashima Oza, Serra E. Favila, Brice A. Kuhl 
Volume 23, Issue 18, Pages (September 2013)
Kinesthetic information disambiguates visual motion signals
Volume 26, Issue 18, Pages (September 2016)
Clayton P. Mosher, Prisca E. Zimmerman, Katalin M. Gothard 
Volume 27, Issue 20, Pages e3 (October 2017)
Theta-Coupled Periodic Replay in Working Memory
Cerebellar rTMS disrupts predictive language processing
Bennett Drew Ferris, Jonathan Green, Gaby Maimon  Current Biology 
Miguel P. Eckstein, Kathryn Koehler, Lauren E. Welbourne, Emre Akbas 
Differential Impact of Behavioral Relevance on Quantity Coding in Primate Frontal and Parietal Neurons  Pooja Viswanathan, Andreas Nieder  Current Biology 
Face Pareidolia in the Rhesus Monkey
A Role for the Superior Colliculus in Decision Criteria
Volume 27, Issue 19, Pages e2 (October 2017)
Inactivation of Medial Frontal Cortex Changes Risk Preference
High Resilience of Seed Dispersal Webs Highlighted by the Experimental Removal of the Dominant Disperser  Sérgio Timóteo, Jaime Albino Ramos, Ian Phillip.
Volume 25, Issue 14, Pages (July 2015)
Adaptive Training Diminishes Distractibility in Aging across Species
Cultural Confusions Show that Facial Expressions Are Not Universal
Children, but Not Chimpanzees, Prefer to Collaborate
Chimeric Synergy in Natural Social Groups of a Cooperative Microbe
Consequences of the Oculomotor Cycle for the Dynamics of Perception
Volume 27, Issue 23, Pages e3 (December 2017)
Avi J.H. Chanales, Ashima Oza, Serra E. Favila, Brice A. Kuhl 
The Occipital Place Area Is Causally Involved in Representing Environmental Boundaries during Navigation  Joshua B. Julian, Jack Ryan, Roy H. Hamilton,
BOLD fMRI Correlation Reflects Frequency-Specific Neuronal Correlation
Working memory without consciousness
Ryota Kanai, Tom Feilden, Colin Firth, Geraint Rees  Current Biology 
Mosquitoes Use Vision to Associate Odor Plumes with Thermal Targets
Serial, Covert Shifts of Attention during Visual Search Are Reflected by the Frontal Eye Fields and Correlated with Population Oscillations  Timothy J.
Consequences of the Oculomotor Cycle for the Dynamics of Perception
Visual Sensitivity Can Scale with Illusory Size Changes
Jeremy M. Wolfe, Michael J. Van Wert  Current Biology 
Direct Two-Dimensional Access to the Spatial Location of Covert Attention in Macaque Prefrontal Cortex  Elaine Astrand, Claire Wardak, Pierre Baraduc,
Daniel Hanus, Josep Call  Current Biology 
Caudate Microstimulation Increases Value of Specific Choices
Visual Development: Learning Not to See
Robust Selectivity to Two-Object Images in Human Visual Cortex
Volume 72, Issue 6, Pages (December 2011)
Dissociable Effects of Salience on Attention and Goal-Directed Action
Volume 27, Issue 3, Pages (February 2017)
Volume 27, Issue 3, Pages (February 2017)
Arielle Tambini, Nicholas Ketz, Lila Davachi  Neuron 
Attention Samples Stimuli Rhythmically
Encoding of Stimulus Probability in Macaque Inferior Temporal Cortex
Akinori Mitani, Mingyuan Dong, Takaki Komiyama  Current Biology 
Category Selectivity in the Ventral Visual Pathway Confers Robustness to Clutter and Diverted Attention  Leila Reddy, Nancy Kanwisher  Current Biology 
Volume 16, Issue 20, Pages (October 2006)
Ian C. Fiebelkorn, Yuri B. Saalmann, Sabine Kastner  Current Biology 
Daniela Vallentin, Andreas Nieder  Current Biology 
Sung Jun Joo, Geoffrey M. Boynton, Scott O. Murray  Current Biology 
Sound Facilitates Visual Learning
Volume 21, Issue 7, Pages (April 2011)
Color Constancy for an Unseen Surface
Impaired Associative Learning with Food Rewards in Obese Women
Attention-Dependent Representation of a Size Illusion in Human V1
Texture density adaptation and visual number revisited
Li Zhaoping, Nathalie Guyader  Current Biology 
Will Travel for Food: Spatial Discounting in Two New World Monkeys
Visual Crowding at a Distance during Predictive Remapping
Visual Crowding Is Correlated with Awareness
The Sound of Actions in Apraxia
Spatiotemporal Neural Pattern Similarity Supports Episodic Memory
Head-Eye Coordination at a Microscopic Scale
Prefrontal Control of Visual Distraction
Presentation transcript:

Humans, but Not Deep Neural Networks, Often Miss Giant Targets in Scenes  Miguel P. Eckstein, Kathryn Koehler, Lauren E. Welbourne, Emre Akbas  Current Biology  Volume 27, Issue 18, Pages 2827-2832.e3 (September 2017) DOI: 10.1016/j.cub.2017.07.068 Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 1 Visual Search for Objects in Scenes (A) Sample image of a bathroom (28.2 × 22.6 degree visual angle) and toothbrush at a likely location (on the sink). (B) Toothbrush at an unexpected location (on the floor rug). (C) Mis-scaled toothbrush with a size inconsistent with the scene but placed at the expected location (sink). (D) Entire image is rescaled so that the toothbrush is consistent in size with scene but has the same physical size as in the mis-scaled toothbrush image in (C). (E) Timeline of a single experimental trial. Current Biology 2017 27, 2827-2832.e3DOI: (10.1016/j.cub.2017.07.068) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 2 Target Detection by Humans and Deep Neural Networks (A) Hit rates for images with target objects consistent (normal) or inconsistent (mis-scaled) in size with the scene, and the control condition. ∗∗∗p < 0.001. The correct rejection rate for target absent images is also shown, for reference. (B) Hit rate for normal and mis-scaled target conditions across blocks of seven trials. (C) Distance in degrees of closest fixation to the target for missed target trials in normal and mis-scaled target conditions. n.s., not statistically significant. (D) Proportion of trials in which the target was missed even if it was foveated for the normal and mis-scaled conditions. ∗∗∗p < 0.001. (E) Hit rate for humans detecting targets consistent (normal) and inconsistent (mis-scaled) in spatial scale with the scenes for a subset of 12 images, on which the deep neural networks were tested. Also plotted are the target probabilities associated with this subset of images (as well as 31 additional scenes) for normal and mis-scaled targets, shown for three different deep neural networks: R-FCN [21], YOLO [22], and Faster R-CNN [23]. All error bars are SEM. Current Biology 2017 27, 2827-2832.e3DOI: (10.1016/j.cub.2017.07.068) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 3 Deep Neural Network False Positives that Are Inconsistent in Size with the Scene (A) Top row: sample images where the target is absent but other objects in the image produce high target probabilities for Faster R-CNN; these objects would be inconsistent in size (mis-scaled) with the scene if they were the target. Bottom row: comparison scenes where Faster R-CNN correctly identifies target objects, resulting in probabilities that are comparable or lower than the mis-scaled object false positives shown in the top row. The target object names are listed above the images and apply to both rows. Above each image, the target object probabilities are shown in green, and the false alarm or hit rates from humans are shown in orange. Image credits: Ronnie Macdonald (https://www.flickr.com/photos/ronmacphotos/), Brian Dryden, Aimee Wanner/TTN, dhgate.com, atlantacloset.com, Monkey Business Images (dreamstime.com), and Fotoamator (iStockPhoto.com). (B) Hit rate and false alarm rate for humans (orange) and associated target probabilities for Faster R-CNN (green) for all of the real-world images that were tested (including the samples shown in A). Error bars are SEM. Humans can discount distractors that elicit large probabilities in Faster R-CNN because they are inconsistent with the expected size of the searched target relative to the scene. Current Biology 2017 27, 2827-2832.e3DOI: (10.1016/j.cub.2017.07.068) Copyright © 2017 Elsevier Ltd Terms and Conditions