Presentation is loading. Please wait.

Presentation is loading. Please wait.

Above and below the object level

Similar presentations


Presentation on theme: "Above and below the object level"— Presentation transcript:

1 Above and below the object level

2

3 Object level: Single object recognition (ImageNet)

4 Below the object level: recognizing local configurations
The human visual system makes highly effective use of limited information: it can recognize not only objects, but severely reduced sub-configurations in terms of size or resolution. ‘Configurations’ rather than well-defined parts.

5 Minimizing variability
Motivation: interested because they are useful for the interpretation of complex scenes Will see more, but the basic reason is that the reduce variability Generalization was much better with the reduced images Useful for the interpretation of complex scenes

6 Searching for Minimal Images
Eg 40 to 35 Over 15,000 Mechanical Turk subjects ‘Atoms of Recognition’ PNAS 2016

7 Pairs Parent – MIRC, Child – ‘sub-MIRC’

8 0.79 0.0

9 0.88 0.14

10

11

12 Average 16 per class After removing highly overlapping ones Jaccard > 0.5

13 Cover Average 16.9 / class Highly redundant
Each MIRC is non-redundant: each feature is important Highly redundant, each MIRC is non-redundant Number of visual elements is the same at different scales

14 Testing computational models

15

16 DNN do not reach human level recognition on minimal images
Recognition of minimal images does not emerge by training any of the existing models tested. The large gap at the minimal level is not reproduced The accuracy of recognition is lower than human’s Representations used by existing models do not capture differences that human recognition is sensitive to

17 Minimal Images: Internal Interpretation
Humans can interpret detailed sub-structures within the ‘atomic’ MIRC. Cannot be done by current feed-forward models Another basic aspect that people can do and models cannot:

18 Internal Interpretation
Another limitation of current models / A related task that people can do Also tells us about features and representations that humans use Examples of internal interpretations, produced automatically by a model, validated by Turk These structures do not appear in the same way in the false detections. False detection: using Felzenszwalb Internal interpretations, perceived by humans Cannot be produced by existing feed-forward models

19 Current recognition models are feed-forward only
This is likely to limit their ability to providing interpretation of find details A model that can produce interpretation of MIRCs The model uses a top-down stage (back to V1) Ben-Yossef et al, CogSci 2015 A model for full local image interpretation

20 Above the object level

21 Image Captioning

22 Automatic caption: A brown horse standing next to a building

23 Automatic caption: a man is standing in front woman in white shirt.

24 Stealing Human description: ‘Two women sitting at restaurant table while a man in a black shirt takes one of their purses off the chair, while they are not looking’

25 Components: Man, Woman, Purse, Chair
Properties: Young, dark-hair, red, Relations: Man grabs purse, Purse hanging of chair, Woman sitting on chair

26 This is what vision produces, and we identify the event as ‘stealing’
Purse Hanging-of Chair Red Grabbing Sitting Man Woman Dark Not-looking We also connect each component to the image or in some working memory This is what vision produces, and we identify the event as ‘stealing’

27 Stealing Grabbing Owner Not-looking Person A Object X Person B
Object X and 2 people: we also need to know that person B is unaware of the grabbing Abstract and cognitive (shared with non-sighted), broad generalization

28 The ‘ownership’ is added to the structure,
Purse Hanging-of Chair Red Owns Grabbing Sitting Man Woman Dark Not-looking The ‘ownership’ is added to the structure, This cognitive addition contributes to ‘stealing’

29


Download ppt "Above and below the object level"

Similar presentations


Ads by Google