Zeroshot Learning 2015.4.2 Mun Jonghwan.

Zeroshot Learning Mun Jonghwan

Zero-shot Learning Traindata : Which image shows a cat?

Zero-shot Learning Traindata : Which image shows a giraffe?

Zero-shot Learning Which image shows a giraffe? Description
has long neck? Is black? Is spot? lives in plain?

External information Attribute Word vector Hierarchy Co-occurrence
- C.H. Lampert, Attribute-based classification for zero-shot visual object classification, TPAMI13[1] D. Parikh, Relative attributes, ICCV2011[3] Z. Akata, Label embedding for attribute-based classification, CVPR13[2] Word vector - A. Frome, Devise: A deep visual-semantic embedding model, NIPS13[3] - Z. Akata, Evaluation of output embedding for fine-grained image classification, CVPR15[4] Hierarchy - Usually used as side information Co-occurrence - T. Mensink, Costa: Co-occurrence statistics for zero-shot classification, CVPR14

Direct Attribute Prediction (DAP)[1]
Learn attribute classifier from related classes Use attribute-to-class mapping for prediction Label Attribute Image 𝑝 𝑎 𝑚 = 𝑎 𝑚 𝑧 𝑥 = 𝑝 𝑎 𝑚 𝑥 𝑖𝑓 𝑎 𝑚 𝑧 =1 1−𝑝 𝑎 𝑚 𝑥 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝑧 ∗ = argmax 𝑧 𝑚 𝑝( 𝑎 𝑚 𝑧 |𝑥)

Direct Attribute Prediction (DAP)[1]
1. Vocabulary of attributes and class decriptions - giraffe has properties X and Y but not Z 2. Train classifier for each attribute X, Y, Z - From visual examples of related classes 3. Make image attributes predictions 4. Combine into decision: this image is not giraffe 𝑃 𝑋 𝑖𝑚𝑔 =0.8 𝑃 𝑌 𝑖𝑚𝑔 =0.3 𝑃 𝑍 𝑖𝑚𝑔 =0.7 ⇒

Relative Attribute[2] Problem : Binary attributes are very crude
If mouse = small, then cat ≠ small If elephant = large, then cat ≠ large 𝑂 𝑚 : 𝑤 𝑚 𝑡 𝑥 𝑖 > 𝑤 𝑚 𝑡 𝑥 𝑗 𝑂 𝑚 : 𝑤 𝑚 𝑡 𝑥 𝑖 = 𝑤 𝑚 𝑡 𝑥 𝑗

Relative Attribute[2] S Clive Smiling J H Age Age: Scarlett Hugh Jared
1 2 ( 𝜇 𝐻 𝑠 + 𝜇 𝑆 𝑠 ) Clive Age: Scarlett Hugh Jared Miley Smiling: Smiling Age S J H 1 2 ( 𝜇 𝐽 𝑠 + 𝑑 𝑚 ) Infer image category using max-likelihood

Attribute Label Embedding (ALE)[2]
Embedding to attribute space Search for the class with the highest compatibility

Word Vector[3] Use CNN feature
Embedding vector is collected automatically from text corpora Embedding to word vector space

Word Vector[3] Semantically similar classes are close
country capital Semantically similar classes are close Word relationship is represented as displacement - 𝐾𝑖𝑛𝑔 – 𝑀𝑎𝑛 + 𝑤𝑜𝑚𝑒𝑛 = 𝑄𝑢𝑒𝑒𝑛

Survey result[4]

Relative information from word vector
Tiger : bobcat = strong : ? bobcat : tiger = small : ?

Relative information from word vector
Some ranking information Attribute Attribute embedding

Thank you

Zeroshot Learning 2015.4.2 Mun Jonghwan.

Similar presentations

Presentation on theme: "Zeroshot Learning 2015.4.2 Mun Jonghwan."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Zeroshot Learning 2015.4.2 Mun Jonghwan.

Similar presentations

Presentation on theme: "Zeroshot Learning 2015.4.2 Mun Jonghwan."— Presentation transcript:

Similar presentations

About project

Feedback