Zero shot learning Presented by: YuYing Chou

Slides:



Advertisements
Similar presentations
Face Recognition: A Convolutional Neural Network Approach
Advertisements

Tiled Convolutional Neural Networks TICA Speedup Results on the CIFAR-10 dataset Motivation Pretraining with Topographic ICA References [1] Y. LeCun, L.
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang1;2, Manohar Paluri1, Marc’Aurelio Ranzato1, Trevor Darrell2, Lubomir Bourdev1 1: Facebook.
A General Distributed Deep Learning Platform
Video Tracking Using Learned Hierarchical Features
Yang, Luyu.  Postal service for sorting mails by the postal code written on the envelop  Bank system for processing checks by reading the amount of.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
Students: Meera & Si Mentor: Afshin Dehghan WEEK 4: DEEP TRACKING.
ZEBRAS BY MARLENA WRABLEY. Young zebras are next to their mothers for 1 or 2 years. Female zebras pair when are 2 years old. Zebras are pregnant for 1.
Learning Hierarchical Features for Scene Labeling
Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN 1.
Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.
Feature selection using Deep Neural Networks March 18, 2016 CSI 991 Kevin Ham.
Face Recognition based on 2D-PCA and CNN
Generative Adversarial Nets ML Reading Group Xiao Lin Jul
Tofik AliPartha Pratim Roy Department of Computer Science and Engineering Indian Institute of Technology Roorkee CVIP-WM 2017 Paper ID 172 Word Spotting.
CNN architectures Mostly linear structure
Big data classification using neural network
LSUN Semantic Segmentation Extended PSPNet
Convolutional Neural Network
Automatic Lung Cancer Diagnosis from CT Scans (Week 2)
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Perceptual Loss Deep Feature Interpolation for Image Content Changes
References [1] - Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11): ,
Adversaries.
Combining CNN with RNN for scene labeling (segmentation)
CNN Demo LIU Pengpeng.
Poisoning Attacks with Back-Gradient Optimization
Structured Predictions with Deep Learning
Mean Euclidean Distance Error (mm)
Deepak Kumar1, Chetan Kumar1, Ming Shao2
Dipartimento di Ingegneria «Enzo Ferrari»
State-of-the-art face recognition systems
A Convolutional Neural Network Cascade For Face Detection
Bird-species Recognition Using Convolutional Neural Network
Distributed Representation of Words, Sentences and Paragraphs
CNNs and compressive sensing Theoretical analysis
Outline Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no.
NormFace:
Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision
Towards Understanding the Invertibility of Convolutional Neural Networks Anna C. Gilbert1, Yi Zhang1, Kibok Lee1, Yuting Zhang1, Honglak Lee1,2 1University.
ECE 599/692 – Deep Learning Lecture 1 - Introduction
Progressive Cross-media Correlation Learning
Object Detection + Deep Learning
Pattern Recognition & Machine Learning
Outline Background Motivation Proposed Model Experimental Results
Forward and Backward Max Pooling
Attack and defense on learning-based security system
Zhedong Zheng, Liang Zheng and Yi Yang
Zeroshot Learning Mun Jonghwan.
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
Adversarial Learning for Security System
Face Recognition: A Convolutional Neural Network Approach
Course Recap and What’s Next?
Meta Learning (Part 2): Gradient Descent as LSTM
Automatic Handwriting Generation
Human-object interaction
Keshav Balasubramanian
Image Processing and Multi-domain Translation
Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
Visual Grounding 专题报告 Lejian Ren 4.23.
End-to-End Facial Alignment and Recognition
Deep learning: Recurrent Neural Networks CV192
Fig. 2. Examples showing the ability of deep learning to generate realistic fake images. (a) Representative test images from the trained network for generating.
Text-to-speech (TTS) Traditional approaches (before 2016) Neural TTS
Bidirectional LSTM-CRF Models for Sequence Tagging
SDSEN: Self-Refining Deep Symmetry Enhanced Network
A-CCNN: ADAPTIVE CCNN FOR DENSITY ESTIMATION AND CROWD COUNTING
CVPR 2019 Poster.
Presentation transcript:

Zero shot learning Presented by: YuYing Chou email: d07922014@csie.ntu.edu.tw phone: 0928372603 Advisor: Tyng-Luh Liu, Hsuan-Tien Lin

What is zero shot learning Teach computer to recognize something they have not seen. How to make it become possible? https://applealmond.com/posts/28378

How human start to recognize something Imagine Guess with hint

what does alien look like

The zero shot task is similar … Baby tries to learn something new in the world. 我們要教machine 如何辨認斑馬

Give the machine hints C. H. Lampert, H. Nickisch, and S. Harmeling. "Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer". In CVPR, 2009

Give the machine a hint The attributes are the hints given to the computer. C. H. Lampert, H. Nickisch, and S. Harmeling. "Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer". In CVPR, 2009

Zebra wiki Zebras (/ˈziːbrə/ ZEE-brə, UK also /ˈzɛbrə/ ZEB-rə)[1] are several species of African equids (horse family) united by their distinctive black and white striped coats. Their stripes come in different patterns, unique to each individual. They are generally social animals that live in small harems to large herds. Unlike their closest relatives, horses and donkeys, zebras have never been truly domesticated. There are three species of zebras: the plains zebra, the mountain zebra and the Grévy's zebra. The plains zebra and the mountain zebra belong to the subgenus Hippotigris, but Grévy's zebra is the sole species of subgenus Dolichohippus. The latter resembles an ass, to which zebras are closely related, while the former two look more horse-like. All three belong to the genus Equus, along with other living equids. Frome, Andrea, Greg S. Corrado, Jon Shlens, Samy Bengio, Jeff Dean, and Tomas Mikolov. "Devise: A deep visual-semantic embedding model." In Advances in neural information processing systems, pp. 2121-2129. 2013.

Old models

From https://becominghuman.ai/back-propagation-in-convolutional-neural-networks-intuition-and-code-714ef1c38199 https://en.wikipedia.org/wiki/Generalised_logistic_function

The drawbacks of old models Zerba is similar to horses and has stripes Bias like horse spot

transductive learning Unseen data Unseen data Unsupervised learning

Fu, Yanwei, Timothy M. Hospedales, Tao Xiang, and Shaogang Gong Fu, Yanwei, Timothy M. Hospedales, Tao Xiang, and Shaogang Gong. "Transductive multi-view zero-shot learning." IEEE transactions on pattern analysis and machine intelligence 37, no. 11 (2015): 2332-2345.

Generative model to generate fake unseen image decoder

Generate Unseen image decoder

Use unseen image to train model (MLP: Multi-Level Perceptron)

𝐿𝑜𝑠𝑠= 𝑖=1 𝑁 [ 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑝 2 2 − 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑛 2 2 ] 𝑓 𝑖 𝑎 : a sample of image features ( 𝑓 𝑖 𝑎 , 𝑓 𝑖 𝑝 ) : a positive pair ( 𝑓 𝑖 𝑎 , 𝑓 𝑖 𝑛 ) : a negative pair Triplet pair 𝐿𝑜𝑠𝑠= 𝑖=1 𝑁 [ 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑝 2 2 − 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑛 2 2 ] 𝑓 𝑖 𝑎 𝑓 𝑖 𝑛 𝑓 𝑖 𝑝 𝑓 𝑖 𝑎 𝑓 𝑖 𝑛 𝑓 𝑖 𝑝 After learning

*method 1. CVAE (Conditional Variational Autoencoder) 2. MLP 3. Triplet loss

*steps 1. Generate seen classes features with CNN 2. Train CVAE with seen classes image features 3. Generate unseen classes image from CVAE 4. Train MLP with seen classes features and generated unseen classes image features

Caltech-UCSD Birds-200-2011

[2]Annadani, Yashas and Biswas, Soma Preserving Semantic Relations for Zero-Shot Learning ,arXiv preprint arXiv:1803.03049, 2018 [3] Chen, Long, Hanwang Zhang, Jun Xiao, Wei Liu, and Shih-Fu Chang. ”Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network.” arXiv preprint arXiv:1712.01928 (2017). [4] Arora, Gundeep, Vinay Kumar Verma, Ashish Mishra, and Piyush Rai. ”Generalized Zero-Shot Learning via Synthesized Examples.” arXiv preprint arXiv:1712.03878 (2017). [6] Song, Jie, Chengchao Shen, Yezhou Yang, Yang Liu, and Mingli Song. ”Transductive Unbiased Embedding for Zero-Shot Learning.” arXiv preprint arXiv:1803.11320 (2018).

*result The generative model very likely improves the ZSL task, but it will sacrifice the accuracy of training (seen) classes.