Zero shot learning Presented by: YuYing Chou

Slides:

Advertisements

Similar presentations

Face Recognition: A Convolutional Neural Network Approach

Advertisements

Tiled Convolutional Neural Networks TICA Speedup Results on the CIFAR-10 dataset Motivation Pretraining with Topographic ICA References [1] Y. LeCun, L.

PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang1;2, Manohar Paluri1, Marc’Aurelio Ranzato1, Trevor Darrell2, Lubomir Bourdev1 1: Facebook.

A General Distributed Deep Learning Platform

Video Tracking Using Learned Hierarchical Features

Yang, Luyu.  Postal service for sorting mails by the postal code written on the envelop  Bank system for processing checks by reading the amount of.

Semantic Embedding Space for Zero Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.

Students: Meera & Si Mentor: Afshin Dehghan WEEK 4: DEEP TRACKING.

ZEBRAS BY MARLENA WRABLEY. Young zebras are next to their mothers for 1 or 2 years. Female zebras pair when are 2 years old. Zebras are pregnant for 1.

Learning Hierarchical Features for Scene Labeling

Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN 1.

Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.

Feature selection using Deep Neural Networks March 18, 2016 CSI 991 Kevin Ham.

Face Recognition based on 2D-PCA and CNN

Generative Adversarial Nets ML Reading Group Xiao Lin Jul

Tofik AliPartha Pratim Roy Department of Computer Science and Engineering Indian Institute of Technology Roorkee CVIP-WM 2017 Paper ID 172 Word Spotting.

CNN architectures Mostly linear structure

Big data classification using neural network

LSUN Semantic Segmentation Extended PSPNet

Convolutional Neural Network

Automatic Lung Cancer Diagnosis from CT Scans (Week 2)

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Perceptual Loss Deep Feature Interpolation for Image Content Changes

References [1] - Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11): ,

Combining CNN with RNN for scene labeling (segmentation)

CNN Demo LIU Pengpeng.

Poisoning Attacks with Back-Gradient Optimization

Structured Predictions with Deep Learning

Mean Euclidean Distance Error (mm)

Deepak Kumar1, Chetan Kumar1, Ming Shao2

Dipartimento di Ingegneria «Enzo Ferrari»

State-of-the-art face recognition systems

A Convolutional Neural Network Cascade For Face Detection

Bird-species Recognition Using Convolutional Neural Network

Distributed Representation of Words, Sentences and Paragraphs

CNNs and compressive sensing Theoretical analysis

Outline Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no.

Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision

Towards Understanding the Invertibility of Convolutional Neural Networks Anna C. Gilbert1, Yi Zhang1, Kibok Lee1, Yuting Zhang1, Honglak Lee1,2 1University.

ECE 599/692 – Deep Learning Lecture 1 - Introduction

Progressive Cross-media Correlation Learning

Object Detection + Deep Learning

Pattern Recognition & Machine Learning

Outline Background Motivation Proposed Model Experimental Results

Forward and Backward Max Pooling

Attack and defense on learning-based security system

Zhedong Zheng, Liang Zheng and Yi Yang

Zeroshot Learning Mun Jonghwan.

Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS

Adversarial Learning for Security System

Face Recognition: A Convolutional Neural Network Approach

Course Recap and What’s Next?

Meta Learning (Part 2): Gradient Descent as LSTM

Automatic Handwriting Generation

Human-object interaction

Keshav Balasubramanian

Image Processing and Multi-domain Translation

Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation

Visual Grounding 专题报告 Lejian Ren 4.23.

End-to-End Facial Alignment and Recognition

Deep learning: Recurrent Neural Networks CV192

Fig. 2. Examples showing the ability of deep learning to generate realistic fake images. (a) Representative test images from the trained network for generating.

Text-to-speech (TTS) Traditional approaches (before 2016) Neural TTS

Bidirectional LSTM-CRF Models for Sequence Tagging

SDSEN: Self-Refining Deep Symmetry Enhanced Network

A-CCNN: ADAPTIVE CCNN FOR DENSITY ESTIMATION AND CROWD COUNTING

CVPR 2019 Poster.

Presentation transcript:

Zero shot learning Presented by: YuYing Chou email: d07922014@csie.ntu.edu.tw phone: 0928372603 Advisor: Tyng-Luh Liu, Hsuan-Tien Lin

What is zero shot learning Teach computer to recognize something they have not seen. How to make it become possible? https://applealmond.com/posts/28378

How human start to recognize something Imagine Guess with hint

what does alien look like

The zero shot task is similar … Baby tries to learn something new in the world. 我們要教machine 如何辨認斑馬

Give the machine hints C. H. Lampert, H. Nickisch, and S. Harmeling. "Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer". In CVPR, 2009

Give the machine a hint The attributes are the hints given to the computer. C. H. Lampert, H. Nickisch, and S. Harmeling. "Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer". In CVPR, 2009

Zebra wiki Zebras (/ˈziːbrə/ ZEE-brə, UK also /ˈzɛbrə/ ZEB-rə)[1] are several species of African equids (horse family) united by their distinctive black and white striped coats. Their stripes come in different patterns, unique to each individual. They are generally social animals that live in small harems to large herds. Unlike their closest relatives, horses and donkeys, zebras have never been truly domesticated. There are three species of zebras: the plains zebra, the mountain zebra and the Grévy's zebra. The plains zebra and the mountain zebra belong to the subgenus Hippotigris, but Grévy's zebra is the sole species of subgenus Dolichohippus. The latter resembles an ass, to which zebras are closely related, while the former two look more horse-like. All three belong to the genus Equus, along with other living equids. Frome, Andrea, Greg S. Corrado, Jon Shlens, Samy Bengio, Jeff Dean, and Tomas Mikolov. "Devise: A deep visual-semantic embedding model." In Advances in neural information processing systems, pp. 2121-2129. 2013.

Old models

From https://becominghuman.ai/back-propagation-in-convolutional-neural-networks-intuition-and-code-714ef1c38199 https://en.wikipedia.org/wiki/Generalised_logistic_function

The drawbacks of old models Zerba is similar to horses and has stripes Bias like horse spot

transductive learning Unseen data Unseen data Unsupervised learning

Fu, Yanwei, Timothy M. Hospedales, Tao Xiang, and Shaogang Gong Fu, Yanwei, Timothy M. Hospedales, Tao Xiang, and Shaogang Gong. "Transductive multi-view zero-shot learning." IEEE transactions on pattern analysis and machine intelligence 37, no. 11 (2015): 2332-2345.

Generative model to generate fake unseen image decoder

Generate Unseen image decoder

Use unseen image to train model (MLP: Multi-Level Perceptron)

𝐿𝑜𝑠𝑠= 𝑖=1 𝑁 [ 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑝 2 2 − 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑛 2 2 ] 𝑓 𝑖 𝑎 : a sample of image features ( 𝑓 𝑖 𝑎 , 𝑓 𝑖 𝑝 ) : a positive pair ( 𝑓 𝑖 𝑎 , 𝑓 𝑖 𝑛 ) : a negative pair Triplet pair 𝐿𝑜𝑠𝑠= 𝑖=1 𝑁 [ 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑝 2 2 − 𝑓 𝑖 𝑎 − 𝑓 𝑖 𝑛 2 2 ] 𝑓 𝑖 𝑎 𝑓 𝑖 𝑛 𝑓 𝑖 𝑝 𝑓 𝑖 𝑎 𝑓 𝑖 𝑛 𝑓 𝑖 𝑝 After learning

*method 1. CVAE (Conditional Variational Autoencoder) 2. MLP 3. Triplet loss

*steps 1. Generate seen classes features with CNN 2. Train CVAE with seen classes image features 3. Generate unseen classes image from CVAE 4. Train MLP with seen classes features and generated unseen classes image features

Caltech-UCSD Birds-200-2011

[2]Annadani, Yashas and Biswas, Soma Preserving Semantic Relations for Zero-Shot Learning ,arXiv preprint arXiv:1803.03049, 2018 [3] Chen, Long, Hanwang Zhang, Jun Xiao, Wei Liu, and Shih-Fu Chang. ”Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network.” arXiv preprint arXiv:1712.01928 (2017). [4] Arora, Gundeep, Vinay Kumar Verma, Ashish Mishra, and Piyush Rai. ”Generalized Zero-Shot Learning via Synthesized Examples.” arXiv preprint arXiv:1712.03878 (2017). [6] Song, Jie, Chengchao Shen, Yezhou Yang, Yang Liu, and Mingli Song. ”Transductive Unbiased Embedding for Zero-Shot Learning.” arXiv preprint arXiv:1803.11320 (2018).

*result The generative model very likely improves the ZSL task, but it will sacrifice the accuracy of training (seen) classes.