Learning Compositional Visual Concepts with Mutual Consistency

Slides:



Advertisements
Similar presentations
Image Processing David Kauchak cs160 Fall 2009 Empirical Evaluation of Dissimilarity Measures for Color and Texture Jan Puzicha, Joachim M. Buhmann, Yossi.
Advertisements

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.
Hierarchical Region-Based Segmentation by Ratio-Contour Jun Wang April 28, 2004 Course Project of CSCE 790.
Introduction to Image Quality Assessment
Color a* b* Brightness L* Texture Original Image Features Feature combination E D 22 Boundary Processing Textons A B C A B C 22 Region Processing.
Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.
Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.
Tinghui Zhou1, Yong Jae Lee2, Stella X. Yu1,3, Alexei A. Efros1
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Speaker: Chi-Yu Hsu Advisor: Prof. Jian-Jung Ding Leveraging Stereopsis for Saliency Analysis, CVPR 2012.
1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:
Functional maps: A Flexible Representation of Maps Between Shapes SIGGRAPH 2012 Maks Ovsjanikov 1 Mirela Ben-Chen 2 Justin Solomon 2 Adrian Butscher 2.
Chapter Ten: Mixed Methods Procedures
A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.
K. Selçuk Candan, Maria Luisa Sapino Xiaolan Wang, Rosaria Rossini
Learning Color Names from Real-World Images Joost van de Weijer, Cordelia Schmid, Jakob Verbeek Lear Team, INRIA Grenoble, France
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
CONCEPTS AND TECHNIQUES FOR RECORD LINKAGE, ENTITY RESOLUTION, AND DUPLICATE DETECTION BY PETER CHRISTEN PRESENTED BY JOSEPH PARK Data Matching.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 10 – Compression Basics and JPEG Compression (Part 4) Klara Nahrstedt Spring 2014.
Deep Visual Analogy-Making
CHAPTER 4 THE VISUALIZATION PIPELINE. CONTENTS The focus is on presenting the structure of a complete visualization application, both from a conceptual.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Bassem Makni SML 16 Click to add text 1 Deep Learning of RDF rules Semantic Machine Learning.
Conditional Generative Adversarial Networks
Automatic Lung Cancer Diagnosis from CT Scans (Week 2)
Textual Video Prediction Week 2
Using XML Transformations for Enterprise Architectures
ICCV Hierarchical Part Matching for Fine-Grained Image Classification
Unsupervised Conditional Generation
CS6890 Deep Learning Weizhen Cai
Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros
Thesis Advisor : Prof C.V. Jawahar
Presenter: Hajar Emami
Adversarially Tuned Scene Generation
Enhanced-alignment Measure for Binary Foreground Map Evaluation
Efficient Image Classification on Vertically Decomposed Data
Computer Vision James Hays
Paper Presentation Aryeh Zapinsky
Multiple Instance Learning: applications to computer vision
Geometric Hashing: An Overview
Brief Review of Recognition + Context
Deep Visual-Semantic Alignments for Generating Image Descriptions
JPEG Still Image Data Compression Standard
Image to Image Translation using GANs
GAN Applications.
Papers 15/08.
Transformations Day 2: Composition.
Iterative Crowd Counting
Composition of Transformations
Zhedong Zheng, Liang Zheng and Yi Yang
CS 2770: Computer Vision Generative Adversarial Networks
Course Recap and What’s Next?
Neural Modular Networks
Warm up Rotate P(-4, -4) 180 Rotate Q(-1, -3) 90 CCW
Sequence to Sequence Video to Text
Image Processing and Multi-domain Translation
Hsiao-Yu Chiang Xiaoya Li Ganyu “Bruce” Xu Pinshuo Ye Zhanyuan Zhang
Background Task Fashion image inpainting Some conceptions
Review and Importance CS 111.
Xilai Li, Tianfu Wu, and Xi Song CVPR 2019 Presented by Dingquan Li
CVPR 2019 Tutorial on Map Synchronization
End-to-End Speech-Driven Facial Animation with Temporal GANs
GIF2Video: Color Dequantization and Temporal Interpolation of GIF images Yang Wang, Haibin Huang, Chuan Wang, Tong He, Jue Wang, Minh Hoai. Stony Brook.
Sign Language Recognition With Unsupervised Feature Learning
Visual Grounding.
CRCV REU 2019 Aaron Honculada.
SDSEN: Self-Refining Deep Symmetry Enhanced Network
CVPR 2019 Poster.
CVPR2019 Jiahe Li SiamRPN introduces the region proposal network after the Siamese network and performs joint classification and regression.
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

Learning Compositional Visual Concepts with Mutual Consistency Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, and Peter C. Doerschuk CVPR 2018 (spotlight) Concept GAN Presented by: Anand Bhattad (Vision Lunch, April 24 2018)

Image Translation (Pix to Pix and Cycle GAN) Pix2Pix Cycle GAN Source: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

Cycle GAN

Source: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

New Unseen Samples from Translations? Colored Bags Edge Bags Colored Shoes Edge Shoes

Overview: Cycle GAN G1 F1 F2 G2 F2 G2 G1 F1 Cycle GAN: Learn and Compose concepts separately G1 F1 G2 F2 G1 F1 G2 F2

Losses: Adversarial G1 F1 G2 F2

Losses: Cycle Consistency

Can we generate sketch shoes by combining them? F1 F2 G2 F2 G2 G1 F1

Proposed Model: Concept GAN Cycle GAN: Learn and Compose concepts separately Concept GAN: Learn and Compose concepts simultaneously G1 F1 F2 F2 G2 F2 G2 G1 F1

Losses: Cycle Consistency

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Loss: Commutative G1 G1 F1 F1 F2 G2 F2 G2

Loss: Commutative G1 F1 F2 G2 F2 G2 G1 F1

Loss: Commutative G1 F1 F2 G2 F2 G2 G1 F1

Concept GAN

Comparing Results

Qualitative Results

Qualitative Results

Quantitative Results (Classification) Test Set: edge Shoes only Test Set: with eyeglasses, with bangs 18 60 (Accuracy if L_x removed)

Ablation Study

Re-ID

Generalizing to n(=3) Concepts

Learning from 2 attributes in 2 experiments Experiment1: (smile, eyeglasses) Experiment2: (bangs, eyeglasses) (smile, eyeglasses, bangs)

Yaxing Wang, Joost van de Weijer, Luis Herranz CVPR 2018 Mix and match networks: encoder-decoder alignment for zero-pair image translation Yaxing Wang, Joost van de Weijer, Luis Herranz CVPR 2018

Task Mix and match networks do not require retraining on unseen transformations, in contrast to unpaired translation alternatives (e.g. CycleGAN) leverages depth and semantic segmentation, ignoring the available RGB data and the corresponding pairs

Overview

Losses

Depth –to- Semantic segmentation

Depth-to-Semantic segmentation