Learning Compositional Visual Concepts with Mutual Consistency

Slides:

Advertisements

Similar presentations

Image Processing David Kauchak cs160 Fall 2009 Empirical Evaluation of Dissimilarity Measures for Color and Texture Jan Puzicha, Joachim M. Buhmann, Yossi.

Advertisements

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.

Hierarchical Region-Based Segmentation by Ratio-Contour Jun Wang April 28, 2004 Course Project of CSCE 790.

Introduction to Image Quality Assessment

Color a* b* Brightness L* Texture Original Image Features Feature combination E D 22 Boundary Processing Textons A B C A B C 22 Region Processing.

Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.

Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.

Tinghui Zhou1, Yong Jae Lee2, Stella X. Yu1,3, Alexei A. Efros1

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.

Speaker: Chi-Yu Hsu Advisor: Prof. Jian-Jung Ding Leveraging Stereopsis for Saliency Analysis, CVPR 2012.

1 Efficient packet classification using TCAMs Authors: Derek Pao, Yiu Keung Li and Peng Zhou Publisher: Computer Networks 2006 Present: Chen-Yu Lin Date:

Functional maps: A Flexible Representation of Maps Between Shapes SIGGRAPH 2012 Maks Ovsjanikov 1 Mirela Ben-Chen 2 Justin Solomon 2 Adrian Butscher 2.

Chapter Ten: Mixed Methods Procedures

A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.

K. Selçuk Candan, Maria Luisa Sapino Xiaolan Wang, Rosaria Rossini

Learning Color Names from Real-World Images Joost van de Weijer, Cordelia Schmid, Jakob Verbeek Lear Team, INRIA Grenoble, France

Semantic Embedding Space for Zero Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.

CONCEPTS AND TECHNIQUES FOR RECORD LINKAGE, ENTITY RESOLUTION, AND DUPLICATE DETECTION BY PETER CHRISTEN PRESENTED BY JOSEPH PARK Data Matching.

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 10 – Compression Basics and JPEG Compression (Part 4) Klara Nahrstedt Spring 2014.

Deep Visual Analogy-Making

CHAPTER 4 THE VISUALIZATION PIPELINE. CONTENTS The focus is on presenting the structure of a complete visualization application, both from a conceptual.

Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.

Bassem Makni SML 16 Click to add text 1 Deep Learning of RDF rules Semantic Machine Learning.

Conditional Generative Adversarial Networks

Automatic Lung Cancer Diagnosis from CT Scans (Week 2)

Textual Video Prediction Week 2

Using XML Transformations for Enterprise Architectures

ICCV Hierarchical Part Matching for Fine-Grained Image Classification

Unsupervised Conditional Generation

CS6890 Deep Learning Weizhen Cai

Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros

Thesis Advisor : Prof C.V. Jawahar

Presenter: Hajar Emami

Adversarially Tuned Scene Generation

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Efficient Image Classification on Vertically Decomposed Data

Computer Vision James Hays

Paper Presentation Aryeh Zapinsky

Multiple Instance Learning: applications to computer vision

Geometric Hashing: An Overview

Brief Review of Recognition + Context

Deep Visual-Semantic Alignments for Generating Image Descriptions

JPEG Still Image Data Compression Standard

Image to Image Translation using GANs

GAN Applications.

Transformations Day 2: Composition.

Iterative Crowd Counting

Composition of Transformations

Zhedong Zheng, Liang Zheng and Yi Yang

CS 2770: Computer Vision Generative Adversarial Networks

Course Recap and What’s Next?

Neural Modular Networks

Warm up Rotate P(-4, -4) 180 Rotate Q(-1, -3) 90 CCW

Sequence to Sequence Video to Text

Image Processing and Multi-domain Translation

Hsiao-Yu Chiang Xiaoya Li Ganyu “Bruce” Xu Pinshuo Ye Zhanyuan Zhang

Background Task Fashion image inpainting Some conceptions

Review and Importance CS 111.

Xilai Li, Tianfu Wu, and Xi Song CVPR 2019 Presented by Dingquan Li

CVPR 2019 Tutorial on Map Synchronization

End-to-End Speech-Driven Facial Animation with Temporal GANs

GIF2Video: Color Dequantization and Temporal Interpolation of GIF images Yang Wang, Haibin Huang, Chuan Wang, Tong He, Jue Wang, Minh Hoai. Stony Brook.

Sign Language Recognition With Unsupervised Feature Learning

Visual Grounding.

CRCV REU 2019 Aaron Honculada.

SDSEN: Self-Refining Deep Symmetry Enhanced Network

CVPR 2019 Poster.

CVPR2019 Jiahe Li SiamRPN introduces the region proposal network after the Siamese network and performs joint classification and regression.

Shengcong Chen, Changxing Ding, Minfeng Liu 2018

Presentation transcript:

Learning Compositional Visual Concepts with Mutual Consistency Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, and Peter C. Doerschuk CVPR 2018 (spotlight) Concept GAN Presented by: Anand Bhattad (Vision Lunch, April 24 2018)

Image Translation (Pix to Pix and Cycle GAN) Pix2Pix Cycle GAN Source: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

Cycle GAN

Source: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

New Unseen Samples from Translations? Colored Bags Edge Bags Colored Shoes Edge Shoes

Overview: Cycle GAN G1 F1 F2 G2 F2 G2 G1 F1 Cycle GAN: Learn and Compose concepts separately G1 F1 G2 F2 G1 F1 G2 F2

Losses: Adversarial G1 F1 G2 F2

Losses: Cycle Consistency

Can we generate sketch shoes by combining them? F1 F2 G2 F2 G2 G1 F1

Proposed Model: Concept GAN Cycle GAN: Learn and Compose concepts separately Concept GAN: Learn and Compose concepts simultaneously G1 F1 F2 F2 G2 F2 G2 G1 F1

Losses: Cycle Consistency

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Losses: Cycle Consistency (Dist=4) G1 G1 F1 F1 F2 G2 F2 G2

Loss: Commutative G1 G1 F1 F1 F2 G2 F2 G2

Loss: Commutative G1 F1 F2 G2 F2 G2 G1 F1

Loss: Commutative G1 F1 F2 G2 F2 G2 G1 F1

Concept GAN

Comparing Results

Qualitative Results

Qualitative Results

Quantitative Results (Classification) Test Set: edge Shoes only Test Set: with eyeglasses, with bangs 18 60 (Accuracy if L_x removed)

Ablation Study

Re-ID

Generalizing to n(=3) Concepts

Learning from 2 attributes in 2 experiments Experiment1: (smile, eyeglasses) Experiment2: (bangs, eyeglasses) (smile, eyeglasses, bangs)

Yaxing Wang, Joost van de Weijer, Luis Herranz CVPR 2018 Mix and match networks: encoder-decoder alignment for zero-pair image translation Yaxing Wang, Joost van de Weijer, Luis Herranz CVPR 2018

Task Mix and match networks do not require retraining on unseen transformations, in contrast to unpaired translation alternatives (e.g. CycleGAN) leverages depth and semantic segmentation, ignoring the available RGB data and the corresponding pairs

Overview

Losses

Depth –to- Semantic segmentation

Depth-to-Semantic segmentation