Recent Advances in Generative Adversarial Networks (GAN)

Slides:



Advertisements
Similar presentations
On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.
Advertisements

Comp 5013 Deep Learning Architectures Daniel L. Silver March,
Deep Residual Learning for Image Recognition
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Recent developments in object detection
Big data classification using neural network
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
Generative Adversarial Network (GAN)
Learning to Compare Image Patches via Convolutional Neural Networks
Deep Neural Net Scenery Generation
Object Detection based on Segment Masks
Compact Bilinear Pooling
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Generative Adversarial Networks
RIVER SEGMENTATION FOR FLOOD MONITORING
CNN Demo LIU Pengpeng.
Deep learning and applications to Natural language processing
Structured Predictions with Deep Learning
Training Techniques for Deep Neural Networks
Generative adversarial networks (GANs) for edge detection
Unsupervised Conditional Generation
CS6890 Deep Learning Weizhen Cai
Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros
Presenter: Hajar Emami
Deep Learning and Newtonian Physics
Adversarially Tuned Scene Generation
Low Dose CT Image Denoising Using WGAN and Perceptual Loss
Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang
Neural Photo Editing Andrew Brock.
Computer Vision James Hays
Two-Stream Convolutional Networks for Action Recognition in Videos
Deep learning Introduction Classes of Deep Learning Networks
Very Deep Convolutional Networks for Large-Scale Image Recognition
Generative Adversarial Network
[Figure taken from googleblog
Non-Stationary Texture Synthesis by Adversarial Expansion
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
Image to Image Translation using GANs
GAN Applications.
Papers 15/08.
Outline Background Motivation Proposed Model Experimental Results
Neural Speech Synthesis with Transformer Network
Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences 1, Zhizhong.
Lip movement Synthesis from Text
Coding neural networks: A gentle Introduction to keras
Hein H. Aung Matt Anderson, Advisor Introduction
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
Advances in Deep Audio and Audio-Visual Processing
边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University
Course Recap and What’s Next?
An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,
CMS 165 Lecture 18 Summing up...
Ch 14. Generative adversarial networks (GANs) for edge detection
Chuan Wang1, Haibin Huang1, Xiaoguang Han2, Jue Wang1
Human-object interaction
Image Processing and Multi-domain Translation
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Hsiao-Yu Chiang Xiaoya Li Ganyu “Bruce” Xu Pinshuo Ye Zhanyuan Zhang
Object Detection Implementations
Background Task Fashion image inpainting Some conceptions
Learning Deconvolution Network for Semantic Segmentation
Weak-supervision based Multi-Object Tracking
End-to-End Facial Alignment and Recognition
Deep screen image crop and enhance
Week 7 Presentation Ngoc Ta Aidean Sharghi
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks
Deep screen image crop and enhance
Learning to Detect Human-Object Interactions with Knowledge
Directional Occlusion with Neural Network
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

Recent Advances in Generative Adversarial Networks (GAN) ELEG 5491 Introduction to Deep Learning Tutorial Xihui Liu

Outline Recap of generative adversarial networks Projection discriminator Self-attention GAN BigGAN StyleGAN EdgeConnect S3GAN SPADE

Recap

cGANs with Projection Discriminator Takeru Miyato, Masanori Koyama

Discriminator Structure

Results & Category morphing

Self-Attention Generative Adversarial Networks Han Zhang, Ian Goodfellow, Dimitris Metaxas, Augustus Odena

Self-Attention Structure

Techniques to stabilize GAN training Spectral normalization for both generator and discriminator Imbalanced learning rate for generator and discriminator updates

Visualize self-attention

Large Scale GAN Training for High Fidelity Natural Image Synthesis Andrew Brock, Jeff Donahue, Karen Simonyan https://www.youtube.com/watch?v=ZKQp28OqwNQ

Model Structure

Techniques Increase batch size by a factor of 8 ---> IS increases by 46% Increased channel width ---> IS further increases by 21% Increased depth ---> BigGAN-deep Shared class embedding for conditional Batch-norm. Skip connections to multiple layers of generator. Truncation trick -- trading off the variety and fidelity. Orthogonal regularization Extensive experiments

Results

Results

Easy Classes and Difficult Classes

Tero Karras, Samuli Laine, Timo Aila A Style-Based Generator Architecture for Generative Adversarial Networks Tero Karras, Samuli Laine, Timo Aila https://www.youtube.com/watch?v=-cOYwZ2XcAc&t=21s

Network Architecture Adaptive IN

Building up the Model

Results -- faces

Results -- faces

Results -- bedrooms

Results -- cars

A Style-Based Generator Architecture for Generative Adversarial Networks

Semantic Image Synthesis with Spatially-Adaptive Normalization Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu

Image Synthesis from Semantic Segmentation Masks

Previous approaches: pix2pixHD Directly feed the semantic layout as input to the deep network, which is processed through stacks of convolution, normalization, and nonlinearity layers. However, this is suboptimal as the normalization layers tend to “wash away” semantic information in input semantic segmentation masks.

Conditional Normalization Layers Proven effective for recent generative adversarial networks such as Self-attention GAN, BigGAN, StyleGAN, etc.

SPADE: spatially-adaptive denormalization

SPADE Generator

SPADE Generator Better preserve semantic information against common normalization layers. The conventional normalization layers tend to “wash away” semantic information in input semantic segmentation masks.

Details Multi-modal synthesis with an image encoder and a KL-Divergence loss. Discriminator is the same as pix2pixHD. Datasets COCO-Stuff ADE20K ADE20K-outdoor Cityscapes Flickr Landscapes

Results

EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning Kamyar Nazeri, Eric Ng, Tony Joseph, Faisal Z. Qureshi, Mehran Ebrahimi

Network Structure Edge generator Image completion network

Results

Results

Youngjoo Jo, Jongyoul Park https://www.youtube.com/watch?v=dvzlvHNxdfI SC-FEGAN: Face Editing Generative Adversarial Network with User's Sketch and Color Youngjoo Jo, Jongyoul Park https://www.youtube.com/watch?v=dvzlvHNxdfI

Results

High-Fidelity Image Generation With Fewer Labels Mario Lucic, Michael Tschannen, Marvin Ritter, Xiaohua Zhai, Olivier Bachem, Sylvain Gelly

High-Fidelity Image Generation With Fewer Labels

High-Fidelity Image Generation With Fewer Labels