Recent Advances in Generative Adversarial Networks (GAN)

Slides:

Advertisements

Similar presentations

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.

Advertisements

Comp 5013 Deep Learning Architectures Daniel L. Silver March,

Deep Residual Learning for Image Recognition

Deep Learning Overview Sources: workshop-tutorial-final.pdf

Recent developments in object detection

Big data classification using neural network

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

Generative Adversarial Network (GAN)

Learning to Compare Image Patches via Convolutional Neural Networks

Deep Neural Net Scenery Generation

Object Detection based on Segment Masks

Compact Bilinear Pooling

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Generative Adversarial Networks

RIVER SEGMENTATION FOR FLOOD MONITORING

CNN Demo LIU Pengpeng.

Deep learning and applications to Natural language processing

Structured Predictions with Deep Learning

Training Techniques for Deep Neural Networks

Generative adversarial networks (GANs) for edge detection

Unsupervised Conditional Generation

CS6890 Deep Learning Weizhen Cai

Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros

Presenter: Hajar Emami

Deep Learning and Newtonian Physics

Adversarially Tuned Scene Generation

Low Dose CT Image Denoising Using WGAN and Perceptual Loss

Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang

Neural Photo Editing Andrew Brock.

Computer Vision James Hays

Two-Stream Convolutional Networks for Action Recognition in Videos

Deep learning Introduction Classes of Deep Learning Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Generative Adversarial Network

[Figure taken from googleblog

Non-Stationary Texture Synthesis by Adversarial Expansion

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

Image to Image Translation using GANs

GAN Applications.

Outline Background Motivation Proposed Model Experimental Results

Neural Speech Synthesis with Transformer Network

Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences 1, Zhizhong.

Lip movement Synthesis from Text

Coding neural networks: A gentle Introduction to keras

Hein H. Aung Matt Anderson, Advisor Introduction

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.

Advances in Deep Audio and Audio-Visual Processing

边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University

Course Recap and What’s Next?

An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,

CMS 165 Lecture 18 Summing up...

Ch 14. Generative adversarial networks (GANs) for edge detection

Chuan Wang1, Haibin Huang1, Xiaoguang Han2, Jue Wang1

Human-object interaction

Image Processing and Multi-domain Translation

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Hsiao-Yu Chiang Xiaoya Li Ganyu “Bruce” Xu Pinshuo Ye Zhanyuan Zhang

Object Detection Implementations

Background Task Fashion image inpainting Some conceptions

Learning Deconvolution Network for Semantic Segmentation

Weak-supervision based Multi-Object Tracking

End-to-End Facial Alignment and Recognition

Deep screen image crop and enhance

Week 7 Presentation Ngoc Ta Aidean Sharghi

Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks

Deep screen image crop and enhance

Learning to Detect Human-Object Interactions with Knowledge

Directional Occlusion with Neural Network

Shengcong Chen, Changxing Ding, Minfeng Liu 2018

Presentation transcript:

Recent Advances in Generative Adversarial Networks (GAN) ELEG 5491 Introduction to Deep Learning Tutorial Xihui Liu

Outline Recap of generative adversarial networks Projection discriminator Self-attention GAN BigGAN StyleGAN EdgeConnect S3GAN SPADE

Recap

cGANs with Projection Discriminator Takeru Miyato, Masanori Koyama

Discriminator Structure

Results & Category morphing

Self-Attention Generative Adversarial Networks Han Zhang, Ian Goodfellow, Dimitris Metaxas, Augustus Odena

Self-Attention Structure

Techniques to stabilize GAN training Spectral normalization for both generator and discriminator Imbalanced learning rate for generator and discriminator updates

Visualize self-attention

Large Scale GAN Training for High Fidelity Natural Image Synthesis Andrew Brock, Jeff Donahue, Karen Simonyan https://www.youtube.com/watch?v=ZKQp28OqwNQ

Model Structure

Techniques Increase batch size by a factor of 8 ---> IS increases by 46% Increased channel width ---> IS further increases by 21% Increased depth ---> BigGAN-deep Shared class embedding for conditional Batch-norm. Skip connections to multiple layers of generator. Truncation trick -- trading off the variety and fidelity. Orthogonal regularization Extensive experiments

Results

Results

Easy Classes and Difficult Classes

Tero Karras, Samuli Laine, Timo Aila A Style-Based Generator Architecture for Generative Adversarial Networks Tero Karras, Samuli Laine, Timo Aila https://www.youtube.com/watch?v=-cOYwZ2XcAc&t=21s

Network Architecture Adaptive IN

Building up the Model

Results -- faces

Results -- faces

Results -- bedrooms

Results -- cars

A Style-Based Generator Architecture for Generative Adversarial Networks

Semantic Image Synthesis with Spatially-Adaptive Normalization Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu

Image Synthesis from Semantic Segmentation Masks

Previous approaches: pix2pixHD Directly feed the semantic layout as input to the deep network, which is processed through stacks of convolution, normalization, and nonlinearity layers. However, this is suboptimal as the normalization layers tend to “wash away” semantic information in input semantic segmentation masks.

Conditional Normalization Layers Proven effective for recent generative adversarial networks such as Self-attention GAN, BigGAN, StyleGAN, etc.

SPADE: spatially-adaptive denormalization

SPADE Generator

SPADE Generator Better preserve semantic information against common normalization layers. The conventional normalization layers tend to “wash away” semantic information in input semantic segmentation masks.

Details Multi-modal synthesis with an image encoder and a KL-Divergence loss. Discriminator is the same as pix2pixHD. Datasets COCO-Stuff ADE20K ADE20K-outdoor Cityscapes Flickr Landscapes

Results

EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning Kamyar Nazeri, Eric Ng, Tony Joseph, Faisal Z. Qureshi, Mehran Ebrahimi

Network Structure Edge generator Image completion network

Results

Results

Youngjoo Jo, Jongyoul Park https://www.youtube.com/watch?v=dvzlvHNxdfI SC-FEGAN: Face Editing Generative Adversarial Network with User's Sketch and Color Youngjoo Jo, Jongyoul Park https://www.youtube.com/watch?v=dvzlvHNxdfI

Results

High-Fidelity Image Generation With Fewer Labels Mario Lucic, Michael Tschannen, Marvin Ritter, Xiaohua Zhai, Olivier Bachem, Sylvain Gelly

High-Fidelity Image Generation With Fewer Labels

High-Fidelity Image Generation With Fewer Labels