Single Image Super-Resolution

Slides:



Advertisements
Similar presentations
Fast Direct Super-Resolution by Simple Functions
Advertisements

Deep Residual Learning for Image Recognition
SUPER RESOLUTION USING NEURAL NETS Hila Levi & Eran Amar Weizmann Ins
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Deeply-Recursive Convolutional Network for Image Super-Resolution
Recent developments in object detection
Generative Adversarial Nets
What Convnets Make for Image Captioning?
Object Detection based on Segment Masks
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Textual Video Prediction Week 2
Perceptual Loss Deep Feature Interpolation for Image Content Changes
Compositional Human Pose Regression
Chaoyun Zhang, Xi Ouyang, and Paul Patras
Synthesis of X-ray Projections via Deep Learning
Super-Resolution Image Reconstruction
Super-resolution Image Reconstruction
Presenter: Hajar Emami
Efficient Deep Model for Monocular Road Segmentation
Machine Learning: The Connectionist
Deep Residual Learning for Image Recognition
Presenter: Hajar Emami
Textual Video Prediction
Low Dose CT Image Denoising Using WGAN and Perceptual Loss
Deep Learning Convoluted Neural Networks Part 2 11/13/
Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang
Neural Photo Editing Andrew Brock.
Layer-wise Performance Bottleneck Analysis of Deep Neural Networks
Visual Question Generation
Introduction to Neural Networks
Image Classification.
Learning to See in the Dark
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Two-Stream Convolutional Networks for Action Recognition in Videos
RGB-D Image for Scene Recognition by Jiaqi Guo
Deep Learning Hierarchical Representations for Image Steganalysis
Very Deep Convolutional Networks for Large-Scale Image Recognition
Deep Visual-Semantic Alignments for Generating Image Descriptions
Generative Adversarial Network
Image to Image Translation using GANs
Lecture: Deep Convolutional Neural Networks
Visualizing and Understanding Convolutional Networks
Tuning CNN: Tips & Tricks
Lip movement Synthesis from Text
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
Speakers: Luo Mai, Jingqing Zhang
Zhedong Zheng, Liang Zheng and Yi Yang
Textual Video Prediction
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis Speaker: ZHAO Jian
TPGAN overview.
Abnormally Detection
Deep Object Co-Segmentation
Natalie Lang Tomer Malach
A New Large-Scale Sentinel-2 Benchmark Archive to Drive Deep Learning Studies in Remote Sensing Gencer Sumbul1, Marcela Charfuelan3, Begüm Demir1, Volker.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Deep screen image crop and enhance
Deep screen image crop and enhance
Xilai Li, Tianfu Wu, and Xi Song CVPR 2019 Presented by Dingquan Li
CRCV REU 2019 Kara Schatz.
Week 3 Volodymyr Bobyr.
Deep screen image crop and enhance
End-to-End Speech-Driven Facial Animation with Temporal GANs
Deep screen image crop and enhance
Recent Developments on Super-Resolution
CRCV REU 2019 Aaron Honculada.
Deep screen image crop and enhance
Deep screen image crop and enhance
Directional Occlusion with Neural Network
CVPR2019 Jiahe Li SiamRPN introduces the region proposal network after the Siamese network and performs joint classification and regression.
Deep CNN for breast cancer histology Image Analysis
Presentation transcript:

Single Image Super-Resolution Adam Vest

Problem Overview Challenging problem → difficult to infer the values for multiple pixels from a single pixel Significant advances in the last two years due to deep learning methods First deep learning systems used ConvNets, recent state-of-the-art uses GANs Goal is to find some mapping f(ILR) = ISR where ISR is as close as possible to IHR

The First Wave: ConvNets Convolutional networks made great progress in SISR Several variants over the years: SRCNN DRCN ESPCN General take-aways: learning upscaling leads to dramatic increases in performance, deeper networks performed better Suffers from blurry image outputs/cannot capture image textures well due to reliance on L1/L2 losses SRResNet is the current state-of-the-art ConvNet for SISR

SRGAN First model to use adversarial learning for super-resolution 16 block deep ResNet structure + 2 PixelShuffle blocks (Transpose Conv.) Utilized a content loss (MSE or VGG) and an adversarial loss (BCE) to produce realistic images Pretty good PSNR, very good Mean-Opinion-Score

Project Goal To use adversarial learning to improve on the current state-of-the-art in SISR Build on the SRGAN baseline by exploring different choices for network architectures, the merits of different loss functions, etc. Implement an existing/design a new adversarial model that has not yet been used for SISR

Where Are We Now? Downloaded ImageNet and other datasets Setup on the cluster Modular implementations of all SRResNet and SRGAN variants Trained SRResNet on ImageNet, currently training SRGAN Implemented evaluation suite for BSD100, Urban100, Set5, and Set14 datasets Tested SRResNet implementation on evaluation suite

SRResNet Implementation Results LR SR HR

SRResNet Implementation Results Cont. PSNR Results for 4x Upscaling Dataset Ours (RGB) Theirs (Y-Channel) Set5 29.90 32.05 Set14 26.42 28.49 BSD100 26.15 27.58 Urban100 24.31 Not Reported

Where Are We Going? SRGAN + Wasserstein discriminator SRGAN + PatchGAN discriminator? Do super-resolution on Y-channel of the image only Investigate other adversarial models/network structures Investigate different evaluation metrics → SSIM Could lead to different choices for loss functions

Thank you! Questions?