Deep screen image crop and enhance

Slides:

Advertisements

Similar presentations

Logan Lebanoff Mentor: Haroon Idrees

Advertisements

Deep Residual Learning for Image Recognition

Lecture 4b Data augmentation for CNN training

Lecture 3a Analysis of training of NN

SUPER RESOLUTION USING NEURAL NETS Hila Levi & Eran Amar Weizmann Ins

Deeply-Recursive Convolutional Network for Image Super-Resolution

Deep Residual Learning for Image Recognition

Generative Adversarial Nets

Deep Learning for Dual-Energy X-Ray

Learning to Compare Image Patches via Convolutional Neural Networks

Analysis of Sparse Convolutional Neural Networks

Deep Residual Networks

Environment Generation with GANs

Summary of “Efficient Deep Learning for Stereo Matching”

Object Detection based on Segment Masks

Data Mining, Neural Network and Genetic Programming

Computer Science and Engineering, Seoul National University

DeepCount Mark Lenson.

The Problem: Classification

Textual Video Prediction Week 2

Inception and Residual Architecture in Deep Convolutional Networks

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Hierarchical Deep Convolutional Neural Network

Synthesis of X-ray Projections via Deep Learning

Super-resolution Image Reconstruction

Single Image Super-Resolution

Efficient Deep Model for Monocular Road Segmentation

Presenter: Hajar Emami

Textual Video Prediction

Low Dose CT Image Denoising Using WGAN and Perceptual Loss

Deep Learning Convoluted Neural Networks Part 2 11/13/

Layer-wise Performance Bottleneck Analysis of Deep Neural Networks

Bird-species Recognition Using Convolutional Neural Network

Face Recognition with Deep Learning Method

Image Classification.

SBNet: Sparse Blocks Network for Fast Inference

A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,

Deep CNN of JPEG 2000 電信所R 林俊廷.

Counting in Dense Crowds using Deep Learning

By: Behrouz Rostami, Zeyun Yu Electrical Engineering Department

Road Traffic Sign Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Basics of Deep Learning No Math Required

Generative Adversarial Network

A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE

Image to Image Translation using GANs

GAN Applications.

Use 3D Convolutional Neural Network to Inspect Solder Ball Defects

Lip movement Synthesis from Text

Object Tracking: Comparison of

Analysis of Trained CNN (Receptive Field & Weights of Network)

Mihir Patel and Nikhil Sardana

Objective: to find and verify inverses of functions.

Deep Object Co-Segmentation

Natalie Lang Tomer Malach

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Weeks 1 and 2 Aaron Ott.

Deep screen image crop and enhance

End-to-End Facial Alignment and Recognition

Appearance Transformer (AT)

Deep screen image crop and enhance

Self-Supervised Cross-View Action Synthesis

End-to-End Speech-Driven Facial Animation with Temporal GANs

Self-Supervised Cross-View Action Synthesis

Deep screen image crop and enhance

CRCV REU 2019 Aaron Honculada.

Deep screen image crop and enhance

Deep screen image crop and enhance

Directional Occlusion with Neural Network

Shengcong Chen, Changxing Ding, Minfeng Liu 2018

Presentation transcript:

Deep screen image crop and enhance Week 3 (Aaron Ott, Amir Mazaheri)

Problem We have taken a photo of an image, and we want the original image. This can be broken into 2 parts: Image Detector/Cropper Image Enhancer

Cropper Uses a frozen VGG-19 model to get feature map Applies convolutions, normalizations, and activations Final dense layer creates 6-number affine transformation STN takes input image and applies affine transformation

Enhancer Pretrained EDSR (trained on DIV2K) Modified form of Resnet https://github.com/krasserm/super-resolution Pretrained EDSR (trained on DIV2K) Modified form of Resnet Uses modified residual block, which excludes batch normalization and final ReLU layer 16 Residual blocks Subpixel Conv2D layers for upscaling the image Scales the image 4x Lim, Son, Kim, Nah, Lee. “Enhanced Deep Residual Networks for Single Image Super-Resolution”. 10 July 2017

Combined Cropper and Enhancer Trained with 2 outputs and 2 Loss Functions: - Trained Cropper on VGG + Cosine Proximity - Trained Enhancer on VGG + MSE

Results Cropper & Enhancer Metric\Model Cropper Cropper & Enhancer PSNR 11.1903 16.2060 SSIM 0.4254 0.4909 MSE 0.0796 0.0281 Input Cropper Actual

Shortcomings of PSNR, SSIM, and MSE Metrics: PSNR: 18.3130 SSIM: 0.5358 MSE: 0.0164 Input Output Loss Functions VGG + Cosine Proximity MSE Actual

Building the GAN: Discriminator Used Discriminator from https://github.com/krasserm/super-resolution Skips Batch Normalization in first Discrimination Block Pairs of each level of number of feature maps Final Dense layers, with a single value output Discrimination Block

Only trained on 15 epochs, starting with existing weights Current GAN Output Input Output Actual Metric\Model Cropper Cropper & Enhancer GAN PSNR 11.1903 16.2060 16.2404 SSIM 0.4254 0.4909 0.4899 MSE 0.0796 0.0281 0.0277

What’s Next Short Term (next week) Optimize GAN and get it training properly Try new enhancers Synthetically create dataset Long Term (to the end of the summer) Develop network to work on harder datasets Connect model to solve existing issues: identification/classification