Deep Automatic Portrait Matting

Slides:

Advertisements

Similar presentations

Request Dispatching for Cheap Energy Prices in Cloud Data Centers

Advertisements

SpringerLink Training Kit

Luminosity measurements at Hadron Colliders

From Word Embeddings To Document Distances

Choosing a Dental Plan Student Name

Virtual Environments and Computer Graphics

Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI

THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –

D. Phát triển thương hiệu

NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN

Điều trị chống huyết khối trong tai biến mạch máu não

BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.

Nasal Cannula X particulate mask

Evolving Architecture for Beyond the Standard Model

HF NOISE FILTERS PERFORMANCE

Electronics for Pedestrians – Passive Components –

Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel

L-Systems and Affine Transformations

CMSC423: Bioinformatic Algorithms, Databases and Tools

Some aspect concerning the LMDZ dynamical core and its use

Bayesian Confidence Limits and Intervals

实习总结（Internship Summary)

Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,

Front End Electronics for SOI Monolithic Pixel Sensor

Face Recognition Monday, February 1, 2016.

Solving Rubik's Cube By: Etai Nativ.

CS284 Paper Presentation Arpad Kovacs

انتقال حرارت 2 خانم خسرویار.

Summer Student Program First results

Theoretical Results on Neutrinos

HERMESでのHard Exclusive生成過程による核子内クォーク全角運動量についての研究

Wavelet Coherence & Cross-Wavelet Transform

yaSpMV: Yet Another SpMV Framework on GPUs

Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.

MOCLA02 Design of a Compact L-band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Fuel cell development program for electric vehicle

Overview of TST-2 Experiment

Optomechanics with atoms

داده کاوی سئوالات نمونه

Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium

ლექცია 4 - ფული და ინფლაცია

10. predavanje Novac i financijski sustav

Wissenschaftliche Aussprache zur Dissertation

FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,

Particle acceleration during the gamma-ray flares of the Crab Nebular

Interpretations of the Derivative Gottfried Wilhelm Leibniz

Advisor: Chiuyuan Chen Student: Shao-Chun Lin

Widow Rockfish Assessment

SiW-ECAL Beam Test 2015 Kick-Off meeting

On Robust Neighbor Discovery in Mobile Wireless Networks

Chapter 6 并发：死锁和饥饿 Operating Systems: Internals and Design Principles

You NEED your book!!! Frequency Distribution

Y V =0 a V =V0 x b b V =0 z

Fairness-oriented Scheduling Support for Multicore Systems

Climate-Energy-Policy Interaction

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Ch48 Statistics by Chtan FYHSKulai

The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.

Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs

Online Learning: An Introduction

Factor Based Index of Systemic Stress (FISS)

What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.

THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*

Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.

The Toroidal Sporadic Source: Understanding Temporal Variations

FW 3.4: More Circle Practice

ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف

Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM

Limits on Anomalous WWγ and WWZ Couplings from DØ

Presentation transcript:

Deep Automatic Portrait Matting Xiaoyong Shen, Xin Tao, Hongyun Gao, Chao Zhou, Jiaya Jia The Chinese University of Hong Kong Morning everyone. I am Xiaoyong Shen from the Chinese University of Hong Kong. In this spotlight presentation, we propose the deep automatic portrait matting.

Portrait Matting Alpha Matte Stylization Cartoon Input Image Color Transform Depth-of-field Portrait Background Edit [press] Portrait matting extracts the foreground and plays important role for portrait post-processing. For example, we need to extract the foreground to do stylization and the depth-of-field effect to stand out the person or do some background editing.

Matting Problem 𝑰= 𝜶𝑭+ 𝟏−𝜶 𝑩 Ill-posed problem Image 𝑰= Alpha Matte 𝜶𝑭+ 𝟏−𝜶 𝑩 Foreground Background Simply put, image matting decompose an image into background and foreground, [press] Which are linearly combined by an alpha matte. It is easy to know that matting is an ill-posed problem since there are seven unknowns for each pixel. Ill-posed problem --seven unknowns should be estimated for each pixel.

Image Matting User interactions are needed Strokes Trimap Input To make matting problem tackable, matting approaches need user specified foreground and background. [press] Generally, they are masked by strokes or trimap. Input

Issues User specified strokes or trimap are difficult to meet the algorithm requirements Unfortunately, user specified strokes or trimap are very hard to meet the algorithm requirements. [press] For example, even with such complex strokes, the closed form matting is still not satisfactory. Similarly, it is also very difficult to label a trimap to achieve good matting result.

Tedious interaction is involved to produce these trimaps. To further illustrate the difficulties, we take the trimap specification as an example. [press] For these two input portraits, in order to achieve such matting results, The users needs to label such trimaps which are with complex structures. There is no doubt that tedious interactions are required even for professional users. Tedious interaction is involved to produce these trimaps.

Deep Automatic Portrait Matting End-to-end CNNs To address the tedious interaction issue, we proposed the deep automatic portrait matting by well defined convolutional neural networks.

Deep Automatic Matting Our CNNs are end-to-end network, which takes a portrait as input and directly predict the matte.

Trimap Labeling Input: RGB image Output: trimap representation Network: FCN [Long et al. 2015] Our network mainly includes two parts. [press] The first part is the trimap labeling process, which takes a color image as input and outputs the trimap representation by a FCN structure.

Image Matting Layer Input: trimap representation Output: alpha matte Newly-designed layers [press] The second part is the image matting layer with the former predicted trimap representation as inputs and estimates the final alpha matte. It is our novel-designed CNN layer with newly defined functions.

Image Matting Layer Feed-Forward 𝑚𝑖𝑛 𝜆 𝐴 𝑇 𝐵𝐴+𝜆 𝐴−1 𝑇 𝐹(𝐴−1) + 𝐴 𝑇 𝐿𝐴 𝑚𝑖𝑛 𝜆 𝐴 𝑇 𝐵𝐴+𝜆 𝐴−1 𝑇 𝐹(𝐴−1) + 𝐴 𝑇 𝐿𝐴 Back-Forward 𝜕𝑓 𝜕𝐵 =−𝜆 𝐷 −1 𝑑𝑖𝑎𝑔( 𝐷 −1 𝐹) 𝜕𝑓 𝜕𝐹 = 𝜕𝑓 𝜕𝐵 + 𝐷 −1 𝜕𝑓 𝜕𝜆 =−𝜆 𝐷 −1 𝑑𝑖𝑎𝑔 𝐹+𝐵 𝐷 −1 𝐹 Here we show its feed-Forward and back-forward formulas.

Learning Data Collection We create a 2,000 portraits dataset for training and testing 1,700 for training and 300 for testing Large variations in age, gender, pose, hairstyle, background, camera type, etc. The matting ground truth is estimated by human well labeled trimap [press] To make the network training and testing possible, we collect a 2,000 portrait dataset where the data is with large variations. For each portrait, we get its matting ground truth by closed form matting under the human well labeled trimap. Here we show some examples in our dataset.

Data Examples The portrait images

Labeled Mattes And our labeled matting ground truth.

Experiments Running Time Comparisons Training: 20k iterations, one day on Titan X GPU Testing: 0.6s for 600×800 color image Comparisons Automatic segmentation to trimap approaches Direct trimap labeling methods [press] Our method is efficient in training and testing, which needs one day for training and 0.6s for testing. To evaluate its performance, we compare our approach with automatic segmentation to trimap approaches and the direct trimap labeling methods.

Evaluation Methods Grad. Error (×𝟏 𝟎 −𝟑 ) Conn. Error (×𝟏 𝟎 −𝟒 ) Graph-cut Trimap 4.93 7.73 AutoTrimap 4.61 7.63 Trimap by FCN 4.14 7.61 Trimap by DeepLab 3.91 7.52 Trimap by CRFasRNN 3.56 7.39 Ours without Shape Mask 3.11 6.99 Ours 3.03 6.90 Here we show the comparisons. [press] Ours achieved the best performance.

Input For this input portrait with complex hair,

Graph-cut Trimap The matting result from graph-cut segmentation based trimap fails estimating the correct matte in hair areas.

FCN Trimap Directly labelling the trimap using FCN also fails extracting the hair.

Ours Ours achieved satisfactory result with clear matte details.

Input Here is another example,

Graph-cut Trimap The trimap estimation from Graph-cut cannot generate correct matting result.

FCN Trimap The matting result from FCN labeled trimap is better.

Ours Ours achieved the high performance result with enough hair details.

More Results For this complex hair and lighting example, [press] our automatic matting result is also with high accuracy.

More Results Here is another portrait, [press] Our approach well extracts the hair structures.

Conclusion We proposed the deep automatic portrait matting Future work An end-to-end matting CNNs framework Novel matting layer A matting dataset with 2,000 portraits Future work Video portrait matting Person matting General object matting [press] In conclusion, we proposed the deep automatic portrait matting approach under an end-to-end matting CNNs framework ##with novel matting layers. A 2000 portrait matting dataset is created for training and testing. In the future, we are interested to extend our framework to video, person, general object matting.

Thanks Thanks very much and welcome to our poster.