Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,

Slides:

Advertisements

Similar presentations

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Advertisements

Lecture 6: Classification & Localization

Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

The Viola/Jones Face Detector (2001)

Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.

4EyesFace-Realtime face detection, tracking, alignment and recognition Changbo Hu, Rogerio Feris and Matthew Turk.

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.

1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.

On the Object Proposal Presented by Yao Lu

FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.

Face Alignment Using Cascaded Boosted Regression Active Shape Models

Detection, Segmentation and Fine-grained Localization

A New Fingertip Detection and Tracking Algorithm and Its Application on Writing-in-the-air System The th International Congress on Image and Signal.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR

FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.

Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Recent developments in object detection

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

A Discriminative Feature Learning Approach for Deep Face Recognition

CNN: Single-label to Multi-label

Lung cancer cell identification based on artificial neural network ensembles 指導老師: 李麗華教授報告者: 廖偉丞.

2. Skin - color filtering.

Deeply learned face representations are sparse, selective, and robust

Object Detection based on Segment Masks

ISBI Camelyon16 Challenge Prague, April 13, 2016

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.

Understanding and Predicting Image Memorability at a Large Scale

Depth estimation and Plane detection

Hybrid Features based Gender Classification

Huazhong University of Science and Technology

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Mean Euclidean Distance Error (mm)

CS 698 | Current Topics in Data Science

J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Adversarially Tuned Scene Generation

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601.

A Convolutional Neural Network Cascade For Face Detection

Deep Face Recognition Omkar M. Parkhi Andrea Vedaldi Andrew Zisserman

Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.

Walter J. Scheirer, Samuel E. Anthony, Ken Nakayama & David D. Cox

Counting in Dense Crowds using Deep Learning

Object Detection + Deep Learning

On-going research on Object Detection *Some modification after seminar

CornerNet: Detecting Objects as Paired Keypoints

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

Faster R-CNN By Anthony Martinez.

Outline Background Motivation Proposed Model Experimental Results

Object Tracking: Comparison of

RCNN, Fast-RCNN, Faster-RCNN

MULTI-VIEW VISUAL SPEECH RECOGNITION BASED ON MULTI TASK LEARNING HouJeung Han, Sunghun Kang and Chang D. Yoo Dept. of Electrical Engineering, KAIST, Republic.

Deep Object Co-Segmentation

Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method.

Motivation State-of-the-art two-stage instance segmentation methods depend heavily on feature localization to produce masks.

Object Detection Implementations

Multi-UAV to UAV Tracking

End-to-End Facial Alignment and Recognition

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Learning to Cluster Faces on an Affinity Graph

Point Set Representation for Object Detection and Beyond

Learning to Navigate for Fine-grained Classification

Presentation transcript:

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification, and so on. Automatically output the position for each face in the given image.

Cascaded framework that includes three-stage multi-task deep convolutional networks. Fig1: flowchart

Given an image, we initially resize it to different scales to build an image pyramid

Stage1:Proposal Network(P-net) Obtain the candidate facial windows and their bounding box regression vectors.

Stage1:Proposal Network(P-net) Face classification

Stage1:Proposal Network(P-net) Bounding box regression

Stage1:Proposal Network(P-net) Facial landmark location

P-net detection result Candidates are calibrated based on the estimated bounding box regression vectors. Employ non-maximum suppression (NMS) to merge highly overlapped candidates.

Stage2:Refinement Network(R-net)

R-net detection result All candidates are fed to another CNN(R-Net),which further rejects a large number of false candidates, performs calibration with bounding box regression, and conducts NMS.

Stage3:Output Network (O-Net) This stage is similar to the second stage, but in this stage we aim to identify face regions with more supervision. The network will output five facial landmarks’ positions.

O-net detection result The network will output five facial landmarks’ positions.

Flowchart

Training Face classification: cross-entropy loss

Training Bounding box regression: Euclidean loss

Training Facial landmark localization: Euclidean loss

Training Data

Experiments Training Data Negatives: Regions that the Intersection-over-Union (IoU) ratio less than 0.4 to any ground-truth faces Positives: IoU above 0.65 to a ground truth face Part faces: IoU between 0.4 and 0.65 to a ground truth face Landmark faces: faces labeled 5 landmarks’ positions

Experiments Training Data Negatives and positives are used for face classification tasks Positives and part faces are used for bounding box regression landmark faces are used for facial landmark localization

Experiments

Oscar

Result

Conclusion Framework adopts a cascaded structure with three stages of carefully designed deep convolutional networks that predict face and landmark location in a coarse-to-fine manner. Given the cascade structure, proposed method can achieve very fast speed in joint face detection and alignment.

Reference [1] Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters, 23(10), 1499- 1503.