Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,

Slides:



Advertisements
Similar presentations
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Advertisements

Lecture 6: Classification & Localization
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
The Viola/Jones Face Detector (2001)
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
4EyesFace-Realtime face detection, tracking, alignment and recognition Changbo Hu, Rogerio Feris and Matthew Turk.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.
1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.
On the Object Proposal Presented by Yao Lu
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Detection, Segmentation and Fine-grained Localization
A New Fingertip Detection and Tracking Algorithm and Its Application on Writing-in-the-air System The th International Congress on Image and Signal.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.
Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Recent developments in object detection
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
A Discriminative Feature Learning Approach for Deep Face Recognition
CNN: Single-label to Multi-label
Lung cancer cell identification based on artificial neural network ensembles 指導老師: 李麗華 教授 報告者: 廖偉丞.
2. Skin - color filtering.
Deeply learned face representations are sparse, selective, and robust
Object Detection based on Segment Masks
ISBI Camelyon16 Challenge Prague, April 13, 2016
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.
Understanding and Predicting Image Memorability at a Large Scale
Depth estimation and Plane detection
Hybrid Features based Gender Classification
Huazhong University of Science and Technology
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Mean Euclidean Distance Error (mm)
CS 698 | Current Topics in Data Science
J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Adversarially Tuned Scene Generation
Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601.
A Convolutional Neural Network Cascade For Face Detection
Deep Face Recognition Omkar M. Parkhi Andrea Vedaldi Andrew Zisserman
Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.
Walter J. Scheirer, Samuel E. Anthony, Ken Nakayama & David D. Cox
Counting in Dense Crowds using Deep Learning
Object Detection + Deep Learning
On-going research on Object Detection *Some modification after seminar
CornerNet: Detecting Objects as Paired Keypoints
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
Faster R-CNN By Anthony Martinez.
Papers 15/08.
Outline Background Motivation Proposed Model Experimental Results
Object Tracking: Comparison of
RCNN, Fast-RCNN, Faster-RCNN
MULTI-VIEW VISUAL SPEECH RECOGNITION BASED ON MULTI TASK LEARNING HouJeung Han, Sunghun Kang and Chang D. Yoo Dept. of Electrical Engineering, KAIST, Republic.
Deep Object Co-Segmentation
Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method.
Motivation State-of-the-art two-stage instance segmentation methods depend heavily on feature localization to produce masks.
Object Detection Implementations
Multi-UAV to UAV Tracking
End-to-End Facial Alignment and Recognition
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Learning to Cluster Faces on an Affinity Graph
Jiahe Li
Point Set Representation for Object Detection and Beyond
Learning to Navigate for Fine-grained Classification
Presentation transcript:

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification, and so on. Automatically output the position for each face in the given image.

Cascaded framework that includes three-stage multi-task deep convolutional networks. Fig1: flowchart

Given an image, we initially resize it to different scales to build an image pyramid

Stage1:Proposal Network(P-net) Obtain the candidate facial windows and their bounding box regression vectors.

Stage1:Proposal Network(P-net) Face classification

Stage1:Proposal Network(P-net) Bounding box regression

Stage1:Proposal Network(P-net) Facial landmark location

P-net detection result Candidates are calibrated based on the estimated bounding box regression vectors. Employ non-maximum suppression (NMS) to merge highly overlapped candidates.

Stage2:Refinement Network(R-net)

R-net detection result All candidates are fed to another CNN(R-Net),which further rejects a large number of false candidates, performs calibration with bounding box regression, and conducts NMS.

Stage3:Output Network (O-Net) This stage is similar to the second stage, but in this stage we aim to identify face regions with more supervision. The network will output five facial landmarks’ positions.

O-net detection result The network will output five facial landmarks’ positions.

Flowchart

Training Face classification: cross-entropy loss

Training Bounding box regression: Euclidean loss

Training Facial landmark localization: Euclidean loss

Training Data

Experiments Training Data Negatives: Regions that the Intersection-over-Union (IoU) ratio less than 0.4 to any ground-truth faces Positives: IoU above 0.65 to a ground truth face Part faces: IoU between 0.4 and 0.65 to a ground truth face Landmark faces: faces labeled 5 landmarks’ positions

Experiments Training Data Negatives and positives are used for face classification tasks Positives and part faces are used for bounding box regression landmark faces are used for facial landmark localization

Experiments

Oscar

Result

Conclusion Framework adopts a cascaded structure with three stages of carefully designed deep convolutional networks that predict face and landmark location in a coarse-to-fine manner. Given the cascade structure, proposed method can achieve very fast speed in joint face detection and alignment.

Reference [1] Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters, 23(10), 1499- 1503.