Lip Feature Extraction Using Red Exclusion Trent W. Lewis and David M.W. Powers Flinders University of SA VIP2000.

Slides:



Advertisements
Similar presentations
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Advertisements

Advanced Image Processing Student Seminar: Lipreading Method using color extraction method and eigenspace technique ( Yasuyuki Nakata and Moritoshi Ando.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
CM613 Multimedia storage and retrieval Lecture: Video Compression Slide 1 CM613 Multimedia storage and retrieval Content-based image retrieval D.Miller.
F ACE TRACKING EE 7700 Name: Jing Chen Shaoming Chen.
Automatic Feature Extraction for Multi-view 3D Face Recognition
3D Face Modeling Michaël De Smet.
Facial feature localization Presented by: Harvest Jang Spring 2002.
Face Recognition and Biometric Systems
Color Image Processing
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
1 of 25 1 of 22 Blind-Spot Experiment Draw an image similar to that below on a piece of paper (the dot and cross are about 6 inches apart) Close your right.
Medical Imaging Mohammad Dawood Department of Computer Science University of Münster Germany.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Evaluation of Image Pre-processing Techniques for Eigenface Based Face Recognition Thomas Heseltine york.ac.uk/~tomh
Fig. 2 – Test results Personal Memory Assistant Facial Recognition System The facial identification system is divided into the following two components:
A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.
CRICOS No J † e-Health Research Centre/ CSIRO ICT Centre * Speech, Audio, Image and Video Research Laboratory Comparing Audio and Visual Information.
Face Recognition: A Comparison of Appearance-Based Approaches
Lecture 3a: Feature detection and matching CS6670: Computer Vision Noah Snavely.
UNIVERSITY OF MURCIA (SPAIN) ARTIFICIAL PERCEPTION AND PATTERN RECOGNITION GROUP REFINING FACE TRACKING WITH INTEGRAL PROJECTIONS Ginés García Mateos Dept.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
Oral Defense by Sunny Tang 15 Aug 2003
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Human Visual System 4c8 Handout 2. Image and Video Compression We have already seen the need for compression We can use what we have learned in information.
Neighborhood Operations
Multimedia Databases (MMDB)
Multimodal Interaction Dr. Mike Spann
GEOG2021 Environmental Remote Sensing
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Remote Sensing and Image Processing: 2 Dr. Hassan J. Eghbali.
Multimodal Information Analysis for Emotion Recognition
AdeptSight Image Processing Tools Lee Haney January 21, 2010.
Learning to perceive how hand-written digits were drawn Geoffrey Hinton Canadian Institute for Advanced Research and University of Toronto.
Medical Image Analysis Dr. Mohammad Dawood Department of Computer Science University of Münster Germany.
Advanced Multimedia Image Content Analysis Tamara Berg.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Corner Detection & Color Segmentation CSE350/ Sep 03.
` Tracking the Eyes using a Webcam Presented by: Kwesi Ackon Kwesi Ackon Supervisor: Mr. J. Connan.
Autonomous Robots Vision © Manfred Huber 2014.
1 Machine Vision. 2 VISION the most powerful sense.
CSSE463: Image Recognition Day 6 Yesterday: Local, global, and point operators use different context, but all Yesterday: Local, global, and point operators.
UCD Electronic and Electrical Engineering Robust Multi-modal Person Identification with Tolerance of Facial Expression Niall Fox Dr Richard Reilly University.
Lecture 10: Harris Corner Detector CS4670/5670: Computer Vision Kavita Bala.
Alexandrina Rogozan Adaptive Fusion of Acoustic and Visual Sources for Automatic Speech Recognition UNIVERSITE du MAINE
Effect of Hough Forests Parameters on Face Detection Performance: An Empirical Analysis M. Hassaballah, Mourad Ahmed and H.A. Alshazly Department of Mathematics,
CONTENTS:  Introduction.  Face recognition task.  Image preprocessing.  Template Extraction and Normalization.  Template Correlation with image database.
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
OCR Reading.
Color Image Processing
Color Image Processing
Deep Learning Amin Sobhani.
The Colour of Light: Additive colour theory.
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Graphics – Day 1.
Color Image Processing
Computer Vision Lecture 4: Color
Histogram Histogram is a graph that shows frequency of anything. Histograms usually have bars that represent frequency of occuring of data. Histogram has.
4.2 Data Input-Output Representation
Color Image Processing
Color Image Processing
CSSE463: Image Recognition Day 6
CSSE463: Image Recognition Day 6
EE 492 ENGINEERING PROJECT
CSSE463: Image Recognition Day 6
CSSE463: Image Recognition Day 6
Presentation transcript:

Lip Feature Extraction Using Red Exclusion Trent W. Lewis and David M.W. Powers Flinders University of SA VIP2000

01/12/2000Lip Feature Extraction Using Red Exlucsion 2 Overview Context Lip Feature Extraction –Related Work (greyscale, horizontal edges, red and hue colour spaces) –Red Exclusion AVSR: Results and Issues Summary

01/12/2000Lip Feature Extraction Using Red Exlucsion 3 Context Audio Speech Recognition (ASR) Psycholinguistic Research Audio Visual Speech Recognition (AVSR)

01/12/2000Lip Feature Extraction Using Red Exlucsion 4 Context - ASR Up to 99% word accuracy However, –limited context –limited vocabulary –trained on individual –close microphone, cannot handle noise

01/12/2000Lip Feature Extraction Using Red Exlucsion 5 Context - Psycholinguistic McGurk Effect –A[ba] + V[ga]  [da] Viseme –visual phonemes –form complementary sets Demonstrates vision can assist the perception of speech  AVSR

01/12/2000Lip Feature Extraction Using Red Exlucsion 6 Context - AVSR Acoustic Features Visual Features –width –height –oral cavity Integration –Early –Late

01/12/2000Lip Feature Extraction Using Red Exlucsion 7 Lip Feature Extraction Pixel-BasedModel

01/12/2000Lip Feature Extraction Using Red Exlucsion 8 Lip Feature Extraction Pixel-BasedModel –raw pixels or minimal processing –retain linguistically relevant data –large amounts of data,  time –shift and lighting variant –normalisation and PCA

01/12/2000Lip Feature Extraction Using Red Exlucsion 9 Lip Feature Extraction Pixel-BasedModel –reduced input to set of hand-crafted features –width, height, average intensity, etc. –less features,  time –model fitting,  time –lose linguistically relevant features

01/12/2000Lip Feature Extraction Using Red Exlucsion 10 Lip Feature Extraction Pixel-BasedModel –feature extraction –Steps preprocess to enhance contrast locate mouth edges identify corners, height, and other key features train recogntion engine Our Approach

01/12/2000Lip Feature Extraction Using Red Exlucsion 11 Lip Feature Extraction Database 123

01/12/2000Lip Feature Extraction Using Red Exlucsion 12 Lip Feature Extraction Preprocessing Techniques –Grey-scale –Horizontal Edges –Red Analysis –Hue, Saturation, and Value (HSV) –Red Exclusion

01/12/2000Lip Feature Extraction Using Red Exlucsion 13 Lip Feature Extraction Grey-scale –vertical position of mouth minimum row sum –threshold minimum row average of min and max of row –search for above threshold pixels

01/12/2000Lip Feature Extraction Using Red Exlucsion 14 Lip Feature Extraction Grey-scale

01/12/2000Lip Feature Extraction Using Red Exlucsion 15 Lip Feature Extraction Grey-scale

01/12/2000Lip Feature Extraction Using Red Exlucsion 16 Lip Feature Extraction Horizontal Edges –high horizontal edge content –3x3, DY Prewitt operator

01/12/2000Lip Feature Extraction Using Red Exlucsion 17 Lip Feature Extraction Horizontal Edges “Found” CornersBinary Image

01/12/2000Lip Feature Extraction Using Red Exlucsion 18 Lip Feature Extraction Red Analysis –overcome bearded subjects –used for face location

01/12/2000Lip Feature Extraction Using Red Exlucsion 19 Lip Feature Extraction Red Analysis

01/12/2000Lip Feature Extraction Using Red Exlucsion 20 Lip Feature Extraction HSV –disentangles illumination from colour –  Illumination >  Hue

01/12/2000Lip Feature Extraction Using Red Exlucsion 21 Lip Feature Extraction HSV

01/12/2000Lip Feature Extraction Using Red Exlucsion 22 Lip Feature Extraction Red Exclusion –needed extraction method for AVSR –similar to Red Analysis face predominantly red variations occur in the blue and green colours

01/12/2000Lip Feature Extraction Using Red Exlucsion 23 Lip Feature Extraction Red Exclusion

01/12/2000Lip Feature Extraction Using Red Exlucsion 24 Lip Feature Extraction Corners found using Red Exclusion

01/12/2000Lip Feature Extraction Using Red Exlucsion 25 Lip Feature Extraction Comparison

01/12/2000Lip Feature Extraction Using Red Exlucsion 26 AVSR: Results and Issues Application for red exclusion Used in finding lip features –Width –Height –Key pixels

01/12/2000Lip Feature Extraction Using Red Exlucsion 27 AVSR: Results and Issues Visual Speech Recognition Static (%)Dynamic (%) Voicing Viseme Phoneme

01/12/2000Lip Feature Extraction Using Red Exlucsion 28 AVSR: Results and Issues AVSR - Integration Early Static Early Dynamic Late Voice/Vis Late Error Voice/Vis Phoneme

01/12/2000Lip Feature Extraction Using Red Exlucsion 29 Summary Vision can help ASR –AVSR Needed good extraction technique –Red Exclusion AVSR is difficult when both signals degraded

01/12/2000Lip Feature Extraction Using Red Exlucsion 30 Questions? Trent W. Lewis BSc (Cognitive Science) Flinders University