Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,

Slides:



Advertisements
Similar presentations
Some activities on Biometrics at ENST/CNRS-LTCI
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
ECG Signal processing (2)
An Introduction of Support Vector Machine
Fusion of HMM’s Likelihood and Viterbi Path for On-line Signature Verification Bao Ly Van - Sonia Garcia Salicetti - Bernadette Dorizzi Institut National.
An Overview of Machine Learning
GMM-Based Multimodal Biometric Verification Yannis Stylianou Yannis Pantazis Felipe Calderero Pedro Larroy François Severin Sascha Schimke Rolando Bonal.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Speaker Recognition G. CHOLLET, G. GRAVIER,
Lecture 20 Object recognition I
Instructor: Dr. G. Bebis Reza Amayeh Fall 2005
Dynamic Face Recognition Committee Machine Presented by Sunny Tang.
1 Cours parole du 9 Mars 2005 enseignants: Dr. Dijana Petrovska-Delacrétaz et Gérard Chollet Reconnaissance du locuteur 1.Introduction, Historique, Domaines.
Data mining and statistical learning - lecture 13 Separating hyperplane.
Comparison and Combination of Ear and Face Images in Appearance-Based Biometrics IEEE Trans on PAMI, VOL. 25, NO.9, 2003 Kyong Chang, Kevin W. Bowyer,
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
ICASSP'06 1 S. Y. Kung 1 and M. W. Mak 2 1 Dept. of Electrical Engineering, Princeton University 2 Dept. of Electronic and Information Engineering, The.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
A PRESENTATION BY SHAMALEE DESHPANDE
Oral Defense by Sunny Tang 15 Aug 2003
Authors: Anastasis Kounoudes, Anixi Antonakoudi, Vasilis Kekatos
Toshiba Update 04/09/2006 Data-Driven Prosody and Voice Quality Generation for Emotional Speech Zeynep Inanoglu & Steve Young Machine Intelligence Lab.
HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang Wen-Yi Chu Department of Computer Science & Information.
1 Robust HMM classification schemes for speaker recognition using integral decode Marie Roch Florida International University.
Institute of Information Science, Academia Sinica, Taiwan Speaker Verification via Kernel Methods Speaker : Yi-Hsiang Chao Advisor : Hsin-Min Wang.
An Introduction to Biometric Identity Verification
June 28th, 2004 BioSecure, SecurePhone 1 Automatic Speaker Verification : Technologies, Evaluations and Possible Future Gérard CHOLLET CNRS-LTCI, GET-ENST.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
LOG-ENERGY DYNAMIC RANGE NORMALIZATON FOR ROBUST SPEECH RECOGNITION Weizhong Zhu and Douglas O’Shaughnessy INRS-EMT, University of Quebec Montreal, Quebec,
Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Dijana Petrovska-Delacrétaz 1 Asmaa el Hannani 1 Gérard Chollet 2 1: DIVA Group, University of Fribourg 2: GET-ENST, CNRS-LTCI,
Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Speaker Verification Speaker verification uses voice as a biometric to determine the authenticity of a user. Speaker verification systems consist of two.
A Baseline System for Speaker Recognition C. Mokbel, H. Greige, R. Zantout, H. Abi Akl A. Ghaoui, J. Chalhoub, R. Bayeh University Of Balamand - ELISA.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Speaker Identification by Combining MFCC and Phase Information Longbiao Wang (Nagaoka University of Technologyh, Japan) Seiichi Nakagawa (Toyohashi University.
Handwritten Signature Verification
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
A DYNAMIC APPROACH TO THE SELECTION OF HIGH ORDER N-GRAMS IN PHONOTACTIC LANGUAGE RECOGNITION Mikel Penagarikano, Amparo Varona, Luis Javier Rodriguez-
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Speaker Verification System Middle Term Presentation Performed by: Barak Benita & Daniel Adler Instructor: Erez Sabag.
Machine Learning: A Brief Introduction Fu Chang Institute of Information Science Academia Sinica ext. 1819
FieldTraining Seminar on Field Training Seminar on “Hand-Geometry Based Person Authentication System ” By By Ullesh Chavadi M Ullesh Chavadi M.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
A Brief Introduction to Support Vector Machine (SVM) Most slides were from Prof. A. W. Moore, School of Computer Science, Carnegie Mellon University.
Multimodal Biometric Security 1.
BIOMETRICS VOICE RECOGNITION. Meaning Bios : LifeMetron : Measure Bios : LifeMetron : Measure Biometrics are used to identify the input sample when compared.
Face Detection EE368 Final Project Group 14 Ping Hsin Lee
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Multimodal Biometric Security
Semantic Video Classification
Mixture of SVMs for Face Class Modeling
Sharat.S.Chikkerur S.Anand Mantravadi Rajeev.K.Srinivasan
Support Vector Machines Introduction to Data Mining, 2nd Edition by
Sfax University, Tunisia
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
Multimodal Caricatural Mirror
A maximum likelihood estimation and training on the fly approach
A Novel Smoke Detection Method Using Support Vector Machine
Auditory Morphing Weyni Clacken
Presentation transcript:

Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*, B. Dorizzi*, G. Chollet** * INT, dept EPH, 9 rue Charles Fourier, EVRY France; **ENST, Lab. CNRS-LTCI, 46 rue Barrault, Paris s: {Bao.Ly_van, Sonia.Salicetti, {Blouet, Renouard,

Overview Introduction: Why Speech and Signature? BIOMET database: brief description –Signature data –Speech data Writer verification Speaker verification systems Fusion systems Results and Conclusions

The BIOMET Database 5 modalities: hand shape, fingerprints, on- line signatures, talking faces 131 people: 50% male, 50% female Data from 68 people for fusion Time variability: two sessions spaced of 5 months –S. Garcia-Salicetti, C. Beumier, G. Chollet, B. Dorizzi, J. Leroux-Les Jardins, J. Lunter, Y. Ni, D. Petrovska-Delacretaz, "BIOMET: a Multimodal Person Authentication Database Including Face, Voice, Fingerprint, Hand and Signature Modalities", 4th International Conference on Audio and Video-Based Biometric Person Authentication, 2003.

Signatures capture Captured on a digitizer : 200 Hz –WACOM Intuos2 A6 5 parameters: –Coordinates –Axial pressure –Azimuth and Altitude 15 genuine per person 12 forgeries per person Altitude (0°-90°) 90° 270° 0° Azimuth (0°-359°) 180°

Signatures modeling Preprocessing (filtering) Feature extraction: 12 parameters Modeling signature: continuous HMM –2 states, 3 gaussians per state –Bagging techniques: 10 models to build an «aggregated» model (average score) –Training: 10 signatures of one session Normalized score: |S i (O) - S i * |

Speech Two verification systems: –Data: volontary degraded Text-dependent: only 4 digits sequence among 10 digits (5 templates per speaker) Text-independent: sentences extracted from the original data: –client model: trained on digits (15 seconds) and tested on sentences –world model: trained on data from people –Methods: Text-dependent: DTW (Dynamic Time Warping) Text-independent: GMM (Gaussian Mixture Model)

Text-dependent (DTW) DTW computes the spectral distance between two template patterns Template speech signal Sample speech signal DTW Score

Front-end GMM MODELING WORLD GMM MODEL Front-end GMM model adaptation TARGET GMM MODEL Text-independent (GMM)

Baseline GMM method HYPOTH. TARGET GMM MOD. Front-end WORLD GMM MODEL Test Speech LLR SCORE =

Fusion systems Additive Tree Classifier (ATC) –Boosting techniques on Binary Trees –CART algorithm Support Vector Machine (SVM) –Linear kernel Input: –Normalized signature score –Text-dependent LLR score –Text-independent LLR score

 Goal: finding an optimal partition R = {R k } 1  k  K of the score space S=(s 1, s 2, s 3 ) according to an Information Theory criterion  a sub-optimal solution, based on CART:  Best partition : R* = arg min R C(R)  Score estimation based on P(client|R k ) and P(world|R k ) at each node of a given tree  Use of RealAdaboost to build 50 trees per client and to obtain a robust estimation of P(client|R k ) and P(world|R k ) Tree-based Approach for score fusion

 A score S=(s 1, s 2, s 3 ) is presented to the system composed of 50 trees :  each tree gives as output a score, based on the affected region R k  the LLR score is computed with P(client|R k ) and P(world|R k )  an average score is then computed with the 50 scores Verification based on ATC

SVM principles X (X)(X) Input space Feature space Separating hyperplans H, with the optimal hyperplan H o HoHo H Class(X)

Fusion experiments The 68 people database: splitted in 2 equal parts –34 people: Fusion Learning Base (and threshold estimation for unimodal systems with the criterion min TE) –34 people: Fusion Test Base (and test of unimodal systems) Per person: –5 genuine bimodal values –12 impostor bimodal values

Fusion Performances

Conclusions Equivalent results of ATC and SVM: –role of Boosting (ATC) Fusion increases performance by a factor 2 relatively to the best unimodal system (in clear or noisy environments) Other methods to create noisy environments should be tested (not gaussian white noise but real one !) Fusion performances should also be studied only on the 2 speech verification systems, since no noise was introduced in the signature modality