Study on Deep Learning in Speaker Recognition Lantian Li CSLT / RIIT Tsinghua University May 26, 2016.

Slides:



Advertisements
Similar presentations
Overview of biometric technology. Contents: 1. What is Biometrics The term “biometrics” is derived from the Greek words “bio” (life) and “metrics” (to.
Advertisements

BIOMETRIC VOTING SYSTEM
Speaker Recognition Sharat.S.Chikkerur Center for Unified Biometrics and Sensors
U of HCOSC 6397 – Lecture 1 #1 U of HCOSC 6397 Lecture 1: Introduction to Biometrics Prof. Ioannis Pavlidis.
BTC - 1 Biometrics Technology Centre (BTC) Biometrics Solution for Authentication Prof. David Zhang Director Biometrics Technology Centre (UGC/CRC) Department.
Section – Biometrics 1. Biometrics Biometric refers to any measure used to uniquely identify a person based on biological or physiological traits.
What is Biometrics?  Biometrics is referred as an authentication system that measures the physiological and individual characteristics of a human being.
By: Monika Achury and Shuchita Singh
FIT3105 Biometric based authentication and identity management
Introduction to Biometrics Dr. Pushkin Kachroo. New Field Face recognition from computer vision Speaker recognition from signal processing Finger prints.
GUIDE TO BIOMETRICS CHAPTER I & II September 7 th 2005 Presentation by Tamer Uz.
Biometrics Austen Hayes and Cody Powell. Overview  What is Biometrics?  Types of Biometric Recognition  Applications of Biometric Systems  Types of.
Biometrics and Authentication Shivani Kirubanandan.
A PRESENTATION BY SHAMALEE DESHPANDE
B IOMETRICS Akash Mudubagilu Arindam Gupta. O VERVIEW What is Biometrics? Why Biometrics? General Biometric System Different types of Biometrics Uses.
Marjie Rodrigues
Security-Authentication
Biometrics: Ear Recognition
Karthiknathan Srinivasan Sanchit Aggarwal
Zachary Olson and Yukari Hagio CIS 4360 Computer Security November 19, 2008.
By Alvaro E. Escobar 1 Biometrics Agenda I. Video II. Biometric Overview III. Biometric Technologies IV. Accuracy Metrics V. BioPrivacy Concerns.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Speaker Recognition By Afshan Hina.
VOICE USER INTERFACE BY R.SELVI K.PRIYA I-MCA. INTRODUCTION voice portal can be defined as “speech  enabled access to Web based information”. In other.
June 28th, 2004 BioSecure, SecurePhone 1 Automatic Speaker Verification : Technologies, Evaluations and Possible Future Gérard CHOLLET CNRS-LTCI, GET-ENST.
N ew Security Approaches Biometric Technologies are Coming of Age ANIL KUMAR GUPTA & SUMIT KUMAR CHOUDHARY.
B IOMETRIC STANDARDS A N OVERVIEW OF BIOMETRICS AND IDENTITY MANAGEMENT Supervisor : Ahmed Abu Mosameh Prepared by samaher el nbahen UNIVERSITY.
Introduction to Biometrics Charles Tappert Seidenberg School of CSIS, Pace University.
BIOMETRICS By: Lucas Clay and Tim Myers. WHAT IS IT?  Biometrics are a method of uniquely identifying a person based on physical or behavioral traits.
Voice Recognition All Talk No Walk.
BIOMETRICS.
Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,
Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,
Identification and Biometrics By Jay Eichler. Introduction What is biometrics? What is biometrics? Types of biometrics Types of biometrics Controversy.
Biometrics Authentication Technology
Biometrics and Retina Scan Technology Lum OSMANI Alex CHERVENKOV Course: Information Security April 2008.
Introduction to Biometrics Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #5 Issues on Designing Biometric Systems September 7, 2005.
Biometric Technologies
Biometrics Group 3 Tina, Joel, Mark, Jerrod. Biometrics Defined Automated methods or recognizing a person based on a physiological and behavioral characteristics.
INTRODUCTION TO BIOMATRICS ACCESS CONTROL SYSTEM Prepared by: Jagruti Shrimali Guided by : Prof. Chirag Patel.
L. F. Coppenrath & Associates PASSWORD BIOPASSWORD ® Biometric Keystroke Dynamics Technology Overview.
Biometric Devices Biometric devices use secure identification and authentication in order for someone to use the device. These devices use automated.
Biometric Technology Jordan Seaton CIS 320 WKU. Essay Questions Which industries would find biometric technology the most useful? Why?
Biometrics By Rachel Borazio. What is biometrics? Biometrics is a security measure used to identify physical features of people to allow access to a system.
FieldTraining Seminar on Field Training Seminar on “Hand-Geometry Based Person Authentication System ” By By Ullesh Chavadi M Ullesh Chavadi M.
What does it mean to us?.  History  Biometrics Defined  Modern Day Applications  Spoofing  Future of Biometrics.
Biometrics Dr. Nermin Hamza
Shital ghule..  INTRODUCTION: This paper proposes an ATM security model that would combine a physical access card,a pin and electronic facial recognition.
Michael Carlino. ROADMAP -Biometrics Definition -Different types -Future -Advantages -Disadvantages -Common Biometric Report -Current Issues.
Speaker Recognition UNIT -6. Introduction  Speaker recognition is the process of automatically recognizing who is speaking on the basis of information.
BIOMETRICS VOICE RECOGNITION. Meaning Bios : LifeMetron : Measure Bios : LifeMetron : Measure Biometrics are used to identify the input sample when compared.
Introduction to Pattern Recognition
A Seminar Report On Face Recognition Technology
BLIND AUTHENTICATION: A SECURE CRYPTO-BIOMETRIC VERIFICATION PROTOCOL
Authentication.
FACE RECOGNITION TECHNOLOGY
Biometrics Reg: AMP/HNDIT/F/F/E/2013/067.
Sharat.S.Chikkerur S.Anand Mantravadi Rajeev.K.Srinivasan
Seminar Presentation on Biometrics
Biometric technology.
Sfax University, Tunisia
Asst. Prof. Arvind Selwal, CUJ,Jammu
موضوع پروژه : تشخیص هویت بر اساس اثر انگشت
Presentation Outlines
A SEMINAR REPORT ON BIOMETRICS
Decision Making Based on Cohort Scores for
Faculty of Science IT Department Lecturer: Raz Dara MA.
Deep Factorization for Speech Signal
BY: Michael Etse and Maverick Fermill
Presentation transcript:

Study on Deep Learning in Speaker Recognition Lantian Li CSLT / RIIT Tsinghua University May 26, 2016

Biometric Recognition Speaker Recognition Deep Learning in Speaker Recognition Prospects

Biometric refers to metrics related to human characteristics [wiki]. Term "biometrics" is derived from Greek words bio (life) and metric (to measure). Biometric recognition is a kind of automated technologies for measuring and analyzing an individual's physiological or behavioral characteristics, and can be used to verify or identify an individual.

 Fingerprint  Face  Palmprint  Iris  Retina Scan  DNA  Signatures  Gait/gesture  Keystroke  Voiceprint Physiological Characteristic Behavioral Characteristic

Language Recognition What language was spoken? Accent Recognition Where is he/she from? Speech Recognition What was spoken? Gender Recognition Male or Female? Emotion Recognition Positive? Negative? Happy? Sad? Speaker Recognition Who spoke?

Speaker recognition is the identification of a person from characteristics of voices (voice biometrics). It is also called voiceprint recognition. [wiki]

Speaker Identification  Determining which identity in a specified speaker set is speaking during a given speech segment. Speaker Verification  Determining whether a claimed identity is speaking during a speech segment. It is a binary decision task. Speaker Detection  Determining whether a specified target speaker is speaking during a given speech segment. Speaker Tracking ( Speaker Diarization = Who Spoke When )  Performing speaker detection as a function of time, giving the timing index of the specified speaker.

Advantages of Speaker recognition  Speech signal more accessible  The use more acceptable by users  Remote authentication more convenient Application scenarios  Access control (e.g. voice lock)  Transaction authentication (e.g. remote payment)  Forensic analysis (e.g. police criminal detection)

Time Feature Model speech waveform spectrogrm LPC, LPCC, PLAR MFCC PLP phone information, deep spk-vector Template matching DTW, VQ, HMM GMM-UBM, GMM-SVM JFA, i-vector Deep learning Keep pace with times Work together to develop Small and clean speech data Big and practical speech data

Machine learning is a subfield of compute science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence [wiki]. Deep learning is a branch of machine learning methods based on learning representations of data. Neural networks is a beautiful biologically-inspired programming paradigm which enables a computer to learn from observational data.

Lantian Li, Dong Wang, Zhiyong Zhang, Thomas Fang Zheng, “Deep Speaker Vectors for Semi Text-independent Speaker Verification”, arXiv: , 2015.

Segment pooling and dynamic time warping (DTW) Lantian Li, Yiye Lin, Zhiyong Zhang, Dong Wang, “Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition”, APSIPA ASC 2015, pp IEEE, 2015.

Max-margin metric learning  Metric learning: to learn a projection M.  Distance metric:  Goal: to discriminate true speakers and imposters. Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng, “Max-margin Metric Learning for Speaker Recognition”, arXiv: , 2015.

Multi-task Recurrent Model for Speech and Speaker Recognition Zhiyuan Tang +, Lantian Li +, Dong Wang, “Multi-task Recurrent Model for Speech and Speaker Recognition”, arXiv: , 2016.

Sequence model --> Deep speaker embedding Encoder-decoder + attention model

Thank you