Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant.

NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant Analysis x Correlated Audio & Visual Subspaces Co-inertia & Canonical Correlation Analysis

NOLISP, Paris, May 23rd 2007 Voice conversion techniques Definition: Process of making one person’s voice « source » sounds like another person’s voice target source target Voice conversion My name is John

NOLISP, Paris, May 23rd 2007 Principle of ALISP Dictionary of representative segments Spectral analysis Prosodic analysis Selection of segmental units Segment index Prosodic parameters Input speech Concatenative synthesis HNM Output speech CODER

NOLISP, Paris, May 23rd 2007 details of Encoding speech Spectral analysis Prosodic analysis HMM Recognition Dictionary of HMM models of ALISP classes Synth unit A 1 … Synth unit A 8 HMM A Representative units of the class Selection by DTW Prosodic encoding Index of ALISP class Index of synth. unit Pitch, energy, duration

NOLISP, Paris, May 23rd 2007 Details of decoding Output speech Synth unit A 1 … Synth unit A 8 ALISP Index Synth unit index within class Prosodic parameters Loading Synth unit Concatenative synthesis

NOLISP, Paris, May 23rd 2007 Principle of Alisp conversion Learning step: one hour of target voice - Parametric analysis: MFCC - Segmentation based on temporal decompostion and vector quantization - Stochastic modelling based on HMM - Creation of representative units Conversion step - Parametric analysis: MFCC - HMM recognition - Selection of representative segment  DTW Synthesis step - Concatenation of representative - HNM synthesis

NOLISP, Paris, May 23rd 2007 Voice conversion using ALISP results Score distributionDET curve EER before forgery: 16 % (1729 impostors, 1320 clients) EER after forgery : 26 % (1729 impostors, 1320 clients)

NOLISP, Paris, May 23rd 2007 Voice conversion using ALISP results BREF databaseNIST database Source Result Target SourceTarget Result female male

Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant.

Similar presentations

Presentation on theme: "Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant.

Similar presentations

Presentation on theme: "Page 1 NOLISP, Paris, May 23rd 2007 Audio-Visual Audio-Visual Subspaces Audio Visual Reduced Audiovisual Subspace Principal Component & Linear Discriminant."— Presentation transcript:

Similar presentations

About project

Feedback