Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less.

Slides:



Advertisements
Similar presentations
Elasticity   Elasticity measures the degree of one variable’s dependence on another variable, or the “sensitivity” of one variable to a change in another.
Advertisements

Digital Signal Processing
Synthesizing naturally produced tokens Melissa Baese-Berk SoundLab 12 April 2009.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Randomized Algorithms Randomized Algorithms CS648 Lecture 3 Two fundamental problems Balls into bins Randomized Quick Sort Random Variable and Expected.
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech F. Bettens *, F. Grenez *, J. Schoentgen *,** * Université Libre.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Anatomy of the vocal mechanism
Automatic Lip- Synchronization Using Linear Prediction of Speech Christopher Kohnert SK Semwal University of Colorado, Colorado Springs.
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Laryngeal Physiology.
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
Introduction to Error Analysis
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.
Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.
storing data in k-space what the Fourier transform does spatial encoding k-space examples we will review:  How K-Space Works This is covered in the What.
 Consider the following "experiment" where you construct a catapult which launches a dart at a target.  The object is to hit the bulls eye. Topic 1.2.
Basic Statistics Concepts Marketing Logistics. Basic Statistics Concepts Including: histograms, means, normal distributions, standard deviations.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
Accuracy Precision % Error. Variable is a factor that affects the outcome of an experiment. 3 Types of variables Experimental/ Independent Variable The.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Pitch Tracking + Prosody January 17, 2012 The Plan for Today One announcement: On Thursday, we’ll meet in the Craigie Hall D 428 We’ll be working on.
SH 565- Instrumentation in Communicative Disorders Spring ‘02.
Uncertainty and Error in Measurement (IB text - Ch 11) (If reviewing this slide in the senior year, there is also uncertainty information in the AP text.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Pitch Determination by Wavelet Transformation Santhosh Bellikoth ECE Speech Processing Instructor: Dr Kepuska.
1 Psych 5500/6500 Measures of Variability Fall, 2008.
Error, Accuracy, Deviation, and Precision in Lab data.
Listen and learn!. * “READ THE BOOKS. I don't understand why some kids think they can take a test on a book they have never read. That is actually crazy,
SPPA 6010 Advanced Speech Science
Error in Measurement Precision Accuracy Error Types Significant Digits Error Propagation.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Here is a kind of question that you can get on Verbal Reasoning. They might give you three groups of numbers like this: (4 [6] 2) (3 [7] 4) (5 [12] ?)
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
Correlation and Regression
Organizing Qualitative Data
Pitch and Amplitude Perturbation (Jitter and Shimmer)
Chapter 3: Measurement: Accuracy, Precision, and Error
Instrumental Assessment
You’ll never be able to know them directly…so what do you do? For the purposes of our next activity, your dice represent the population you’re studying...now,
Introduction to measurements
Changes in Vocal Intensity
Problems With Assistance Module 1 – Problem 2
Part III – Gathering Data
Statistical Data Analysis - Lecture10 26/03/03
Multimedia Systems and Applications
Statistical Process Control
Unit 27 Task 3 Week 1-4 and Review 1.
ANOTHER way to integrate (oh joy!!)
Extemp – Your first Tournament
Lesson 1: Summarizing and Interpreting Data
ACT English Test - Economy
“last minute” strategies
FUNDAMENTAL PERIOD TRACK FEATURES
Uncertainty and Error
Problems With Assistance Module 1 – Problem 2
Introduction In today’s lesson we will look at: why Python?
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Organizing Qualitative Data
Problems With Assistance Module 1 – Problem 2
CHAPTER 16: Inference in Practice
Changes in Vocal Intensity
Presentation transcript:

Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less. Voiced speech is never perfectly periodic. Random cycle-to-cycle variability in f 0 (or, equivalently, t 0 – the fundamental period) is called pitch perturbation or jitter. Random cycle-to-cycle variability in the amplitude of glottal pulses is called amplitude perturbation or shimmer.

Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less. Voiced speech is never perfectly periodic. Random cycle-to-cycle variability in f 0 (or, equivalently, t 0 – the fundamental period) is called pitch perturbation or jitter. (Also – and more properly – known as fundamental frequency perturbation, although that term is seldom used.) Random cycle-to-cycle variability in the amplitude of glottal pulses is called amplitude perturbation or shimmer.

The jitter concept is quite simple. This signal looks perfectly periodic, but it isn’t. There are small, more-or-less random differences in the fund. period from one cycle to the next.

How Jitter is Measured: Mean Jitter Mean Jitter = sum of (abs) period diffs / number of diffs = / 3 = 0.3 / 3 = 0.1 ms In English: MeanJ = SumOfAbsDiffs / ndiffs

Percent Jitter It is far more common to represent jitter as a percentage of the average period: Percent Jitter = MeanJ/MeanPeriod x 100 Our example from the previous slide: MeanJ = 0.1 ms; MeanPeriod = 8 ms Percent Jitter = 0.1/8 * 100 = 0.125/8.0 x 100 = 1.25%

What We Know About Jitter 1.Jitter values in non-dysphonic voices are quite small – about 0.5%. 2.There is a good deal of research to show that jitter values in dysphonic voices are significantly larger. 3.Jitter is thought to be associated with a sensation of roughness in the voice. 4.For that reason, it has been proposed as an objective correlate of either roughness or overall dysphonia.

Shimmer: More-or-less random cycle-to-cycle variation in voice amplitude (vocal intensity) rather than f 0.

What We Know About Shimmer 1.Shimmer values in non-dysphonic voices are quite small – usually less ~0.7 dB. 2.As with jitter, shimmer values in dysphonic voices are significantly larger. 3.Shimmer is thought by some to be associated with a sensation of roughness in the voice – but this is not settled. You’ll be able to decide this on your own in a few minutes.

Synthetic Continuum Varying in Jitter 0.0%2.0% 0.2%2.5% 0.4%3.0% 0.6%4.0% 0.8%5.0% 1.0%6.0% 1.5%

Shimmer calculation: There are calculations for shimmer that are analogous to the ones we saw earlier for jitter – the average absolute difference in amplitude between adjacent periods. (There is no percent shimmer, for reasons we won’t worry about.)

Synthetic Continuum Varying in Shimmer 0.00 dB1.60 dB 0.20 dB1.80 dB 0.40 dB2.00 dB 0.60 dB2.25 dB 0.80 dB2.50 dB 1.00 dB2.75 dB 1.20 dB3.00 dB 1.40 dB

Jitter Versus Shimmer Do pitch perturbation and amplitude perturbation produce the same kinds of sound qualities? Jitter (6%)Shimmer (3 dB) Judge for yourself, but these sound quite different to my ear. Jitter continuum: Clear to very rough, Shimmer continuum: Clear to unnaturally crackly, somewhat like static.

One More Problem: Measurement Accuracy Jitter and shimmer measurements require that the starting and ending times of every pitch pulse be measured with a lot of precision. This is waaaay too time consuming to do by hand, so a computer algorithm is needed. Measuring the t 0 is not too hard for highly periodic voices. But what voices are the most interesting? Highly periodic voices or dysphonic voices that show imperfect periodicity?

This is the problem: The voices that are most interesting are exactly the ones that are the hardest to measure. Also, the quantities being measured (e.g., jitter) are quite small – only a few percent even in dysphonic voices – so even a few errors can make a big difference. There are commercial systems available for measuring perturbation. They’re well designed, but they will sometimes make mistakes. This doesn’t mean that you shouldn’t use them, but when the computer gives you a measurement that doesn’t agree with what your ear tells you, your ear could easily be right.