Pitch and Amplitude Perturbation (Jitter and Shimmer)

Slides:



Advertisements
Similar presentations
Elasticity   Elasticity measures the degree of one variable’s dependence on another variable, or the “sensitivity” of one variable to a change in another.
Advertisements

Digital Signal Processing
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
Randomized Algorithms Randomized Algorithms CS648 Lecture 3 Two fundamental problems Balls into bins Randomized Quick Sort Random Variable and Expected.
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech F. Bettens *, F. Grenez *, J. Schoentgen *,** * Université Libre.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Anatomy of the vocal mechanism
Automatic Lip- Synchronization Using Linear Prediction of Speech Christopher Kohnert SK Semwal University of Colorado, Colorado Springs.
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 5): Outliers Fall, 2008.
Laryngeal Physiology.
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
Theory testing Part of what differentiates science from non-science is the process of theory testing. When a theory has been articulated carefully, it.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.
Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.
storing data in k-space what the Fourier transform does spatial encoding k-space examples we will review:  How K-Space Works This is covered in the What.
 Consider the following "experiment" where you construct a catapult which launches a dart at a target.  The object is to hit the bulls eye. Topic 1.2.
Basic Statistics Concepts Marketing Logistics. Basic Statistics Concepts Including: histograms, means, normal distributions, standard deviations.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
Accuracy Precision % Error. Variable is a factor that affects the outcome of an experiment. 3 Types of variables Experimental/ Independent Variable The.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Views Lesson 7.
Pitch Tracking + Prosody January 17, 2012 The Plan for Today One announcement: On Thursday, we’ll meet in the Craigie Hall D 428 We’ll be working on.
SH 565- Instrumentation in Communicative Disorders Spring ‘02.
Uncertainty and Error in Measurement (IB text - Ch 11) (If reviewing this slide in the senior year, there is also uncertainty information in the AP text.
Developing a Research Question
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Sampling Distribution Models Chapter 18. Toss a penny 20 times and record the number of heads. Calculate the proportion of heads & mark it on the dot.
Pitch Determination by Wavelet Transformation Santhosh Bellikoth ECE Speech Processing Instructor: Dr Kepuska.
Feature Engineering Studio September 30, Quick Note Please me for appointments rather than just showing up at my office – I’m always glad.
1 Psych 5500/6500 Measures of Variability Fall, 2008.
Error, Accuracy, Deviation, and Precision in Lab data.
By: Maisha Loveday 8C Maths Reflection: Binomial Expansion.
Listen and learn!. * “READ THE BOOKS. I don't understand why some kids think they can take a test on a book they have never read. That is actually crazy,
Plenary 1. What’s important about the Math we Teach? A Focus on Big Ideas Marian Small
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Here is a kind of question that you can get on Verbal Reasoning. They might give you three groups of numbers like this: (4 [6] 2) (3 [7] 4) (5 [12] ?)
 The earlier you use Cued Speech the better. A baby or young deaf child will learn by imitation as hearing babies do – they are not specifically taught.
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less.
Making yourself understood is not all about accent.
Correlation and Regression
Question So, I’ve done my factor analysis.
Changes in Vocal Intensity
Problems With Assistance Module 1 – Problem 2
Part III – Gathering Data
Multimedia Systems and Applications
ANOTHER way to integrate (oh joy!!)
Extemp – Your first Tournament
Lesson 1: Summarizing and Interpreting Data
ACT English Test - Economy
“last minute” strategies
Uncertainty and Error
Problems With Assistance Module 1 – Problem 2
Introduction In today’s lesson we will look at: why Python?
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Sampling.
Problems With Assistance Module 1 – Problem 2
CHAPTER 16: Inference in Practice
Changes in Vocal Intensity
Presentation transcript:

Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less. Naturally spoken voiced speech is never perfectly periodic. Random cycle-to-cycle variability in f0 (or, equivalently, t0 – the fundamental period) is called pitch perturbation or jitter. Random cycle-to-cycle variability in the amplitude of glottal pulses is called amplitude perturbation or shimmer.

The jitter concept is quite simple The jitter concept is quite simple. This signal might look perfectly periodic, but it isn’t. There are small, more-or-less random differences in the fund. period from one cycle to the next.

How Jitter is Measured: Mean Jitter Mean Jitter = sum of (abs) period diffs / number of diffs = 0.2 + 0.1 + 0.0 / 3 = 0.3 / 3 = 0.1 ms In English: MeanJ = SumOfAbsDiffs / ndiffs

Percent Jitter = MeanJ/MeanPeriod x 100 It is far more common to represent jitter as a percentage of the average period: Percent Jitter = MeanJ/MeanPeriod x 100 Our example from the previous slide: MeanJ = 0.1 ms; MeanPeriod = 8 ms Percent Jitter = 0.1/8 * 100 = 0.125/8.0 x 100 = 1.25%

What We Know About Jitter Jitter values in non-dysphonic voices are quite small – about 0.5%. There is a good deal of research to show that jitter values in dysphonic voices are significantly larger. Jitter is thought to be associated with a sensation of roughness in the voice. For that reason, it has been proposed as an objective correlate of either roughness or overall dysphonia.

Shimmer: More-or-less random cycle-to-cycle variation in voice amplitude (vocal intensity) rather than f0.

What We Know About Shimmer Shimmer values in non-dysphonic voices are quite small – usually less ~0.7 dB. As with jitter, shimmer values in dysphonic voices are significantly larger. Shimmer is thought by some to be associated with a sensation of roughness in the voice – but this is not settled. You’ll be able to decide this on your own in a few minutes.

Synthetic Continuum Varying in Jitter 0.0% 2.0% 0.2% 2.5% 0.4% 3.0% 0.6% 4.0% 0.8% 5.0% 1.0% 6.0% 1.5%

Shimmer calculation: There are calculations for shimmer that are analogous to the ones we saw earlier for jitter – the average absolute difference in amplitude between adjacent periods. (There is no percent shimmer, for reasons we won’t worry about.)

Synthetic Continuum Varying in Shimmer 0.00 dB 1.60 dB 0.20 dB 1.80 dB 0.40 dB 2.00 dB 0.60 dB 2.25 dB 0.80 dB 2.50 dB 1.00 dB 2.75 dB 1.20 dB 3.00 dB 1.40 dB

Jitter Versus Shimmer Do pitch perturbation and amplitude perturbation produce the same kinds of sound qualities? Jitter (6%) Shimmer (3 dB) Judge for yourself, but these sound quite different to my ear. Jitter continuum: Clear to very rough, Shimmer continuum: Clear to unnaturally crackly, somewhat like static.

One More Problem: Measurement Accuracy Jitter and shimmer measurements require that the starting and ending times of every pitch pulse be measured with a lot of precision. This is waaaay too time consuming to do by hand, so a computer algorithm is needed. Measuring the t0 is not too hard for highly periodic voices. But what voices are the most interesting? Highly periodic voices or dysphonic voices that show imperfect periodicity?

This is the problem: The voices that are most interesting are exactly the ones that are the hardest to measure. Also, the quantities being measured (e.g., jitter) are quite small – only a few percent even in dysphonic voices – so even a few errors can make a big difference. There are commercial systems available for measuring perturbation. They’re well designed, but they will sometimes make mistakes. This doesn’t mean that you shouldn’t use them, but when the computer gives you a measurement that doesn’t agree with what your ear tells you, your ear could easily be right.