Pitch Detection from Waveform and Spectrogram

Slides:



Advertisements
Similar presentations
Building an ASR using HTK CS4706
Advertisements

Supplemental Figure 1 A No. at risk T T T
Atomic Physics. 4.6 (a) Cathode rays Core Describe the production and detection of cathode rays Describe their deflection in electric fields State that.
Harmonic Series and Spectrograms 220 Hz (A3) Why do they sound different? Instrument 1 Instrument 2Sine Wave.
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
On-line Learning with Passive-Aggressive Algorithms Joseph Keshet The Hebrew University Learning Seminar,2004.
Tools for Speech Analysis Julia Hirschberg CS4706 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
Application of G-Sensors Final project for extra project (will arrange a presentation in the last class)
PH 105 Dr. Cecilia Vogel Lecture 13. OUTLINE  Timbre and graphs:  Time graph  Spectrum graph  Spectrogram  Envelope  scales  units  interval factors.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
PH 105 Dr. Cecilia Vogel Lecture 12. OUTLINE  Timbre review  Spectrum  Fourier Synthesis  harmonics and periodicity  Fourier Analysis  Timbre and.
Human Psychoacoustics shows ‘tuning’ for frequencies of speech If a tree falls in the forest and no one is there to hear it, will it make a sound?
Parabolas Date: ____________.
Structure of Spoken Language
Mixer. Performance Parameters Gain (convolution analysis) Port-to-Port Feedthrough (Noise) (Linearity)
CS 551/651: Structure of Spoken Language Lecture 1: Visualization of the Speech Signal, Introductory Phonetics John-Paul Hosom Fall 2010.
Speech Recognition with Hidden Markov Models Winter 2011
1 CS 552/652 Hidden Markov Models for Speech Recognition Spring, 2006 Oregon Health & Science University OGI School of Science & Engineering John-Paul.
Speech Signal Processing
Harmonic Series and Spectrograms
1 Phonetics and Phonemics. 2 Phonetics and Phonemics : Phonetics The principle goal of Phonetics is to provide an exact description of every known speech.
Daniel May Department of Electrical and Computer Engineering Mississippi State University Analysis of Correlation Dimension Across Phones.
Quantitative and qualitative differences in understanding sentences interrupted with noise by young normal-hearing and elderly hearing-impaired listeners.
Introduction to the development of 2D Windows 8 games using Direct X and C++ Markus Jost CEO, Lead Programmer, Codebox GmbH
Why does a violin sound different from a horn? Several kinds of audible information Pitch Timbre Attack Decay Vibrato.
Complex Auditory Stimuli
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio
Statistical NLP Spring 2011
Acoustics Research Institute S_TOOLS -STx Supplement Speech Formant Tracking and Fundamental Frequency Extraction Default Parameter Setting.
Bootstrap TDNN for Classification of Voiced Stop Consonants (B,D,G) CAIP, Rutgers University Oct.13, 2006 Jun Hou, Lawrence Rabiner and Sorin Dusan.
Experimentation Duration is the most significant feature with around 40% correlation. Experimentation Duration is the most significant feature with around.
FFF FFF i v e o r m s o f a c t o r i n g 1.Greatest Common Factor (GCF) Ex 1 10x 2 y 3 z - 8x 4 y 2 2x 2 y 2 (5yz - 4x 2 ) Ex 2 15a 2 b 5 + 5ab 2 -
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-25: Vowels cntd and a “grand” assignment.
ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone Institute for Signal and Information Processing, Temple University.
EOC Practice #17 SPI EOC Practice #17 Determine the equation of a line and/or graph a linear equation.
CSE 551/651: Structure of Spoken Language Lecture 1: Visualization of the Speech Signal, Introductory Phonetics John-Paul Hosom Fall 2005.
Statistical NLP Spring 2010
The Doppler Effect THE LAST LECTURE.
Structure of Spoken Language
Time-Frequency Spectrum
Structure of Spoken Language
Structure of Spoken Language
Structure of Spoken Language
© 2016 Pearson Education, Inc., Hoboken, NJ. All rights reserved.
Speed Formula - Waves.
Speech Processing August 10, /10/2018.
Jennifer J. Venditti Postdoctoral Research Associate
2 Terms 3 Terms 4 Terms Always look for a GCF! Always look for a GCF!
Rasterizing Lines 2 Lecture 33 Wed, Nov 14, 2007.
Figure Hz sine wave to be sampled.
Component: E Local quakes Broad frequency Two tremor bursts
Homework Questions.
Spoken Language Processing:Summing Up
Phonetics and Phonemics
Associative Property of addition Commutative Property of addition
Software Engineering Lecture #7
Sound shadow effect Depends on the size of the obstructing object and the wavelength of the sound. If comparable: Then sound shadow occurs. I:\users\mnshriv\3032.
CS 188: Artificial Intelligence Spring 2006
C.2.10 Sample Questions.
C.2.8 Sample Questions.
C.2.8 Sample Questions.
Midshipman grunt trains.
Time series of diving depth, the three axes of head acceleration (heave, surge and sway), spectrogram of the three axes of head acceleration, body acceleration.
Sample traces of Tth, abdominal pumping rate, raw chamber pressure and at different PO2. Sample traces of Tth, abdominal pumping rate, raw chamber pressure.
3-D Graphing.
Phonetics and Phonemics
A pulse species.
D calls recorded on the tag hydrophone and tri-axial accelerometer.
Logarithms.
Presentation transcript:

Pitch Detection from Waveform and Spectrogram Lecture 3 Supplement Pitch Detection from Waveform and Spectrogram

Period=360/3 = 120 samples; Pitch=fs/period =16000/120 = 133 Hz Period=225/3 = 75 samples; Pitch=fs/period =10000/75 = 133 Hz Period=195/3 = 65 samples; Pitch=fs/period = 8000/65 = 129 Hz Period=82/2 = 41 samples; Pitch=fs/period = 6000/41 = 146 Hz

Period=365/7 = 52 samples; Pitch=fs/period =16000/52 = 308 Hz Period=275/8 = 34.4 samples; Pitch=fs/period =10000/34.4 = 291 Hz Period=205/7 = 29.3 samples; Pitch=fs/period =8000/29.3 = 273 Hz Period=141/7 = 20.1 samples; Pitch=fs/period =6000/20.1 = 298 Hz

AY N OY S L AY JH EH IH-M P F DH-AX AX Pitch=100 Hz Pitch=125 Hz