Speech Analysis TA:Chuan-Hsun Wu

Slides:



Advertisements
Similar presentations
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Advertisements

Tom Lentz (slides Ivana Brasileiro)
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
專題研究 語音訊號處理專題 助教:余典翰 指導教授:李琳山 2013/07/30.
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Xkl: A Tool For Speech Analysis Eric Truslow Adviser: Helen Hanson.
Created by Amanda Shultz About Section 1 Section 2 Section 3 Links.
AN INTRODUCTION TO PRAAT Tina John M.A. Institute of Phonetics and digital Speech Processing - University Kiel Institute of Phonetics and Speech Processing.
Looking at Spectrogram in Praat cs4706, Jan 30 Fadi Biadsy.
Praat Fadi Biadsy.
Google Training By: Amy Shannon and Dave Auwerda.
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
RiskMeter Batch Training. Batch Tool The Riskmeter batch tool allows users to submit multiple locations all at once. The Riskmeter batch tool allows users.
Speech analysis with Praat Paul Trilsbeek DoBeS training course June 2007.
Praat LING115 November 4, Getting started Basic phonetic analyses with Praat –Creating sound objects Recording, reading from a file, creating from.
Speech Analysis TA : 林賢進 HW /10/28 1. Goal This homework is aimed to analyze speech from spectrogram, and try to distinguish different initials/
Digital Speech Processing HW3
Landscaper 101. Time Code AMC AMCNET HELP!!! Where do you go for help? –Upper right corner has a ? for the online help –This presentation.
MGS 351 Introduction to Management Information Systems
DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION DIGITAL SPEECH PROCESSING HOMEWORK #1 DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION Date: Oct, Revised.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
HW2-2 Speech Analysis TA: 林賢進
Click on “My Courses”. Please note that only summative assignments can be uploaded on the new virtual campus. Formative assignments are now available online.
CD 491: Audiology Lecture 2 Clinical Applications.
Praat: doing phonetics by computer Introductory tutorial Kyuchul Yoon Division of English Kyungnam University.
Chapter 1: Introduction to audio signal processing KH WONG, Rm 907, SHB, CSE Dept. CUHK,
Designing Online Assignments for the Advanced Academics Classroom
Journal of Mountain Science (JMS)
Submission if Paying to Print in Dublin ($38)
Date: October, Revised by 李致緯
ELPA21 Data Entry Interface (DEI) Overview
Fluency in Oral Interaction Workshop (FLOW)
An Introduction to : a closer look at analysing vowels
The Human Voice. 1. The vocal organs
Talking with computers
Completing Your Technical Paper Submission for SAMPE 2018 – Long Beach
ELPA21 Data Entry Interface (DEI) Overview
Structure of Spoken Language
Digital Speech Processing
The Human Voice. 1. The vocal organs
A guide to sign-up as a Notetaker with the MU Disability Center.
[insert Module title here]
ELPA21 Data Entry Interface (DEI) Overview
Ch.1: Introduction to audio signal processing
Data Entry Interface (DEI) Overview
How Students Navigate a Test and Use Test Tools
Managing Rosters Screener Training Module Module 5
Once you log into ACRS using the link : account. interfolio
[insert Module title here]
Online Testing System Assessment Viewing Application (AVA)
Data Entry Interface (DEI) Overview
AP Chair and the Faculty Voters receive an from ACRS with the information to their meeting, individual voting poll link and a button on the top to.
[insert Module title here]
Review Committee Member Instructions
Adding information to provider pages
[insert Module title here]
Data Entry Interface (DEI) Overview
GRANT APPLICATION INSTRUCTIONS
Grading Assignments in Google Classroom
Tools for Speech Analysis
Looking at Spectrogram in Praat cs4706, Jan 30
Volunteer Notetakers A complete guide on how to to sign-up as a Notetaker with the MU Disability Center.
PHONETICS AND PHONOLOGY INTRODUCTION TO LINGUISTICS Lourna J. Baldera BSED- ENGLISH 1.
Presentation transcript:

Speech Analysis TA:Chuan-Hsun Wu HW 2-2

Goal This homework is aimed to analyze speech from spectrogram, and try to distinguish different initials/ finals on spectrogram.

Before you start, you should know… …about the initials/finals (聲母/韻母) in Mandarin Chinese. …about Right-Context-Dependent Initial Final (RCDIF) Ex: t_a stands forㄊfollowed by finals starting with ㄚ-like sounds, like ㄊㄞ= t_a ai

Before you start, you should know… …about some classification of consonants Plosive/Stop (爆破音/塞音) ㄅㄆㄉㄊㄍㄎ Fricative (擦音) ㄈㄏㄒㄕㄙ Affricate (塞擦音) ㄐㄑㄓㄔㄗㄘ Nasal (鼻音) ㄇㄋ …about some classification of vowels Monophthong (單母音) 一ㄨㄩㄚㄛㄜㄦ Diphthong (雙母音) ㄞㄟㄠㄡ……

Before you start, you should know… …how labeling works “sil” for silence “sp” for short pause fricative/affricate initials do not contain voicing parts plosive initials contain closure or aspiration period

Some files you need Phonetic class table: Syllable table: Data: http://speech.ee.ntu.edu.tw/courses/DSP2013Autumn/homework/DSP_HW2-2/phonetic_class.pdf Syllable table: http://speech.ee.ntu.edu.tw/courses/DSP2013Autumn/homework/DSP_HW2-2/syllable.txt Data: http://speech.ee.ntu.edu.tw/courses/DSP2013Autumn/homework/DSP_HW2-2/ Download NTU__xyyyy.zip according to your student ID on the above webpage.

Tools for labeling Wavesurfer MATLAB Praat we will be using this for hw2-2

Praat Download http://www.fon.hum.uva.nl/praat/

Praat

Praat Read from file (.wav file from data)

Praat View & Edit

Praat

Pitch

Intensity

Formant

Reminder The intensity is the power of all frequency components. The acoustic signal may contain the same amount of power but in quite different frequency components. The formant is an acoustic resonance, measured by the peak in the frequency spectrum. You should not trust the formant detection output for unvoiced initials.

Praat Annotate to TextGrid

Praat Create one interval tier named RCDIF No point tiers

Praat View & Edit with both objects selected

Praat

Labeling Click on spectrogram for your boundary Add the boundary by clicking the small circle Remove by choosing “Boundary/Remove” Drag you boundaries to be more accurate Click between your boundary and type in your label (according to the “Syllable table”) Listen to your label by clicking the number (interval time) below it

Praat

Praat After finish labeling Save your TextGrid object as short text file File should be “.TextGrid” not “.Collection”

Part 1 (20%) Download NTU__xyyyy.zip according to your student ID at http://speech.ee.ntu.edu.tw/courses/DSP2013Autumn/homework/DSP_HW2-2/. You must submit at least 5 fully labeled TextGrid files (along with their wave files) These 5 files should contain the initial/final labels you use in part 2

Part 2 (30%) Choose at least 2 initials from the 4 classes (Plosive, Fricative, Affricate, Nasal) For each of these 8 initials, create a table that contains at least 2 screenshots of its label (please show intensity and formant) An example of a table for plosives ㄅ & ㄆ is on the next two pages

Part 2 (30%) Phonetic Class Plosive b(ㄅ)

Part 2 (30%) Phonetic Class Plosive p(ㄆ)

Useful tips Zoom in & out, show all or selection part in Praat by clicking the buttons on the lower-left corner of spectrograms In your chosen directory “NTU_XXXXX_phn2file” lists all files containing each phone “NTU_XXXXX_file2phn” lists all phones contained in each file

Part 3 (50%) (20%) What are the consistencies of the spectrogram in each phonetic class? (Plosive, Fricative, Affricate, Nasal) (10%) Is the boundary between neighboring initial and final clear? What is the benefit of using “right-context dependent” initial model (ex: sh_a) instead of pure initial model (ex: sh) to model initials?

Part 3 (50%) (10%) What are the differences when pronouncing ㄅ & ㄆ? How can you tell the differences in spectrogram for ㄅ & ㄆ? (You may also want to compare ㄉ & ㄊ, ㄍ & ㄎ respectively) (10%) Take a look at the spectrogram of finals. Is there any simple rules to discriminate initials from finals provided only spectrogram?

Bonus (10%) The following is a speech analysis plot for a Chinese word composed of 3 characters. Each character is composed of an initial and a final. Guess what the word is and describe your reasoning. (Score: reasoning 8%, correct answer 2%) If you cannot figure out the word, you can guess the phonetic class or initial/finals. For example, your answer can be “l_i, i, sic_a, au” or “plosive, diphthong, plosive, monophthong”.

Hint: it’s a Taiwanese politician.

Submission Requirements 5 TextGrid files (each along with its wave file) the “.TextGrid” & “.wav” filenames should be the same 1 report (in PDF format) the filename should be hw2-2_bXXXXXXXX.pdf (your student ID) Compress the above 11 files to 1 zip file and upload it to ceiba 20% of the final score will be taken off for each day of late submission

If you have any problem… … look up the Praat introduction website. It should solve all your technical problems. http://www.fon.hum.uva.nl/praat/manual/Intro.html …contact the TA by email. 吳全勳 r02922002@ntu.edu.tw

Homework #2 Your can submit either You can also submit both HW 2-1 (HMM Training and Testing) HW 2-2 (Speech Analysis) You can also submit both The higher grade of the two will count as your final score for HW2 Deadline: To be discussed

Thank You Questions?