Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.

Slides:



Advertisements
Similar presentations
Support.ebsco.com Using the Search History Feature Tutorial.
Advertisements

Acoustic/Prosodic Features
ZoomText, Step-by-Step IU Adaptive Technology Center.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Synthesizing naturally produced tokens Melissa Baese-Berk SoundLab 12 April 2009.
CPIT 102 CPIT 102 CHAPTER 1 COLLABORATING on DOCUMENTS.
Excel Tutorial 6 Managing Multiple Worksheets and Workbooks
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Xkl: A Tool For Speech Analysis Eric Truslow Adviser: Helen Hanson.
Created by Amanda Shultz About Section 1 Section 2 Section 3 Links.
AN INTRODUCTION TO PRAAT Tina John M.A. Institute of Phonetics and digital Speech Processing - University Kiel Institute of Phonetics and Speech Processing.
Tools for Speech Analysis Julia Hirschberg CS4706 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
Looking at Spectrogram in Praat cs4706, Jan 30 Fadi Biadsy.
Tools for Speech Analysis 2 How do we choose? What kind of data? Which task?
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
Praat Fadi Biadsy.
Speech tools Jean-Philippe Goldman Two questions What kind of data ? Which task ?
Phonetics October 1-3, 2008 Phonetics 1.Experimental Phonetics a. Production b. Perception 2. Surveys/Interviews and Phonetics.
Create a Narrated Story with PowerPoint. Basics Enter Text. (Click in the text box and start typing. If a text box is not visible, go to Insert > Text.
Basic and advanced Praat Scripting
Lecture 6 Desktop Publishing III – Presentation Software Introduction to Information Technology With thanks to Dr. A. Zhang, Dr. Haipeng Guo, and Dr. David.
1 FAQ on video editing. 2 1.Is it possible if I look for some video clips (e.g. firework, speech of Obama) from other sources?  Yes, but you need to.
COMPREHENSIVE Excel Tutorial 8 Developing an Excel Application.
Lesson 17 Enhancing Presentations with Multimedia Effects
David Halpin Carilion Clinic Carrie Cao Virginia Western Community College.
Administriva l Questions? l Next Thursday – Cinematography from Lonny – Critiques of hw1 films CS 5964 L03- 1.
Tutorial 5 Making a Document Interactive. XP Objectives Explore the different button states Add a button from the Button library Create a button Learn.
Tutorial 5 Making a Document Interactive. XP Objectives Explore the different button states Add a button from the Buttons library Edit a button instance.
What does it mean “to study”?
Lesson 1 What is Camtasia?. Lesson 2 Editing Objectives After completing the lesson, the student will be able to: Edit a basic recording Camtasia file.
INTRODUCTION TO PHOTOSHOP. TOPICS TO BE DISCUSSED……….  Introduction Introduction  Features Features  Work Area Work Area.
Hands-on tutorial: Using Praat for analysing a speech corpus Mietta Lennes Palmse, Estonia Department of Speech Sciences University of Helsinki.
OCR Nationals: Unit 22 – Creating Sound using ICT A03 – Create an audio clip Sound Editing & Effects.
Speech analysis with Praat Paul Trilsbeek DoBeS training course June 2007.
Insight from Interactions Business Analyzer 1 INSIGHT FROM INTERACTIONS Business Analyzer Overview.
Praat LING115 November 4, Getting started Basic phonetic analyses with Praat –Creating sound objects Recording, reading from a file, creating from.
Adobe AuditionProject 4 guide © 2012 Adobe Systems IncorporatedOverview of Adobe Audition workspace1 Adobe Audition is an audio application designed for.
FIRST COURSE Word Tutorial 6 Using Mail Merge. Objectives Learn about the mail merge process Use the Mail Merge task pane Select a main document Create.
XP Tutorial 8 New Perspectives on Microsoft Windows XP 1 Microsoft Windows XP Object Linking and Embedding Tutorial 8.
Speech Analysis TA : 林賢進 HW /10/28 1. Goal This homework is aimed to analyze speech from spectrogram, and try to distinguish different initials/
Administriva l James will run a hands on tutorial in WEB 130 today at 3:30 and again at 2:00 and 3:30 on Thursday. l Can everyone that wants to attend,
M. D. Slaughter1 Microsoft Office PowerPoint 2003 Tutorial Lai 687 Instructional Design M. D. Slaughter.
Photo Story. How to use Photo Story Photo Story 3 can be located in the Accessories folder on school computers. You will need to have your pictures already.
CPSC 203 Introduction to Computers T97 By Jie (Jeff) Gao.
The first thing you need to do is log in. This is what the “Log In Screen” looks like. Remember to get teacher permission and login information prior.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 6 1 Microsoft Office Access 2003 Tutorial 6 – Creating Custom Forms.
HW2-2 Speech Analysis TA: 林賢進
1 Instructions for using this template Delete this slide when you have completed your edits. Make a copy of this tutorial and use the copy to do your edits.
COMPREHENSIVE Excel Tutorial 12 Expanding Excel with Visual Basic for Applications.
JING SCREEN CAPTURE Anne Perorazio Information Resources Specialist UM Health Sciences Libraries
XP Creating Web Pages with Microsoft Office
Praat: doing phonetics by computer Introductory tutorial Kyuchul Yoon Division of English Kyungnam University.
Excel Tutorial 8 Developing an Excel Application
How to annotate your Audacity project
Fluency in Oral Interaction Workshop (FLOW)
An Introduction to : a closer look at analysing vowels
* the sampling rate and filter bandwidth were set to 500Hz and 1-10 Hz, respectively. * an additional 60Hz notch filter was employed to avoid the power-line.
Word Tutorial 6 Using Mail Merge
Speech Analysis TA:Chuan-Hsun Wu
Why Study Spoken Language?
Meanings of Intonational Contours
JMP User Group Meeting JSL Scripting101
N. Capp, E. Krome, I. Obeid and J. Picone
Why Study Spoken Language?
3.1 Basic Concept of Directory and Sub-directory
Learning How to Create an Online Interactive Poster using
Tools for Speech Analysis
Looking at Spectrogram in Praat cs4706, Jan 30
Presentation transcript:

Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy

2 Goals Create stimuli for a perception experiment Record subjects for a production experiment Analyze data from an experiment or from a natural speech corpus Extract features from speech data for a machine learning experiment

3 Many Software Options Goldwave(audio editor) Esps Xwaves(routines + visual.) Praat(speech analysis) Wavesurfer(speech editor) Transcriber(annotation tool) Matlab(general purpose soft) OGI speech tools(routines + app. dev.) …winpitch, pitchworks, phonedit, cooledit…..

4 Links (Matlab) (phonedit) (PitchWorks) (WinPitch) m.html (CoolEdit > Audition) m.html

5 Our Choice: PraatPraat Developed by Paul Boersma and David Weenink at the Institute of Phonetic Sciences, University of Amsterdam General purpose speech tool : editing, segmentation and labeling, prosodic manipulation, many tutorials, large user community, yahoo groupyahoo group

6

7 File Management Recording files and saving them –New menu Opening files –Read menu Long and short sound files Other file types –Write menu –Exercise: Record a file with your own name, play it to check, call it ‘ ’, save it to list, write it to a.wav file on disk, remove it from the objects list, read it back in

8 Editing Options from Objects Window Select and edit your name file Spectrum: –Show a spectral slice –Show a spectrogram Pitch: –Show pitch –Check the settings, change the range –Get pitch information: get pitch, get min/max pitch Intensity: –Get intensity information: similar to pitch functions –Check the settings Formant: Display

9 Modifying the Data Changing the pitch contour of your name file: Go to To manipulation Edit the new object Pitch  Stylize pitch (2st) Modify pitch by dragging points up and down Modify duration: –Add points in duration tier –Drag points up and down To save: File -> Publish resynthesis

10 Annotation: Textgrids From objects window, w/ sound file selected –Annotate  To textgrid –Point vs. interval tiers Add a point tier and an interval tier and insert some labels NB: remember to select the interval or point first in the waveform or spectrogram before trying to insert a label

11 Scripting From history: –Praat  new Praatscript  Edit  Paste history –NB: you can run all or part of the script Writing scripts Modifying existing scripts: –Tutorials, scripts, resources, user groups, searchTutorials, scripts, resources, user groups

Sample Praat Script # This script will create a new text-grid for a wav file form Make a text-grid for a.wav file comment Source Directory? sentence Directory C:\Documents and Settings\julila\My Documents\ comment File name? sentence Filename comment Tier Name? sentence Tier endform Read from file... 'directory$‘ ‘filename$' stem$ = left$(filename$,length(filename$)-4) select Sound 'stem$' To TextGrid... 'tier$' 'tier$‘ # tier names, which tiers are point tiers Write to text file... 'directory$'\'stem$'.TextGrid Remove

Task 1 Read in the file ‘mummy.wav’ Edit it Zoom in on the most prominent Select the entire contour –Display the pitch and intensity contours –What is the minimum pitch? Maximum? Mean? –What is the minimum intensity? Maximum? Mean?

Task2 Record a statement Record a yes-no question Record a wh-question What are the similarities? Differences?

Task3 Record something in a very loud voice, to produce clipping, and see what the waveform looks like – how do you identify clipping? Avoid it?

Task4 Record a file using falling intonation Modify it to produce a rising intonational contour Edit the new contour to –Raise the pitch range (select the contour and use ‘shift pitch frequencies’ –Stylize the pitch contour

Task 5: Pitch contour cloning Replace the pitch contour in beach.wav with the contour in speech.wav –Create a manipulation object for beach.wav –Extract the pitch tier –Create a manipulation object for speech.wav –Select the pitch tier of beach.wav and the manipulation object for speech.wav and click ‘Replace pitch tier’ –Select the manipulation object for speech.wav and click ‘Get resynthesis…’ –Compare the original file and the new file

Task6 Record “My mama moonlights in Memphis” –As angry speech –As sad speech –As happy speech For each token answer the following: –What is the mean pitch? Maximum? –What is the mean intensity? Maximum? –What is the duration? –Do you see any differences in the F0 contour?

Task 7 Edit mummy.wav Convert  Change Gender Filter  filter (pass) Hann band –Find a pass band that masks the words but retains the intonation –Find a pass band that masks the intonation but retains the words

20 Help Online help, FAQ, manual Links from Additional tutorials, scripts, resources, user groupsAdditional tutorials, scripts, resources, user groups

21 Next Class Report on your results from the seven tasks