A Corpus Search Methodology for Focus Realization Jonathan Howell and Mats Rooth Linguistics and CIS Cornell University.

Slides:



Advertisements
Similar presentations
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Advertisements

SPEECH RECOGNITION 2 DAY 15 – SEPT 30, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Adopting the Process Approach to Teaching Listening Dr. Jian Kang Loar Defense Language Institute October 15, 2011.
Frequency, Pitch, Tone and Length October 15, 2012 Thanks to Chilin Shih for making some of these lecture materials available.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
Tone, Accent and Stress February 14, 2014 Practicalities Production Exercise #2 is due at 5 pm today! For Monday after the break: Yoruba tone transcription.
General Problems  Foreign language speakers of a target language cause a great difficulty to native speakers because the sounds they produce seems very.
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
Dr. O. Dakkak & Dr. N. Ghneim: HIAST M. Abu-Zleikha & S. Al-Moubyed: IT fac., Damascus U. Prosodic Feature Introduction and Emotion Incorporation in an.
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
AN INTRODUCTION TO PRAAT Tina John M.A. Institute of Phonetics and digital Speech Processing - University Kiel Institute of Phonetics and Speech Processing.
Introduction to Speech Synthesis ● Key terms and definitions ● Key processes in sythetic speech production ● Text-To-Phones ● Phones to Synthesizer parameters.
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
Natural Language Processing AI - Weeks 19 & 20 Natural Language Processing Lee McCluskey, room 2/07
Pavel Skrelin (Saint-Petersburg State University) Some Principles and Methods of Measuring Fo and Tempo.
Chapter three Phonology
Focus Contrast in Web Harvested Data Mats Rooth Linguistics and CIS Cornell University based on joint research with Jonathan Howell.
A PRESENTATION BY SHAMALEE DESHPANDE
Phonology Katie Burns Title III Resource Teacher.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
Building the Design Studio of the Future Aaron Adler Jacob Eisenstein Michael Oltmans Lisa Guttentag Randall Davis October 23, 2004.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
14: THE TEACHING OF GRAMMAR  Should grammar be taught?  When? How? Why?  Grammar teaching: Any strategies conducted in order to help learners understand,
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Some Voice Enable Component Group member: CHUAH SIONG YANG LIM CHUN HEAN Advisor: Professor MICHEAL Project Purpose: For the developers,
ISSUES IN SPEECH RECOGNITION Shraddha Sharma
Phonetics and Phonology
Whither Linguistic Interpretation of Acoustic Pronunciation Variation Annika Hämäläinen, Yan Han, Lou Boves & Louis ten Bosch.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
Microphone Integration – Can Improve ARS Accuracy? Tom Houy
1 Computational Linguistics Ling 200 Spring 2006.
Intonation in Communication Skill: Recent Research Discourse, both in theoretical linguistics and in foreign language pedagogy,has focused on describing.
Advanced Spoken English Phonology session 2 Stress & Weak Forms 1.
SPEECH PERCEPTION DAY 16 – OCT 2, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Intelligibility of voiced and voiceless consonants produced by Lebanese Arabic speakers with respect to vowel length Romy Ghanem.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
LATERALIZATION OF PHONOLOGY 2 DAY 23 – OCT 21, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )
Frequency, Pitch, Tone and Length October 16, 2013 Thanks to Chilin Shih for making some of these lecture materials available.
Ideas for 100K Word Data Set for Human and Machine Learning Lori Levin Alon Lavie Jaime Carbonell Language Technologies Institute Carnegie Mellon University.
Introduction to Computational Linguistics
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
A Fully Annotated Corpus of Russian Speech
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Performance Comparison of Speaker and Emotion Recognition
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Phonetics, part III: Suprasegmentals October 19, 2012.
Development of an Intelligent Translation Memory MorphoLogic SZAK Publishers Balázs Kis
Levels of Linguistic Analysis
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Lecture 1 Phonetics – the study of speech sounds
Machine Learning in Practice Lecture 9 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
2014 Development of a Text-to-Speech Synthesis System for Yorùbá Language Olúòkun Adédayọ̀ Tolulope Department of Computer Science.
Unit One Basic Concepts: Syllables, Stress & Rhythm.
Text Linguistics. Definition of linguistics Linguistics can be defined as the scientific or systematic study of language. It is a science in the sense.
Music Matching Speaker : 黃茂政 指導教授 : 陳嘉琳 博士.
3.0 Map of Subject Areas.
Studying Intonation Julia Hirschberg CS /21/2018.
What is Linguistics? The scientific study of human language
Turn-taking and Disfluencies
S. M. Joshi College, Hadapsar, Pune-28.
Command Me Specification
Artificial Intelligence 2004 Speech & Natural Language Processing
Huawei CBG AI Challenges
Presentation transcript:

A Corpus Search Methodology for Focus Realization Jonathan Howell and Mats Rooth Linguistics and CIS Cornell University

Goals Study phonetic realization of focus in cases where formal-semantic theories make clear predictions. Natural data from podcasts, radio, etc. Find data using speech search engine based on speech recognition (Everyzing) Automate all of the workflow Today: preliminary data from pilot

he stayed longer than I did -er [[ he he stayed x long] 2 than [ I F stayed x long ]~2] [ y stayed x-long ] antecedent clause [ speaker stayed x-long ] scope of focus

… I should have liked that song a lot more than I did. [more x[[should w[ I like that song x well in w]] than [I like that song x well in w 0 ]]]

I understand even less than I did before even less [[ I prs understand x much] 2 than [I understood x much before F ] ]~2]

Focus in comparative clauses Coherent syntactic-semantic theory about where focus should go Possibilities are constrained, because the main clause is usually the antecedent for focus interpretation in the comparative clause On a theoretical basis, we often think we know the correct grammatical analysis of sentences people use

Result Hundreds of elements of a minimal pair varying position for focus Speech files for short and 10-second intervals spanning than I did Everyzing html contains time offsets for beginnings words. These are converted by program into a Praat representation. Alingments are not good enough to use without correction.

Classification Listen to sound snippet to determine if there is an actual token of “than I did”. True in 56% of cases in a sample of 179 tokens.

Classify correct tokens into three grammatical-semantic classes scomparing than- and main clauses, reference varies in the position of “I”. This licenses focus on the subject “I”. [ he looked younger than I did. ] 21/40 tokens

d Comparing than- and main clauses, reference is constant in the position of “I”, but varies in the possible-world or temporal index of did, and not in any following position. Depending on details of the representation of modality and time, this could license a focus on “did”. 5/40 tokens

f comparing than- and main clauses, reference in the position of I is constant, but varies in some position following did, often a temporal phrase. I actually look younger now than I did 5 years ago 13/40 tokens

Mark vowel intervals in I and did with hand work. Pitch in vowel region and duration of vowel region contribute positively to the area under the pitch curve (definite integral of pitch). Number of glottal pulses in the vowel region.

NLP vs. Acoustic Phonetics Classification based on signal NLP classifier based on correct sentence (or speech recognition output), using parsing and machine learning on text features

Multiple focus Issues marking of multiple foci with different scopes, and prominence of focus relative to accents not marking focus. You made a very small amount more than I did. Now I make much F more than you F do.