8/23/20141 Accenting and Information Status Julia Hirschberg CS 4706.

Slides:



Advertisements
Similar presentations
Writing constructed response items
Advertisements

EER to Relation Models Mapping
Introduction to TCP/IP and OSI model
1/13/20141 What is SimNet? LEARNING & ASSESSMENT MODULES FOR… Office 2010 | Windows Vista & IE7,8,9 | Windows XP, Vista & 7 | Computer Concepts In a simulated.
Monday, January 13, Knowledge of Adult Learners Lesson 2 Rose DeJarnette.
Monday, January 13, Instructor Development Lesson 9.
Monday, January 13, Instructor Development Lesson 6 Instructor Resources.
Rev Monday, January 13, Foundations, Technology, Skills Tools.
1/15/ A car starts from rest and travel 10s with uniform acceleration 5m/s 2. Find out its final velocity. Here, u=0 m/s t=10s a= 5m/s 2 v=? We.
an innovative solution by central control
National Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics in Turkey Arzu TOKDEMİR 10 September 2013 Ankara.
The AESKOPP System and the Steps of Selling
Welcome Welcome to the next session in the professional development program focused around the 9-12 Mathematics Standards. 3/1/20141Geometry.
SAP-Customizing SAP-Customizing.

The 5S numbers game..
A DIY Accessible Hardware Solution for Students with Disabilities | Ryan Vernon BC College and Institute.
Sampling of Coal Dr kalyan sen Director, Central Fuel Research Institute, Dhanbad, 2003.
S2 Training Guide OVERVIEW 1. … DESKTOP 3/31/2017.
The basics for simulations
Pengukuran penyakit dalam populasi 3/2014 DISIAPKAN OLEH PROF. DR.DRH.PRATIWI, TS. MS DRH.ROSITAWATI, I. MP 2/22/2014PTS-RST-PKH
6/2/20141 A Short Tour of our School. 6/2/20142 Hazelwood is a part of the Edmonds School District th Street SW Lynnwood, WA (425)
Grade-3 Pine View School Mrs. Seider’s class
Analysis of Union Budget 2012 CENVAT Credit CA. Atul Kumar Gupta.
Referring Expressions: Definition Referring expressions are words or phrases, the semantic interpretation of which is a discourse entity (also called referent)
6/12/20141 Money and Banking Chapter Outline The Functions of Money The Functions of Money The Components of Money Supply The Components of Money.
Please, select a question: How does a personal account work? How to apply to a job offer? How to send a spontaneous application? How to recover your password?
8/25/ Click on Member Billing Icon 2 8/25/2014 Recent Updates to the Program 3.
Accessing spoken words: the importance of word onsets
Project Quality Management
1 Small group teaching. 10/10/ What is a small group: Small groups are not determined by number, but by certain characteristics: – Active student.
11 Problem Based Learning. 10/10/ The original problem-based curriculum at McMaster University, featuring small learning groups with a faculty.
1 Small group teaching2. 10/10/ Summary of techniques for effective facilitation in group discussion 1. Brainstorming: Everyone should be encouraged.
10/11/20141 MART Managers’ Conference G. George Wallin, PhD, MBA Vice President/Chief Operating Officer Sherburne TeleSystems, Inc.
08/01/ Final Conference The SONETOR platform- Functionalities and services Catherine Christodoulopoulou CTI.
10/12/20141 Part /12/20142 Week 16, 17, 18 and 19 The Atmosphere Modules 3, 4, 7 and 8 in AP The study of the Air and the Atmosphere and how we.
Session Agenda  What is WebCRD?  The four ways to place an order  Placing an order from an application  Uploading a document  Placing a Catalog order.
10/12/20141Chem-160. Covalent Bonds 10/12/20142Chem-160.
12/10/ Guy Platten Caledonian Maritime Assets Ltd (CMAL) 12/10/20142 Hybrid Ferries – the opportunity for Scotland.
10/22/20141 GDP and Economic Growth Chapter /22/20142 Outline Gross Domestic Product Gross Domestic Product Economic Growth Economic Growth.
MarcEdit "A Closer Look at Productivity Tools” NETSL 2014 Apr. 11, pm.
Bayesian networks - Time-series models - Apache Spark & Scala
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus,
Eye Movements and Spoken Language Comprehension: effects of visual context on syntactic ambiguity resolution Spivey et al. (2002) Psych 526 Eun-Kyung Lee.
Teaching Listening Zhang Lu.
“Downstepped contours in the given/new distinction” Agustín Gravano Spoken Language Processing Group Columbia University, New York On the Role of Prosody.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Accent and reference resolution (Dahan, Tanenhaus & Chambers, in press) Experiment 1: –Tested hypothesis that de-accented noun is interpreted as referring.
1 Discourse, coherence and anaphora resolution Lecture 16.
1 Spoken Dialogue Systems Dialogue and Conversational Agents (Part IV) Chapter 19: Draft of May 18, 2005 Speech and Language Processing: An Introduction.
Accenting, Givenness, and Syntactic Role By E.G. Bard and M.P. Aylett Presented by David Vespe.
Information Status Varieties of Information Status –Contrast John wanted a poodle but Becky preferred a corgi. –Topic/comment The corgi they bought turned.
LANGUAGE LEARNING STRATEGIES
Stages of Second Language Acquisition
The time-course of prediction in incremental sentence processing: Evidence from anticipatory eye movements Yuki Kamide, Gerry T.M. Altman, and Sarah L.
Common Ground Linguistic referents are established w/in a “domain of interpretation”, which includes context –One component of context = Common Ground.
Teaching Listening.
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
Using Technology to Teach Listening Skills
Teaching Listening Why teach listening?
Information Status.
Studying Intonation Julia Hirschberg CS /21/2018.
Accenting and Information Status
Accenting and Information Status
Information Structure and Prosody
Accenting and Given/New Status
“Downstepped contours in the given/new distinction”
Agustín Gravano & Julia Hirschberg {agus,
Discourse Structure in Generation
Presentation transcript:

8/23/20141 Accenting and Information Status Julia Hirschberg CS 4706

8/23/20142 Information Status Topic/comment, theme/rheme The orangutan we wanted to buy escaped from the pet store. Focus of attention I only bought candy for that orangutan. Given/new I only bought candy for that orangutan. I would never buy an ape drugs! All commonly signaled in human speech by intonation

8/23/20143 Today: Acent and Given/New Motivation in speech technology Models of Given/New Experiments on Given/New and pitch accent Possible models of intonation wrt given/new entities How might we identify given/new information automatically? How should we produce given/new information appropriately? Why is this important?

8/23/20144 A Simple Definition Given: Recoverable from some form of context or, what a Speaker believes to be in a Hearer’s consciousness New: Not recoverable from context or, what a Speaker believes is not in a Hearer’s consciousness

8/23/20145 Role in Speech Technologies TTS: Natural production –Given information is often deaccented –New information is usually accented ASR: Improved recognition –Given information may already have been recognized earlier –New information may be important cue to topic shift Summarization: Improved precision –Given information less likely to be included in a summary; new information more likely

8/23/20146 Spoken Dialogue Systems: Grounding –Critical for system to convey what is given and what is new to facilitate Hearer comprehension

8/23/20147 Prince ’81: A More Complex Model Speaker (S) and Hearer (H), in a discourse, construct a discourse model –Includes discourse entities, attributes, and links between entities –Discourse entities: individuals, classes, exemplars, substances, concepts (NPs) Entities when first introduced are new –Brand-new (H must create a new entity) My dog bit a rhinoceros this morning.

8/23/20148 –Unused (H already knows of this entity) The sun came out this morning. Evoked entities are old, or ‘given’ -- already in the discourse –Explicitly evoked (in text or speech) The rhinoceros was wearing suspenders. Rather unusual for a rhino. –Situationally evoked Watch out for the snake! Inferables are also old, or ‘given’ I bought a new car. The gear shift is a bit tricky.

8/23/20149 Prince ’92: A Still More Complex Model Hearer-centric information status: –Given: what S believes H has in his/her consciousness –New: what S believes H does not have in his/her consciousness But discourse entities may also be given and new wrt the current discourse –Discourse-old: already evoked in the discourse –Discourse-new: not evoked

8/23/ The stars are very bright tonight (Hearer- given; Discourse-new) When I see stars this bright, I think of my vacations in the mountains. (Hearer-given; Discourse-given) My friend Buddy and I would sneak out late at night. (Hearer-new; Discourse-new) I said, “My friend BUDDY…” (Hearer-new; Discourse-given)

8/23/ Given/New and Pitch Accent New information is often accented and given information is often deaccented (Halliday ‘67, Brown ‘83, Terken ‘84) –But there are many exceptions: a simple TTS rule: accent ‘new’ and deaccent ‘given’ will make 25-30% errors –How can we reduce these errors, to produce human-like intonation?

8/23/ Brown ‘83: Accent Status and Subclasses of Given/New Speech elicitation in laboratory –12 Scottish-English undergrads –A describes a diagram for B to draw, which B cannot see Draw a black triangle. Draw a circle in the middle. Draw a blue triangle next to the black one with a line from the top angle to the bottom. Analysis: based on Prince ‘81 categories with modifications

8/23/ –Brand-new (a triangle), given:inferrable (middle, angle), given:contextually evoked (the page), given:‘textually’ evoked (divided into current topic vs. earlier mention) –Accent status of all entity-referring NPs Results: –Brand-new information accented (87%) Note: new entity/old expression issue –Given: contextually evoked information deaccented (98%) –Given: ’textually’ evoked deaccented (current topic 100%; earlier: 96%) –Given: inferable information accented (79%)

8/23/ Boston Directions Corpus (Hirschberg & Nakatani ’96)Hirschberg & Nakatani ’96 Experimental Design 12 speakers: 4 used Spontaneous and read versions of 9 direction- giving tasks (monologues) Corpus: 50m read; 67m spon Labeling –Prosodic: ToBI intonational labeling –Given/new (Prince ’92), grammatical function, p.o.s.,…

8/23/ d1: dsp1: step 1: enter and get token first enter the Harvard Square T stop and buy a token d2: dsp2: inbound on red line then proceed to get on the inbound um Red Line uh subway Boston Directions Corpus: Describe how to get to MIT from Harvard

8/23/ dp3 dsp3: take subway from hs, to cs to ks and take the subway from Harvard Square to Central Square and then to Kendall Square dp4: dsp4: get off T. then get off the T

8/23/ Hearer and Discourse Given/New Labeling first enter the Harvard Square T stop and buy a token then proceed to get on the inbound um Red Line uh subway and take the subway from Harvard Square to Central Square and then to Kendall Square then get off the T

8/23/ Hearer and Discourse Given/New Labeling first enter and buy then proceed to get on <HI/DN the inbound um Red Line uh subway> and take from to and then to then get off

8/23/ Does Given/New Status Predict Deaccenting? NPaHGHIHNDGDN Deaccented37.1%53.9%26.2%43.3%38.8% Total HG: Hearer Given HI: Hearer Inferable HN: Hearer New DG: Discourse Given DN: Discourse New 39.4% of (H or D) Given items deaccented… 36.9% of (H or D) New Items are deaccented…

8/23/ And….Bard’99: Givenness, deaccenting and intelligibility Speech elicited in laboratory –Glasgow Scottish-English Map Task Each has a slightly different map A traces a route described by B Analysis –Compare repeated mentions of same items (i.e. given items) wrt accent status Within dialogue Across dialogue Findings

8/23/ –Deaccenting rare in repeated mentions (within 15% and across 6% dialogues) –But repeated mentions were `less intelligible’ Caveats: –Were they really identifying ‘deaccenting’ (the absence of a pitch accent)? –Were mentions within speaker or across speaker? –Some more questions to ask….

8/23/ What else is going on? Given/new and grammatical function Hypothesis: how discourse entities are evoked in a discourse influences accent status E.g., How might grammatical function and surface position interact with the accentuation of ‘given’ items? Cases: –X has not been mentioned in the prior context –X has been mentioned, with the same grammatical function/surface position –X has been mentioned but with a different grammatical function/surface position

8/23/ Experimental Design Major problem: –How to elicit ‘spontaneous’ productions while varying desired phenomena systematically? –Key: simple variations and actions can capitalize upon natural tendency to associate grammatical functions with particular thematic roles for a given set of verbs

8/23/ Triangle Cylinder Diamond Rectangle Octagon

8/23/ Triangle Cylinder Diamond Rectangle Octagon Context 1

8/23/ Triangle Cylinder Diamond Rectangle Octagon Context 2

8/23/ Triangle Cylinder Diamond Rectangle Octagon Context 3

8/23/ Triangle Cylinder Diamond Rectangle Octagon Target(A)

8/23/ Triangle Cylinder Diamond Rectangle Octagon Target(B)

8/23/ Materials 9 objects in visual display 3 event types: –X covers Y (subject, object) –X pushes Y against Z (subject, object, pp-object) –X touches Y (subject, object) 75 scenarios of 4 sequences of actions each –3 context turns (all containing the same given item) –1 target turn (always containing the same given item) –3x3 design (given item is subj, direct object or pp-obj in context and same or not in target) with 5 scenarios per cell –2 controls: all new, all given objects (15 scenarios each) –Presented in random order

8/23/ Experimental Conditions 10 native speakers of standard American English Subject and experimenter in soundproof booth Subject told to describe scenes to confederate outside the booth, visible but with providing no feedback 10 practice scenarios ~20 minutes per subject

8/23/ Prosodic Analysis Target turns excised and analyzed by two judges independently for location of pitch accents for each referring expression: accented (2), unsure (1), deaccented (0)  accentedness score from 0-4 (81% agreement for 0 and 2 scores) Accent scores calculated by adding the two judges scores for each item (range: 0-4)

8/23/ Grammatical Role/Surface Position Accenting ‘Score’ CONTEXTTARGET GIVENSubjD-objPp-obj Subj D-obj Pp-obj NEW

8/23/ Findings Items that differ from context to target in grammatical function or surface position tend to be accented Items that share grammatical function and surface position tend to be deaccented Subjects were accented more than objects, even if previously mentioned in same role Direct objects and pp-objects differ more from subjects than from each other

8/23/ Given/New Isn’t Just About Discourse Entities Consider e.g. Subj  D.O. variation The TRIANGLE touches the CYLINDER. The triangle touches the DIAMOND. The triangle touches the OCTAGON. The RECTANGLE touches the TRIANGLE. An entity may be ‘given’ or ‘new’ wrt the role it plays in the discourse

8/23/ How can we determine automatically whether a discourse entity is given or new? A rule-based approach: –Stem the content words in the discourse –Select a window within which incoming items with the same stem as a previous entity and within this window will be labeled ‘given’ Other items are ‘new’ Is this hearer-based? Discourse-based? How well does it predict pitch accent? –65-75% accurate (precision) depending on genre, domain

8/23/ What else can we do? Instead of just accenting new and deaccenting given items –Keep track of given ‘type’ (evoked or inferable) –Keep track of grammatical function of discourse entities when introduced (subject, direct object, pp-object) What else? downstepped contours and given/new

8/23/ How important is it to accent given/new items appropriately? Are listeners sensitive to intonational correlates of information status? Evidence that ‘appropriate’ accentuation facilitates comprehension: –Birch & Clifton (1995): appropriateness speeds makes-sense judgments in Q&A pairs –Bock & Mazzella (1983): comprehension time of denial-counterassertion pairs –Davidson (2001): phoneme-monitoring in denial- counterassertion pairs –Terken & Nooteboom (1987), Nooteboom & Kruyt (1987): verification latencies of target words

8/23/ Intonational cues in on-line processing Dahan et al. (2002): Accentuation effects referential interpretation even at very early stages of processing Used eye-tracking to monitor listeners’ fixations on pictured entities as they heard instructions to manipulate these entities on computer screen Examined moment-by-moment recognition of accented vs. unaccented words which share a primary-stressed initial syllable (e.g. candy/candle).

8/23/ Dahan et al. (2002) Example discourse: “Put the CANDLE below the triangle. Now put the CAN | DLE above the square.” utt 1utt 2 Now put the [kæn]... = candle = candy = candle pred. fixation

8/23/ CANDLE-CAN|DLE condition candle candy  Competition from “candy” upon hearing accented [kæn].  Accentuation is used by listeners to process discourse representations on-line, as a word is unfolding. [From Dahan et al. (2002)]

8/23/ Accent, Given/New, and Grammatical Function Some more evidence about grammatical function…. Dahan et al ‘02 also examine conditions in which the antecedent and target did not share grammatical role: “Put the necklace below the candle. Now put the CANDLE above the square.” –NO competitor effects (i.e. no looks to “candy” upon hearing accented [kæn])

8/23/ “Put the necklace below the CANDLE. Now put the CANDLE above the square.” Prediction: “candle” is given  competition from “candy” upon hearing accented [kæn]. [From Dahan et al. (2002)] No competitor effects in this condition! Here, accent serves to cue shift in “focus”. [See also Terken & Hirschberg (1994)]

8/23/ Grammatical Role or Syntactic parallelism? Dahan et al.’s and Terken & Hirschberg’s data confound grammatical role with syntactic parallelism Venditti et al ‘02,’03: syntactic parallelism (NOT just persistence of grammatical role) affects interpretation of nuclear-accented pronouns John hit Bill and then HE... hit George. (N2 pref)... ran away. (less N2 pref) N2 pref when syntatically parallel clauses

8/23/ Next Class Back end synthesis and TTS evaluation