Semantic annotation of a dialog corpus Silvie Cinková Institute of Formal and Applied Linguistics Charles University in Prague, Czech Republic COMPANIONS.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

Lecture #9 Syntax, Pragmatics, and Semantics © 2014 MARY RIGGS 1.
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Why study grammar? Knowledge of grammar facilitates language learning
Chapter 4 Syntax.
Dr. Abdullah S. Al-Dobaian1 Ch. 2: Phrase Structure Syntactic Structure (basic concepts) Syntactic Structure (basic concepts)  A tree diagram marks constituents.
King Abdulaziz University Department of European Languages & Literature Syntax (LANE-334) Chapter 2 Constituents Dr. Abdulrahman Alqurashi Dr. Abdulrahman.
June 6, 20073rd PIRE Meeting1 Tectogrammatical Representation of English in Prague Czech-English Dependency Treebank Lucie Mladová Silvie Cinková, Kristýna.
Statistical NLP: Lecture 3
Annotating language data Tomaž Erjavec Institut für Informationsverarbeitung Geisteswissenschaftliche Fakultät Karl-Franzens-Universität Graz Tomaž Erjavec.
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
Prepositions, Conjunctions, and Interjections
1 Words and the Lexicon September 10th 2009 Lecture #3.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
April 26, 2007Workshop on Treebanking, NAACL-HTL 2007 Rochester1 Treebanks: Layering the Annotation Jan Hajič Institute of Formal and Applied Linguistics.
April 26, 2007Workshop on Treebanking, NAACL-HTL 2007 Rochester1 Treebanks: Language-specific Issues Czech Jan Hajič Institute of Formal and Applied Linguistics.
Artificial Intelligence 2004 Natural Language Processing - Syntax and Parsing - Language Syntax Parsing.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
Phonetics, Phonology, Morphology and Syntax
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
PDT 2.0 Prague Dependency Treebank 2.0 Zdeněk Žabokrtský Dept. of Formal and Applied Linguistics Charles University, Prague.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
9/8/20151 Natural Language Processing Lecture Notes 1.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
PDT Grammatemes and Coreference in the PDT 2.0 Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University in Prague.
Syntactically annotated corpora of Estonian Heli Uibo Institute of Computer Science University of Tartu
8 November 2003 PP attachment problem1 Prepositional Phrase Attachment Problem 03M05601 Ashish Almeida.
March 5, 2008Companions Semantic Representation and Dialog Interfacing Workshop - Morphology and Surface Syntax 1 The PDT Morphology and Surface Syntax.
Writing Effective Sentences Prof ADama. Objective To help the student write clear and effective sentences.
10/12/2015CPSC503 Winter CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
Tree-based Machine Translation using syntax and semantics
April 17, 2007MT Marathon: Tree-based Translation1 Tree-based Translation with Tectogrammatical Representation Jan Hajič Institute of Formal and Applied.
The Prague (Czech-)English Dependency Treebank Jan Hajič Charles University in Prague Computer Science School Institute of Formal and Applied Linguistics.
SYNTAX Lecture -1 SMRITI SINGH.
1 LIN 1310B Introduction to Linguistics Prof: Nikolay Slavkov TA: Qinghua Tang CLASS 13, Feb 16, 2007.
Introduction to Linguistics Ms. Suha Jawabreh Lecture 18.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
ENGLISH SYNTAX Introduction to Transformational Grammar.
Resemblances between Meaning-Text Theory and Functional Generative Description Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University,
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
CPE 480 Natural Language Processing Lecture 4: Syntax Adapted from Owen Rambow’s slides for CSc Fall 2006.
1 Context Free Grammars October Syntactic Grammaticality Doesn’t depend on Having heard the sentence before The sentence being true –I saw a unicorn.
Phrases and Clauses Adjective, Adverb, Prepositional Phrases. Embedding. Coordination and Apposition. Introduction to Clauses.
nd PIRE project workshop1 Tectogrammatical Representation of English Silvie Cinková Lucie Mladová, Anja Nedoluzhko, Jiří Semecký, Jana Šindlerová,
March 5, 2008Companions Semantic Representation and Dialog Interfacing Workshop - Intro 1 The Prague Dependency Treebank (PDT) Introduction Jan Hajič Institute.
Unit 8 Syntax. Syntax Syntax deals with rules for combining words into sentences, as well as with relationship between elements in one sentence Basic.
Supertagging CMSC Natural Language Processing January 31, 2006.
Annotation Procedure in Building the Prague Czech-English Dependency Treebank Marie Mikulová and Jan Štěpánek Institute of Formal and Applied Linguistics.
Syntactic Annotation of Slovene Corpora (SDT, JOS) Nina Ledinek ISJ ZRC SAZU
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
Natural Language Processing Slides adapted from Pedro Domingos
Arabic Syntactic Trees Zdeněk Žabokrtský Otakar Smrž Center for Computational Linguistics Faculty of Mathematics and Physics Charles University in Prague.
Non-sentential utterances (NSU) in dialog Silvie Cinková (CU) Companions Semantic Representation and Dialog Interfacing Workshop Edinburgh, March 5, 2008.
March 5, 2008Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics 1 PDT: Tectogrammatical Representation Jan Hajič Institute.
Coreference: Current and outlook Silvie Cinková (CU) Companions Semantic Representation and Dialog Interfacing Workshop Edinburgh, March 5, 2008.
Language Structure Lecture 1: Introduction & Overview Helena Frännhag Spring 2013.
Week 3. Clauses and Trees English Syntax. Trees and constituency A sentence has a hierarchical structure Constituents can have constituents of their own.
Netgraph – a Tool for Searching in the Prague Dependency Treebank 2.0 Defence of the Doctoral Thesis, Prague, September 3 rd, 2008 Author: Mgr. Jiří Mírovský.
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27
Natural Language - General
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 26
Generative Transformation
Syntax.
Presentation transcript:

Semantic annotation of a dialog corpus Silvie Cinková Institute of Formal and Applied Linguistics Charles University in Prague, Czech Republic COMPANIONS ( European Commission Sixth Framework Programme Information Society Technologies Integrated Project IST-34434

Data for machine learning audio-synchronized transcription linguistic annotation –Charles University (Czech Republic) –Napier University (Edinburgh, UK) –University of Sheffield (UK) –Oxford University (UK)

Functional Generative Description formal language description Prague structuralism + computational ling. since 1960's stratifies language –phonology –morphology –surface syntax –underlying syntax (tectogrammatics) transition between syntax and semantics a "poor men's interlingua"

Dependency  constituency syntax VP NP PP on the left ? thatJessIs NP

Tectogrammatical representation "Underlying syntax" linguistic meaning syntactic and semantic relations parent-child node(s) valency ellipsis restoration coreference across sentence boundaries information structure (TFA) synonymous function  identical representation

Tectogrammatical representation Is that Jess on the left?

Tectogrammatical representation ellipsis restoration coreference Yes it is, laughing.

Current... written –Prague Dependency Treebank Czech newspapers 800 k words manually LDC 2006 –Wall Street Journal in progress, 15% so far monolog reporting standard language spoken –dialogs real time interaction clause fragments exophora, deixis (syntax deviations) and challenges

Non-sentential utterances (NSU) phrases (NP, PP, ADVP, ADJP) –Me. –At 5 o'clock. –Blue. interjections –Mhm. –Oh, no! interjections attached to phrases –No, Billy. –Oh, sure. subordinate clause without main clause –If he goes with me. –Skiing. phrase combinations in coordination or apposition –With Mary in the morning or shopping at Tesco. –Or without.

Utterance-response pair "Who's that?" "Peggy." utterance U response NSU UPred UMods Functors (semantic labels)

Utterance-response pair Who's that? [Peggy.] Peggy.("That is Peggy").

Two students? Shopping with Mary. Coreferential predicate

Predicate with interjections No, Billy.Yes. Mhm.

NSUMods versus UMods attribute: response_type values: –overrules –bridging –wh-path –other form: reference (arrow) to antecedent node

Non-conflicting Modifier addition Yes [I brought the book]. [It will be] probably not [worth getting].

Overruling I'm at a little place called Ellenthorpe. Hellenthorpe.

Overruling by an identical modifier A: There are only two people in the class. B: Two people?

Bridging There are only two people in the class. Two students?

Bridging A: You lift the crane out, so this part will come up. B: The end?

Pronominal anaphora vs. overruling A: Peter should introduce Paul to Mary. B: Rather her to him.

Wh-path A: "Who's that?"B: "Peggy."

Wh-path - different functor matches up to the annotator we expect regular alternation patterns Where would you like to go tomorrow? Shopping with Mary.

Other A: He entered the largest room. B: Room 128? A: I don't know the number.

Summary U-NSU pairs NSU inherits the predicate of U (coreference) NSU inherits all modifiers of U NSU's own modifiers overrule the inherited –overrule –bridging –wh-path –other

References Raquel Fernández, Jonathan Ginzburg, and Shalom Lappin (2007): Classifying Non- Sentential Utterances in Dialogue: A Machine Learning Approach. Computational Linguistics, Volume 33, Nr. 3. MIT Press for the Association for Computational Linguistics Eva Hajičová (ed) (1995): Text-And- Inference-Based Approach to Question Answering, Prague, 1995