Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology 100 44 Stockholm +46-8-790 66 34

Slides:



Advertisements
Similar presentations
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Advertisements

Why study grammar? Knowledge of grammar facilitates language learning
December 2003CSA3050: Natural Language Generation 1 What is Natural Language Generation? When is NLG an Appropriate Technology? NLG System Architectures.
Stat-JR: eBooks Richard Parker. Quick overview To recap… Stat-JR uses templates to perform specific functions on datasets, e.g.: – 1LevelMod fits 1-level.
Discourse Martin Hassel KTH NADA Royal Institute of Technology Stockholm
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
How Language Use Varies
Natural Language Generation: Discourse Planning
Chapter 20: Natural Language Generation Presented by: Anastasia Gorbunova LING538: Computational Linguistics, Fall 2006 Speech and Language Processing.
What is NLG? NLG "is the process of deliberately constructing a natural language text in order to meet specified communicative goals". [McDonald 1992]
Natural Language Generation Research Presentation Presenter Shamima Mithun.
A Pipelined Architecture Document Planning Microplanning Surface Realisation Document Plan Text Specification.
The Unified Software Development Process - Workflows Ivar Jacobson, Grady Booch, James Rumbaugh Addison Wesley, 1999.
Evaluation of NLP Systems Martin Hassel KTH NADA Royal Institute of Technology Stockholm
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
Natural Language Generation Martin Hassel KTH CSC Royal Institute of Technology Stockholm
Natural Language Generation Ling 571 Fei Xia Week 8: 11/17/05.
Systems Analysis and Design in a Changing World, 6th Edition
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 6 Slide 1 Software Requirements 2.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Chapter 6: The Traditional Approach to Requirements
Introduction to Natural Language Generation
Model Performance Indicators.
9/8/20151 Natural Language Processing Lecture Notes 1.
SOFTWARE ENGINEERING BIT-8 APRIL, 16,2008 Introduction to UML.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Natural Language Generation An Overview
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
CSI 3120, Grammars, page 1 Language description methods Major topics in this part of the course: –Syntax and semantics –Grammars –Axiomatic semantics (next.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Organizing Your Information
Teaching Productive Skills Which ones are they? Writing… and… Speaking They have similarities and Differences.
Chapter 14. Writing Definitions, Descriptions, and Instructions © 2013 by Bedford/St. Martin's1 What are definitions, descriptions, and instructions? A.
Chapter 15 Natural Language Processing (cont)
SEMANTIC ANALYSIS WAES3303
An Intelligent Analyzer and Understander of English Yorick Wilks 1975, ACM.
Selected Topics in Software Engineering - Distributed Software Development.
L8 - March 28, 2006copyright Thomas Pole , all rights reserved 1 Lecture 8: Software Asset Management and Text Ch. 5: Software Factories, (Review)
1 6 Systems Analysis and Design in a Changing World, 2 nd Edition, Satzinger, Jackson, & Burd Chapter 6 The Traditional Approach to Requirements.
7 Systems Analysis and Design in a Changing World, Fifth Edition.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
1 Cohesion + Coherence Lecture 9 MODULE 2 Meaning and discourse in English.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Culture , Language and Communication
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Introduction to Computational Linguistics
CPSC 871 John D. McGregor Module 3 Session 1 Architecture.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Jan 2004CSA3050: NLG21 CSA3050: Natural Language Generation 2 Surface Realisation Systemic Grammar Functional Unification Grammar see J&M Chapter 20.3.
OBJECT ORIENTED AND FUNCTION ORIENTED DESIGN 1 Chapter 6.
Work your way up the pyramid of Six Writing Traits as writing develops. Do not leave one behind as you move on to the next – all must be evident for good.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
NLP. Introduction to NLP (U)nderstanding and (G)eneration Language Computer (U) Language (G)
NATURAL LANGUAGE PROCESSING
Discourse & Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology Stockholm
1 Sobah Abbas Petersen Adjunct Associate Professor TDT4252 Modelling of Information Systems Advanced Course Lecture 4: Introduction to.
 An important first quality of any good thesis is that it should stem from real problems in the field. Therefore, a researcher should emphasize the reasons.
System and the axis of Choice  Systems are list of choices which are available in the grammar of a language.  It could be a list of things b/w which.
SYNTAX.
WP4 Models and Contents Quality Assessment
Collecting Written Data
Assessing Grammar Module 5 Activity 5.
Objectives Importance of Requirement Engineering
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Assessing Grammar Module 5 Activity 5.
An Introduction to Software Architecture
Design Yaodong Bi.
Presentation transcript:

Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology Stockholm

2Martin Hassel What Is Natural Language Generation? A process of constructing a natural language output from non linguistic inputs that maps meaning to text.

3Martin Hassel Related Simple Text Generation Canned text Ouputs predefined text Template filling Outputs predefined text with predefined variable words/phrases

4Martin Hassel Areas of Use NLG techniques can be used to: generate textual weather forecasts from representations of graphical weather maps summarize statistical data extracted from a database or spreadsheet explain medical info in a patient-friendly way describe a chain of reasoning carried out by an expert system paraphrase information in a diagram for inexperienced users

5Martin Hassel Goals of a NLG System To supply text that is: correct and relevant information non redundant suiting the needs of the user in an understandable form in a correct form

6Martin Hassel Choices for NLG Content selection Lexical selection Sentence structure Aggregation Referring expressions Orthographic realisation Discourse structure

7Martin Hassel Example Architecture Discourse Specification Knowledge Base Communicative Goal Natural Language Output Discourse Planner Surface Realizer

8Martin Hassel Discourse Planner 1. Text shemata Use consistent patterns of discourse structure 2. Rhetorical Relations Uses the Rhetorical Structure Theory Used for varied generation tasks

9Martin Hassel Discourse Planner – Text Schemata Augmented Transition Network (ATN) Uses a set of internal states and transitions Produces texts with a rigid structure i.e. recipes, directions, technical instructions

10Martin Hassel Discourse Planner – Augmented Transition Network

11Martin Hassel Discourse Planner – Augmented Transition Network Example output: Save the document: First, choose the save option from the file menu. This causes the system to display the Save-As dialog box. Next, choose the destination folder and type the filename. Finally, press the save button. This causes the system to save the document.

12Martin Hassel Discourse Planner – Rhetorical Relations Rhetorical Structure Theory (Mann & Thompson 1988) Nucleus Multi-nuclear Satellite

13Martin Hassel Discourse Planner – Rhetorical Relations 23 rhetorical relations, among these: Elaboration Contrast Condition Purpose Sequence Result

14Martin Hassel Discourse Planner – Rhetorical Relations Rethorical annotation:

15Martin Hassel Discourse Planner – Rhetorical Relations The resulting RST tree corpus: (SATELLITE(SPAN|4||19|)(REL2PAR ELABORATION- ADDITIONAL) (SATELLITE(SPAN|4||7|)(REL2PAR CIRCUMSTANCE) (NUCLEUS(LEAF|4|)(REL2PAR CONTRAST) (TEXT _!THE PACKAGE WAS TERMED EXCESSIVE BY THE BUSH |ADMINISTRATION,_!|)) (NUCLEUS(SPAN|5||7|)(REL2PAR CONTRAST) (NUCLEUS(LEAF|5|)(REL2PAR SPAN) (TEXT _!BUT IT ALSO PROVOKED A STRUGGLE WITH INFLUENTIAL CALIFORNIA LAWMAKERS_!))

16Martin Hassel Surface Realisation 1. Systemic Grammar Represents sentences as collections of functions Using functional categorization 2. Functional Unification Grammar Builds generation grammar as a feature structure Using functional categorization

17Martin Hassel Surface Realisation – Systemic Grammar Emphasises the functional organisation of language Surface forms are viewed as the consequences of selecting a set of abstract functional features Choices correspond to minimal grammatical alternatives The interpolation of an intermediate abstract representation allows the specification of the text to accumulate gradually

18Martin Hassel Surface Realisation – Systemic Grammar Uses multiple layers Mood, Transivity, Theme Meta-functions Interpersonal meta-function (mood) Ideational meta-function (transivity) Textual meta-function (theme) Grammar represented as a system network Directed, acyclic and/or graph

19Martin Hassel Surface Realisation – Systemic Grammar Present-Participle Past-Participle Infinitive Polar Wh- Imperative Indicative Declarative Interrogative Major Minor Mood … Bound Relative

20Martin Hassel Surface Realisation – Systemic Grammar Clause Choices Major indicative declarative: The cat is on the mat. Major indicative declarative relative: [He didn’t see the cat] that chased the rat. Major indicative declarative bound: [It only hurts] when I laugh. Major indicative interrogative polar: Has anybody seen my seagull? Major imperative: Don’t be ridiculous. Minor present-participle: [You’ll enjoy] having more free time.

21Martin Hassel Surface Realisation – Functional Unification Grammar Basic idea: Input specification in the form of a FUNCTIONAL DESCRIPTION, a recursive matrix The grammar is a large functional description with alternations representing choice points Realisation is achieved by unifying the input FD with the grammar FD

22Martin Hassel Surface Realisation – Functional Unification Grammar ((cat clause) (process ((type composite) (relation possessive) (lex ‘hand’))) (participants ((agent ((cat pers_pro) (gender feminine))) ((affected  ((cat np) (lex ‘editor’))) ((possessor  )) ((possessed ((cat np) (lex ‘draft’))))) She hands the draft to the editor.

23Martin Hassel Microplanners 1 Lexical selection Syntactic realization Morphological realization Orthographic realization

24Martin Hassel Microplanners 2 Referring Expression Generation Referring expression generation is concerned with how we describe domain entities in such a way that the hearer will know what we are talking about Major issue is avoiding ambiguity Fluency pulls in the opposite direction

25Martin Hassel Kinds of Referring Expressions Proper names Aberdeen, Scotland Aberdeen Definite Descriptions the train that leaves at 10am the next train Pronouns she it

26Martin Hassel Microplanners 3 Aggregation Some possibilities: Simple conjunction Ellipsis Set introduction

27Martin Hassel Aggregation Without aggregation: The next train is the Caledonian Express. It leaves at 10AM. With Simple Conjunction : The next train is the Caledonian Express, and it leaves at 10AM. Without aggregation: It leaves at 10AM. It arrives at 12.30PM. With ellipsis: It leaves at 10AM, and Ø arrives at 12.30PM.

28Martin Hassel Aggregation Without aggregation: It has a snack bar. It has a restaurant car. With set introduction : It has {a snack bar, a restaurant car}. It has a snack bar and a restaurant car. Caution! Need to avoid changing the meaning: John bought a TV and Bill bought a TV. John and Bill bought a TV.

29Martin Hassel Further Reading Siggen Allen 1995: Natural Language Understanding cl/Allen95/al1995co.htm