Natural Language Generation 74.793 Research Presentation Presenter Shamima Mithun.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Natural Language Generation
A very short introduction to Natural Language Generation Kees van Deemter Computing Science University of Aberdeen.
Generation of Referring Expressions: Managing Structural Ambiguities I.H. KhanG. Ritchie K. van Deemter University of Aberdeen, UK.
December 2003CSA3050: Natural Language Generation 1 What is Natural Language Generation? When is NLG an Appropriate Technology? NLG System Architectures.
Systems Analysis and Design in a Changing World
Chapter 8: Evaluating Alternatives for Requirements, Environment, and Implementation.
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Natural Language Generation: Discourse Planning
Chapter 20: Natural Language Generation Presented by: Anastasia Gorbunova LING538: Computational Linguistics, Fall 2006 Speech and Language Processing.
Generation Miriam Butt January The Two Sides of Generation 1) Natural Language Generation (NLG) Systems which take information from some database.
An Introduction to NLG What is Natural Language Generation? Some Example Systems Types of NLG Applications When are NLG Techniques Appropriate? NLG System.
Introduction To System Analysis and Design
The Unified Software Development Process - Workflows Ivar Jacobson, Grady Booch, James Rumbaugh Addison Wesley, 1999.
Technical Writing II Acknowledgement: –This lecture notes are based on many on-line documents. –I would like to thank these authors who make the documents.
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
PDDL: A Language with a Purpose? Lee McCluskey Department of Computing and Mathematical Sciences, The University of Huddersfield.
Natural Language Generation Martin Hassel KTH CSC Royal Institute of Technology Stockholm
Natural Language Generation Ling 571 Fei Xia Week 8: 11/17/05.
Software Configuration Management (SCM)
4. Interaction Design Overview 4.1. Ergonomics 4.2. Designing complex interactive systems Situated design Collaborative design: a multidisciplinary.
Systems Development Life Cycle
Second Language Acquisition and Real World Applications Alessandro Benati (Director of CAROLE, University of Greenwich, UK) Making.
Chapter 1 Program Design
8 Systems Analysis and Design in a Changing World, Fifth Edition.
Task analysis 1 © Copyright De Montfort University 1998 All Rights Reserved Task Analysis Preece et al Chapter 7.
Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.
The Software Product Life Cycle. Views of the Software Product Life Cycle  Management  Software engineering  Engineering design  Architectural design.
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Top Ten Tips for teachers preparing students for the academic version of IELTS Sam McCarter Macmillan Online Conference 2013.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
System Analysis Overview Document functional requirements by creating models Two concepts help identify functional requirements in the traditional approach.
Introduction to Systems Analysis and Design Trisha Cummings.
Introduction to Natural Language Generation
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Lesson 7 Guide for Software Design Description (SDD)
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Preparing Teacher Candidates and Faculty to address Academic Language
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Natural Language Generation An Overview
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
Introduction To System Analysis and Design
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Lecture 1 Introduction Figures from Lewis, “C# Software Solutions”, Addison Wesley Richard Gesick.
1 Introduction to Software Engineering Lecture 1.
Software Architecture
Design and Layout in Illustrated Documents: Towards a Model of Genre Judy Delin University of Stirling John Bateman University of Bremen Patrick Allen.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Introduction to Computational Linguistics
Introduction to Computational Linguistics Jay Munson (special thanks to Misty Azara) May 30, 2003.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Jan 2004CSA3050: NLG21 CSA3050: Natural Language Generation 2 Surface Realisation Systemic Grammar Functional Unification Grammar see J&M Chapter 20.3.
Learning from Model-Produced Graphs in a Climate Change Science Class Catherine Gautier Geography Department UC Santa Barbara.
Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology Stockholm
NLP. Introduction to NLP (U)nderstanding and (G)eneration Language Computer (U) Language (G)
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
1-1 © Prentice Hall, 2004 Chapter 1: The Object-Oriented Systems Development Environment Object-Oriented Systems Analysis and Design Joey F. George, Dinesh.
System A system is a set of elements and relationships which are different from relationships of the set or its elements to other elements or sets.
To my presentation about:  IELTS, meaning and it’s band scores.  The tests of the IELTS  Listening test.  Listening common challenges.  Reading.
Ehud Reiter, Computing Science, University of Aberdeen1 CS5545: Natural Language Generation Background Reading: Reiter and Dale, Building Natural Language.
Systems Development Life Cycle
WP4 Models and Contents Quality Assessment
Systems Analysis and Design in a Changing World, Fifth Edition
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Assessing Grammar Module 5 Activity 5.
Assessing Grammar Module 5 Activity 5.
Reading Strategies English 9 Honors.
Systems Development Life Cycle
Presentation transcript:

Natural Language Generation Research Presentation Presenter Shamima Mithun

Overview Introduction What is Natural Language Generation (NLG)? Usages of Natural Language Generation When NLG Systems are appropriate? Applications of NLG Example NLG System Architectures for NLG How to Evaluate NLG Systems Conclusions Demo on ILEX System

What is NLG “Natural language generation is the process of deliberately constructing a natural language text in order to meet specified communicative goals”. [McDonald 1992] from [Dale and Reiter 1999] “Natural Language Generation (NLG) is the process of constructing natural language outputs from non-linguistic inputs”. [Jurafsky and Martin 2000 ]

What is NLG (contd.) Non-linguistic Input NLG System Output Text Goal: produces understandable and appropriate texts in English or other human languages Input: some underlying non-linguistic representation of information, e.g. Meteorological maps, Airline/Railway schedule databases Output: documents, reports, explanations, help messages, and other kinds of texts Knowledge sources required: knowledge of language and of the domain [Dale and Reiter 1999]

Text vs. Graphics which medium is better? Computer Generation vs. Human Authoring is the necessary source data available? is automation economically justified? NLG vs. Simple String Concatenation how much variation occurs in output texts? [Reiter and Dale 1999] When NLG Systems are Appropriate?

Applications of NLG Automated Document Production weather forecasts, summarizing statistical data, answering questions etc. Information Presentation medical records, weather forecast etc. Entertainment jokes, stories, poetry etc. Teaching Dialog Systems [Rambow et al., 2001]

Applications of NLG (contd.) Two Types of NLG Systems The system produces a document without human help summaries of statistical data, generating weather forecast etc. The system helps human authors to create documents customer-service letters, patent claims, technical documents, job descriptions etc. [Reiter and Dale 2000]

NLG System: FoG Reiter and Dale give the description of the FoG System as follows Function: Produces textual weather reports in English or French Input: Graphical weather depiction User: Environment Canada (Canadian Weather Service) Developer: CoGenTex Status: Fielded, in operational use since 1992 [Reiter and Dale 1999]

NLG System: FoG Input Output From [Reiter and Dale 1999]

Architectures for NLG

NLG System Architectures: Text Planner Linguistic Realiser Sentence Planner From [Jurafsky and Martin 2000] Goal Text Plan Sentence Plan Surface Text From [Reiter and Dale 1997]

 This component starts with a communicative goal and makes choices of Content selection Discourse Plan Lexical selection Micro planning  Aggregation  Referring expressions  It selects the content from the knowledge base and then structures that content appropriately  The resulting discourse plan will specify all the choices made for the entire communication Discourse Planner

Content Selection Content Selection: is the process of deciding what information should be communicated in the text  Creating a set of MESSAGES from the underlying data sources  Message-creation process and the form and content of the messages created are highly application-dependent  Generally messages are expressed in some formal language (e.g., Sentence Planning Language) with the notion of ENTITIES, CONCEPTS and RELATIONS in domain

Content Selection (contd.) For Example, specific trains, places and times as entities, the property of being the next train as a concept, and departure as relation between trains and time. Message-id: msg01 Relation: IDENTITY Arguments: arg1: NEXT-TRAIN arg2: CALEDONIAN-EXPRESS The next Train is the Caledonian Express Message-id: msg02 Relation: DEPARTURE Arguments: departure-entity: CALEDONIAN-EXPRESS departure-location: ABERDEEN departure-time: 1000 The Caledonian Express leaves Aberdeen at 10 am

Discourse Plan  Discourse Planning is the task of structuring the messages produced by the Content Selection process  Two predominant mechanisms for building discourse structures: Text Schemata Rhetorical Relation

Text Schemata Figure: Knowledge Base Representation for saving a file as a simple procedural hierarchy Figure: A Schema for expressing procedures From [Jurafsky and Martin 2000]

Drawbacks of Text Schemata Impractical when the text being generated requires more structural variety and richness of expressions. For example, we express certain segments of the text in a different manner or in different order. No higher-level structure relating the sentence together. For example, if we explained a process in some detail, we might not want to do it again.

Rhetorical Relation Rhetorical Structure Theory (RST), is a descriptive theory of text organization based on the relationships that hold between parts of the text. Example: i) I love to collect classic automobiles. My favorite car is my 1899 Duryea. ii) I love to collect classic automobiles. My favorite car is my 2001 Toyota. It designates a central segment of text, called nucleus, and a more peripheral segment, called the satellite RST relations are defined in terms of the constraints placed on the nucleus, on the satellite, and on the combination of both

Rhetorical Relation (contd.) Name: Expand Purpose Effect: (COMPETENT hearer (DO-ACTION ? action)) Constraints: (AND (c-get-all-substeps ?action ?sub-actions) (NOT (singular-list? ?sub-actions)) Nucleus: (COMPETENT hearer (DO-SEQUENCE ?sub-actions)) Satellites: (((RST_PURPOSE (INFORM s hearer (DO ?action))) Name: Expand Sub-Actions Effect: (COMPETENT hearer (DO-SEQUENCE ? actions)) Constraints: NIL Nucleus: (foreach ?actions (RST-SEQUENCE (COMPETENT hearer (DO-ACTION ?actions)))) Satellites: NIL From [Jurafsky and Martin 2000]

Lexical Selection Lexical selection: the process of deciding which words and phrases should be used in order to transform the underlying messages into a readable text. Handling lexical selection requires that the generation system deal with two issues: It must be able to choose the appropriate lexical item when more than one alternatives exists. The generation system must be able to choose the appropriate grammatical form for the expression of the concept. [Jurafsky and Martin 2000]

Micro Planning Aggregation The process of grouping messages together into sentences; not always necessary. Example: "You’ve just compiled and run a simple C program." Without aggregation : "You’ve just complied a simple C program. You’ve just run a simple C program." Referring Expressions Selecting words and phrases to identify entities (e.g. Caledonian Express or it or this train), generating deictic expressions.

Surface Realization This component receives the fully specified discourse plan and generates individual sentences as constrained by its lexical and grammar If the plan specifies multiple-sentence output, the surface realizer is called multiple times No general consensus as to the level at which the input to the surface realizer should be specified Approach for Surface Realizations Functional Unification Grammar

Functional Unification Grammar uses unification to manipulate and reason about feature structure Unify the available grammar with an input specification which is represented with the same feature structure The unification process then takes the features specified in the input and unify with those in the grammar, producing a full feature structure which can then be linearized to form sentence output

Functional Unification Grammar (contd.) Sample Output: The system will save the document Propositional content specification: a saving action done by a system entity to a document entity Specification of the grammatical form: a future tense assertion and lexical items (“save”, ”system”, and “document”).

From [Jurafsky and Martin 2000]

Functional Unification Grammar (contd.) Input (functional description) CAT S ACTOR [HEAD [LEX SYSTEM] ] PROCESS HEAD [LEX SAVE ] TENSE FUTURE GOAL [HEAD [LEX DOCUMENT] ] From [Jurafsky and Martin 2000]

From [Jurafsky and Martin 2000]

Reusable Surface Realization Packages FUF: is a reusable package to generate English grammar This package is developed using functional unification structures If the grammar and the input are specified then the system will construct the syntactically correct sentence output Drafter is a system to support the production of software documentation in English and French. Drafter [Power et al., 1998] is built using the FUF for surface realization It uses Rhetorical Structure Theory (RST) based planning for Discourse planning

Evaluating Generation Systems In early work, the quality of the NLG system was assessed by the system builders themselves. If the system gives correct output then the system was judged as success. Currently Convene a panel of experts to judge the output of the generator in comparison with text produced by human authors Judge how effective the generated text is at achieving its goal. [Jurafsky and Martin 2000]

Conclusions Many NLG applications being investigated but all are not successful. However, few systems are in use e.g., FoG Currently the evaluation process of NLG systems has received much attention In late 1980s and early 1990s the trend was to construct reusable NLG system e.g., FUF. Now the trend is to port the systems to other languages and platforms

References Jurafsky D., and Martin J.H “Speech and Language Processing, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition”. Prentice Hall. Reiter E., and Dale R., “Building Applied Natural Language Generation”. Cambridge University Press. Reiter E., and Dale R., “Building Natural Language Generation Systems”. Cambridge University Press. Bateman J., and Zock M., “The B-to-Z of Natural Language Generation: an almost complete list.” Oxford Handbook of computational Linguistics. Rambow O., Bangalore S., and Walker M., Natural Language Generation in Dialog Systems. Reiter E., and Dale R., Building Natural Language Generation System. Power R., Scott D., and Evans R., What You See Is What You Meant: direct knowledge editing with natural language feedback. Elhadad M., FUF: the Universal Unifier User Manual Version 5.2 FUF:

Demo on ILEX System

Thanks