David McDonald BBN Technologies Flexibility counts more than precision.

Slides:



Advertisements
Similar presentations
Reader Response Journal
Advertisements

Texts and Digital Objects What seems to have changed.
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
報告者:會計四 簡思佳 The process of converting a program written in a high-level language into a machine-executable form. language implementation Ex.C++
December 2003CSA3050: Natural Language Generation 1 What is Natural Language Generation? When is NLG an Appropriate Technology? NLG System Architectures.
ADGEN USC/ISI ADGEN: Advanced Generation for Question Answering Kevin Knight and Daniel Marcu USC/Information Sciences Institute.
Why?.  Discourse structure is a linguistic method.  The other linguistic methods are:  Lexis  Grammar  Pragmatics  Phonology  Graphology  Linguistic.
Natural Language Generation: Discourse Planning
Chapter 20: Natural Language Generation Presented by: Anastasia Gorbunova LING538: Computational Linguistics, Fall 2006 Speech and Language Processing.
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
Search Engines and Information Retrieval
Start Let’s a r i o t s ur hing eading.
Subject Access in the Digital Age Presented by Carol Bradsher.
Information Retrieval in Practice
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
Natural Language Generation Ling 571 Fei Xia Week 8: 11/17/05.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
E Marketing E Newsletter and E-Surveys Are They For You???
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Luc Audrain Hachette Livre Head of digitalization
Processing of large document collections Part 3 (Evaluation of text classifiers, applications of text categorization) Helena Ahonen-Myka Spring 2005.
Search Engines and Information Retrieval Chapter 1.
Intellectual Property Rights and the Web: Common Myths Presentation by Jean Mistele.
Colin Clark, Fluid Project Technical Lead, Adaptive Technology Resource Centre, University of Toronto Bridging the Gap: Design & Development in Sakai.
1 Natural Language Processing Gholamreza Ghassem-Sani Fall 1383.
CERN – European Organization for Nuclear Research Administrative Information Services Demonstration of 3 HTMLDB-based applications 1 IT-AIS-HRMarch 17.
CMPD 434 MULTIMEDIA AUTHORING Chapter 06 Multimedia Authoring Process IV.
Personal Knowledge Management - SIIA 19 April 2006 Screenshots from Live Demo + Backup Greg Lloyd – President & Co-Founder Traction Software Inc. Providence,
The Information School at the University of Washington INFO 440: Information System Design Fall 2003 * Bob Boiko * MSIM Associate Chair XML Transforms.
© 2008 SpeechCycle, Inc. More Than Call Steering Managing Dynamic Contextual Complexity Phillip Hunter VP, Application Design/Development SpeechTek NYC.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
Accounting Costs, Profit, Contribution and break Even Analysis.
Artificial Intelligence: Natural Language
Introduction to Dialogue Systems. User Input System Output ?
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Dirk Van CompernolleAtranos Workshop, Leuven 12 April 2002 Automatic Transcription of Natural Speech - A Broader Perspective – Dirk Van Compernolle ESAT.
CSC USI Class Meeting 10 November 9, 2010.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Jette Viethen 20 April 2007NLGeval07 Automatic Evaluation of Referring Expression Generation is Possible.
Information Retrieval
An Evaluation Competition? Eight Reasons to be Cautious Donia Scott Open University & Johanna Moore University of Edinburgh.
Going Online Michael, Michelle, Phillip, Uyen.. What is “Going Online?” Online: Describes a system which is connected (generally electrically) to a larger.
Academic vs Media Discourse week 3 B. Mitsikopoulou.
BY DR. HAMZA ABDULGHANI MBBS,DPHC,ABFM,FRCGP (UK), Diploma MedED(UK) Associate Professor DEPT. OF MEDICAL EDUCATION COLLEGE OF MEDICINE June 2012 Writing.
Natural Language Generation with Tree Conditional Random Fields Wei Lu, Hwee Tou Ng, Wee Sun Lee Singapore-MIT Alliance National University of Singapore.
HiST The foundation for a dialogic grammar Ivar Tormod Berg Ørstavik.
CS10: The Beauty and Joy of Computing Artificial Intelligence Anna Rafferty (Slides adapted from Dan Garcia) 3 April 2013.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
NATURAL LANGUAGE PROCESSING
Jacob (Jack) Gryn - Presented November 28, Semi-Structured Data and XML.
Getting Your Content in the Penn State Student Portal Presented By James Leous, Program Manager James Vuccolo, Lead Research Programmer.
Virtual LSP workshop 21st century skills in focus Budapest Business School College of Commerce, Catering and Tourism Language Department Asztalos Réka.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Tasneem Ghnaimat. Language Model An abstract representation of a (natural) language. An approximation to real language Assume we have a set of sentences,
Information Retrieval in Practice
Non-fiction and Media Higher Tier.
Explanatory Essay.
ENG1120K Review Class.
PRESENTED BY: PEAR A BHUIYAN
Developing graphic design skills
Artificial Intelligence for Speech Recognition
Machine Learning Ali Ghodsi Department of Statistics
Walking Through Grade 10 English
Communicative Competence (Canale and Swain, 1980)
Communicative Competence (Canale and Swain, 1980)
Inclusive Communication Hub
Communicating and Adapting Language task
Understand the Written Communication rubric
Presentation transcript:

David McDonald BBN Technologies Flexibility counts more than precision

April 20th, 2007ST&CE NLG workshop2 $$$ Any serious effort depends on funding (e.g. RAGS) Participating in an unfunded shared task isn’t reasonable (for me) Contributing resources and discussions on a Wiki is fine –Mumble-86: TAG-based surface realizer But BBN is a law firm: –We have to bill hours

April 20th, 2007ST&CE NLG workshop3 The price of entry A generator’s output should be as good as a person’s –Always grammatical –Fits the discourse context. –Fluent and cohesive Pronouns, reduced NPs, conjunction reduction, rich repertoire of clause combinations, etc. Real sources –Doing real work for real (commercial) systems

April 20th, 2007ST&CE NLG workshop4 Variation is essential Even read text will vary (bedtime stories) People rarely phrase their content the same way every time. A generator that didn’t vary its output would be unnatural. –Synthetic characters for game-based training (NPCs) have to be realistic

April 20th, 2007ST&CE NLG workshop5 These have the same “content” “Apple profits surge on iPod sales” »Clickable title next to graphic on first page “Apple reports a 78% jump in quarterly profits thanks to strong Christmas sales of its iPod digital music player” »Summary below the graphic “Apple has reported a 78% surge in profits for the three months to 30 December, boosted by strong Christmas sales of its iPod digital music player” »Same content in the full article BBC web page: 1/17/07

April 20th, 2007ST&CE NLG workshop6 Shared data: real texts & contexts Start with actual texts that are judged to have the same ‘content’ –Varying in purpose, level of detail, randomly, … Parse the texts back far enough that the same source could produce all the variations Every player can use their favorite representation It’s a natural task in its own right

April 20th, 2007ST&CE NLG workshop7 “Killer app” Regenerating content in different styles for different purposes –Initiatives in ‘deep’ NLU: “Machine Reading”, “Learning by Reading” E.g. Rough-cut intelligence analyst’s reports –Read the raw texts of hundreds of news sources, military movement messages, field reports, … –Provide tailored summaries (hypertexts) –Gets a handle on the intelligence overload

April 20th, 2007ST&CE NLG workshop8 Questions? Comments?

April 20th, 2007ST&CE NLG workshop9 backup

April 20th, 2007ST&CE NLG workshop10 Flexible natural language generation using linguistic templates Provide easy authoring of fluent prose while keeping simplicity of variable substitution seen in XSLT string templates. Explicit linguistic form enables optimizations, provides structure needed for synthetic speech. Automatically build the templates by parsing ex. with reversible grammar. Schematic variation given the context. “X was in the room” Parser x = past/be/in-room(x) R R (Alice) & R (Bob) NLG Alice and Bob were in the room. Alice was in the room and so was Bob. Alice was in the room and Bob was there too. Alice and Bob were both there. Where were Alice and Bob? In the room.