1 Deryle Lonsdale, Jeremiah McGhee, Nathan Glenn, and Tory Anderson.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan.
Non Monotonic Reasoning for Disambiguation and Parsing Arthi Murugesan.
LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
Shallow Parsing CS 4705 Julia Hirschberg 1. Shallow or Partial Parsing Sometimes we don’t need a complete parse tree –Information extraction –Question.
Generation Miriam Butt January The Two Sides of Generation 1) Natural Language Generation (NLG) Systems which take information from some database.
By Neng-Fa Zhou Compiler Construction CIS 707 Prof. Neng-Fa Zhou
Introduction to treebanks Session 1: 7/08/
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Are Linguists Dinosaurs? 1.Statistical language processors seem to be doing away with the need for linguists. –Why do we need linguists when a machine.
Soar NL-Soar update Deryle Lonsdale BYU Linguistics
1/13 Parsing III Probabilistic Parsing and Conclusions.
January 12, Compiler Design Hongwei Xi Comp. Sci. Dept. Boston University.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
January 19, Compiler Design Hongwei Xi Comp. Sci. Dept. Boston University.
Bootstrapping an Ontology-based Information Extraction System Alexander Maedche, Günter Neumann, Steffen Staab (presented by D. Lonsdale) CS 652 – June.
©TheMcGraw-Hill Companies, Inc. Permission required for reproduction or display. COMPSCI 125 Introduction to Computer Science I.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Soar Progress on NL-Soar, and Introducing XNL-Soar Deryle Lonsdale, Jamison Cooper-Leavitt, and Warren Casbeer ( and the rest of the BYU NL-Soar.
A New Soar Debugger In Java Douglas Pearson
Programming. Software is made by programmers Computers need all kinds of software, from operating systems to applications People learn how to tell the.
1 Deryle Lonsdale, Jeremiah McGhee, Nathan Glenn, and Tory Anderson.
Visualization By: Simon Luangsisombath. Canonical Visualization  Architectural modeling notations are ways to organize information  Canonical notation.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Principles of Programming Chapter 1: Introduction  In this chapter you will learn about:  Overview of Computer Component  Overview of Programming 
ELN – Natural Language Processing Giuseppe Attardi
Invitation to Computer Science 5th Edition
9/8/20151 Natural Language Processing Lecture Notes 1.
Operational Capability: An underlying simplification of a data encoding standard has been developing over the past decade and is being demonstrated in.
April 2005CSA2050:NLTK1 CSA2050: Introduction to Computational Linguistics NLTK.
BTANT 129 w5 Introduction to corpus linguistics. BTANT 129 w5 Corpus The old school concept – A collection of texts especially if complete and self-contained:
Interference in Short-Term Memory The Magical Number Two (or Three) in Sentence Processing ` (Sat.) / Chan-hoon Park Hypernetwork Models of Learning.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Chapter 1. Introduction.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
ATLAS Demystified: A Practical Introduction Christophe Laprun, Jonathan Fiscus, John Garofolo, Sylvain Pajot National Institute of Standards and Technology.
A Web Application for Customized Corpus Delivery Nancy Ide, Keith Suderman, Brian Simms Department of Computer Science Vassar College USA.
The Software Development Life Cycle. Software Development SDLC The Software Development Life-Cycle Sometimes called the program development lifecycle.
Soar and Construction Grammar Peter Lindes, Deryle Lonsdale, David Embley Brigham Young University 2014 Soar Workshop © 2014 Peter Lindes 6/19/2014PL 2014.
Ontology-based Information Extraction with a Cognitive Agent Peter Lindes 1, Deryle Lonsdale, David Embley Brigham Young University AAAI Now at.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Future Memory Research in Soar Nate Derbinsky University of Michigan.
Introduction to Dialogue Systems. User Input System Output ?
Towards the better software metrics tool motivation and the first experiences Gordana Rakić Zoran Budimac.
Natural Language Programming David Vadas The University of Sydney Supervisor: James Curran.
1 Compiler Construction (CS-636) Muhammad Bilal Bashir UIIT, Rawalpindi.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
Compiler Construction (CS-636)
07/10/2007 VDCT Status Update EPICS Collaboration, October 2007 Knoxville, Tennessee
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
Supertagging CMSC Natural Language Processing January 31, 2006.
Generality and Openness in Enabling Methodologies for Morphology and Text Processing Anssi Yli-Jyrä Department of General Linguistics, University of Helsinki.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
OntoSoar: Soar Finds Facts in Text Peter Lindes, Deryle Lonsdale, David Embley Brigham Young University 33 rd Soar Workshop, June 2013 pl 6/6/201333rd.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Syntax.
ICS312 Introduction to Compilers Set 23. What is a Compiler? A compiler is software (a program) that translates a high-level programming language to machine.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Website Source Code Free Download.
Cognitive Language Processing for Rosie
Algorithm and Ambiguity
Cognitive Language Comprehension in Rosie
Computational Linguistics: New Vistas
Knowledge Representation for Natural Language Understanding
Presentation transcript:

1 Deryle Lonsdale, Jeremiah McGhee, Nathan Glenn, and Tory Anderson

 Language involves interesting cognition  Language/task integrations compelling  Lots of theories, microtheories  Linguists aren’t always grounded  Gives students experience  Carrying forward prior work  Little CL/NLP interaction with modeling

 NL-Soar: language modeling system  Natural language parsing: syntax  Problematic and difficult structures  Semantics (lexical conceptual structure)  Generation (concepts  text)  Discourse (discourse recipes)  Language translation (English  French)  Task integrations  WordNet

 NL-Soar versions  Soar 6: mid/late 1990’s  Soar 7: early 2000’s  Soar 8: late 2000’s  Extensive chunking  Extremely deep stacks  Many different operators  Soar 8: ran in Soar 7 mode

 XNL-Soar: upgrade, re-do NL-Soar  More current syntax (Minimalist Program)  Soar 9 architecture (finally!)  Soar 9.3

 Reduction of total # productions by an order of magnitude  C interface  Tcl /C interface  Java interface (JAWS interface)  Several intermediate versions ( )  SMem version too

 X-windows code  Low-level, tightly integrated with agent  CLIG: computational linguistics interactive grapher  Nice primitives, good export capability  Not well maintained/disseminated, Tcl  GraphViz  Very versatile  Various output formats

 Several thousand productions to a few hundred  Updated syntax  Soar architectural improvements  Easier system installation  Eclipse, SVN  Students writing code

 Ambiguity is pervasive in natural language; need to:  Avoid it when possible (strategies)  Detect it when present  Disambiguate it when necessary  Two types of ambiguity  PP attachment  Present participle / gerund

 I saw a man with a moustache. (N attach)  I saw a man with my own eyes. (V attach)  I saw a man with a telescope. (N/V attach)  Training: corpus of WSJ PP attachments  Test: annotated Brown Corpus sentences  6 different strategies  SMem boosts, generalizes

?

 My favorite cousin was singing. (participle)  My favorite activity was singing. (gerund)  His business is advertising. (both possible)  Exemplars mined from Penn treebank  SMem helps, need more semantics

 Snips: undo erroneous hypotheses  Semantics  Semsnips: semantic snips  Finish semantic graphing  French  Japanese

 No learning/chunking  Walled off from other tasks  Learning how to code Soar  Getting used to Soar 9

 Translation  Discourse  Generation  Chunking?

COAL  Still not public-ready  Some more linguistics to implement  No chunking yet  No activation, decay NUGGETS  Up-to-date after years  More easily deployable  SMem and EpMem  More streamlined tools  Evaluation with real- language use cases  Hundreds of sentences parsed