Semantics: Semantic Grammar & Information Extraction CMSC 35100 Natural Language Processing May 13, 2003.

Slides:



Advertisements
Similar presentations
CS Morphological Parsing CS Parsing Taking a surface input and analyzing its components and underlying structure Morphological parsing:
Advertisements

Natural Language Processing Lecture 2: Semantics.
Semantics (Representing Meaning)
CMSC 723: Intro to Computational Linguistics November 24, 2004 Lecture 12: Lexical Semantics Bonnie Dorr Christof Monz.
Semi-Supervised Learning & Summary Advanced Statistical Methods in NLP Ling 572 March 8, 2012.
Statistical NLP: Lecture 3
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
Computational Semantics Ling 571 Deep Processing Techniques for NLP February 2, 2011.
Semantic Analysis Ling571 Deep Processing Techniques for NLP February 14, 2011.
CS 4705 Relations Between Words. Today Word Clustering Words and Meaning Lexical Relations WordNet Clustering for word sense discovery.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
CS 4705 Lecture Lexical Semantics. What is lexical semantics? Meaning of Words Lexical Relations WordNet Thematic Roles Selectional Restrictions Conceptual.
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
November 20, 2003 Chapter 16 Lexical Semantics. Words have structured meanings Lexeme – a pairing of a form with a sense Orthographic form – the way the.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
Natural Language Query Interface Mostafa Karkache & Bryce Wenninger.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Lecture Lexical Semantics CS 4705.
CS 4705 Lexical Semantics. Today Words and Meaning Lexical Relations WordNet Thematic Roles Selectional Restrictions Conceptual Dependency.
Meaning and Language Part 1.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Corpus-Based Approaches to Word Sense Disambiguation
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
10. Parsing with Context-free Grammars -Speech and Language Processing- 발표자 : 정영임 발표일 :
Semantic Analysis CMSC Natural Language Processing May 8, 2003.
Lexical Semantics Chapter 16
Natural Language Processing Artificial Intelligence CMSC February 28, 2002.
Lexical Semantics & Word Sense Disambiguation CMSC Natural Language Processing May 15, 2003.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
For Wednesday Read chapter 23 Homework: –Chapter 22, exercises 1,4, 7, and 14.
Topic #1: Introduction EE 456 – Compiling Techniques Prof. Carl Sable Fall 2003.
Rules, Movement, Ambiguity
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
The meaning of Language Chapter 5 Semantics and Pragmatics Week10 Nov.19 th -23 rd.
Collocations and Terminology Vasileios Hatzivassiloglou University of Texas at Dallas.
Word Meaning and Similarity
CS 4705 Lecture 17 Semantic Analysis: Robust Semantics.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Group I: Native American Languages Laura Buszard-WelcherEmily Kidder Phil Cash CashPaul Kroeber Gene DealRuth Rouvier Brian FitzsimmonsLori Levin Julia.
NATURAL LANGUAGE PROCESSING
23.3 Information Extraction More complicated than an IR (Information Retrieval) system. Requires a limited notion of syntax and semantics.
SEMANTICS Chapter 10 Ms. Abrar Mujaddidi. What is semantics?  Semantics is the study of the conventional meaning conveyed by the use of words, phrases.
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
Eugene Nida
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
Lexicons, Concept Networks, and Ontologies
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Statistical NLP: Lecture 3
ece 627 intelligent web: ontology and beyond
Semantics (Representing Meaning)
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Robust Semantics, Information Extraction, and Information Retrieval
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
CSCI 5832 Natural Language Processing
CSC 594 Topics in AI – Applied Natural Language Processing
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
Linguistic Essentials
Semantics Going beyond syntax.
Relations Between Words
Presentation transcript:

Semantics: Semantic Grammar & Information Extraction CMSC Natural Language Processing May 13, 2003

Potential Projects –Develop a morphological transducer for a portion of a language of your choice –Improve the output of a word-for-word translation procedure – say, by improved translation selection, fluency smoothing or other technique –Implement a keyword spotting system for a small vocabulary –Build a speech recognizer for a restricted domain, e.g. stock quotes –Improve the durational modeling of a base ASR system –Incorporate phrase boundary pause modeling in ASR –Build a commercial/music/report detector for audio –Describe and critique a set of at least three contemporary approaches to machine translation, information retrieval, or other area – Implement a finite-state style information extraction approach for a small domain of your choice

Roadmap Domain & Syntax-driven Semantics –Semantic Grammar Example Pros & Cons –Information Extraction Template filling –Cascaded grammar automata Goals & Limitations

Approach II: Semantic Grammars Issue: – Grammatical overkill Constituents with little (no) contribution to meaning Constituents so general that semantics are vacuous –Mismatch of locality Components scattered around tree Solution: Semantic Grammars –Developed for dialogue systems Tied to domain Exclude unnecessary elements

Semantic Grammar Example What do I have on Thursday? –CalQ -> What Aux UserP have {on} DateP Cal action:=find; CalOwner:= head UserP; Date:=head DateP; –UserP-> Pron Head:=Head Pron –Pron-> I Head:= USER –DateP -> Dayof Week Head:= sem DayofWeek

Semantic Grammar Pros & Cons Useful with ellipsis & anaphora –Restrict input by semantic class: e.g. DataP Issues: –Limited reuse Tied to application domain –Simple rules may overgenerate

Information Extraction Requirements: –Information specified within simple, fixed template Semantic analysis = Slot filling –Small proportion of text relevant Example: Message Understanding (MUC) –Joint venture info, terrorist reports –Template: –Relationship, Entities, Joint Venture Co, Activity, Amount »Activity: Company, Product, Start date Possible hierarchical template structure

Cascaded Automata & IE Typical approach: –Cascade of automata build up representation Match syntax & produce semantics Phases: –Multi-word unit recognizer: expressions, names, numbers »E.g. San Francisco Symphony Orchestra –Basic phrases: noun groups, verb groups, –Complex phrases: conjunctions –Semantic patterns: e.g. joint venture co »Often domain specific –Merging: Combine references to same entity

Information Extraction Issues Domain specificity: –Only extract small subset of available info –Rules tuned to vocabulary, grammar, domain Some systems use learning components –Generalize constructions, word use

Lexical Semantics & Word Sense Disambiguation CMSC Natural Language Processing May 13, 2003

Roadmap Lexical Semantics –Motivation: Word sense disambiguation –Meaning at the word level –Issues Ambiguity Meaning Meaning structure –Relations to other words –Subword meaning composition –WordNet: Lexical ontology

What is a plant? There are more kinds of plants and animals in the rainforests than anywhere else on Earth. Over half of the millions of known species of plants and animals live in the rainforest. Many are found nowhere else. There are even plants and animals in the rainforest that we have not yet discovered. The Paulus company was founded in Since those days the product range has been the subject of constant expansions and is brought up continuously to correspond with the state of the art. We’re engineering, manufacturing, and commissioning world-wide ready-to-run plants packed with our comprehensive know-how.

Lexical Semantics Focus on word meanings: –Relations of meaning among words Similarities & differences of meaning in sim context –Internal meaning structure of words Basic internal units combine for meaning Lexeme: meaning entry in lexicon –Orthographic form, phonological form, sense

Sources of Confusion Homonymy: –Words have same form but different meanings Generally same POS, but unrelated meaning E.g. bank (side of river) vs bank (financial institution) »Bank1 vs bank2 Homophones: same phonology, diff’t orthographic form »E.g. two, to, too Homographs: Same orthography, diff’t phonology Why? –Problem for applications: TTS, ASR transcription, IR

Sources of Confusion II Polysemy –Multiple RELATED senses E.g. bank: money, organ, blood,… –Big issue in lexicography # of senses, relations among senses, differentiation E.g. serve breakfast, serve Philadelphia, serve time

Relations between Words Synonymy: –“same meaning”: substitutability? –Issues: Polysemy – same as some sense Shades of meaning – other associations: –Price/fare Collocational constraints: e.g. babbling brook Register: social factors: e.g. politeness, formality Hyponomy: –Isa relations: More General (hypernym) vs more specific (hyponym) –E.g. dog vs golden retriever –Organize as ontology/taxonomy

WordNet Taxonomy Manually constructed lexical database –3 Tree-structured hierarchies Nouns, verbs, adjective+adverb Entries: synonym set, gloss, example use Relations between entries: –Synonymy: in synset –Hypo(per)nym: Isa tree Heavily used resource

Word-internal Structure Thematic roles: –Characterize verbs by their arguments E.g. transport: agent, theme, source, destination –They transported grain from the fields to the silo. Deep structure: passive / active: same roles Thematic hierarchy –E.g. agent > theme > source, dest Provide default surface positions –Tie to semantics (e.g. Levin): Interlinguas Cluster verb meanings by set of syntactic alternations Limitations: only NP,PP: other arguments predicates less

Selectional Restrictions Semantic constraints on filling of roles –E.g. Bill ate chicken Eat: Agent: animate; Theme: Edible –Associate with sense Most commonly of verb/event; possibly adj, noun… Specifying constraints: –Add a term to semantics, e.g. Isa(x,Ediblething) –Tie to position in WordNet All hyponyms inherit

Primitive Decompositions –Jackendoff(1990), Dorr(1999), McCawley (1968) Word meaning constructed from primitives –Fixed small set of basic primitives E.g. cause, go, become, kill=cause X to become Y –Augment with open-ended “manner” Y = not alive E.g. walk vs run Fixed primitives/Infinite descriptors