Semantics and Context in Natural Language Processing (NLP) Ari Rappoport The Hebrew University.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
Dan Jurafsky Lecture 4: Sarcasm, Alzheimers, +Distributional Semantics Computational Extraction of Social and Interactional Meaning SSLST, Summer 2011.
Machine Learning on.NET F# FTW!. A few words about me  Mathias Brandewinder  Background: economics, operations research .NET developer.
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Semantics (Representing Meaning)
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
Discourse Martin Hassel KTH NADA Royal Institute of Technology Stockholm
Extract from various presentations: Bing Liu, Aditya Joshi, Aster Data … Sentiment Analysis January 2012.
Sentiment Analysis An Overview of Concepts and Selected Techniques.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Second Language Acquisition and Real World Applications Alessandro Benati (Director of CAROLE, University of Greenwich, UK) Making.
1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Mining and Summarizing Customer Reviews
Education office, Evaz district, autumn 1393 Presenter: Rahmanpour CEF (Common European Framework): The basis of the new course book development in Iran.
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
NLP superficial and lexic level1 Superficial & Lexical level 1 Superficial level What is a word Lexical level Lexicons How to acquire lexical information.
Word Sense Disambiguation (WSD)
Introduction An Introduction to Linguistics. LINGUISTICS STUDIES LANGUAGES.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Time, Tense and Aspect Rajat Kumar Mohanty Centre For Indian Language Technology Department of Computer Science and Engineering Indian.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Natural Language Processing Menu Based Natural Language Interfaces -Kyle Neumeier.
AutoTutor Benjamin Kempe Tutoring Research Group, University of Memphis
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Blog Summarization We have built a blog summarization system to assist people in getting opinions from the blogs. After identifying topic-relevant sentences,
The Unreasonable Effectiveness of Data
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
INTRODUCTION TO APPLIED LINGUISTICS
PRESUPPOSITION PRESENTED BY: SUHAEMI.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C
Automatically Labeled Data Generation for Large Scale Event Extraction
Vocabulary Module 2 Activity 5.
Contextual Intelligence as a Driver of Services Innovation
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Natural Language Processing (NLP)
Statistical NLP: Lecture 13
What is Pattern Recognition?
Statistical NLP: Lecture 9
Text Categorization Rong Jin.
CSE 635 Multimedia Information Retrieval
CS246: Information Retrieval
Natural Language Processing (NLP)
The Winograd Schema Challenge Hector J. Levesque AAAI, 2011
Discovering Companies we Know
Artificial Intelligence 2004 Speech & Natural Language Processing
Statistical NLP : Lecture 9 Word Sense Disambiguation
Natural Language Processing (NLP)
Presentation transcript:

Semantics and Context in Natural Language Processing (NLP) Ari Rappoport The Hebrew University

Form vs. Meaning Solid NLP progress with statistical learning OK for classification, search, prediction – Prediction: language models in speech to text But all this is just form (lexicon, syntax) Language is for expressing meanings (semantics) – (i) Lexical, (ii) sentence, (iii) interaction

(i) Lexical Semantics Relatively context independent Sensory feature conjunction: house, hungry, guitar – Non-linguistic semantic machine learning: face identification in photographs Categories: is-a: a smartphone is a (type of) product, iPhone is a (type of) smartphone Configurations: part-of: engine:car, places

Generic Relationships Medicine : Illness – Hunger : Thirst – Love : Treason – Law : Anarchy – Stimulant : Sensitivity You use X in a way W, to do V to some Z, at a time T and a place P, for the purpose S, because of B, causing C, …

Flexible Patterns X w v Y : countries such as France – Davidov & Rappoport (ACL, EMNLP, COLING, etc) Content words, High frequency words Meta-patterns: CHHC, CHCHC, HHC, etc. Fully unsupervised, general Efficient hardware filtering, clustering Categories, SAT exams, geographic maps, numerical questions, etc.

Ambiguity Relative context independence does not solve ambiguity Apple: fruit, company Madrid: Spain, New Mexico Which one is more relevant? Context must be taken into account – Language use is always contextual

(ii) Sentence Semantics The basic meaning expressed by (all) languages: argument structure scenes Dynamic or static relations between participants; elaborators; connectors; embeddings: – John kicked the red ball – Paul and Anne walked slowly in the park – She remembered Johns singing

Several Scenes Linkers: cause, purpose, time, conditionality – He went there to buy fruits, Before they arrived, the party was very quiet, If X then Y Ground: referring to the speech situation – In my opinion, machine learning is the greatest development in computer science since FFT [and neither were done by computer scientists] Career, Peace, Democracy

Sentence Semantics in NLP Mostly manual: FrameNet, PropBank Unsupervised algorithms – Arg. identification, Abend & Rappoport (ACL 2010) Question Answering – Bag of words (lexical semantics) Machine Translation – Rewriting of forms (alignment, candidates, target language model)

Extreme Semantic Application Tweet Sentiment Analysis – Schwartz (Davidov, Tsur) & Rappoport 2010, 2013 Coarse semantics: 2 categories (40) Short texts, no words lists; fixed patterns

(iii) Interaction Semantics Understanding means having enough information to DO something – The brains main cycle Example: human-system interaction Full context dependence – Relevance to your current situation

Interaction Examples Searching Argo, did you mean – The plot? Reviews? Where and/or when to watch? Chinese restaurant – The best in the country? In town? The nearest to you? The best deal? There are hints: – Location (regular, irregular); time (lunch?)

Interaction Directions Extending flexible patterns: – Include Text-Action H and C items (words, actions) Action: – represented as User Interface operations Shortcut: bag of words (lexical semantics) + current context. Ignore sentence semantics Noise, failure (Siri, maps,…)

Summary Lexical, sentence, and interaction semantics Applications are possible using all levels As relevance to life grows, so do requirements from algorithms Both sentence and interaction semantics necessary for future smart applications Current focus: sentence semantics