Kees van Deemter (AC, Dec 09) Vagueness Facilitates Search Kees van Deemter Computing Science University of Aberdeen.

Slides:



Advertisements
Similar presentations
A Semantic Model for Vague Quantifiers Combining Fuzzy Theory and Supervaluation Theory Ka Fat CHOW The Hong Kong Polytechnic University The title of.
Advertisements

P.Johnson 1 Chapter 4 Experimental Research Designs.
Commentary on Katalin Balog, In defense of the phenomenal concept strategy Assistant Prof. István Aranyosi, Bilkent University, Ankara, Turkey.
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Generation of Referring Expressions: the State of the Art SELLC Summer School, Harbin 2010 Kees van Deemter Computing Science University of Aberdeen.
Charting the Potential of Description Logic for the Generation of Referring Expression SELLC, Guangzhou, Dec Yuan Ren, Kees van Deemter and Jeff.
Generation of Referring Expressions: the State of the Art SELLC Winter School, Guangzhou 2010 Kees van Deemter Computing Science University of Aberdeen.
Van Deemter, WORD, May 2010 Not Exactly Vagueness as Original Sin? Kees van Deemter University of Aberdeen.
Consider... [[Tall(John) Tall(John)]] [[Tall(John)]] = undecided, therefore [[Tall(John) Tall(John)]] = undecided.
Kees van Deemter (SSE, Jan '10) Vagueness Facilitates Search Kees van Deemter Computing Science University of Aberdeen.
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
Vagueness: a problem for AI Kees van Deemter University of Aberdeen Scotland, UK.
Kees van Deemter, Dublin, Trinity College, May 2009 What utility can do for NLG: the case of vague language Kees van Deemter University of Aberdeen Scotland,
Brief Introduction to Logic. Outline Historical View Propositional Logic : Syntax Propositional Logic : Semantics Satisfiability Natural Deduction : Proofs.
Active Learning with Feedback on Both Features and Instances H. Raghavan, O. Madani and R. Jones Journal of Machine Learning Research 7 (2006) Presented.
CS4018 Formal Models of Computation weeks Computability and Complexity Kees van Deemter (partly based on lecture notes by Dirk Nikodem)
Presentation on Artificial Intelligence
The influence of domain priors on intervention strategy Neil Bramley.
Artificial Intelligence Chapter 13 The Propositional Calculus Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.
1 Functions and Applications
Fuzzy Sets and Fuzzy Logic
Probabilistic Information Retrieval Part I: Survey Alexander Dekhtyar department of Computer Science University of Maryland.
Fuzzy Expert System  An expert might say, “ Though the power transformer is slightly overloaded, I can keep this load for a while”.  Another expert.
Is There a Discrete Analog of the Median Voter Theorem? James Bradley Calvin College Public Choice 2003.
CLASSICAL LOGIC and FUZZY LOGIC. CLASSICAL LOGIC In classical logic, a simple proposition P is a linguistic, or declarative, statement contained within.
On Status and Form of the Relevance Principle Anton Benz, ZAS Berlin Centre for General Linguistics, Typology and Universals Research.
CS344 Artificial Intelligence Prof. Pushpak Bhattacharya Class on 6 Mar 2007.
Kees van Deemter (Guangzhou WS, Dec '10) Why Be Vague? Kees van Deemter Computing Science University of Aberdeen.
AI TECHNIQUES Fuzzy Logic (Fuzzy System). Fuzzy Logic : An Idea.
Peter Gärdenfors Why must language be vague?. Philosophers since Leibniz have dreamt of a precise language Vagueness is a design feature of natural language.
Albert Gatt LIN1180 – Semantics Lecture 10. Part 1 (from last week) Theories of presupposition: the semantics- pragmatics interface.
CUSTOMER NEEDS ELICITATION FOR PRODUCT CUSTOMIZATION Yue Wang Advisor: Prof. Tseng Advanced Manufacturing Institute Hong Kong University of Science and.
Multilingual Text Retrieval Applications of Multilingual Text Retrieval W. Bruce Croft, John Broglio and Hideo Fujii Computer Science Department University.
Game-theoretic probability and its applications Glenn Shafer December 14, 2009 Subjective Bayes 2009 Warwick University.
Audit Tests: Risk, Confidence and Materiality Some basic statistics about Inherent Risk and Control Risk.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Slides 00 1 Advanced Logics Part 1: Lambda Calculus and Type Theory Lecturer: Lim Yohanes Stefanus Part 2: Modal Logic Lecturer: Gregory Wheeler.
Ragionamento in condizioni di incertezza: Approccio fuzzy Paolo Radaelli Corso di Inelligenza Articifiale - Elementi.
Fuzzy Logic BY: ASHLEY REYNOLDS. Where Fuzzy Logic Falls in the Field of Mathematics  Mathematics  Mathematical Logic and Foundations  Fuzzy Logic.
Metaphor Analysis in Social Science: The problem Lynne Cameron and Rob Maslen.
FUZZY LOGIC Babu Appat. OVERVIEW What is Fuzzy Logic? Where did it begin? Fuzzy Logic vs. Neural Networks Fuzzy Logic in Control Systems Fuzzy Logic in.
N What do you expect from Science? n Science is a process, not a thing. n Science is a paradigm based on understanding nature (rather than passively observing.
Lecture 1 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.
CORRELATION & REGRESSION
Fuzzy Logic. Lecture Outline Fuzzy Systems Fuzzy Sets Membership Functions Fuzzy Operators Fuzzy Set Characteristics Fuzziness and Probability.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Chapter 2 Developmental Psychology A description of the general approach to behavior by developmental psychologists.
N o, you don’t understand, I mean… Irini Nomikou supervisor: Dr. Floriana Grasso The one with the conductor and the girl on the train Cond: Did you pay.
INTEGRALS 5. INTEGRALS In Chapter 2, we used the tangent and velocity problems to introduce the derivative—the central idea in differential calculus.
Integrals  In Chapter 2, we used the tangent and velocity problems to introduce the derivative—the central idea in differential calculus.  In much the.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
A Behavioral Science and Research Perspective. What Behavioral sciences do you think are involved to Organizational Behavior?
Discrete Random Variables
Uncertainty Management in Rule-based Expert Systems
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
FUZZY LOGIC INFORMATION RETRIEVAL MODEL Ferddie Quiroz Canlas, ME-CoE.
Ngram models and the Sparcity problem. The task Find a probability distribution for the current word in a text (utterance, etc.), given what the last.
Theories and Hypotheses. Assumptions of science A true physical universe exists Order through cause and effect, the connections can be discovered Knowledge.
Corpus-based evaluation of Referring Expression Generation Albert Gatt Ielka van der Sluis Kees van Deemter Department of Computing Science University.
Jhu-hlt-2004 © n.j. belkin 1 Information Retrieval: A Quick Overview Nicholas J. Belkin
Optimal answers and their implicatures A game-theoretic approach Anton Benz April 18 th, 2006.
Textbook Basics of an Expert System: – “Expert systems: Design and Development,” by: John Durkin, 1994, Chapters 1-4. Uncertainty (Probability, Certainty.
Transient Unterdetermination and the Miracle Argument Paul Hoyningen-Huene Leibniz Universität Hannover Center for Philosophy and Ethics of Science (ZEWW)
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Fuzzy Logic Artificial Intelligence Chapter 9. Outline Crisp Logic Fuzzy Logic Fuzzy Logic Applications Conclusion “traditional logic”: {true,false}
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Western Civilization to c CIV Professor Ed Lamoureux Day 2 January 22, 2016 Two approaches to analyzing cultural history Remember: I put these.
Chapter 7 The electronic theory of metal Objectives At the end of this Chapter, you should: 1. Understand the physical meaning of Fermi statistical distribution.
A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto.
Introduction to Machine Learning, its potential usage in network area,
Presentation transcript:

Kees van Deemter (AC, Dec 09) Vagueness Facilitates Search Kees van Deemter Computing Science University of Aberdeen

Kees van Deemter (AC, Dec 09) Two big problems with vagueness A predicate is vague (V) if it has borderline cases or degrees 1.The semantic problem: How to model the meaning of V expressions? –Classical models: 2-valued –Partial models: 3-valued –Degree models: many-valued (e.g. Fuzzy Logic, Probabilistic Logic) No agreement how to answer this question (Keefe & Smith 1997, Van Rooij (to appear))

Kees van Deemter (AC, Dec 09) The pragmatic problem: 2. Why is language vague? (Variant: When & Why use vague expressions?) B.Lipman (2000, 2006): Why have we tolerated a world-wide several-thousand- year efficiency loss?

Kees van Deemter (AC, Dec 09) The pragmatic problem: Why is language vague? Speaker... –hides information –only has uncertain or vague information –lack an objective metric –reduces processing cost –expresses an opinion –aids understanding or gist memory (Lipman 2000, 2006, Veltman 2000, de Jaegher 2003, Jäger 2003, van Rooij 2003, Peters et al. 2009)

Kees van Deemter (AC, Dec 09) This talk No verdict on these earlier answers van Deemter 2009 [JPL survey] Explore tentative new answer: V can oil the wheels of communication Starting point: its almost inconceivable that all speakers arrive at exactly the same concepts

Kees van Deemter (AC, Dec 09) Causes of semantic mismatches Perception varies per individual –Hilbert 1987 on colour terms: density of pigment on lens & retina; sensitivity of photo receptors Cultural issues. –Reiter et al on temporal expressions. Example: evening: Are the times of dinner and sunset relevant? R.Parikh (1994) recognised that mismatches exist …

Kees van Deemter (AC, Dec 09) Parikh proposed utility-oriented perspective on meaning –Utility as reduction in search effort showed communication doesnt always break down when words are understood (slightly) differently by different speakers

Kees van Deemter (AC, Dec 09)

Blue books (Bob) Blue books (Ann) Ann: Bring the blue book on topology Bob: Search [[blue]] Bob, then, if necessary, all other books (only10% probability!) 675

Kees van Deemter (AC, Dec 09) What Parikh did not do: show utility of V –Ann and Bob used crisp concepts! This talk: tall instead of blue. 2-dimensional [[tall 1 ]] [[tall 2 ]] or [[tall 2 ]] [[tall 1 ]] Focus on V

Kees van Deemter (AC, Dec 09) The story of the stolen diamond A diamond has been stolen from the Emperor and (…) the thief must have been one of the Emperors 1000 eunuchs. A witness sees a suspicious character sneaking away. He tries to catch him but fails, getting fatally injured (...). The scoundrel escapes. (…) The witness reports The thief is tall, then gives up the ghost. How can the Emperor capitalize on these momentous last words? (book, to appear)

Kees van Deemter (AC, Dec 09) The problem with dichotomies Suppose Emperor uses a dichotomy, e.g. Model A: [[tall]] Emperor = [[>180cm]] (e.g., 500 people) What if [[tall]] Witness = [[>175cm]]? If thief [[tall]] Witness - [[tall]] Emperor then Predicted search effort: 500+ ½(500)=750 Without witness utterance: ½(1000)=500 The witness utterance misled the Emperor 180cm thief 175cm A

Kees van Deemter (AC, Dec 09) Model A uses a crisp dichotomy between [[tall]] A and [[ tall]] A Contrast this with a partial model B, which has a truth value gap [[?tall?]] B

Kees van Deemter (AC, Dec 09) Model AModel B tall A tall B ?tall? B tall B 2-valued 3-valued 180cm 165cm

Kees van Deemter (AC, Dec 09) How does the Emperor classify the thief? Three types of situations s(X) = def expected search time given model X Type 1: thief [[tall]] A = [[tall]] B In this case, s(A)=s(B)

Kees van Deemter (AC, Dec 09) How does the emperor classify the thief? Type 2: thief [[?tall?]] B model A: search all of [[tall]] A in vain, then (on average) half of [[ tall]] A model B: search all of [[tall]] B in vain, then (on average) half of [[?tall?]] B B searches ½(card([[ tall]] B )) less! So, s(B)<s(A)

Kees van Deemter (AC, Dec 09) How does the emperor classify the thief? Type 3: thief [[ tall]] B model A: search all of [[tall]] A in vain, then (on average) half of [[ tall]] A model B: search all of [[tall]] B in vain, then all of [[?tall?]] B then (on average) half of [[ tall]] B Now A searches ½(card([[?tall?]] B )) less. So, s(A)<s(B)

Kees van Deemter (AC, Dec 09) Normally model B wins Type 2 is more probable than Type 3 Let S = xy ((x [[?tall?]] B & y [[ tall]] B ) p(tall(x)) > p(tall(y)) S is highly plausible! (Taller individuals are more likely to be called tall)

Kees van Deemter (AC, Dec 09) Normally model B wins S : xy ((x [[?tall?]] B & y [[ tall]] B ) p(tall(x)) > p(tall(y) ) S implies xy ((x [[?tall?]] B & y [[ tall]] B ) p(thief(x)) > p(thief(y) ) It pays to search [[?tall?]] B before [[ tall]] B A priori, s(B) < s(A)

Kees van Deemter (AC, Dec 09) Why does model B win? 3-valued models beat 2-valued models because they distinguish more finely Therefore, many-valued models should be even better! –All models that allow degrees or ranking (including e.g. Kennedy 2001)

Kees van Deemter (AC, Dec 09) Consider degree model C v(tall(x)) [0,1] (e.g., Fuzzy or Probabilistic Logic) xy ((v(tall(x)) > v((tall(y)) p(tall(x)) > p(tall(y)) height(x) probability of x being called tall

Kees van Deemter (AC, Dec 09) Consider degree model C Analogous to the Partial model C allows the Emperor to –rank the eunuchs, and –start searching the tallest ones, etc. Assuming >3 differences in height (i.e., more than 3 differences in v(tall(x)), this is even quicker than B: s(C) < s(B)

Kees van Deemter (AC, Dec 09) An example of model C v(tall(a1)=v(tall(a2))=0.9 v(tall(b1)=v(tall(b2))=0.7 v(tall(c1)=v(tall(c2))=0.5 v(tall(d1)=v(tall(d2))=0.3 v(tall(e1)=v(tall(e2))=0.1 Search {a1,a2} first, then {b1,b2}, etc. (5 levels) This is quicker than a partial model (3 levels) which is quicker than a classical model (2 levels)

Kees van Deemter (AC, Dec 09) This analysis suggests …

Kees van Deemter (AC, Dec 09) This analysis suggests …... that it helps the Emperor to understand tall as having borderline cases or degrees But, borderline cases and degrees are the hallmark of V It appears to follow that V has benefits for search

Kees van Deemter (AC, Dec 09) This analysis also suggests …... that degree models offer a better understanding of V than partial models –Compare the first big problem with V Radical interpretation: V concepts dont serve to narrow down search space, but to suggest an ordering of it

Kees van Deemter (AC, Dec 09) Objections …

Kees van Deemter (AC, Dec 09) Objections … Objection 1: Smart search would have been equally possible based on a dichotomous (i.e., classical) model.

Kees van Deemter (AC, Dec 09) Objections … Objection 1: Smart search would have been equally possible based on a dichotomous (i.e., classical) model. The idea: You can use a classical model, yet understand that other speakers use other classical models. Start searching individuals who are tall on most models.

Kees van Deemter (AC, Dec 09) Objections … Objection 1: Smart search would have been equally possible based on a dichotomous (i.e., classical) model. The idea: you can use a classical model yet understand that other speakers use other classical models. Start searching individuals who are tall on most models Response: Reasoning about different classical models is not a classical logic but a Partial Logic with supervaluations. Presupposes that tall is understood as vague.

Kees van Deemter (AC, Dec 09) Objections … Objection 2: Truth value gaps (or degrees) do not imply vagueness.

Kees van Deemter (AC, Dec 09) Objections … Objection 2: Truth value gaps (or degrees) do not imply vagueness. The idea: Truth value gaps that are crisp fail to model V. (Similar for degrees.) Higher-order V needs to be modelled.

Kees van Deemter (AC, Dec 09) Objections … Objection 2: A truth value gap (or degrees) does not imply vagueness. The idea: Truth value gaps that are crisp fail to model V. (Similar for degrees.) Higher-order V needs to be modelled. Response: few formal accounts of V pass this test

Kees van Deemter (AC, Dec 09) Objections … Objection 3: Why was the witness not more precise?

Kees van Deemter (AC, Dec 09) Objections … Objection 3: Why was the witness not more precise? The idea: Witness should have said the thief was 185cm tall, giving an estimate

Kees van Deemter (AC, Dec 09) Objections … Objection 3: Why was the witness not more precise? The idea: Witness should have said the thief was 185cm tall, giving an estimate Response: (1) Why are such estimates vague (approximately 185cm) rather than crisp (185cm =/- 0.5cm)? (2) This can be answered along the lines proposed.

Kees van Deemter (AC, Dec 09) Objections … Objection 4: This contradicts Lipmans theorem

Kees van Deemter (AC, Dec 09) Objections … Objection 4: This contradicts Lipmans theorem The idea: Lipman (2006) proved that every V predicate can be replaced by a crisp one that has higher utility.

Kees van Deemter (AC, Dec 09) Objections … Objection 4: This contradicts Lipmans theorem The idea: Lipman (2006) proved that every V predicate can be replaced by a crisp one that has higher utility. Response: Lipmans assumptions dont apply: (1) Theorem models V through probability distribution. (2) Theorem assumes that hearer knows what crisp model the speaker uses (e.g. >185cm).

Kees van Deemter (AC, Dec 09) Lipmans theorem assumes that hearer knows what crisp model the speaker uses Our starting point: in continuous domains, perfect alignment between speakers/hearers would be a miracle (pace epistemicist approaches to V !)

Kees van Deemter (AC, Dec 09) Not Exactly: in Praise of Vagueness. Oxford University Press, Jan Part 1: Vagueness in science and daily life. Part 2: Linguistic and logical models of vagueness. Part 3: Working models of V in Artificial Intelligence. (Related to his talk: pp ; )

Kees van Deemter (AC, Dec 09) Empirical tests of various hypotheses concerning the effects of vague expressions (aiding understanding or gist memory): E.Peters, N.Dieckmann, D.Västfjäll, C.Mertz, P.Slovic, and J.Hibbard (2009). Bringing meaning to numbers: the impact of evaluative categories on decisions. J. Experimental Psychology 15 (3):

Kees van Deemter (AC, Dec 09) Appendix Extra slides

Kees van Deemter (AC, Dec 09) Normally model B wins Let S abbreviate: p(t [[?tall?]] B | tall(t)) > p(t [[ tall]] B | tall(t)) For example, S is plausible if card([[?tall?]] B )=card([[ tall]] B )

Kees van Deemter (AC, Dec 09) Example (draft) Let [[tall]]={a,b,c,d} p(thief [[tall]]) = 1/2 [[?tall]]={e,f} p(thief [[?tall?]]) = 1/4 [[ tall]]={g,h,i,j} p(thief [[ tall]]) = 1/4 then s( )= 1/2*(2)+1/4*(4+1)+1/4*(6+2)= 4.25 s( )= 1/2*(2)+1/4*(4+2)+1/4*(8+1)= 4.75 s( )= 1/4*(2)+1/4*(4+1)+1/2*(6+2)= 5.75

Kees van Deemter (AC, Dec 09) Consider degree model C In C: v(tall(x)) [0,1] (e.g., Fuzzy or Probabilistic Logic) S = ab: a>b p(tall(x) | v(tall(x)=a)) > p(tall(x) | v(tall(x)=b)) height(x) probability of x being called tall

Kees van Deemter (AC, Dec 09) rich=earn more than 10 6 EUR (1) xy: ( x South & y North p(rich(x)) > p(rich(y)) ) (2) x: (rich(x) oil(x)) Therefore (3) xy: ( x South & y North p(oil(x)) > p(oil(y)) )

Kees van Deemter (AC, Dec 09) End