On Bayesian Measures 27 May 2005 V. Crup i Vincenzo Crupi Department of Cognitive and Education Sciences Laboratory of Cognitive Sciences University of.

Slides:



Advertisements
Similar presentations
Hume’s Problem of Induction 2 Seminar 2: Philosophy of the Sciences Wednesday, 14 September
Advertisements

The Logic of Intelligence Pei Wang Department of Computer and Information Sciences Temple University.
Weakening the Causal Faithfulness Assumption
1 Logic Logic in general is a subfield of philosophy and its development is credited to ancient Greeks. Symbolic or mathematical logic is used in AI. In.
Confirmation and the ravens paradox 1 Seminar 3: Philosophy of the Sciences Wednesday, 21 September
Bayesian Inference Chris Mathys Wellcome Trust Centre for Neuroimaging UCL SPM Course London, May 12, 2014 Thanks to Jean Daunizeau and Jérémie Mattout.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Testing Theories: Three Reasons Why Data Might not Match the Theory.
Application of Ethical Reasoning
Stupid Bayesian Tricks Gregory Lopez, MA, PharmD SkeptiCamp 2009.
NOTE: CORRECTION TO SYLLABUS FOR ‘HUME ON CAUSATION’ WEEK 6 Mon May 2: Hume on inductive reasoning --Hume, Enquiry Concerning Human Understanding, section.
Copyright © Cengage Learning. All rights reserved. CHAPTER 5 SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Truth Trees Intermediate Logic.
James Hawthorne Workshop: Conditionals Counterfactuals and Causes In Uncertain Environments Düsseldorf 19/5/2011 – 22/5/
Chapter 4 Validity.
1. Introduction Consistency of learning processes To explain when a learning machine that minimizes empirical risk can achieve a small value of actual.
Models -1 Scientists often describe what they do as constructing models. Understanding scientific reasoning requires understanding something about models.
Discrete Structures Chapter 2 Part B Mathematical Induction
Two Broad Classes of Functions for Which a No Free Lunch Result Does Not Hold Matthew J. Streeter Genetic Programming, Inc. Mountain View, California
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Stochastic Dominance Michael H. Birnbaum Decision Research Center California State University, Fullerton.
Introduction, Acquiring Knowledge, and the Scientific Method
Chapter 14 Inferential Data Analysis
Judgment and Decision Making in Information Systems Probability, Utility, and Game Theory Yuval Shahar, M.D., Ph.D.
Accounting Theory & Practice OVERVIEW OF ACCOUNTING THEORY
Basic Concepts of Research Basis of scientific method Making observations in systematic way Follow strict rules of evidence Critical thinking about evidence.
Copyright © Cengage Learning. All rights reserved. CHAPTER 11 ANALYSIS OF ALGORITHM EFFICIENCY ANALYSIS OF ALGORITHM EFFICIENCY.
COMP14112: Artificial Intelligence Fundamentals L ecture 3 - Foundations of Probabilistic Reasoning Lecturer: Xiao-Jun Zeng
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Hypothesis Testing.
Discrete Mathematics and Its Applications
Theory testing Part of what differentiates science from non-science is the process of theory testing. When a theory has been articulated carefully, it.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Political Science 102 May 18 th Theories and hypotheses Evidence Correlation and Causal Relationships Doing comparative research Your Term Paper.
Quantitative Analysis: Statistical Testing using SPSS Geof Staniford Room Telephone:
STOCHASTIC DOMINANCE APPROACH TO PORTFOLIO OPTIMIZATION Nesrin Alptekin Anadolu University, TURKEY.
© 2007 The McGraw-Hill Companies, Inc. All Rights Reserved Slide 1 Research Methods In Psychology 2.
Testing Theories: Three Reasons Why Data Might not Match the Theory Psych 437.
Basics of Probability. A Bit Math A Probability Space is a triple, where  is the sample space: a non-empty set of possible outcomes; F is an algebra.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
The Integers. The Division Algorithms A high-school question: Compute 58/17. We can write 58 as 58 = 3 (17) + 7 This forms illustrates the answer: “3.
1 Reasoning Under Uncertainty Artificial Intelligence Chapter 9.
Ellsberg’s paradoxes: Problems for rank- dependent utility explanations Cherng-Horng Lan & Nigel Harvey Department of Psychology University College London.
Experimental Method. METHODS IN PSYCHOLOGY 1.Experimental Method 2.Observation Method 3.Clinical Method.
Estimating Component Availability by Dempster-Shafer Belief Networks Estimating Component Availability by Dempster-Shafer Belief Networks Lan Guo Lane.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
1 Statistical Significance Testing. 2 The purpose of Statistical Significance Testing The purpose of Statistical Significance Testing is to answer the.
Uncertainty Management in Rule-based Expert Systems
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
LECTURE 22 THE FINE-TUNING ARGUMENT FOR DESIGN. THE INITIAL COMPETITORS NATURALISTIC (SINGLE WORLD) HYPOTHESIS (NH 1 ): Reality consists of a single material,
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
NON-PARAMETRIC STATISTICS
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Title Sub-Title Open Writing it up! The content of the report/essay/article.
Foundations of Discrete Mathematics Chapter 1 By Dr. Dalia M. Gil, Ph.D.
Research Methods Research Fundamentals Business Research Methods.
Ethical Intuitions about Risks Sabine Roeser Philosophy Department, Faculty of Technology, Policy and Management, Delft University.
Philosophy of science What is a scientific theory? – Is a universal statement Applies to all events in all places and time – Explains the behaviour/happening.
The Pigeonhole Principle
Inconsistent Constraints
If You Aren’t Dong Arguments, You Aren’t Doing Evidence
Propositional Calculus: Boolean Algebra and Simplification
Using climate model robustness as an example of scientific confirmation Matt Newman CIRES.
Henry Prakken COMMA 2016 Berlin-Potsdam September 15th, 2016
The Nature of Scientific Knowledge
This Lecture Substitution model
Foundations of Discrete Mathematics
Presentation transcript:

On Bayesian Measures 27 May 2005 V. Crup i Vincenzo Crupi Department of Cognitive and Education Sciences Laboratory of Cognitive Sciences University of Trento 27 May 2005 On Bayesian Measures of Evidential Support: Normative and Descriptive Considerations

On Bayesian Measures 27 May 2005 V. Crup i Core Bayesianism… (CB) confirmation is represented by a function X depending only on probabilistic information about evidence (premise) e and hypothesis (conclusion) h and identifying confirmation with an increase in the probability of h provided by the piece of information e, so that: X(e,h)X(e,h) > 0 iff p(h|e) > p(h) = 0 iff p(h|e) = p(h) < 0 iff p(h|e) < p(h)

On Bayesian Measures 27 May 2005 V. Crup i …and the plurality of Bayesian measures

On Bayesian Measures 27 May 2005 V. Crup i if p(h) = 0,5 D R L continues

On Bayesian Measures 27 May 2005 V. Crup i symmetries and asymmetries: an example Eells, E. & Fitelson, B. “Symmetries and Asymmetries in Evidential Support”, Philosophical Studies, 107 (2002), pp “evidence symmetry”: X(e,h) = –X(¬e,h) main thesis: the most adequate Bayesian measure(s) may be selected by testing competitors against intuitively compelling symmetries and asymmetries e.g.: “does a piece of evidence e support a hypothesis h equally well as e’s negation (¬e) undermines the same hypothesis h?” (p. 129) ?

On Bayesian Measures 27 May 2005 V. Crup i symmetries and asymmetries: a unified account a symmetry s is a function mapping an argument (e,h) onto a different argument by one or more of the following steps: negate e negate h invert premise and conclusion

On Bayesian Measures 27 May 2005 V. Crup i continues negate e negate h invert premise and conclusion  “evidence symmetry” (ES): ES(e,h) = (¬e,h)

On Bayesian Measures 27 May 2005 V. Crup i negate e negate h invert premise and conclusion  “hypothesis symmetry” (HS): HS(e,h) = (e,¬h) continues

On Bayesian Measures 27 May 2005 V. Crup i negate e negate h invert premise and conclusion  “inverse symmetry” (IS) such that: IS(e,h) = (h,e) continues

On Bayesian Measures 27 May 2005 V. Crup i negate e negate h invert premise and conclusion  “total symmetry” (TS): TS(e,h) = (¬e,¬h) continues

On Bayesian Measures 27 May 2005 V. Crup i negate e negate h invert premise and conclusion  “inverse evidence symmetry” (IES): IES(e,h) = (h,¬e) continues

On Bayesian Measures 27 May 2005 V. Crup i (e,h)(e,h) evidence (ES): (¬e,h) hypothesis (HS): (e,¬h) total (TS): (¬e,¬h) inverse (IS): (h,e) inverse evidence (IES): (h,¬e) inverse hypothesis (IHS): (¬h,e) inverse total (ITS): (¬h,¬e) convergent a symmetry s is: convergent iff (e,h) and s(e,h) have the same direction (both confirmations or both disconfirmations) divergent iff (e,h) and s(e,h) have opposite directions a convergent symmetry s holds iff X(e,h) = X[s(e,h)] a divergent symmetry s holds iff X(e,h) = –X[s(e,h)] continues

On Bayesian Measures 27 May 2005 V. Crup i Eells & Fitelson: an adequate measure of evidential support should violate “evidence symmetry” for at least some choice of e and h e.g.: X(Jack,face) >> –X(not-Jack,face) X(ace, face) << –X(not-ace,face) “the extremeness of logical implication [refutation] of (or conferring probability 1 [0] on) [the hypothesis] is not what is crucial to the examples for the purposes of evaluating […] (ES)” (p. 134) continues E & F suggest to extrapolate from extreme to non-extreme cases by simply assuming that the relevant premises describe reports of “very reliable, but fallible, assistants” (ibid.)

On Bayesian Measures 27 May 2005 V. Crup i V(e,h)V(e,h) = +1 iff e implies h = –1 iff e refutes h (i.e., implies not-h) = 0 otherwise principle of extrapolation (from the deductive to the inductive domain): (PE) any symmetry s holds for an adequate measure of evidential support iff it demonstrably holds for V continues

On Bayesian Measures 27 May 2005 V. Crup i (PE) implies that (IS) should not hold generally  counterexample: X(Jack,face) >> X(face,Jack) BUT (PE) also implies that (IS) should hold generally for pairs of disconfirmatory arguments X(e,h) – = X(h,e) – because e refutes h iff h refutes e (PE) implies that (ES) should not hold generally  counterexamples involving deductive arguments (PE) implies that (HS) should hold generally X(e,h) = –X(e,¬h) because e implies h iff e refutes ¬h continues

On Bayesian Measures 27 May 2005 V. Crup i the whole set of consequences of (PE): continues

On Bayesian Measures 27 May 2005 V. Crup i continues any Bayesian measure of evidential support fulfils the consequences of (PE) concerning (HS) and (IS) iff it fulfils all the consequences of (PE) concerning the other symmetries theorem:

On Bayesian Measures 27 May 2005 V. Crup i the whole set of consequences of PE: continues

On Bayesian Measures 27 May 2005 V. Crup i the need for yet another Bayesian measure of confirmation Eells & Fitelson (2002) suggest that measures D and L are to be preferred to other measures because both: fulfil (HS) violate (ES) violate (TS) violate (IS) [but E&F disregard the disconfirmation case…] however:

On Bayesian Measures 27 May 2005 V. Crup i continues  none of the currently available Bayesian measures of evidential support satisfies both (CB) and (PE) is it possible to define a measure of evidential support satisfying both (CB) and (PE) (and therefore the whole set of desirable symmetries and asymmetries)? existence proof:

On Bayesian Measures 27 May 2005 V. Crup i continues 0,5 p(h|e)p(h|e) 0,2 p(h|e)p(h|e) 0,8 p(h|e)p(h|e) measure Z

On Bayesian Measures 27 May 2005 V. Crup i continues Th 1. Z satisfies (HS) Proof: Z(e,h) + = 2/π arcsin{1 – [p(¬h|e)/p(¬h)]} = –2/π arcsin{[p(¬h|e)/p(¬h)] – 1} = – Z(e,¬h) – since h = ¬(¬h), this is equivalent to (HS) Th 2. Z satisfies (IS) for pairs of disconfirmatory arguments Proof: Z(e,h) – = 2/π arcsin{[p(h|e)/p(h)] – 1} = 2/π arcsin{[p(e|h)/p(e)] – 1} (by Bayes theorem) = Z(h,e) – Th 3. Z violates (IS) for some pair of confirmatory arguments Proof: suppose that: p(h & e) = 49/10 p(h & ¬e) = 41/10 p(¬h & e) = 1/100 p(¬h & ¬e) = 9/100 then: Z(e,h) + = 2/π arcsin(4/5) > 2/π arcsin(4/45) = Z(h,e) +

On Bayesian Measures 27 May 2005 V. Crup i empirical comparison of competing measures are the most normatively justified confirmation measures among the most psychologically descriptive? Tentori, K., Crupi, V., Bonini, N. & Osherson, D., “Comparison of Confirmation Measures”, 2005

On Bayesian Measures 27 May 2005 V. Crup i participants, materials and procedure: 26 students (Milan, Trento; mean age 24) 2 urns: (A) 30 black balls + 10 white balls (B) 15 black balls + 25 white balls random selection of one urn (outcome hidden) ten random extractions without replacement after each extraction:

On Bayesian Measures 27 May 2005 V. Crup i continues results : average correlations between judged evidential impact and confirmation measures (evidential impact computed from objective probabilities) each number is the average of 26 correlations (one per participant) for each correlation n = 10 p(A[B]|e) denotes p(A|e) or p(B|e) as appropriate. * = reliably greater than the average for p(A[B]|e) by paired t-test (p < 0,02)

On Bayesian Measures 27 May 2005 V. Crup i continues comparison of Z with other confirmation measures (confirmation computed from objective probabilities) each cell reports a paired t-test between the correlations obtained with the confirmation measures in the associated row and column for each t-test, n = 26 (corresponding to the 26 participants) the correlations each involve 10 observations the last row of each cell shows the number of participants (out of 26) for whom Z predicted better than the rival measure at the top of the column

On Bayesian Measures 27 May 2005 V. Crup i conclusive remarks limitations: a strictly probabilistic setting the results suggest a remarkable convergence between the normative and the descriptive dimension in the study of evidential support (compare with probability judgments!…) most symmetries and asymmetries were not involved in the experiment (e.g., IS); so IF it turned out that the consequences of (PE) are in fact reflected in na ï ve subjects’ judgments of evidential impact, THEN: – none of the available Bayesian measures could fully account for human inferential processes (not even in purely probabilistic settings) – Z (or similar measures) could have an even greater advantage against competing alternatives than the one detected in our study suggestions for further studies: – more direct comparisons – direct test of various symmetries and asymmetries – extension to non-probabilistic settings

On Bayesian Measures 27 May 2005 V. Crup i thank you