Oct 6, 2014 Lirong Xia Judgment aggregation acknowledgment: Ulle Endriss, University of Amsterdam.

Slides:



Advertisements
Similar presentations
Chapter 7 Hypothesis Testing
Advertisements

Announcement Paper presentation schedule online Second stage open
Oct 9, 2014 Lirong Xia Hypothesis testing and statistical decision theory.
Nov 7, 2013 Lirong Xia Hypothesis testing and statistical decision theory.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Oct 10, 2013 Lirong Xia Judgment aggregation acknowledgment: Ulle Endriss, University of Amsterdam.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
BCOR 1020 Business Statistics
Probability Population:
Introduction to Hypothesis Testing
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview of Statistical Hypothesis Testing: The z-Test
Hypothesis Testing. Method Deductive Reasoning –Reduce to 1 specific conclusion Either “reject” OR “don’t reject” the null hypothesis –From 4 ingredients.
Hypothesis Testing. Distribution of Estimator To see the impact of the sample on estimates, try different samples Plot histogram of answers –Is it “normal”
Chapter 20: Testing Hypotheses about Proportions
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
14. Introduction to inference
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Significance Tests: THE BASICS Could it happen by chance alone?
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
CHAPTER 17: Tests of Significance: The Basics
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Chapter 20 Testing Hypothesis about proportions
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 9: Hypothesis Tests Based on a Single Sample 1.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Math 3680 Lecture #13 Hypothesis Testing: The z Test.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Statistics 20 Testing Hypothesis and Proportions.
Hypothesis testing and statistical decision theory
Fair division Lirong Xia Oct 7, 2013.
Presentation transcript:

Oct 6, 2014 Lirong Xia Judgment aggregation acknowledgment: Ulle Endriss, University of Amsterdam

Two parts –presentation: about 1 hour –discussion: 30 min Meet with me twice before your presentation –1 st : discuss content covered in your presentation –2 st : go over the slides or notes Prepare reading questions for discussion –technical questions –high-level discussions: importance, pros, cons 1 Your paper presentation(s)

2 Last class: Fair division Indivisible goods –house allocation: serial dictatorship –housing market: Top trading cycles (TTC) Divisible goods (cake cutting) – n = 2 : cut-and-choose –discrete and continuous procedures that satisfies proportionality –hard to design a procedure that satisfies envy- freeness

3 Judgment aggregation: the doctrinal paradox Action pAction qLiable? (p ∧ q) Judge 1YYY Judge 2YNN Judge 3NYN MajorityYYN p: valid contract q: the contract has been breached Why paradoxical? –issue-by-issue aggregation leads to an illogical conclusion

An agenda A is a finite nonempty set of propositional logic formulas closed under complementation ([φ ∈ A ] ⇒ [~φ ∈ A ]) – A = { p, q, ~p, ~q, p ∧ q } – A = { p, ~p, p ∧ q, ~p ∨ ~q} A judgment set J on an agenda A is a subset of A (the formulas that an agent thinks is true, in other words, accepts). J is –complete, if for all φ ∈ A, φ ∈ J or ~φ ∈ J –consistent, if J is satisfiable –S( A ) is the set of all complete and consistent judgment sets Each agent (judge) reports a judgment set – D = ( J 1,…, J n ) is called a profile An judgment aggregation (JA) procedure F is a function (S( A )) n → {0,1} A 4 Formal framework

Most previous work took the axiomatic point of view Seems truth is better for many applications –ongoing work 5 Do we want democracy or truth?

Majority rule –F(φ) = 1 if and only if the majority of agents accept φ Quota rules –F(φ) = 1 if and only if at least k % of agents accept φ Premise-based rules –apply majority rule on “premises”, and then use logic reasoning to decide the rest Conclusion-based rules –ignore the premises and use majority rule on “conclusions” Distance-based rules –choose a judgment set that minimizes distance to the profile 6 Some JA procedures

A judgment procedure F satisfies –unanimity, if [for all j, φ ∈ J j ] ⇒ [φ ∈ F( D )] –anonymity, if the names of the agents do not matter –independence, if the decision for φ only depends on agents’ opinion on φ –neutrality, [for all j, φ ∈ J j ⇔ ψ ∈ J j ] ⇒ [φ ∈ F( D ) ⇔ ψ ∈ F( D )] –systematicity, if for all D, D ’, φ, ψ [for all j, φ ∈ J j ⇔ ψ ∈ J j ’ ] ⇒ [φ ∈ F( D ) ⇔ ψ ∈ F( D’ )] =independence + neutrality –majority rule satisfies all of these! 7 Axiomatic properties

Agenda A = { p, ~p, q, ~q, p ∧ q, ~p ∨ ~q} Profile D – J 1 ={p, q, p ∧ q} – J 2 ={p, ~q, ~p ∨ ~q} – J 3 ={~p, q, ~p ∨ ~q} JA Procedure F: majority F( D ) = {p, q, ~p ∨ ~q} 8 Example: Doctrinal paradox Action pAction qLiable? (p ∧ q) Judge 1YYY Judge 2YNN Judge 3NYN MajorityYYN

Theorem. When n >1, no JA procedure satisfies the following conditions –is defined on an agenda containing {p, q, p ∧ q} –satisfies anonymity, neutrality, and independence –always selects a judgment set that is complete and consistent 9 Impossibility theorem

Anonymity + systematicity ⇒ decision on φ only depends on number of agents who accept φ When n is even –half approve p half disapprove p When n is odd –( n -1)/2 approve p and q –( n -3)/2 approve ~p and ~q –1 approves p –1 approves q –# p = #q = # ~(p ∧ q) approve all these violates consistency approve none violates consistency 10 Proof

Anonymity –dictatorship Neutrality –premise-based approaches Independence –distance-based approach 11 Avoiding the impossibility

A = A p + A c – A p =premises – A c =conclusions Use the majority rule on the premises, then use logic inference for the conclusions Theorem. If –the premises are all literals –the conclusions only use literals in the premises –the number of agents is odd then the premise-based approach is anonymous, consistent, and complete 12 Premise-based approaches pq(p ∧ q) Judge 1YYY Judge 2YNN Judge 3NYN MajorityYYLogic reasoning Y

Given a distance function –d: {0,1} A × {0,1} A →R The distance-based approach chooses argmin J ∈ S( A ) Σ J ’ ∈ D d( J, J ’) Satisfies completeness and consistency Violates neutrality and independence –c.f. Kemeny 13 Distance-based approaches

Doctrinal paradox Axiomatic properties of JA procedures Impossibility theorem Premise-based approaches Distance-based approaches 14 Recap

15 Hypothesis testing (definitions)

The average GRE quantitative score of –RPI graduate students vs. –national average: 558(139) Method 1: compute the average score of all RPI graduate students and compare to national average End of class 16 An example

Two heuristic algorithms: which one runs faster in general? Method 1: compare them on all instances Method 2: compare them on a few “randomly” generated instances 17 Another example

You have a random variable X –you know the shape of X: normal the standard deviation of X: 1 –you don’t know the mean of X After observing one sample of X (with value x ), what can you say when comparing the mean to 0? –what if you see 10? –what if you see 2? –what if you see 1? 18 Simplified problem: one sample location test

Method 1 –if x >1.645 then say the mean is strictly positive Method 2 –if x < then say the mean is strictly negative Method 3 –if x 1.96 then say the mean is non- zero How should we evaluate these methods? 19 Some quick answers

Given a statistical model –parameter space: Θ –sample space: S –Pr( s | θ ) H 1 : the alternative hypothesis –H 1 ⊆ Θ –the set of parameters you think contain the ground truth H 0 : the null hypothesis –H 0 ⊆ Θ –H 0 ∩H 1 = ∅ –the set of parameters you want to test (and ideally reject) Output of the test –reject the null: suppose the ground truth is in H 0, it is unlikely that we see what we observe in the data –retain the null: we don’t have enough evidence to reject the null 20 The null and alternative hypothesis (Neyman-Pearson framework)

Combination 1 (one-sided, right tail) –H 1 : mean>0 –H 0 : mean=0 (why not mean<0?) Combination 2 (one-sided, left tail) –H 1 : mean<0 –H 0 : mean=0 Combination 3 (two-sided) –H 1 : mean≠0 –H 0 : mean=0 A hypothesis test is a mapping f : S{reject, retain} 21 One sample location test

H 1 : mean>0 H 0 : mean=0 Parameterized by a number 0< α <1 –is called the level of significance Let x α be such that Pr(X> x α |H 0 )= α – x α is called the critical value Output reject, if – x>x α, or Pr(X> x |H 0 )< α Pr(X> x |H 0 ) is called the p-value Output retain, if – x≤x α, or p-value≥ α 22 One-sided Z-test 0 xαxα α

Popular values of α : –5%: x α = std (somewhat confident) –1%: x α = 2.33 std (very confident) α is the probability that given mean=0, a randomly generated data will leads to “reject” –Type I error 23 Interpreting level of significance 0 xαxα α

H 1 : mean≠0 H 0 : mean=0 Parameterized by a number 0< α <1 Let x α be such that 2Pr(X> x α |H 0 )= α Output reject, if – x>x α, or x<x α Output retain, if – -x α ≤x≤x α 24 Two-sided Z-test 0 xαxα α -x α

One/two-sided Z test: hypothesis tests for one sample location test (for different H 1 ’s) Outputs either to “reject” or “retain” the null hypothesis And defined a lot of seemingly fancy terms on the way –null/alternative hypothesis –level of significance –critical value –p-value –Type I error 25 What we have learned so far…

Isn’t point estimation H 0 never true? –the “chance” for the mean to be exactly 0 is negligible –fine, but what made you believe so? What the heck are you doing by using different H 1 ? –the description of the tests does not depend on the selection of H 1 –if we reject H 0 using one-sided test (mean>0), shouldn’t we already be able to say mean≠0? Why need two-sided test? What the heck are you doing by saying “reject” and “retain” –Can’t you just predict whether the ground truth is in H 0 or H 1 ? 26 Questions that haunted me when I first learned these

Evaluation of hypothesis testing methods Statistical decision theory 27 Next class