1 1 The Scientist Game Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics.

Slides:



Advertisements
Similar presentations
The Scientist Game Scott S. Emerson, M.D., Ph.D.
Advertisements

Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Hypothesis Testing making decisions using sample data.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Significance and probability Type I and II errors Practical Psychology 1 Week 10.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Horng-Chyi HorngStatistics II41 Inference on the Mean of a Population - Variance Known H 0 :  =  0 H 0 :  =  0 H 1 :    0, where  0 is a specified.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Sample Size Determination Ziad Taib March 7, 2014.
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. A sampling error occurs.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Inferential Statistics
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Testing Hypotheses.
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Scientific Inquiry & Skills
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
1 The Scientist Game Chris Slaughter, DrPH (courtesy of Scott Emerson) Dept of Biostatistics Vanderbilt University © 2002, 2003, 2006, 2008 Scott S. Emerson,
1 Virtual COMSATS Inferential Statistics Lecture-16 Ossam Chohan Assistant Professor CIIT Abbottabad.
AP Statistics Chapter 20 Notes
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Hypothesis Testing. The 2 nd type of formal statistical inference Our goal is to assess the evidence provided by data from a sample about some claim concerning.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. Sampling error means.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Psych 230 Psychological Measurement and Statistics Pedro Wolf October 21, 2009.
Chapter 9 Introduction to the t Statistic
Statistical Core Didactic
Hypothesis Testing: Hypotheses
Statistical Inference
Statistical inference
An overview to this point
CHAPTER 10 Comparing Two Populations or Groups
AP STATISTICS LESSON 10 – 4 (DAY 2)
Statistical inference
Presentation transcript:

1 1 The Scientist Game Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington © 2002, 2003 Scott S. Emerson, M.D., Ph.D.

2 Overview A simplified universe –One dimensional universe observed over time –Each position in the universe has an object –Goal is to discover any rules that might determine which objects are in a given location at a particular time A simplified universe –One dimensional universe observed over time –Each position in the universe has an object –Goal is to discover any rules that might determine which objects are in a given location at a particular time

3 Objects Objects in the universe have only three characteristics, each with only two levels Color: White or Orange Size: BIG or small Letter: A or B A a B b A a B b Objects in the universe have only three characteristics, each with only two levels Color: White or Orange Size: BIG or small Letter: A or B A a B b A a B b

4 Universal Laws The level of each characteristic (color, size, letter) for the object at any position in the universe is either completely determined by the prior sequence of that characteristic for objects at that position, OR is completely random (anything is permissible) (No patterns involving probabilities less than 1) (Adjacent positions have no effect) The level of each characteristic (color, size, letter) for the object at any position in the universe is either completely determined by the prior sequence of that characteristic for objects at that position, OR is completely random (anything is permissible) (No patterns involving probabilities less than 1) (Adjacent positions have no effect)

5 Examples of Universal Laws Color only: b a A b a a a B A a A A The next object in the sequence must be white, but any size or letter will do: a A b B Color only: b a A b a a a B A a A A The next object in the sequence must be white, but any size or letter will do: a A b B

6 Examples of Universal Laws Size and letter: A a B b A a B b A a B b The next object in the sequence must be a big A, but any color will do A A Size and letter: A a B b A a B b A a B b The next object in the sequence must be a big A, but any color will do A A

7 Examples of Universal Laws Size only: B a B b A b B a B b A a The next object in the sequence must be big, but any color or letter will do: A B A B Size only: B a B b A b B a B b A a The next object in the sequence must be big, but any color or letter will do: A B A B

8 Examples of Universal Laws No discernible pattern (in available data): A a b a A b B a B B A a If there is truly no deterministic pattern, then any object may appear next: a A b B a A b B No discernible pattern (in available data): A a b a A b B a B B A a If there is truly no deterministic pattern, then any object may appear next: a A b B a A b B

9 Scientific Task Goal is therefore to decide for some position –whether a rule governs the level of each characteristic, and –if so, what that rule is (pattern to the sequence) Goal is therefore to decide for some position –whether a rule governs the level of each characteristic, and –if so, what that rule is (pattern to the sequence)

10 Hypothesis Generation Initially we have observational data gathered over time –Amount of available information varies from position to position –We want to identify some position that is the most likely to be governed by some deterministic rule Initially we have observational data gathered over time –Amount of available information varies from position to position –We want to identify some position that is the most likely to be governed by some deterministic rule

11 Observational Data Time Pstn … 114. b a A b a a B a A A a b ? ? 115. A a B b A a b b A a a A ? ? 116. B b A a a b ? ? 117. b B A b b b a ? ? 118. A b B A B b A B B a B B ? ? 119. B b B b A a A a B A b ? ? 120. B B b ? ? 121. B A a B b b a b A ? ? … Time Pstn … 114. b a A b a a B a A A a b ? ? 115. A a B b A a b b A a a A ? ? 116. B b A a a b ? ? 117. b B A b b b a ? ? 118. A b B A B b A B B a B B ? ? 119. B b B b A a A a B A b ? ? 120. B B b ? ? 121. B A a B b b a b A ? ? …

12 Observational Data Time Pstn … 118. A b B A B b A B B a B B ? ? … Time Pstn … 118. A b B A B b A B B a B B ? ? …

13 Next Step? Further observation? –Might take too long –Won’t really establish cause and effect Further observation? –Might take too long –Won’t really establish cause and effect

14 Experimentation You can try to put an object in the position –If it cannot come next, it disintegrates and you can try another –If it can come next, it stays and you can try a different object to follow it You can try to put an object in the position –If it cannot come next, it disintegrates and you can try another –If it can come next, it stays and you can try a different object to follow it

15 Experimental Goal You need to devise a series of experiments to discover –whether a deterministic rule governs the sequence of objects at position 118, and –if there is such a rule, what it is You need to devise a series of experiments to discover –whether a deterministic rule governs the sequence of objects at position 118, and –if there is such a rule, what it is

16 Real World Problem: –You must buy objects to experiment with (apply for a grant) Question: –What object should you try next in the sequence in order to determine the rule? Problem: –You must buy objects to experiment with (apply for a grant) Question: –What object should you try next in the sequence in order to determine the rule?

17 Possible Experiments Time Pstn A b B A B b A B B a B B ? ? Possible Experiments a A b B a A b B Which experiment do you do first? Time Pstn A b B A B b A B B a B B ? ? Possible Experiments a A b B a A b B Which experiment do you do first?

18 Results of Observation Time Pstn A b B A B b A B B a B B ? ? We identified position 118 which had some regular patterns –Color cycle of length 2: (orange, white) –Size cycle of length 4: (big, little, big, big) –Letter cycle of length 3: (A, B, B) Time Pstn A b B A B b A B B a B B ? ? We identified position 118 which had some regular patterns –Color cycle of length 2: (orange, white) –Size cycle of length 4: (big, little, big, big) –Letter cycle of length 3: (A, B, B)

19 Define Hypotheses Deterministic pattern versus random chance for each characteristic –Recognize that some or all observed patterns might be coincidence Chance observation of a pattern for a single characteristic (e.g., color) with sample size 12 (assuming each level equally likely) –1 out of 1024 for a cycle of length 2 –1 out of 512 for a cycle of length 3 –1 out of 256 for a cycle of length 4 Deterministic pattern versus random chance for each characteristic –Recognize that some or all observed patterns might be coincidence Chance observation of a pattern for a single characteristic (e.g., color) with sample size 12 (assuming each level equally likely) –1 out of 1024 for a cycle of length 2 –1 out of 512 for a cycle of length 3 –1 out of 256 for a cycle of length 4

20 Possible Hypotheses Assuming sufficient data to see any rule 118. A b B A B b A B B a B B ? ? Hypotheses Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence Assuming sufficient data to see any rule 118. A b B A B b A B B a B B ? ? Hypotheses Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence

21 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence

22 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size Color, Letter Size, Letter Color Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size Color, Letter Size, Letter Color Size Letter All coincidence

23 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter Size, Letter Color Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter Size, Letter Color Size Letter All coincidence

24 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter Color Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter Color Size Letter All coincidence

25 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color Size Letter All coincidence

26 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size Letter All coincidence

27 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter All coincidence

28 Most Popular First Choice 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter + All coincidence 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter + All coincidence

29 Most Popular First Choice A noninformative experiment 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter + All coincidence + A noninformative experiment 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A Color, Size, and Letter + Color, Size + Color, Letter + Size, Letter + Color + Size + Letter + All coincidence +

30 Next Worse Choice If all hypotheses equally likely, a 7-1 split 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b Color, Size, and Letter + - Color, Size + - Color, Letter + - Size, Letter + - Color + - Size + - Letter + - All coincidence + + If all hypotheses equally likely, a 7-1 split 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b Color, Size, and Letter + - Color, Size + - Color, Letter + - Size, Letter + - Color + - Size + - Letter + - All coincidence + +

31 Other Suboptimal Experiments If all hypotheses equally likely, a 6-2 split 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence If all hypotheses equally likely, a 6-2 split 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence

32 Optimal Experiments Based on a binary search 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a B a A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence Based on a binary search 118. A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a B a A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence

33 What If Data Insufficient? Suppose deterministic cycle length > A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a B a A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence Cycle length > 12 ? ? ? ? ? ? ? ? Suppose deterministic cycle length > A b B A B b A B B a B B ? ? Possible Experiments Hypotheses A b b B a B a A Color, Size, and Letter Color, Size Color, Letter Size, Letter Color Size Letter All coincidence Cycle length > 12 ? ? ? ? ? ? ? ?

34 Moral The goal of the experiment should be to “decide which” not “prove that” A well designed experiment discriminates between hypotheses –The hypotheses should be the most important, viable hypotheses The goal of the experiment should be to “decide which” not “prove that” A well designed experiment discriminates between hypotheses –The hypotheses should be the most important, viable hypotheses

35 Moral All other things being equal, an experiment should be equally informative for all possible outcomes –In the presence of a binary outcome, use a binary search (using prior probability of being true) –But may need to consider simplicity of experiments, time, cost All other things being equal, an experiment should be equally informative for all possible outcomes –In the presence of a binary outcome, use a binary search (using prior probability of being true) –But may need to consider simplicity of experiments, time, cost

36 In the Presence of Variability We use statistics to quantify the precision of our inference –We will describe our confidence/belief in our conclusions using frequentist or Bayesian probability statements We use statistics to quantify the precision of our inference –We will describe our confidence/belief in our conclusions using frequentist or Bayesian probability statements

37 Statistical Experimental Design I believe a scientific approach to the use of statistics is to –Decide a level of confidence used to construct frequentist confidence intervals or Bayesian credible intervals –Ensure adequate statistical precision (sample size) to discriminate between relevant scientific hypotheses The intervals should not contain two hypotheses that were to be discriminated between I believe a scientific approach to the use of statistics is to –Decide a level of confidence used to construct frequentist confidence intervals or Bayesian credible intervals –Ensure adequate statistical precision (sample size) to discriminate between relevant scientific hypotheses The intervals should not contain two hypotheses that were to be discriminated between

38 Impact on Statistical Power I choose equal one-sided type I and type II errors –E.g., 97.5% power to detect the alternative in a one-sided level hypothesis test In this way, at the end of the study, the 95% CI will not contain both the null and alternative hypotheses –I will have discriminated between the hypotheses with high confidence I choose equal one-sided type I and type II errors –E.g., 97.5% power to detect the alternative in a one-sided level hypothesis test In this way, at the end of the study, the 95% CI will not contain both the null and alternative hypotheses –I will have discriminated between the hypotheses with high confidence