MPS/MSc in StatisticsAdaptive & Bayesian - Lect 41 Lecture 4 Sample size reviews 4.1A general approach to sample size reviews 4.2Binary data 4.3Normally.

Slides:



Advertisements
Similar presentations
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Advertisements

Chapter 7. Statistical Estimation and Sampling Distributions
The Simple Linear Regression Model: Specification and Estimation
Using Statistics in Research Psych 231: Research Methods in Psychology.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Point estimation, interval estimation
Using Statistics in Research Psych 231: Research Methods in Psychology.
Chapter 10 Sampling and Sampling Distributions
Evaluating Hypotheses
Hypothesis Testing for Population Means and Proportions
Statistical Background
Experimental Evaluation
Inferences About Process Quality
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
5-3 Inference on the Means of Two Populations, Variances Unknown
Sample Size Determination Ziad Taib March 7, 2014.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Chapter 7 Estimation: Single Population
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
Estimation Basic Concepts & Estimation of Proportions
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
1 G Lect 6b G Lecture 6b Generalizing from tests of quantitative variables to tests of categorical variables Testing a hypothesis about a.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Statistics Psych 231: Research Methods in Psychology.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 51 Lecture 5 Adaptive designs 5.1Introduction 5.2Fisher’s combination method 5.3The inverse normal method.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 21 Lecture 2 Two-stage designs for normally distributed data 2.1 A typical fixed-sample design 2.2 A two-stage.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 71 Lecture 7 Bayesian methods: a refresher 7.1 Principles of the Bayesian approach 7.2 The beta distribution.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 91 Lecture 9 Bayesian approaches for quantitative responses 9.1 Proof-of-concept studies outside oncology.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Statistical Significance Hypothesis Testing.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 31 Lecture 3 Sequential designs 3.1 A multi-stage (or group sequential) design 3.2 A general setting for.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 4. Inference about Process Quality
Statistics Analysis of Variance.
Distribution functions
t distribution Suppose Z ~ N(0,1) independent of X ~ χ2(n). Then,
Comparing Three or More Means
CONCEPTS OF ESTIMATION
Presentation transcript:

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 41 Lecture 4 Sample size reviews 4.1A general approach to sample size reviews 4.2Binary data 4.3Normally distributed data 4.4 Exact results: normally distributed responses

MPS/MSc in StatisticsAdaptive & Bayesian - Lect A general approach to sample size reviews Many sample size formulae depend on nuisance parameters, the values of which have to be guessed Part way through the trial we will have plenty of data on which to base a better guess So, do that, and recalculate the sample size Now use the new sample size, perhaps within the limits of minimum and maximum possible values Assess the effect of this procedure on type I error: usually it is very small

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 43 This is an adaptive design with a single interim analysis which may lead only to a reassessment of sample size Its use is becoming widespread, and the regulatory authorities are generally well-disposed towards it

MPS/MSc in StatisticsAdaptive & Bayesian - Lect Sample size review for binary data Treatments: Experimental (E) and Control (C) Success probabilities: p E and p C Hypotheses: H 0 : p E = p C H 1 : p E > p C Type I error:  (one-sided) Power: 1 – , when p E = p ER and p C = p CR Sample sizes: n E and n C, where n E + n C = n Allocation ratio: (1:1), that is n E = n C p CR is the anticipated value of p C, and an improvement from that value to p E = p ER on E would be clinically worthwhile

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 45 Probability difference approach Initial sample size calculation Put  = p E  p C, and set power at  =  R = p ER  p CR Two popular formulae for n are: (Machin et al., 1997) and (4.2)

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 46 To use these formulae: use previous data and experience to guess p CR consider what difference  R would be clinically important deduce p ER and Using these values, find the required sample size n Then, when data from about patients are available, a sample size review can be conducted

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 47 At the sample size review we do not change  R :  this remains the clinically important difference To recompute n based on (4.1), identify the control patients and find an estimate, of p C as the success rate on C so far Replace p CR by, p ER by and To recompute n based on (4.2), we need not break the blinding: just estimate as the overall success rate in the trial as a whole (over E and C) The preservation of blindness makes the second option more attractive

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 48 Log-odds ratio approach Initial sample size calculation Put and set power at  =  R computed from the values p ER and p CR The resulting sample size formula is: (4.3) This formula can be updated at a sample size review in the same way as (4.2), without breaking the blind

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 49 Example  = 0.025, 1 –  = 0.90: z 1  = 1.96, z 1  = p CR = 0.3, p ER = 0.5, = 0.4 (4.1) prob diff:  R = 0.2  n = 248 (4.2) prob diff:  R = 0.2  n = 252 (4.3) log-odds ratio:  R =  n = 244

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 410 After 120 observations, the sample size is reviewed We find that = 0.2, rather than 0.4 Retaining probability difference:  R = 0.2, (4.2)  n = 168  sample size goes down   R = 0.2 consistent with p CR = 0.1, p ER = 0.3 Retaining  log-odds ratio:  R = 0.847, (4.3)  n = 366  sample size goes up   R = consistent with p CR = 0.134, p ER = 0.266

MPS/MSc in StatisticsAdaptive & Bayesian - Lect Normally distributed data Treatments: Experimental (E) and Control (C) Distributions: N(  E,  2 ) and N(  C,  2 ) Hypotheses: H 0 :  E =  C H 1 :  E >  C Type I error:  (one-sided) Power: 1 – , when  E =  ER and  C =  CR Sample sizes: n E and n C, where n E + n C = n Allocation ratio: (1:1), that is n E = n C Put  =  E   C and  R =  ER   CR Let denote the anticipated common variance

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 412 The sample size is given by see Slide 2.8 The actual values of  ER and  CR have no effect on n other than through  R The anticipated variance is very influential, and is replaced by an estimate at the sample size review

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 413 Estimating  2 (a)Use the conventional unbiased estimate based on the n observations available so far To use this requires breaking the blind, at least as far as separating the two treatment groups: their identities need not be revealed (4.5)

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 414 (b)Avoid unblinding, using a simple adjustment (Gould, 1995) For each term in (4.5) Substitute in (4.5)

MPS/MSc in Statistics Adaptive & Bayesian - Lect 415 If desired difference is present: Can use estimate for sample size review without unblinding (4.6) : The estimate of total variance

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 416 (c) Avoid unblinding, using an E-M algorithm (Gould and Shih, 1992; Gould, 1995) DO NOT USE THIS METHOD! See Friede and Kieser (2002)to see why you should not

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 417 Example  = 0.025, 1 –  = 0.90: z 1  = 1.96, z 1  =  R = 0.5,  R = 1.0 (4.4)  n = 168 After 80 patients:

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 418 Unblinded approach From (4.5) So that From (4.4), new sample size is

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 419 Blinded approach From (4.6) From (4.4), new sample size is

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 420 Consider the unblinded approach to sample size review An initial sample size is set When n 1 = patients have given responses,  2 is estimated by the usual pooled variance The sample size is then recalculated (as n 2 ) and that number of subjects is taken, provided that n 2 ≥ n 1 Finally, a t-test is performed 4.4 Exact results: normally distributed responses

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 421 The final t-statistic is where the subscript 2 identifies values computed for the final sample Now

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 422 With the subscript + relating to the extra patients recruited after the sample size review, it can be shown that Dividing by the true value of  2, the quantities on the rhs are distributed as independent  2 random variable on n E1 – 1, n E+  1 and 1 degrees of freedom respectively A similar result holds for the control group

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 423 The final t-statistic, is a function of and these quantities are mutually independent From this result, the conditional distribution of t, given, can be deduced The revised sample size depends on the data only through

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 424 The density of t is given by where W represents the random variable is a t density with degrees of freedom specified by : it is a step function in w is a  2 density with n 1  2 degrees of freedom In this way, for a given value of  2, the exact properties of the sample size review procedure can be determined

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 425 Example  = 0.05, 1 –  = 0.90,  = 1,  R =  n = 200 A sample size review is conducted after 100 responses Suppose that the true value of  2 is 1.43 Then the true type I error rate will be The true power will be 0.920

MPS/MSc in StatisticsAdaptive & Bayesian - Lect 426 References General methods for sample size review: Wittes and Brittain (1990) Gould (1992) Birkett and Day (1994) Exact evaluations: Kieser and Friede (2000) Friede and Kieser (2006) Montague (2007)