**************** GCRC Research-Skills Workshop October 26, 2007 William D. Dupont Department of Biostatistics **************** How to do Power & Sample.

Slides:



Advertisements
Similar presentations
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Advertisements

Regression and correlation methods
Lesson 10: Linear Regression and Correlation
Chapter 12 Simple Linear Regression
Sampling: Final and Initial Sample Size Determination
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
EPI 809/Spring Probability Distribution of Random Error.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Chapter 10 Simple Regression.
Point estimation, interval estimation
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Chapter 8 Estimation: Single Population
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Statistical Treatment of Data Significant Figures : number of digits know with certainty + the first in doubt. Rounding off: use the same number of significant.
Sample Size Determination
Correlation and Regression Analysis
Sample Size Determination Ziad Taib March 7, 2014.
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
Analysis of Complex Survey Data
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Simple Linear Regression
What do you think about a doctor who uses the wrong treatment, either wilfully or through ignorance, or who uses the right treatment wrongly (such as by.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Simple Linear Regression Models
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Estimation of Various Population Parameters Point Estimation and Confidence Intervals Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology.
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
Hypothesis of Association: Correlation
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Analysis of Residuals Data = Fit + Residual. Residual means left over Vertical distance of Y i from the regression hyper-plane An error of “prediction”
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Simple Linear Regression ANOVA for regression (10.2)
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Chapter 12 Continuous Random Variables and their Probability Distributions.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
Simple linear regression Tron Anders Moger
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
How to do Power & Sample Size Calculations Part 1 **************** GCRC Research-Skills Workshop October 18, 2007 William D. Dupont Department of Biostatistics.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Sampling and Sampling Distributions
Chapter 4: Basic Estimation Techniques
BIOST 513 Discussion Section - Week 10
Review 1. Describing variables.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Basic Estimation Techniques
How to use the PS sample size software for advanced applications
Statistics 103 Monday, July 10, 2017.
Basic Estimation Techniques
Practice Mid-Term Exam
Review of Hypothesis Testing
Descriptive and inferential statistics. Confidence interval
Comparing Populations
BUS173: Applied Statistics
Lecture11 review for final examination
Statistical Thinking and Applications
Sample Size and Power Part II
Presentation transcript:

**************** GCRC Research-Skills Workshop October 26, 2007 William D. Dupont Department of Biostatistics **************** How to do Power & Sample Size Calculations Part 2

Survival Analysis Follow-up time Fate at exit Statistic: Log-rank test Power: Schoenfeld & Richter, Biometrics 1982 Time Sample Size 0 Accrual time A Additional follow-up F

Hemorage-free survival in patients with previous lobular introcerebral hemmorrhage subdivided by apolipoprotein E genotype (O’Donnell et al. 2000).

Example: Hemorrhage-free survival and genotype Control group = patients with an e2 or e4 allele Pr[Type I error]  = 0.05 Power = 0.8 Relative Risk (control/experiment) R = 2 Medial survival time of controls m 1 = ?

Hemorage-free survival in patients with previous lobular intracerebral hemmorrhage subdivided by apolipoprotein E genotype (O’Donnell et al. 2000).

Example: Hemorrhage-free survival and genotype Control group = patients with an e2 or e4 allele Pr[Type I error]  = 0.05 Power = 0.8 Relative Risk (control/experiment) R = 2 Medial survival time of controls m 1 = 38 Accrual A = 12 months Additional Follow-up F = 24 months

What if we wanted to study a treatment of Homozygous e3/e3 patients?

= some time = probability of survival at time = If survival has an exponential distribution

= some time = probability of survival at time = If survival has an exponential distribution =

Median Survival Experimental treatment Control treatment Relative Risk (Hazard Ratio) for controls relative to experimental subjects Median survival for experimental subjects Median survival for controls = = =

For t tests power calculations for increased or decreased response relative to control response are symmetric. i.e. The power to detect is the same as the power to detect This is not true for Survival Analysis. The power to detect a two-fold increase in hazard does not equal the power to detect a 50% decrease in hazard.

Power to detect treatment 1 vs. control greater than treatment 1 vs.control Treatment 1 Treatment 2 Control Relative Risk Control vs. Treatment 1 2 Control vs. Treatment It is important to read the parameter definitions carefully.

Power Calculations for Linear Regression

s x = 4.75                   Estriol (mg/24 hr) Birthweight (g/100)     Rosner Table 11.1 Am J Obs Gyn 1963;85:1-9 = standard deviation of y variable = standard deviation of x variable

Estimated by s = root mean squared error (MSE) = standard deviation of the regression errors

r = 0 r = -1 0 < r < 1 r = 1 c) is estimated by {1.2} Correlation coefficient

Measures of association between continuous variables Correlation vs. linear regression

Independent Variable x Response Variable y Treatment   A B Both treatments have identical values of  x = 6,  y = 20  x = 2,  y = 5

x j  x j y j j th Regression Error Treatment Dose Level  Regression Line  x 1 Unit of Dosage 0 Patient Response

c) Slope parameter estimate  a.k.a.  ) is estimated by b = r s y /s x

r = 0 r = -1 0 < r < 1 r = 1 is estimated by b = r s y /s x

Studies can be either experimental or observational

Normally distributed unless investigator chooses level Treatment level Chosen by investigator

Estimating s directly is often difficult s = = If we can estimate s Warning: If the anticipated value of in your experiment is different from that found in the literature then your value of r will also be different. or

Independent Variable x Response Variable y

Relationship between BMI and exercise time n = 100 women willing to follow a diet exercise program for six months Interquartile range (IQR) of exercise time = 15 minutes (pilot data) = (IQR / 2) / z 0.25 = 7.5 / = = 4.0 kg/m 2 = standard deviation of BMI for women obtained by Kuskowska-Wolk et al. Int J Obes 1992;16:1-9 Would like to detect a true drop in BMI of = kg/m 2 per minute of exercise (1/2 hour of exercise per day induces a drop of 2 kg/m 2 over 6 months)

When     Interquartile Range = (IQR / 2) / z 0.25 In general

Impaired Antibody Response to Pneumococcal Vaccine after treatment for Hodgkin’s Disease Siber et al. N Engl J Med 1978;299: n = 17 patients treated with subtotal radiation. vaccinated 8 to 51 months later A linear regression of log antibody response against time from radiation to vaccination gave Suppose we wanted to determine the sample size for a new study with patients randomized to vaccination at 10, 30, or 50 weeks

Composing Slopes of Two Linear Regressin Lines Armitage and Berry (1994) gave age and pulmonary vital capacity for 28 cadmium industry workers with > 10 years of exposure 44 workers with no exposure

(unexposed) (exposed) pooled error variance How many workers do we need to detect with ratio of unexposed workers m = 44/28 = 1.57? Need 167 exposed workers and 167 x 1.57 = 262 unexposed workers

(unexposed) (exposed) pooled error variance How many workers do we need to detect with ratio of unexposed workers m = 44/28 = 1.57? Need 167 exposed workers and 167 x 1.57 = 262 unexposed workers