Experimental design and sample size determination Karl W Broman Department of Biostatistics Johns Hopkins University

Slides:



Advertisements
Similar presentations
Analysis by design Statistics is involved in the analysis of data generated from an experiment. It is essential to spend time and effort in advance to.
Advertisements

Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Designing Experiments: Sample Size and Statistical Power Larry Leamy Department of Biology University of North Carolina at Charlotte Charlotte, NC
Chapter 6 Sampling and Sampling Distributions
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Chapter 7 Sampling Distributions
Research vs Experiment
Research Study. Type Experimental study A study in which the investigator selects the levels of at least one factor Observational study A design in which.
Experimental Design Research vs Experiment. Research A careful search An effort to obtain new knowledge in order to answer a question or to solve a problem.
Sampling and Experimental Control Goals of clinical research is to make generalizations beyond the individual studied to others with similar conditions.
Part III: Inference Topic 6 Sampling and Sampling Distributions
The Scientific Method Chapter 1.
Sample Size Determination
Experimental design, basic statistics, and sample size determination
Association vs. Causation
Courtney McCracken, M.S., PhD(c) Traci Leong, PhD May 1 st, 2012.
1 Faculty of Social Sciences Induction Block: Maths & Statistics Lecture 5: Generalisability of Social Research and the Role of Inference Dr Gwilym Pryce.
Introduction to the Design of Experiments
Chapter 7 Estimation: Single Population
Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Experimental Design All experiments have independent variables, dependent variables, and experimental units. Independent variable. An independent.
The Randomized Complete Block Design
Chapter 5: Producing Data “An approximate answer to the right question is worth a good deal more than the exact answer to an approximate question.’ John.
LT 4.2 Designing Experiments Thanks to James Jaszczak, American Nicaraguan School.
Statistics (cont.) Psych 231: Research Methods in Psychology.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 9: Producing Data: Experiments. Chapter 9 Concepts 2  Observation vs. Experiment  Subjects, Factors, Treatments  How to Experiment Badly 
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
C82MST Statistical Methods 2 - Lecture 1 1 Overview of Course Lecturers Dr Peter Bibby Prof Eamonn Ferguson Course Part I - Anova and related methods (Semester.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 4: Estimating parameters with confidence.
Uses of Diagnostic Tests Screen (mammography for breast cancer) Diagnose (electrocardiogram for acute myocardial infarction) Grade (stage of cancer) Monitor.
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Statistical Core Didactic
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Inference About Variables Part IV Review
Experimental Design Research vs Experiment
CHAPTER 4 Designing Studies
CHAPTER 9: Producing Data— Experiments
Research vs Experiment
Chapter 4: Designing Studies
Statistical Reasoning December 8, 2015 Chapter 6.2
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Psych 231: Research Methods in Psychology
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Chapter 10 Introduction to the Analysis of Variance
Chapter 4: Designing Studies
CHAPTER 16: Inference in Practice
DESIGN OF EXPERIMENTS by R. C. Baker
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Presentation transcript:

Experimental design and sample size determination Karl W Broman Department of Biostatistics Johns Hopkins University

Note This is a shortened version of a lecture which is part of a web- based course on “Enhancing Humane Science/Improving Animal Research” (organized by Alan Goldberg, Johns Hopkins Center for Alternatives to Animal Testing) Few details—mostly concepts. 2

Experimental design 3

Basic principles 1.Formulate question/goal in advance 2.Comparison/control 3.Replication 4.Randomization 5.Stratification (aka blocking) 6.Factorial experiments 4

Example Question:Does salted drinking water affect blood pressure (BP) in mice? Experiment: 1.Provide a mouse with water containing 1% NaCl. 2.Wait 14 days. 3.Measure BP. 5

Comparison/control Good experiments are comparative. Compare BP in mice fed salt water to BP in mice fed plain water. Compare BP in strain A mice fed salt water to BP in strain B mice fed salt water. Ideally, the experimental group is compared to concurrent controls (rather than to historical controls). 6

Replication 7

Why replicate? Reduce the effect of uncontrolled variation (i.e., increase precision). Quantify uncertainty. A related point: An estimate is of no value without some statement of the uncertainty in the estimate. 8

Randomization Experimental subjects (“units”) should be assigned to treatment groups at random. At random does not mean haphazardly. One needs to explicitly randomize using A computer, or Coins, dice or cards. 9

Why randomize? Avoid bias. –For example: the first six mice you grab may have intrinsicly higher BP. Control the role of chance. –Randomization allows the later use of probability theory, and so gives a solid foundation for statistical analysis. 10

Stratification Suppose that some BP measurements will be made in the morning and some in the afternoon. If you anticipate a difference between morning and afternoon measurements: –Ensure that within each period, there are equal numbers of subjects in each treatment group. –Take account of the difference between periods in your analysis. This is sometimes called “blocking”. 11

Example 20 male mice and 20 female mice. Half to be treated; the other half left untreated. Can only work with 4 mice per day. Question:How to assign individuals to treatment groups and to days? 12

An extremely bad design 13

Randomized 14

A stratified design 15

Randomization and stratification If you can (and want to), fix a variable. –e.g., use only 8 week old male mice from a single strain. If you don’t fix a variable, stratify it. –e.g., use both 8 week and 12 week old male mice, and stratify with respect to age. If you can neither fix nor stratify a variable, randomize it. 16

Factorial experiments Suppose we are interested in the effect of both salt water and a high-fat diet on blood pressure. Ideally: look at all 4 treatments in one experiment. Plain waterNormal diet Salt waterHigh-fat diet Why? –We can learn more. –More efficient than doing all single-factor experiments.  17

Interactions 18

Other points Blinding –Measurements made by people can be influenced by unconscious biases. –Ideally, dissections and measurements should be made without knowledge of the treatment applied. Internal controls –It can be useful to use the subjects themselves as their own controls (e.g., consider the response after vs. before treatment). –Why? Increased precision. 19

Other points Representativeness –Are the subjects/tissues you are studying really representative of the population you want to study? –Ideally, your study material is a random sample from the population of interest. 20

Summary Unbiased –Randomization –Blinding High precision –Uniform material –Replication –Blocking Simple –Protect against mistakes Wide range of applicability –Deliberate variation –Factorial designs Able to estimate uncertainty –Replication –Randomization Characteristics of good experiments: 21

Data presentation Bad plotGood plot 22

Data presentation TreatmentMean(SEM) A11.2(0.6) B13.4(0.8) C14.7(0.6) TreatmentMean(SEM) A (0.63) B13.49(0.7913) C14.787(0.6108) Good table Bad table 23

Sample size determination 24

Fundamental formula 25

Listen to the IACUC Too few animals  a total waste Too many animals  a partial waste 26

Significance test Compare the BP of 6 mice fed salt water to 6 mice fed plain water.  = true difference in average BP (the treatment effect). H 0 :  = 0 (i.e., no effect) Test statistic, D. If |D| > C, reject H 0. C chosen so that the chance you reject H 0, if H 0 is true, is 5% Distribution of D when  = 0 27

Statistical power Power = The chance that you reject H 0 when H 0 is false (i.e., you [correctly] conclude that there is a treatment effect when there really is a treatment effect). 28

Power depends on… The structure of the experiment The method for analyzing the data The size of the true underlying effect The variability in the measurements The chosen significance level (  ) The sample size Note: We usually try to determine the sample size to give a particular power (often 80%). 29

Effect of sample size 6 per group: 12 per group: 30

Effect of the effect  = 8.5:  = 12.5: 31

Various effects Desired power   sample size  Stringency of statistical test   sample size  Measurement variability   sample size  Treatment effect   sample size  32

Determining sample size The things you need to know: Structure of the experiment Method for analysis Chosen significance level,  (usually 5%) Desired power (usually 80%) Variability in the measurements –if necessary, perform a pilot study The smallest meaningful effect 33

A formula Censored 34

Reducing sample size Reduce the number of treatment groups being compared. Find a more precise measurement (e.g., average time to effect rather than proportion sick). Decrease the variability in the measurements. –Make subjects more homogeneous. –Use stratification. –Control for other variables (e.g., weight). –Average multiple measurements on each subject. 35

Final conclusions Experiments should be designed. Good design and good analysis can lead to reduced sample sizes. Consult an expert on both the analysis and the design of your experiment. 36

Resources ML Samuels, JA Witmer (2003) Statistics for the Life Sciences, 3 rd edition. Prentice Hall. –An excellent introductory text. GW Oehlert (2000) A First Course in Design and Analysis of Experiments. WH Freeman & Co. –Includes a more advanced treatment of experimental design. Course: Statistics for Laboratory Scientists (Biostatistics , Johns Hopkins Bloomberg Sch. Pub. Health) –Intoductory statistics course, intended for experimental scientists. –Greatly expands upon the topics presented here. 37