Vanderbilt University Medical Center SRC Presentation Vincent Kokouvi Agboto Assistant Professor/Director of Biostatistics, Meharry Medical College Assistant.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Optimal designs for one and two-colour microarrays using mixed models
Multiple Comparisons in Factorial Experiments
September 2000Department of Statistics Kansas State University 1 Statistics and Design of Experiments: Role in Research George A. Milliken, PhD Department.
Designing Ensembles for Climate Prediction
Statistics : Role in Research. Statistics: A collection of procedures and processes to enable researchers in the unbiased pursuit of Knowledge Statistics.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Stratification (Blocking) Grouping similar experimental units together and assigning different treatments within such groups of experimental units A technique.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Experimental Design, Response Surface Analysis, and Optimization
1 Introduction to Experimental Design 1/26/2009 Copyright © 2009 Dan Nettleton.
1 Chapter 5: Custom Designs 5.1 Custom Design Generation 5.2 Custom Response Surface Designs.
Design of Engineering Experiments - Experiments with Random Factors
Sandrine Dudoit1 Microarray Experimental Design and Analysis Sandrine Dudoit jointly with Yee Hwa Yang Division of Biostatistics, UC Berkeley
L15:Microarray analysis (Classification). The Biological Problem Two conditions that need to be differentiated, (Have different treatments). EX: ALL (Acute.
Evaluating Hypotheses
Chapter 28 Design of Experiments (DOE). Objectives Define basic design of experiments (DOE) terminology. Apply DOE principles. Plan, organize, and evaluate.
Statistical Background
Statistics: The Science of Learning from Data Data Collection Data Analysis Interpretation Prediction  Take Action W.E. Deming “The value of statistics.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
July 3, A36 Theory of Statistics Course within the Master’s program in Statistics and Data mining Fall semester 2011.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Statistics 2 for Chemical Engineering lecture 4
Maximum likelihood (ML)
Design and Analysis of Engineering Experiments
Chapter 1Based on Design & Analysis of Experiments 7E 2009 Montgomery 1 Design and Analysis of Engineering Experiments Ali Ahmad, PhD.
1 14 Design of Experiments with Several Factors 14-1 Introduction 14-2 Factorial Experiments 14-3 Two-Factor Factorial Experiments Statistical analysis.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Chapter 8Design and Analysis of Experiments 8E 2012 Montgomery 1 Design of Engineering Experiments The 2 k-p Fractional Factorial Design Text reference,
 1  Outline  stages and topics in simulation  generation of random variates.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
An Introduction to Programming and Algorithms. Course Objectives A basic understanding of engineering problem solving process. A basic understanding of.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
1 Design and Analysis of Engineering Experiments Chapter 1: Introduction.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Chapter 11Design & Analysis of Experiments 8E 2012 Montgomery 1.
Geographic Information Science
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 12 Correlational Designs.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Application of Class Discovery and Class Prediction Methods to Microarray Data Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Computer Vision Lecture 6. Probabilistic Methods in Segmentation.
1 Introduction to Mixed Linear Models in Microarray Experiments 2/1/2011 Copyright © 2011 Dan Nettleton.
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
Bayesian Model Robust and Model Discrimination Designs William Li Operations and Management Science Department University of Minnesota (joint work with.
1 Optimal design which are efficient for lack of fit tests Frank Miller, AstraZeneca, Södertälje, Sweden Joint work with Wolfgang Bischoff, Catholic University.
Designing Factorial Experiments with Binary Response Tel-Aviv University Faculty of Exact Sciences Department of Statistics and Operations Research Hovav.
Maximin D-optimal designs for binary longitudinal responses Fetene B. Tekle, Frans E. S. Tan and Martijn P. F. Berger Department of Methodology and Statistics,
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
1. Objectives Novartis is developing a new triple fixed-dose combination product. As part of the clinical pharmacology program, pharmacokinetic (PK) drug-drug.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
Computacion Inteligente Least-Square Methods for System Identification.
Chapter 9 Introduction to the t Statistic
STATISTICS POINT ESTIMATION
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
i) Two way ANOVA without replication
The Nature of Probability and Statistics
Experimental Design Research vs Experiment
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Introduction to Mixed Linear Models in Microarray Experiments
Experimental Design All experiments consist of two basic structures:
Introduction to Experimental Design
Penalized designs of multi-response experiments
14 Design of Experiments with Several Factors CHAPTER OUTLINE
Applied Statistics and Probability for Engineers
Presentation transcript:

Vanderbilt University Medical Center SRC Presentation Vincent Kokouvi Agboto Assistant Professor/Director of Biostatistics, Meharry Medical College Assistant Professor of Biostatistics, Vanderbilt University Medical Center

Introduction to Experimental Designs in Biological and Clinical Settings.

Overview 1. Introduction 2. Examples of Classical Designs 3. Optimal Experimental Design 4. Other Designs Issues 5. Conclusion

1. Introduction Experiment: Investigation in which investigator applies some treatments to experimental units and then observes the effects of treatments on the experimental units through measurement of response (s).

1. Introduction Treatment: Set of conditions applied to experimental units in an experiment. Experimental Unit: Physical entity to which a treatment is randomly assigned and independently applied.

1. Introduction Response variable: Characteristic of an experimental unit that is measured after treatment and analyzed to assess the effects of treatments on experimental units. Observational Unit: Unit on which a response variable is measured.

1. Introduction Experimental design procedure: Decision before data collection. Basic idea: Appropriate selection of values of control variables. Three Fundamental of Experimental Design Concepts: Randomization, Blocking, Replication. (R. A. Fisher)

1. Introduction Important stages of an Experimental Research: Background of the experiment; Choice of factors; Reduction of error; Choice of model; Design criterion and Size of the design; Choice of an experimental design; Conduct of the experiment and Analysis of the data

1. Introduction Classical (Standard) Designs Optimal Experimental Design: Only alternative when the standard designs do not provide us with adequate answers

2. Examples of Classical Designs Example1: Soils Moisture and gene Expression in maize seedlings. Example2: Drug and Feed Consumption on Gene Expression in rats. Example3: Treatments on Gene Expression in dairy cattle.

Example 1 Experiment: Effect of three soil moisture levels on gene expression in maize seedlings. Total of 36 seedlings were grown in 12 pots with 3 seedlings per pot. Three soil moisture levels (low, medium, high) randomly assigned to the 12 pots. After three weeks, RNA extracted from the above ground tissues of each seedling. Each of the 36 RNA samples was hybridized to a microarray slide to measure gene expression.

Example 1 (continued) Treatment: The three moisture levels Experimental Unit: Moisture levels randomly assigned to the pots  Pots: experimental units. A pot consisting of 3 seedlings is one experimental unit. Observational units: Gene expression was measured for each seedling  Seedlings: Observational units. Response variable: Each probe on the microarray slide provide one response variable. This is the Standard Experimental Design (CRD).

Example 2 Experiment: Gauge the effects of a drug and feed consumption on gene expression in rats. A total of 40 rats were housed in individual cages. Half of them  calorie-restricted diet (R); Another half  Provided with access to feeders that were full so calories intake unrestricted (U). Within each diet group, four doses of an experimental drug (1, 2, 3, 4)  rats with 5 rats per dose within each diet group.

Example 2 (continued) At the conclusion of the study, gene expression was measured for each rat using microarrays.

Example 2 (continued) Treatment (factors): Diet and Drug. Factor Diet (R, U); Factor Drug (1, 2, 3, 4) Each combination of diet and drug: Treatment (R1, R2, R3, R4, U1, U2, U3, U4). Each rat: Experimental unit/Observational unit. Response variable: Each probe on the microarray slide. This is a full factorial treatment design. It was used because all possible combination of diet and drug were considered.

Example 3 Experiment: Study the effects of 5 treatments (A, B, C, D, E) on gene expression in dairy cattle. A total of 25 GeneChips and a total of 25 cows, located on 5 farms with 5 cows on each farm are available for the experiment. Which of the following designs is better from a statistical standpoint?

Example 3 (Continued) Design 1: To reduce variability within treatment groups, randomly assign the 5 treatments to the 5 farms so all 5 cows on any one farm receive the same treatment. Measure gene expression using one GeneChip for each cow. Design 2: Randomly assign the 5 treatments to the 5 farms within each farm so that all 5 treatments are represented on each farm. Measure gene expression using one GeneChip for each cow.

Example 3 (continued) Design 1 Design 2 Farm 1: B B B B B Farm 1: A B E D C Farm 2: D D D D D Farm 2: E D A C B Farm 3: A A A A A Farm 3: C D E A B Farm 4: E E E E E Farm 4: A B E C D Farm 5: C C C C C Farm 5: C A D B E

Example 3 (continued) Observation Units: Cows in both designs. Experimental Units: Farms in Design 1 and Cows in Design 2. Design 2: a randomized complete block design (RCBD) with a group of 5 cows on a farm serving as a block of experimental units. Design 1 has no replication because only 1 experimental unit for each treatment. Design 2 has 5 replications per treatment.

Design 3 (continued) Design 2 is by far the better design. We can compare treatments directly among cows that share the same environment. With Design 1, it is impossible to separate difference in expression due to treatment effects from differences in expression due to farm effects.

3. Optimal Experimental Design 3.1. Motivation Example 3.2. Comments on Orthogonal Designs Some Examples of Non-Orthogonal Designs 3.4. Optimal Designs

3.1. Motivating Example Suppose that the yield is linearly related to temperature whose range is [50, 150]: Y= a + b X If we want conduct experiments at two points, which of the following will we choose: Design1 at 50 and 150? Design2 at 70 and 130? Design3 at 90 and 110?

3.1. Motivating Example What is the optimal design in this case? Better design among the three designs mentioned

3.1. Motivating Example It is the design1 because it gives the smallest confidence region for the parameters (D-optimality) and also give the smallest maximum variance for the predicted responses (G-optimality)

3.2. Comments on orthogonal Designs Pros (Many desirable properties) - Easy to calculate - Easy to interpret - Maximum Precision (in some sense) - Tabled designs widely available

3.2. Comments on Orthogonal Designs Cons: Not applicable if - Irregular design space - Mixture experiments - Sample size not power of 2 - Mixed qual and quant factors - Fixed covariates - Nonlinear models

2.3. Some Examples of Non- Orthogonal Designs 16-run design with 8 two-level factors with main effects and 6 interactions: BC, CH, BH, DE, EF, DF 12-run mixed level design with one 3 level factor and 9 two-level factors

2.4. Optimal Designs Optimal Experimental Design (OED): Standard alternative when classical designs not applicable. Choice of a particular experimental design: Depends on the experimenter’s design criterion (optimization problem). OED: Reduce costs of experimentation by allowing statistical models to be estimated with fewer experimental runs; Evaluated using statistical criteria.

3.4. Optimal Designs Y nxp ~ N (X  + ,  2 I), X nxp : design matrix,  : unknown px1 parameter vector and  2 : known y(x i ) = f’(x i )  +  i X=[f(x 1 ), …, f(x n )]’

3.4. Optimal Designs Design  : Probability measure over a compact region  with  (x i ) =  i  places weight  (x i ) on x i Problem: n  (x i ) is not necessary an integer

3.4. Optimal Designs Approximate design:  =  x1 x2… xn   1  2…  n with    (dx) =1 and 0   i  1 Exact design: n  (x i ) must be an integer

3.4. Optimal Designs nM(  )=X’X=   m(x)  (dx)=   f(x) f’(x)  (dx) =   i f(x i )f(x i )’ : Information matrix of  Optimality crietria:  * = arg  max  (M(  ))

3.5. Some Useful Criteria D-Optimality: max |X’X|: A-Optimality: min{trace (X’X) -1 } G-Optimality: min{max d(x)} where d(x) = f’(x)(X’X) -1 f(x) V-Optimality: min{average d(x)}

3.5. Some Useful Criteria D and A-Optimality: Estimation based criteria. G and V-Optimality: Prediction based criteria.

3.6. Algorithms for Optimal Designs Development of efficient computing methods and high power computer systems  Great interest in algorithmic approaches. In general: Difficult to find exact designs analytically. Finding exact designs  Solving a large nonlinear mixed integer programming problem. In practice: Find designs close to the best design locally optimal  introduction of exact design algorithms.

3.6. Algorithms for Optimal Designs Typical Exact Design Algorithm steps: - Choose an initial feasible solution design - Modify solution slightly, by exchanging a point in the design for a point in the design space .

3.6. Algorithms for Optimal Designs Fedorov algorithm (Fedorov, 1969). Modified Fedorov algorithm(Johnson and Nachtsheim, 1983). K-L exchange algorithm (Donev and Atkinson, 1988). Coordinate exchange algorithm (Meyer and Nachtsheim, 1995). Columnwise-Pairwise (CP) algorithm (Wu and Li, 1999).

3.7. Software for the Computation of Optimal Designs SAS JMP Matlab R C++

4. Other Designs Issues Supersaturated Designs Bayesian Designs Model Robust Designs Model Discrimination Designs

5. Conclusion All problems are different Statistical knowledge will help improve the design. Get involved with the statistician (biostatistician) early in the process. Collaborate closely with people who know the background of the study. Even the most sophisticated statistical analysis could save do much to save a study based on a “bad design”.

References Agboto V., Bayesian approaches to model robust and model discrimination designs. Unpublished Ph.D. dissertation, School of Statistics, University of Minnesota. Agboto V, Nachtsheim C, Li W. Screening designs for model discrimination. Journal of Statistical Planning and Inference,140:3, , Atkinson, A.C & Donev, A.N. (1992): “Optimal Experimental Designs”. Oxford Statistical Sciences Series:8, Chaloner, K. (1984). “Bayesian experimental design: A review”. Statistical Science 10, Cook, R. D. & Nachtsheim, C. J. (1982). “A comparison of algorithms for constructing exact D-opitmal designs”. Technometrics 22, Li, W. & Wu, C. F. J. (1997). “Columwise-pairwise algorithms with applications to the construction of supersaturated designs”. Technometrics 39,