Introduction to Augmented Designs

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Split-Plot Designs Usually used with factorial sets when the assignment of treatments at random can cause difficulties –large scale machinery required.
Split-Plot Designs Usually used with factorial sets when the assignment of treatments at random can cause difficulties large scale machinery required for.
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Strip-Plot Designs Sometimes called split-block design
Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.
Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.
Types of Checks in Variety Trials One could be a long term check that is unchanged from year to year –serves to monitor experimental conditions from year.
Augmented Designs Mike Popelka & Jason Morales. What is an augmented design? A replicated check experiment augmented by unreplicated entries. Step 1:
Statistics in Science  Statistical Analysis & Design in Research Structure in the Experimental Material PGRM 10.
Multiple Comparisons in Factorial Experiments
Design and Analysis of Augmented Designs in Screening Trials
INDIAN AGRICULTURAL STATISTICS RESEARCH INSTITUTE, NEW DELHI SPBD RELEASE 1.0: SPBD RELEASE 1.0: A STATISTICAL PACKAGE FOR BLOCK DESIGNS.
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture #19 Analysis of Designs with Random Factor Levels.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
STATISTICAL PACKAGE FOR AUGMENTED DESIGNS
1 Multifactor ANOVA. 2 What We Will Learn Two-factor ANOVA K ij =1 Two-factor ANOVA K ij =1 –Interaction –Tukey’s with multiple comparisons –Concept of.
Chapter 11 Analysis of Variance
Analysis of Variance. Experimental Design u Investigator controls one or more independent variables –Called treatment variables or factors –Contain two.
Chapter 28 Design of Experiments (DOE). Objectives Define basic design of experiments (DOE) terminology. Apply DOE principles. Plan, organize, and evaluate.
Handling Large Numbers of Entries Stratification of materials –group entries on the basis of certain traits - maturity, market class, color, etc. –then.
Incomplete Block Designs
Outline Single-factor ANOVA Two-factor ANOVA Three-factor ANOVA
TOPIC FOUR: INHERITANCE OF A SINGLE GENE Why can’t we all just get along and, say, call an inbred line in the F 6­ generation simply ‘an F 6 line’? Well.
Why conduct experiments?... To explore new technologies, new crops, and new areas of production To develop a basic understanding of the factors that control.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Applications in Plant Breeding
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
Experimental Design in Agriculture CROP 590, Winter, 2015
The Great Plains Canola Germplasm Evaluation System Michael J. Stamm, Kansas State University and Oklahoma State University Table NWCVT Great.
Control of Experimental Error Accuracy = without bias  average is on the bull’s-eye  achieved through randomization Precision = repeatability  measurements.
Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.
Topic 28: Unequal Replication in Two-Way ANOVA. Outline Two-way ANOVA with unequal numbers of observations in the cells –Data and model –Regression approach.
Fixed vs. Random Effects
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Fixed vs. Random Effects Fixed effect –we are interested in the effects of the treatments (or blocks) per se –if the experiment were repeated, the levels.
Experimental Design An Experimental Design is a plan for the assignment of the treatments to the plots in the experiment Designs differ primarily in the.
Breeding for Yield PLS 664 Spring Issues How do we build yield potential into a cross? How do we select for yield in the generations prior to yield.
More complicated ANOVA models: two-way and repeated measures Chapter 12 Zar Chapter 11 Sokal & Rohlf First, remember your ANOVA basics……….
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
1 Experimental Statistics - week 2 Review: 2-sample t-tests paired t-tests Thursday: Meet in 15 Clements!! Bring Cody and Smith book.
Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel
Statistical Aspects of a Research Project Mohd Ridzwan Abd Halim Jabatan Sains Tanaman Universiti Putra Malaysia.
Randomized Block Design (Kirk, chapter 7) BUSI 6480 Lecture 6.
METs for evaluating experimental varieties. Response variable: Grain yield lowmoderate A B extreme Basics of Genotype x Environment interaction Context:
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Chapter coverage Part A Part A –1: Practical tools –2: Consulting –3: Design Principles Part B (4-6) One-way ANOVA Part B (4-6) One-way ANOVA Part C (7-9)
The Completely Randomized Design (§8.3)
BUSI 6480 Lecture 8 Repeated Measures.
Control of Experimental Error Blocking - –A block is a group of homogeneous experimental units –Maximize the variation among blocks in order to minimize.
Topic 30: Random Effects. Outline One-way random effects model –Data –Model –Inference.
1 Always be contented, be grateful, be understanding and be compassionate.
Topic 26: Analysis of Covariance. Outline One-way analysis of covariance –Data –Model –Inference –Diagnostics and rememdies Multifactor analysis of covariance.
Strip-Plot Designs Sometimes called split-block design For experiments involving factors that are difficult to apply to small plots Three sizes of plots.
Topic 20: Single Factor Analysis of Variance. Outline Analysis of Variance –One set of treatments (i.e., single factor) Cell means model Factor effects.
Chapter 4 Analysis of Variance
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
Topic 27: Strategies of Analysis. Outline Strategy for analysis of two-way studies –Interaction is not significant –Interaction is significant What if.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
Engineering Statistics Design of Engineering Experiments.
Comparing Three or More Means
PRINCIPLES OF CROP PRODUCTION ABT-320 (3 CREDIT HOURS)
PRINCIPLES OF CROP PRODUCTION ABT-320 (3 CREDIT HOURS)
Joanna Romaniuk Quanticate, Warsaw, Poland
ANNUAL REPORT Richard Pratt Dept. of Horticulture and Crop Science
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Experimental Design All experiments consist of two basic structures:
Presentation transcript:

Introduction to Augmented Designs Applications in Plant Breeding Jennifer Kling Oregon State University

Outline – Augmented Designs Essential features When are they used in plant breeding? Design options Today - one-way control of heterogeneity Augmented Block Design - Example Randomization and Field Plan Analysis with SAS Interpretation of Results Overview of variations on the basic design Software and Further References

Augmented Designs - Essential Features Introduced by Federer (1956) Controls (check varieties) are replicated in a standard experimental design New treatments (genotypes) are not replicated, or have fewer replicates than the checks – they augment the standard design

When are they used in plant breeding? Early generations Seed is limited Land and other resources are limited Want to evaluate as many genotypes as possible Can re-evaluate selections in subsequent seasons Difficult to maintain homogeneous blocks when comparing so many genotypes Need a mechanism to adjust for field variation

When are they used in plant breeding? Adaptation to diverse environments is a primary goal Participatory Plant Breeding Farmers may prefer to grow a single replication when there are many genotypes to evaluate May not be able to accommodate all entries Farming Systems Research Want to evaluate promising genotypes in as many environments as possible

Before Augmented Designs Systematic use of checks Breeding sorghum for resistance to Striga at ICRISAT, 1983 Stage I Observation Nursery (unreplicated) Stage II Preliminary Screening (replicated) Stage III Advanced Screening susceptible check test plot Checkerboard

Augmented Designs - Advantages Unreplicated designs can make good use of scarce resources Evaluate more genotypes Test in more environments Fewer check plots are required than for designs with systematic repetition of a single check

Augmented Designs - Advantages Provide an estimate of standard error that can be used for comparisons Among the new genotypes Between new genotypes and check varieties Observations on new genotypes can be adjusted for field heterogeneity (blocking) Flexible – blocks can be of unequal size

Some Disadvantages Considerable resources are spent on production and processing of control plots Relatively few degrees of freedom for experimental error, which reduces the power to detect differences among treatments Unreplicated experiments are inherently imprecise, no matter how sophisticated the design

Design Options Choose a design that is appropriate for controlling the heterogeneity in the experimental area One-way blocking Randomized Complete Block Design Incomplete Block Designs (e.g. Lattice Design) Two-way blocking Latin Square (Complete Blocks) Youden Square (Incomplete Blocks) Row-Column Designs

Design Options Underlying design refers to assignment of checks to the experimental units All augmented designs are incomplete with respect to the new entries Can be replicated in different environments Need to consider relative efficiency compared to other possible designs

Augmented Block Design Example Control heterogeneity in one direction Simplest case: Checks occur once in every block New entries occur once in the experiment This is an ARCBD

Meadowfoam (Limnanthes alba) Native plant first produced as a crop in 1980 Seed oil with novel long-chain fatty acids light-colored and odor free exceptional oxidative stability Used in personal care products Potential uses fuel additive vehicle lubricants pharmaceutical products

Meadowfoam in Oregon Good rotation crop in the Willamette Valley Winter annual Plant and seed meal are high in glucosinolates Same equipment as grass seed

Pollinators Honeybees Blue Orchard Bees Blue Bottle Flies

The Germplasm Diverse breeding populations were inherited from a retired breeder (Gary Jolliff) Populations were regenerated in the greenhouse and selfed S2 lines were transplanted to the field and allowed to outcross Insufficient seed for replicated progeny trials Goal - form a broadbased pool for recurrent selection Screen S2-testcrosses Recombine selected S2 parents

The Experiment (2007-2008) New treatments: Check varieties: 50 S2-testcross families Check varieties: Ross (C1) Cycle 4 of an elite population (OMF58) Widely grown commercial variety OMF183 (C2) Cycle 5 from OMF58 Starlight (C3) released variety derived from this germplasm collection

The Design Block in one direction 3 checks (c=3) 6 blocks (r=6) Error degrees of freedom = (r-1)(c-1) = 10 50 new entries (n=50) Number of plots = n + r*c = 50 + 6*3 = 68 Plots per block = 68/6 = 11.3  12 (c=3, n=9) Last block has only 8 plots (c=3, n=5) Unequal numbers of new entries per block is OK

Statistical Model mean + blocks + checks + new entries + error New entries do not contribute to error. New entries are adjusted for block effects. Analysis is equivalent to a completely balanced RBD for the checks without the new entries

Field Plan Include a sufficient number of checks and replicates to provide a good estimate of experimental error and adequate power for detecting differences among varieties Arrange blocks along field gradient to maximize variation among blocks and minimize variation within blocks Assign each of the checks at random to each block Assign new entries at random to remaining plots On-line tool for randomization: http://www.iasri.res.in/design/Augmented%20Designs/home.htm

Designation of plots in blocks

Checks assigned to plots in each block

New entries included in final plan

Meadowfoam progeny trials

Data Collection Flowering dates, plant height, disease resistance, lodging, seed yield, 1000-seed weight, (oil content) For this example: 1000-seed weight (TSW) Relatively high heritability Positively correlated with oil content and oil yield

Fixed or Random? Generally assume that blocks are random Genotypes They represent a larger group of potential blocks We want to make inferences beyond the particular sample of blocks in the experiment Genotypes Checks are fixed effects – we are interested in making comparisons with these specific varieties New entries could be fixed or random May be fixed in this case – want to select the winners Most commonly random – particularly in genetic studies

SAS data input – genotypes fixed DATA anyname; INPUT Plot Entry Name$ Block TSW; datalines; Other options for data input Import Wizard Infile statements SAS libraries

Analysis #1 – new entries fixed PROC MIXED; TITLE 'Augmented Design using PROC MIXED – entries fixed'; CLASS block entry; MODEL TSW=entry; RANDOM block; /*compare all means using Tukey's test*/ LSMEANS entry/pdiff adjust=tukey; /*test for entries that exceed Starlight*/ LSMEANS entry/pdiff=CONTROLU(‘91') adjust=dunnett; ods output lsmeans=TSWadj diffs=TSWpdiff; RUN; Use export wizard to export TSWadj TSWpdiff

Results for Analysis #1 (fixed entries) The Mixed Procedure Covariance Parameter Estimates Cov Parm Estimate Block 0.1381 Residual 0.06981 Fit Statistics -2 Res Log Likelihood 17.7 AIC (smaller is better) 21.7 AICC (smaller is better) 22.7 BIC (smaller is better) 21.3 Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F Entry 52 10 7.51 0.0008

Output from Dunnett Test TSWpdiff.xls Standard error of a difference depends on the comparison check vs check new entry vs check new entries in the same block new entries in different blocks

Analysis #2 – new entries random DATA arcbd; set anyname; if(entry>50) then new=0; else new=1; if(new) then entryc=999; else entryc=entry; RUN; Wolfinger, R.D., W.T. Federer, and O. Cordero-Brana, 1997

Analysis #2 – ANOVA PROC GLM; TITLE 'Augmented Design using GLM – entries random'; CLASS block entry entryc; MODEL TSW = block entryc entry*new/solution; RANDOM block/test; RUN; The GLM Procedure Tests of Hypotheses for Mixed Model Analysis of Variance Dependent Variable: TSW Source DF Type III SS Mean Square F Value Pr > F Block 5 2.420228 0.484046 6.93 0.0048 entryc 2 0.239211 0.119606 1.71 0.2292 new*Entry 49 26.976052 0.550532 7.89 0.0007 Error: MS(Error) 10 0.698056 0.069806

Analysis #2 – new entries random PROC MIXED; TITLE 'Augmented Design using PROC MIXED – entries random'; CLASS block entry entryc; MODEL TSW=entryc/solution; RANDOM block entry*new/solution; LSMEANS entryc; ods output solutionr=eblups; RUN; Use export wizard to export eblups

Results for Analysis #2 (random entries) The Mixed Procedure Covariance Parameter Estimates Cov Parm Estimate Block 0.01418 new*Entry 0.3659 Residual 0.1748 Fit Statistics -2 Res Log Likelihood 136.5 AIC (smaller is better) 142.5 AICC (smaller is better) 142.9 BIC (smaller is better) 141.9 Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F entryc 3 10 0.89 0.4779

Estimated Best Linear Unbiased Predictors Solution for Fixed Effects Standard Effect entryc Estimate Error DF t Value Pr > |t| Intercept 10.2048 0.1150 5 88.72 <.0001 entryc 89 -0.3148 0.2000 10 -1.57 0.1466 entryc 90 -0.1431 0.2000 10 -0.72 0.4906 entryc 91 -0.03478 0.2000 10 -0.17 0.8654 entryc 999 0 . . . . eblups.xls

Variations – two-way control of heterogeneity Design Features/Applications Reference Modified Youden Square Two-way control of heterogeneity Federer and Raghavarao, 1975 Augmented Row-column Entries adjusted for adjacent checks; requires many checks Federer, Nair and Raghavarao, 1975 Modified Augmented (MAD Type 1) Systematic placement of controls; Requires square plots Lin and Poushinsky, 1983 Modified Augmented (MAD Type 2) Systematic placement of controls; Long, rectangular plots Lin and Poushinsky, 1985

More Variations Design Features/Applications Reference Factorial treatments Augmented Split Block Intercropping, stress adaptation Federer, 2005 Incomplete Blocks Augmented Lattice Square Accommodates larger number of checks Federer, 2002 - Lattice Williams and John, 2003 Augmented p-rep (partially replicated) Replicates of entries used to make block adjustments and estimate error Williams, Piepho, and Whitaker, 2011 Multiple locations RCBD, ICBD, row-column, etc. SAS code available Federer, Reynolds, and Crossa, 2001

Multiple Locations – Augmented or Lattice Design? Lattice Designs Sites can be complete replications Information from all plots are used to adjust means for field effects May have greater precision and power to detect differences among entries Augmented Designs Flexible arrangement of new entries in field plans Estimate of experimental error obtained from each location Assess site quality Evaluate GXE interactions More flexibility in combining information across sites

Software for Augmented Designs AGROBASE GenII by Agronomix, Inc. Randomize and analyze modified augmented type-2 designs Indian Agricultural Research Institute, New Delhi Statistical Package for Augmented Designs (SPAD) www.iasri.res.in/iasrisoftware/SPAD.ppt SAS macro called augment.sas web.iasri.res.in/sscnars/Macros/ABD/use_augment.sas.doc CIMMYT – SAS macro called UNREPLICATE Developed in 2000 – uses some older SAS syntax http://apps.cimmyt.org/english/wps/biometrics/

References Full list of references provided as supplement Federer, W.T. 1961. Augmented designs with one-way elimination of heterogeneity. Biometrics 17(3): 447-473 Wolfinger, R.D., W.T. Federer, and O. Cordero-Brana. 1997. SAS PROC GLM and PROC MIXED for recovering blocking and variety information in augmented designs. Agronomy Journal 89: 856-859

Acknowledgements USDA-CSREES special grant for meadowfoam research OMG Meadowfoam Oil Seed Growers Cooperative Paul C. Berger Professorship Endowment Crop and Soil Science Department, OSU Meadowfoam staff Mary Slabaugh Joanna Rosinska Trisha King Alicia Wilson

Questions?