Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.
Types of Checks in Variety Trials One could be a long term check that is unchanged from year to year –serves to monitor experimental conditions from year.
Multiple Comparisons in Factorial Experiments
Latin Square Designs (§15.4)
STT 511-STT411: DESIGN OF EXPERIMENTS AND ANALYSIS OF VARIANCE Dr. Cuixian Chen Chapter 14: Nested and Split-Plot Designs Design & Analysis of Experiments.
Design of Experiments and Analysis of Variance
Sub - Sampling It may be necessary or convenient to measure a treatment response on subsamples of a plot –several soil cores within a plot –duplicate laboratory.
Replications needed to estimate a mean For a single population, you may want to determine the sample size needed to obtain a given level of precision Recall.
Design of Engineering Experiments - Experiments with Random Factors
Chapter 11 Analysis of Variance
The Statistical Analysis Partitions the total variation in the data into components associated with sources of variation –For a Completely Randomized Design.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Power Analysis How confident are we that we can detect an important difference, if it exists? Power = 1 - , where ES = Effect Size Used to design an experiment.
Assumptions of the ANOVA The error terms are randomly, independently, and normally distributed, with a mean of zero and a common variance. –There should.
Assumptions of the ANOVA
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
PBG 650 Advanced Plant Breeding
Experimental Design in Agriculture CROP 590, Winter, 2015
Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.
1 Experimental Statistics - week 7 Chapter 15: Factorial Models (15.5) Chapter 17: Random Effects Models.
Fixed vs. Random Effects
Experimental Design An Experimental Design is a plan for the assignment of the treatments to the plots in the experiment Designs differ primarily in the.
1 Design of Engineering Experiments Part 10 – Nested and Split-Plot Designs Text reference, Chapter 14, Pg. 525 These are multifactor experiments that.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel
So far... We have been estimating differences caused by application of various treatments, and determining the probability that an observed difference.
Chapter 10 Analysis of Variance.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Inferential Statistics
Genotype x Environment Interactions Analyses of Multiple Location Trials.
Chapter coverage Part A Part A –1: Practical tools –2: Consulting –3: Design Principles Part B (4-6) One-way ANOVA Part B (4-6) One-way ANOVA Part C (7-9)
The Completely Randomized Design (§8.3)
DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.
Planning rice breeding programs for impact Multi-environment trials: design and analysis.
Planning rice breeding programs for impact Models, means, variances, LSD’s and Heritability.
1 Analysis of Variance & One Factor Designs Y= DEPENDENT VARIABLE (“yield”) (“response variable”) (“quality indicator”) X = INDEPENDENT VARIABLE (A possibly.
Control of Experimental Error Blocking - –A block is a group of homogeneous experimental units –Maximize the variation among blocks in order to minimize.
Chapter 10: Analysis of Variance: Comparing More Than Two Means.
Strip-Plot Designs Sometimes called split-block design For experiments involving factors that are difficult to apply to small plots Three sizes of plots.
1 The Two-Factor Mixed Model Two factors, factorial experiment, factor A fixed, factor B random (Section 13-3, pg. 495) The model parameters are NID random.
Randomized block designs  Environmental sampling and analysis (Quinn & Keough, 2002)
1 Experimental Statistics - week 9 Chapter 17: Models with Random Effects Chapter 18: Repeated Measures.
PBG 650 Advanced Plant Breeding
Chapter 13 Design of Experiments. Introduction “Listening” or passive statistical tools: control charts. “Conversational” or active tools: Experimental.
1 Experimental Statistics Spring week 6 Chapter 15: Factorial Models (15.5)
Chapter 4 Analysis of Variance
Genotype x Environment Interactions Analyses of Multiple Location Trials.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
Experimental Statistics - week 9
Genotype x Environment Interactions Analyses of Multiple Location Trials.
1 Experimental Statistics - week 8 Chapter 17: Mixed Models Chapter 18: Repeated Measures.
ENGR 610 Applied Statistics Fall Week 8 Marshall University CITE Jack Smith.
Chapter 11 Analysis of Variance
Factorial Experiments
ANOVA Econ201 HSTS212.
Two way ANOVA with replication
i) Two way ANOVA without replication
Comparing Three or More Means
Two way ANOVA with replication
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Chapter 11 Analysis of Variance
Relationship between mean yield, coefficient of variation, mean square error and plot size in wheat field experiments Coefficient of variation: Relative.
Understanding Multi-Environment Trials
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and another experiment is conducted –Combined analysis of experiments is seldom required –Experiments may be repeated to Provide greater precision (increased replication) Validate results from initial experiment Applied Research –Recommendations to producers must be based on multiple locations and seasons that represent target environments (soil types, weather patterns)

Multilocational trials Often called MET = multi-environment trials How do treatment effects change in response to differences in soil and weather throughout a region? –What is the range of responses that can be expected? Detect and quantify interactions of treatments and locations and interactions of treatments and seasons in the recommendation domain Combined estimates are valid only if locations are randomly chosen within target area –Experiments often carried out on experiment stations –Generally use sites that are most accessible or convenient –Can still analyze the data, but consider possible bias due to restricted site selection when making interpretations

Preliminary Analysis Complete ANOVA for each experiment –Do we have good data from each site? –Examine residual plots for validity of ANOVA assumptions, outliers Examine experimental errors from different locations for heterogeneity –Perform F Max test or Levene’s test for homogeneity of variance –If homogeneous, perform a combined analysis across sites –If heterogeneous, may need to use a transformation or break sites into homogeneous groups and analyze separately –Differences in means across sites are often greater than treatment effects –Does not prevent a combined analysis, but may contribute to error heterogeneity if there are associations between means and variances

MET Linear Model (for an RBD) Y ijk =  +  i +  j(i) +  k + (  ) ik +  ijk  = mean effect  i = i th location effect  j(i) = j th block effect within the i th location  k = k th treatment effect  ik = interaction of the k th treatment in the i th location  ijk = pooled error Environments = Locations = Sites Blocks are nested in locations –SS for blocks is pooled across locations

Treatment x Environment Interaction Obtain a preliminary estimate of interaction of treatment with environment or season Will we be able to make general recommendations about the treatments or should they be specific for each site? –Error degrees of freedom are pooled across sites, so it is relatively easy to detect interactions –Consider the relative magnitude of variation due to the treatments compared to the interaction MS –Are there rank changes in treatments across environments (crossover interactions)?

Treatments and locations are random SourcedfSSMSExpected MS Locationl-1SSLM1 Blocks in Loc.l(r-1)SSB(L)M2 Treatmentt-1SSTM3 Loc. X Treatment(l-1)(t-1)SSLTM4 Pooled Error l(r-1)(t-1)SSEM5 F for Locations = (M1+M5)/(M2+M4) Satterthwaite’s approximate df N1’ = (M1+M5) 2 /[(M1 2 /(l-1))+M5 2 /(l)(r-1)(t-1)] N2’ = (M2+M4) 2 /[(M2 2 /(l-1))+M4 2 /(l)(r-1)(t-1)] F for Treatments = M3/M4 F for Loc. x Treatments = M4/M5

Treatments and locations are fixed SourcedfSSMSExpected MS Locationl-1SSLM1 Blocks in Loc.l(r-1)SSB(L)M2 Treatmentt-1SSTM3 Loc. X Treatment(l-1)(t-1)SSLTM4 Pooled Error l(r-1)(t-1)SSEM5 F for Locations = M1/M2 F for Treatments = M3/M5 F for Loc. x Treatments = M4/M5 Fixed Locations constitute the entire population of environments OR represent specific environmental conditions (rainfall, elevation, etc.)

Treatments are fixed, Locations are random SourcedfSSMSExpected MS Locationl-1SSLM1 Blocks in Loc.l(r-1)SSB(L)M2 Treatmentt-1SSTM3 Loc. X Treatment(l-1)(t-1)SSLTM4 Pooled Error l(r-1)(t-1)SSEM5 F for Locations = M1/M2 F for Treatments = M3/M4 F for Loc. x Treatments = M4/M5 SAS uses slightly different rules for determining Expected MS  No direct test for Locations for this model

SAS Expected Mean Squares PROC GLM; Class Location Rep Variety; Model Yield = Location Rep(Location) Variety Location*Variety; Random Location Rep(Location) Location*Variety/Test; Source Type III Expected Mean Square Location Var(Error) + 3 Var(Location*Variety) + 7 Var(Rep(Location)) + 21 Var(Location) Dependent Variable: Yield Source DF Type III SS Mean Square F Value Pr > F Location Error Error: MS(Rep(Location)) + MS(Location*Variety) - MS(Error) Varieties fixed, Locations random

Treatments are fixed, Years are random SourcedfSSMSExpected MS Yearsl-1SSYM1 Blocks in Yearsl(r-1)SSB(Y)M2 Treatmentt-1SSTM3 Years X Treatment(l-1)(t-1)SSYTM4 Pooled Error l(r-1)(t-1)SSEM5 F for Years = M1/M2 F for Treatments = M3/M4 F for Years x Treatments = M4/M5

Locations and Years in the same trial Can analyze as a factorial Can determine the magnitude of the interactions between treatments and environments –TxY, TxL, TxYxL For a simpler interpretation, consider all year and location combinations as “sites” and use one of the models presented for multilocational trials Sourcedf Yearsy-1 Locationsl-1 Years x Locations(y-1)(l-1) Block(Years x Locations)yl(r-1)

Combined Lab or Greenhouse Study (CRD) SourcedfSSMSExpected MS Triall-1SSLM1 Treatmentt-1SSTM2 Trial x Treatment(l-1)(t-1)SSLTM3 Pooled Error lt(r-1)SSEM4 Assume Treatments are fixed, Trials are random A “trial” is a repetition of a replicated experiment If there are no interactions, consider pooling SSLT and SSE –Use a conservative P value to pool (e.g. >0.25 or >0.5) F for Trials = M1/M4 (SAS would say M1/M3) F for Treatments = M2/M3 F for Trials x Treatments = M3/M4

Preliminary ANOVA Assumptions for this example: –locations and blocks are random –Treatments are fixed If Loc. x Treatment interactions are significant, must be cautious in interpreting main effects combined across all locations SourcedfSSMSF Totallrt-1SSTot Locationl-1SSLM1M1/M2 Blocks in Loc.l(r-1)SSB(L)M2 Treatmentt-1SSTM3M3/M4 Loc. X Treatment(l-1)(t-1)SSLTM4M4/M5 Pooled Error l(r-1)(t-1)SSEM5

Genotype by Environment Interactions (GEI) When the relative performance of varieties differs from one location or year to another… –how do you make selections? –how do you make recommendations to farmers?

Genotype x Environment Interactions (GEI) How much does GEI contribute to variation among varieties or breeding lines? P = G + E + GE P is phenotype of an individual G is genotype E is environment GE is the interaction DeLacey et al., 1990 – summary of results from many crops and locations rule E: GE: G 20% of the observed variation among genotypes is due to interaction of genotype and environment

Stability Many approaches for examining GEI have been suggested since the 1960’s Characterization of GEI is closely related to the concept of stability. “Stability” has been interpreted in different ways. –Static – performance of a genotype does not change under different environmental conditions (relevant for disease resistance, quality factors) –Dynamic – genotype performance is affected by the environment, but its relative performance is consistent across environments. It responds to environmental factors in a predictable way.

Measures of stability CV of individual genotypes across locations Regression of genotypes on an environmental index –Eberhart and Russell, 1966 Ecovalence –Wricke, 1962 Superiority measure of cultivars –Lin and Binns, 1988 Many others…

Analysis of GEI – other approaches Rank sum index (nonparametric approach) Cluster analysis Factor analysis Principal component analysis AMMI Pattern analysis Analysis of crossovers Partial Least Squares Regression Factorial Regression