multivariate genotype - environment data

Slides:



Advertisements
Similar presentations
IPM in wheat. The EU requires IPM by what does this mean??? 1.Blind Chemical control –Schematic and routine treatments 2.Chemical control based.
Advertisements

PCA for analysis of complex multivariate data. Interpretation of large data tables by PCA In industry, research and finance the amount of data is often.
The methodology and applications of Agricultural Landscape monitoring in Estonia Kalev Sepp, Institute of Environmental Protection Estonian Agricultural.
Experiences with incomplete block designs in Denmark Kristian Kristensen Department of Animal Breeding and Genetics Danish Institute of Agricultural Sciences.
Producing “New” Small Grain Crops in the Mid-Atlantic Wade Thomason.
Potentielle forklarende variabler for udbytte i forskellige miljøer Hans Pinnschmidt Danmarks JorgbrugsForskning Afdeling for Plantebeskyttelse Forskning.
Development of breeding strategies for environmentally friendly agriculture – one of the major research goals in Priekuli Arta Kronberga, Linda Legzdina,
Phenotypic Structure of Grain Size and Shape Variation in M5 mutant lines of spring wheat Kenzhebayeva Saule, Kazakh National University named after al-Farabi,
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Statistics for the Social Sciences
CSE 300: Software Reliability Engineering Topics covered: Software metrics and software reliability.
Danish Crop Production Seminar 2007 Smart Plant Protection Jens Erik Jensen, DAAS Lise Nistrup Jørgensen, FAS Per Kudsk, FAS Ghita Cordsen Nielsen, DAAS.
Tables, Figures, and Equations
Discriminant Analysis Testing latent variables as predictors of groups.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
© ENDURE, February 2007 FOOD QUALITY AND SAFETY © ENDURE, February 2007 FOOD QUALITY AND SAFETY Grass weed management with IPM Denmark MODULE C17.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 11 Chi-Square Tests and Strategies.
Understanding Research Results
EFFECT SIZE Parameter used to compare results of different studies on the same scale in which a common effect of interest (response variable) has been.
1.4 Assessment of yield losses imposed by plant pathogens Introduction and definitions Effects of plant pathogens on host physiology Effects of plant pathogens.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Biodiversity in Agroecosystems Milano, February 2011 UNIVERSITY of FLORENCE Department of Plant, Soil and Environmental Science EVALUATION OF THE.
Statistics 1 The Basics Sherril M. Stone, Ph.D. Department of Family Medicine OSU-College of Osteopathic Medicine.
Working Group 4: plant-plant interactions
1/17 Identification of thermophilic species by the amino acid compositions deduced from their genomes Reporter: Yu Lun Kuo
Precision Agriculture: The Role of Science Presented by Dr. Eduardo Segarra Department of Agricultural and Applied Economics, Texas Tech University.
Acknowledgements This study was performed with financial support of EEA grant EEZ08AP-27 and European Social Fund co-financed project 2009/0218/1DP/ /09/APIA/VIAA/099.
Available at Chapter 13 Multivariate Analysis BCB 702: Biostatistics
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
« SUSTAINABLE LOW-INPUT CEREAL PRODUCTION: REQUIRED VARIETAL CHARACTERISTICS AND CROP DIVERSITY » NEW COST ACTION PROPOSAL Domain Agriculture and Biotechnology.
PSYCHOLOGICAL RESEARCH AND THE SCIENTIFIC METHOD.
Multivariate Data Analysis Chapter 1 - Introduction.
Crop Protection Online - now also including maize Per Rydahl Danish Institute of Agricultural Sciences.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Level II Agricultural Business Operations.  Understand and identify the key crop production targets  Be able to state performance targets for individual.
Avena genetic resources for quality in human consumption (AVEQ) Startup Meeting, Clermont Ferrand 2007 WP3: Descriptors for field evaluation agreed on.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Chapter 1: Getting Started Section 1: Essential question: What is statistics?
Genotype x Environment Interactions Analyses of Multiple Location Trials.
Multivariate Transformation. Multivariate Transformations  Started in statistics of psychology and sociology.  Also called multivariate analyses and.
Biostatistics Introduction Article for Review.
1 PUZACHENKO Andrey ICAZ , Paris, France Multivariate analysis for the reconstructions in paleobiogeography S1-4. New developments in biogeography.
Grain Yield and Oil content Post flowering temperature effects
Statistics Statistics is that field of science concerned with the collection, organization, presentation, and summarization of data, and the drawing of.
Introduction to Quantitative Research
Transforming the data Modified from:
Bivariate Relationships
Evaluation of early drought tolerant maize genotypes under low nitrogen conditions Nyasha E. Goredema1, Ms Nakai Goredema2, Ezekia Svotwa1, Gabriel Soropa1,
Statistical tests for quantitative variables
The development of near infrared calibrations for
8.DATA DESCRIPTIVE.
Models for estimate yield losses due to wheat rusts and powdery mildew By Dr.Gamalat Abd-Elazize& Dr. Mohamed Abdelkader Wheat Diseases Research Department.
Data Analysis & Report Writing
Tips for exam 1- Complete all the exercises from the back of each chapter. 2- Make sure you re-do the ones you got wrong! 3- Just before the exam, re-read.
Biostatistics?.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Serious Cereal Science
Management of cereal and oilseed crop
Economics of Farm Enterprises II. (Farm Management II.) MSc level
Precision Agriculture an Overview
Introduction to Statistics
Basic Statistical Terms
Lodging immediately after July 4, 2007 storm.
Multivariate Statistics
Understanding Multi-Environment Trials
Growth Stress Response (Agronomical Stress Response)
Feature Selection Methods
NSW Cereal Rusts Season Outlook
RES 500 Academic Writing and Research Skills
Influence of different Phosphorus fertilization rates on yield and P uptake by rice.
Presentation transcript:

multivariate genotype - environment data Analysis of multivariate genotype - environment data using Non-linear Canonical Correlation Analysis Hans Pinnschmidt Danish Institute for Agricultural Sciences Division of Crop Protection Cereal Plant Pathology Group Denmark Archived at http://orgprints.org/8021

Background Objectives BAROF WP1 data: multivariate measurements on 86 spring barley genotypes in 10 environments (2 years: 2002 & 2003, 3 sites: Flakkebjerg, Foulum, Jyndevad, 2 production systems: ecological & conventional). Objectives Multivariate characterisation of genotypes with emphasis on yield-related properties.

} derive information on factors: genotype environment G1 E1 . . . Ej . Gi variables: X1(i,j) ... Xm(i,j) parameters Xm(i)1 ... Xm(i)p Xm(j)1 ... Xm(j)p variables: yield 1000 grain weight grain protein contents culm length date of emergence growth duration mildew severity rust severity scald severity net blotch severity disease diversity weed cover broken panicles & culms lodging parameters: raw data mean/median/max./min. rank/relative values main effects interaction slopes raw data adjusted for E/G main effects/slopes (residuals) IPCA scores SD/variance } derive information on general properties, specificity, stability/variability

Non-linear Canonical Correlation Analysis (NCCA): an optimal scaling procedure suited for handling multivariate data of any kind of scaling (numerical/quantitative, ordinal, nominal).

Non-linear Canonical Correlation Analysis (NCCA) data treatment: quantitative variables (vm) were converted into ordinal variables with n categories (v11 ... v1n, ..., vm1 ... vmn).

Non-linear Canonical Correlation Analysis (NCCA) is based on multivariate contingency tables containing frequency counts. G1 . Gi E1 . . . . . . . . . Ej Vm1 . . . Vmn

Non-linear Canonical Correlation Analysis (NCCA): main “dimensions” ( principal components) are determined “loadings” of variables ( overall correlation) are computed “category centroids” are quantified “object scores” ( principal component scores) are computed

Characterisation of environments based on data adjusted for G main effects (= residuals)

Flakkebjerg 2002: high rust & 1000 grain weight late sowing Foulum 2002 conventional & Jyndevad 2003 ecological: high mildew & lodging Flakkebjerg 2003: high yield, net blotch & panicle breakage low mildew & lodging Jyndevad 2002 ecological: low yield, 1000 grain weight, weed infestation, protein content

Characterisation of genotypes based on data adjusted for E main effects (= residuals)

dimension 5 (sq. root) dimension 1 (sq. root) high yield & 1000 grain weight low protein content & lodging high mildew low net blotch & disease diversity low yield & 1000 grain weight low mildew

genotypes & environments based on: Characterisation of genotypes & environments based on: raw data data adjusted for E main effects data adjusted for G & E main effects ( G x E interaction)

low yield, 1000 grain weight, weed infestation & net blotch high mildew high rust late emergence high yield, 1000 grain weight & net blotch low mildew low rust short culms early emergence

high yield & 1000 grain weight low protein content & lodging low net blotch & disease diversity high mildew low yield & 1000 grain weight high protein content

little lodging high panicle breakage high yield & 1000 grain weight low protein content low yield & 1000 grain weight high protein content much lodging

Conclusions & outlook NCCA is an “intuitive” method good for “visualising” the main features in multivariate data of various scales. NCCA is useful for obtaining an overall orientation of G properties and E characteristics. Future work: Refinements to obtain a better synopsis of E-specific performance of G’s as related to their property profiles. Include AMMI- and clustering (biclassification) results in NCCA, organise data as environment-specific sets of variables.

Characterisation of genotype performance in individual environments based on: raw yield- and disease data disease main effects of G’s environmental disease variability of G’s (= standard deviation of E adjusted data)