Genetic Architecture of Kernel Composition in the Nested Association Mapping (NAM) Population Sherry Flint-Garcia USDA-ARS Columbia, MO
Outline Development of NAM Population Kernel Composition Joint Linkage Mapping Genome-Wide Association Mapping
Linkage-Based QTL Mapping “Genome Scan” Identify genomic regions that contribute to variation and estimate QTL effects Genotype Phenotype Composite Interval Mapping F 2 population Parent 1 F1F1 Parent 2
Linkage (QTL) Mapping Genome scan Structured population High power Low resolution Analysis of 2 alleles Association Mapping Candidate gene testing Unstructured population Low power High resolution Analysis of many alleles Nested Association Mapping Structured families nested within an unstructured population High Power High Resolution Analysis of many alleles
NAM Founders P39 M37W CML277 B97 CML103 CML69 CML52 CML228 CML247 CML332 IL14H Ky21 Ki11 Ki3 MS71 Mo18W Oh7B M162W Tx303 Tzi8 CML333 NC358 NC300 HP301 OH43
NAM Development Current genetic map consists of: 4699 RILs 1106 SNP loci Average marker density - one marker every 1.3 cM Yu, et al. (2008) Genetics; McMullen, et al. (2009) Science Association Linkage
Kernel Composition in NAM Starch Amylose Amylopectin Fiber Oil Fatty Acid Profiles Protein Zeins Amino Acid Profiles
The Phenotypic Data 7 locations of NAM – 2006: MO, NY, NC, PR, FL2007: MO, NY Self pollinated seed samples NIR analysis for starch, protein, and oil content (% kernel - dry matter basis) Two sweet corn families excluded >6000 rows per location
Phenotypic Data Statistics Heritability Trait Correlations (23 Families) rProteinOil Starch Protein 0.32 H 2 Starch0.85 Protein0.83 Oil0.86
NAM Analysis in SAS Permutations for selection thresholds ~10 -5 Joint stepwise regression; Proc GLMSelect Family main effect & markers within families Final model; Proc GLM Estimate effects (P = 0.05) Genome Scan; Proc Mixed Maximum likelihood with background cofactors Epistasis; all (611,065) pair-wise combinations
NAM Kernel Quality Architecture TraitNR 2 (family) R 2 (QTL) R 2 (QTL+family) Starch Protein Oil Starch Protein Oil No Epistasis Observed at the NAM Level
B73 Additive Allelic Effects Starch Oil B73 Protein Sig. Alleles NMinMax (P = 0.05)(%)(%) Starch Protein Oil B73 % % % ^^
Validation Efforts Near Isogenic Lines (NILs) Genome Scan Association Analysis Candidate Genes Association Analysis Fine Mapping CandidateMarkerChr.Dist.Trait Floury1m Oil Opaque2 Modifier/Mucronatem Protein Brittle Endosperm1m Starch DGAT1-2m Oil Waxy1m Starch Jason Cook Estimated TraitAlleleMarkerChr.Dist. (cM)Effect (%) OilTx303m OilCML322m OilCML228m OilTx303m ProteinCML103m StarchTzi8m
Genetic vs. Physical Distance Joint Linkage Mapping - Oil Physical Distance (bp) Genetic Distance (cM) Joint Linkage Mapping - Oil
Genome Wide Association (GWAS) 1.6 Million HapMap v1 SNPs projected onto NAM Bootstrap (80%) sampling to test robustness Physical Distance (bp) GWAS - Oil BPP Joint Linkage Mapping - Oil
Chr. 6 Oil Candidate: DGAT1-2 Encodes acyl-CoA:diacylglycerol acyltransferase Fine mapped by Pioneer-Dupont Zheng, et al. (2008) Nature Genetics High parent = 19% oil High allele = 0.29% additive effect DGAT is the largest effect kernel quality QTL in NAM 4.4% 5.3% 3.6% 3.9% Phenylalanine insertion in the C-terminus of the protein
DGAT 1-2 (Chr6: 105,013, ,020,258) MarkerTraitPopulationAnalysis MethodBPPP-ValueEffect M1Oil282 Assn.MLM (Q+K)-1.2E M2Oil282 Assn.MLM (Q+K)-9.9E M3OilNAMGWAS - Bootstrap M4Oil282 Assn.MLM (Q+K)-4.3E M4StarchNAMGWAS - Bootstrap M5OilNAMGWAS - Bootstrap M5StarchNAMGWAS - Bootstrap NAM Population: 24 Total HapMap.v1 SNPs in DGAT Association Panel: 2 Total 55K SNPs in DGAT M1 M3 M5M4 M2: Phe Insertion
DGAT 1-2 (Chr6: 105,013, ,020,258) M1 M3 M5M4 M2: Phe Insertion ? = B73 Allele = Non-B73 Allele
What’s Next for NAM? NextGen sequencing of the 5000 NAM RILs Potentially Million SNPs Identify very precisely where recombination events are in the mapping population. This will VASTLY improve the mapping resolution of NAM and GWAS.
Conclusions Genetic Architecture of Kernel Quality Traits Governed by many QTL (N = 21-26) Many QTL in common with prior studies Effect sizes are small to moderate Allele series are common Genome Wide Association Studies (GWAS) Results confirm many QTL and candidate genes Resolution will improve with more markers on NAM RILs (define recombination events)
What Does This Mean To You? Identifying Functional Markers for MAS (Distantly) Linked markers not accurate Parent Selection = Allele Mining Valuable alleles are often masked. Selection for specific alleles is more accurate than selecting based on parental phenotype.
Acknowledgements NSF Maize Diversity Project Syngenta Joe Byrum & Kirk Noel
250 Races B47 (SS) PHZ51 (NSS) Allele Library 2500 lines GEM Allelic Diversity Project Genome Wide Association Analysis “mini-NAM” Allele Mining