Oat Molecular Toolbox: Toward Better Oats Nick Tinker, 2014-March-5 Agriculture and Agri-Food Canada.

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

Cameron Peace, Washington State University
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Association Mapping as a Breeding Strategy
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
ASSOCIATION MAPPING WITH TASSEL Presenter: VG SHOBHANA PhD Student CPMB.
Molecular Marker Development and Utilization at Ottawa - Within AAFC Integrated Oat Improvement Project Presented by Steve Molnar VIII International Oat.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
Bob Weaber, Ph.D. Cow-Calf Extension Specialist Assistant Professor Dept. of Animal Sciences and Industry
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
The IWGSC: Building the sequence-based foundation for accelerated wheat breeding Kellye A. Eversole IWGSC Executive Director & The IWGSC Cereals for Food,
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Genome Variations & GWAS
Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support.
Towards utilization of genome sequence information for pigeonpea improvement By ICAR institutes, SAUs and ICRISAT.
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Single Nucleotide Polymorphism
Molecular marker data and their impact on gene bank management Chris Richards NCGRP, Fort Collins, CO Curator Workshop, Atlanta Georgia.
Outline of Barley CAP Approach Breeding lines from 10 breeding programs are phenotyped (over 40 traits) in collaborative trials and individual breeder.
Oat Molecular Toolbox: Toward Better Oats Nick Tinker, 2014-March-5 Agriculture and Agri-Food Canada.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
APPLICATION OF MOLECULAR MARKERS FOR CHARACTERIZATION OF LATVIAN CROP PLANTS Nils Rostoks University of Latvia Vienošanās Nr. 2009/0218/1DP/ /09/APIA/VIAA/099.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Allele Survey - North American Oats Nick Tinker et al. Agriculture and Agri-Food Canada.
Gene Hunting: Linkage and Association
Development and Application of SNP markers in Genome of shrimp (Fenneropenaeus chinensis) Jianyong Zhang Marine Biology.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada 2006 Molecular Mapping of the Dw6 Dwarfing Locus in Oats Abstract In crosses involving.
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Dr. Scott Sebastian, Research Fellow, Pioneer Hi-Bred International Plant Breeding Seminar at University of California Davis Accelerated Yield.
Oat Molecular Markers: Status and Opportunities Howard W. Rines USDA-ARS and Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul,
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics
North Carolina State University Oat Maker Development Peter Maloney Jeanette Lyerly Paul Murphy Gina Brown-Guedira USDA-ARS Crop Science Department 840.
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Use of breeding populations to detect and use QTL Jean-Luc Jannink Iowa State University 2006 American Oat Workers Conference Fargo, ND24 July 2006.
CASE7——RAD-seq for Grape genetic map construction
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Molecular revolution. The first molecular markers: allozymes Allozymes Enzymes that diifer in amino acid sequence yet catalyze the same reaction -visible.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Bioinf.scri.ac.uk/flapjack. Introduction Barley genotyping Visualizing the data Flapjack overview Performance issues Other features Future work.
Oregon Wolfe Barley Map 187 classical loci Oregon Wolfe Barley Map 187 classical DArT = 909 loci.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Institute of Crop Sciences, CAAS
The Haplotype Blocks Problems Wu Ling-Yun
Comparative mapping of Brassica oleracea using sequence-based markers derived from other Brassica relatives and transcriptome sequences generated from.
What gains can we expect from Genetics?
Single Nucleotide Polymorphisms (SNPs
MOLECULAR MARKERS.
upstream vs. ORF binding and gene expression?
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Genome-wide Association Studies
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
“TaqMan genotyping Assay’’
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Presentation transcript:

Oat Molecular Toolbox: Toward Better Oats Nick Tinker, 2014-March-5 Agriculture and Agri-Food Canada

Collaborative Oat Research Enterprise * Mexico: Julio Huerta Eduardo Villa senior Mir Eduardo Espitia

What makes oats different, which differences make a better oat ? 3

CORE Concept Summary 4 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

Trust what you can see, beware of abstraction 5 A shadow is an abstraction of a 3-dimension object. Our data has thousands of dimensions. We need data abstractions to make decisions from “big-data”

CORE Concept Summary 7 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

This is what we were looking for A T Functional difference Single Nucleotide Polymorphism = SNP SNP = Marker = Gene Locus Allele

SNP assays: 9 GTACCATGATCGCTAACTGGCATGGCTTACGGCTTGAC (A) G (B) G (C) A (D) A (E) G A SNP is a SNP …. no matter how you find it ! “Old” non-sequence-based methods (AFLP, DArT) Discover by sequence / assay by design (Illumina Array) – 6000 SNPs Discover and assay by sequencing (GBS) > 50,000 SNPs

6K SNP array annotations 10 Estimated chromosome (relative to Oliver et al. 2013) Estimated map position (cM relative to Oliver et al. 2013) Flag=1 for framework marker (relative to Oliver et al. 2013) Best match to Brachypodium distachyon genome Best match to Oryza sativa genome Best match to Hordeum vulgare genome Best BLAST description from BLAST2GO Minimum E value from BLAST2GO Gene Ontology terms from BLAST2GO Enzyme Code from BLAST2GO Protein Accession (NCBI) from SNPmeta Short protein name of best match from SNPmeta Predicted SNP position in coding sequence from SNPmeta Predicted SNP position in codon from SNPmeta Predicted codon for allele 1 from SNPmeta Predicted codon for allele 2 from SNPmeta Prediction if amino acid change is silent, from SNPmeta Predicted amino acid for SNP allele 1 from SNPmeta Predicted amino acid for SNP allele 2 from SNPmeta 38 important annotations (consolidated from many more) SNP Locus Name SNP Discovery method (from Table 1) Reason for inclusion / in-silico predicted performance a Bead Type (1 a transition SNP, 2=transvesion SNP) Illumina design score Successful conversion to 6K BEAD assay Successful assay based on MAF>0 and H<=10 in 595 progeny SNP bases (A,T,G,C in format [A/T] ) SNP design sequence (SNP in square brackets) Full contig sequence (for SNPs called from an assembly) Comments made in GenomeStudio Genotyping Module Comments made in GenomeStudio Clustering Module Number of clusters formed in Clustering Module Illumina Gentrain score Non-missing calls across 1055 progeny (%) Number of AA (alleles 1) calls in 595 breeding lines Number of BB (alleles 2) calls in 595 breeding lines Percentage of AB calls in 595 breeding lines Minor Allele Frequency in 595 breeding lines

Developing functional gene assays….. 11 Eric Jackson et al. (unpublished)

CORE Concept Summary 12 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

13

CORE Diversity Phenotypes 14

CORE Concept Summary 15 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

High Density Hexaploid Oat Map 16 1C 2C 3C 4C 5C 6C 7C8A 9D 10D 11A 12D 13A14D 15A 16A 17A 18D 19A 20D 21D

The consensus map challenge Consensus map is an abstraction from many maps Smooth out errors in component maps Put all markers on one map Find ‘most popular order’ when real differences exist Why ? –Merge information from diverse studies –Plan experiments –Organize database –Predict optimum genotypes –Sequence genome, clone genes, perfect predictions 17

Building block populations (“component maps”) 18 PopulationAbbr.Pop. SizeMarker TypeContributed byReference GS-7 x BoyerGB766KBonman et al.Babiker et al. in press Provena x GS-7PGS986K, GBSBonman et al. Babiker et al. in press Provena x BoyerPB1396KBonman et al.Babiker et al. in press x Clintland 64IL41126KKolb et al.Foresman et al., in press x Clintlant 64IL KKolb et al.Foresman et al., in press Assiniboia x MN841801AM 161 6KMitchell-Fetch et al.Nanjappa et al. in press Otana x PI269616OP 98 6K, GBSCarson et al. Oliver et al., 2013 CDC SolFi x HiFiSH 53 6K, GBSBeattie et al Oliver et al., 2013 Dal x ExeterDE 145 6K,GBSTinker et al. Hizbai et al., 2012 Hurdal x Z-597HZ 53 6K,GBSBjørnstad et al. Oliver et al., 2013 Ogle x TAMO 301OT 53 6K, GBSJackson et al. Portyanko et al., 1995 Kanota x OgleKO 52 6K, GBSTinker et al. O'Donoughue et al., 1995

CORE Concept Summary 19 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

Spring and Winter are definitely different: 20

Model-based analysis reveals structure of 17 different breeding programs / regions 21 NDSU Winn Ottawa NordTexasIdaho Model: Groups defined by characteristic gene frequencies K = 10 groups (10 colours) % membership in each group for 600 oat varieties

Why does structure matter ? 22 “Winter” alleles Texas varieties Northern Prairie Varieties SNP and GBS markers “Spring” alleles

CORE Concept Summary 23 Representative DNA Sequence Discover genetic differences among oat varieties Diverse germplasm Marker assays + gene database Mapping germplasmBreeder Germplasm Evaluate field & seed TraitsGenotype / Trait databaseConsensus map Analyse population structure Associate markers with traits Breeding assays + genomic selection

Genome Wide Association Mapping (GWAS) Concept is simple: which markers are correlated with a trait which varieties have the good alleles at those loci Hundreds of good predictions from CORE Specialists are refining these predictions Correlate with known disease resistance Genotype x Environment interaction Explore candidate genes 24

Genomic selection Give every marker a weight –Predictions can be made entirely from markers Advantages –Simple: one abstraction, one inference: “best breeding value” Drawbacks –Tends to improve within a good population –Not good at introducing new alleles –Artifacts can go un-noticed 25

Integrating multiple inferences 26

Conclusions CORE data is a rich foundation –Already supporting new oat science –Moving toward a “universal” public oat database –Now mobilizing to support molecular breeding Challenges: –Develop “comfort level” with big-data and abstractions –Build smart-tools into database (“automated abstractions”) –Commit to continue sharing (experience, data and germplasm) –Predict crosses, not just selections –Use tools to access wild relatives 27

Acknowledgements Coauthors: Jesse Poland Eric Jackson Shiaoman Chao Gerard Lazo Becky Oliver Tinker Lab: Charlene Wight Yung-Fun Huang Kyle Gardner Phil Couroux Jiro Hattori Biniam Hizbai Benazir Marquez CORE: Rick Jellen, Marty Carson, Howard Rines, Don Obert, Joe Lutz, Irene Shackelford, Abraham Korol, Aaron Beattie, Åsmund Bjørnstad, Mike Bonman, Jean-Luc Jannink, Mark Sorrells, Gina Brown-Guedira, Jennifer Mitchell Fetch, Steve Harrison, Catherine Howarth, Amir Ibrahim, Fred Kolb, Mike McMullen, Paul Murphy, Herb Ohm, Brian Rossnagel, Weikai Yan, Kelci Miclaus, Jordan Hiller, Jeff Maughan, Rachel Redman, Joe Anderson, Emir Islamovic …. And others