SNP/Tiling arrays for very high density marker based breeding and QTL candidate gene identification Justin Borevitz Ecology & Evolution University of Chicago.

Slides:



Advertisements
Similar presentations
Potato Mapping / QTLs Amir Moarefi VCR
Advertisements

Frary et al. Advanced Backcross QTL analysis of a Lycopersicon esculentum x L. pennellii cross and identification of possible orthologs in the Solanaceae.
Identification of markers linked to Selenium tolerance genes
Genetics of Adaptation: Arabidopsis thaliana as an ecological model Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
1.Generate mutants by mutagenesis of seeds Use a genetic background with lots of known polymorphisms compared to other genotypes. Availability of polymorphic.
A Genomic Survey of Polymorphism and Linkage Disequilibrium Imran Mohiuddin Magnus Nordborg, Ph.D. University of Southern California.
Genomic Approaches to the Genetics of Adaptation Justin Borevitz Ecology & Evolution University of Chicago
Toward the genomics of Adaptation to seasonal environments in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Genomics of Natural Variation in Arabidopsis thaliana Justin Borevitz Salk Institute naturalvariation.org.
Natural Variation in Arabidopsis thaliana Light Response: Genomic Approaches Justin Borevitz Salk Institute naturalvariation.org.
MicroArray Evolution: expression to mapping and back again Justin Borevitz Salk Institute naturalvariation.org.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Light response QTL in Arabidopsis thaliana: LIGHT1 cloning Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Microarrays for mapping and expression analysis: Toward the genetic determinants of light response adaptation in Arabidopsis and Aquilegia Justin Borevitz.
Ecological Genomics Underlying Plant Evolution Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Genomic Methods for Cloning QTL Justin Borevitz University of Chicago naturalvariation.org.
Composite/ LegumeCotton wood, OakArabidopsis lyrata Miriam grass Aquilegia, Arabidopsis, Mimulus? Indiana Dunes National Lakeshore Justin Borevitz Ecology.
Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org.
Arrays as tools for Natural Variation studies: Mapping, Haplotyping, and gene expression Justin Borevitz University of Chicago naturalvariation.org`
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Identification and Genotyping of Single Feature Polymorphisms in Complex Genomes Justin Borevitz University of Chicago naturalvariation.org.
Natural Variation in Light Response using Whole Genome Tiling Arrays Justin Borevitz Ecology & Evolution University of Chicago
EcoSystems Biology EcoSystems Biology
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Toward the Ecological Genomics Underlying Plant Adaptation Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
High Density Oligo Arrays for Single Feature Polymorphism Genotyping and Mapping Justin Borevitz Ecology & Evolution University of Chicago
QTL mapping using Single Feature Polymorphisms Justin Borevitz Salk Institute naturalvariation.org.
Haplotype mapping with Single Feature Polymorphisms in Arabidopsis Justin Borevitz Ecology & Evolution University of Chicago
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
EXtreme Array Mapping and Haplotype analysis Using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Genomic Systems underlying the genetics of adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Mechanisms of Sustainable re Development: Lessons from Plants Justin Borevitz Ecology & Evolution University of Chicago
Towards the Arabidopsis Haplotype Map using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Studies of Genome Wide Molecular Variation in Arabidopsis thaliana using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang Borevitz Lab Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang.
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Array Genotyping to Dissect Quantitative Trait Loci in Arabidopsis thaliana Justin Borevitz Ecology and Evolution University of Chicago naturalvariation.org.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Recombinant DNA Technology Site directed mutagenesis Genetics vs. Reverse Genetics Gene expression in bacteria and viruses Gene expression in yeast Genetic.
Using mutants to clone genes Objectives 1. What is positional cloning? 2.What is insertional tagging? 3.How can one confirm that the gene cloned is the.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Genomics BIT 220 Chapter 21.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Development and Application of SNP markers in Genome of shrimp (Fenneropenaeus chinensis) Jianyong Zhang Marine Biology.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
10cM - Linkage Mapping Set v2 ABI Median intermarker distance: 4.7 Mb Mean intermarker distance: 5.6 Mb Mean genetic gap distance: 8.9 cM Average Heterozygosity.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Ecological and Evolutionary Systems biology: Conceptual and molecular tools for analysis Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
INTRODUCTION TO ASSOCIATION MAPPING
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Goal: Establish Aquilegia as an evolutionary model system 70 taxa (diverse in floral morphology & ecology) recent & rapid radiation (little genetic variation.
Institute of Crop Sciences, CAAS
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Map-based cloning of interesting genes
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Flowering-time QTL in crosses of Lz-0 with Ler and Col.
Presentation transcript:

SNP/Tiling arrays for very high density marker based breeding and QTL candidate gene identification Justin Borevitz Ecology & Evolution University of Chicago

Major Issues in Breeding Complex Traits High throughput Phenotyping –Physiological dissection of 1000s correlated traits –Biological Variation Multiple genes under major QTL –High Density markers –High throughput seedling screens –Linkage Drag Environmental Interaction (GxE) –Good for optimizing local varieties Epistasis (GxG) –Magnify minor QTL in local backgrounds Multi species ecological interactions –“extended phenotype”

Genomic Breeding Path QTL gene Confirmation Marker Identification Genotyping Genomics path Experimental Design Mapping population Phenotyping QTL Analysis Fine Mapping Candidate gene Polymorphisms gene expression loss of function QTL gene Confirmation Experimental Design Mapping population Phenotyping QTL Analysis Fine Mapping Borevitz and Chory, COPB 2003

Talk Outline Phenotyping in multiple environments –Seasonal Variation in the Lab Germplasm Diversity –Population structure, Haplotype Mapping set SNP/Tiling microarrays –Very High Density Markers –Mapping Extreme Bulk Segregant –Expression, splicing, and allelic variation Ecological context –Arabidopsis and Aquilegia Phenotyping in multiple environments –Seasonal Variation in the Lab Germplasm Diversity –Population structure, Haplotype Mapping set SNP/Tiling microarrays –Very High Density Markers –Mapping Extreme Bulk Segregant –Expression, splicing, and allelic variation Ecological context –Arabidopsis and Aquilegia

Begin with regions spanning the Native Geographic range Nordborg et al PLoS Biology 2005 Li et al PLoS ONE 2007 Tossa Del Mar Spain Lund Sweden

Seasons in the Growth Chamber Changing Day length Cycle Light Intensity Cycle Light Colors Cycle Temperature Sweden Spain Seasons in the Growth Chamber Changing Day length Cycle Light Intensity Cycle Light Colors Cycle Temperature Geneva Scientific/ Percival

Kurt Spokas Version 2.0a June 2006 USDA-ARS Website Midwest Area (Morris,MN)

Flowering time QTL, Kas/Col RILs Sweden 1 Col-gl1 Kas1 Sweden 2 Col-gl1 Kas1 Spain 1 Col-gl1 Kas1 Spain 2 Col-gl1 Kas1 Number of RILs Flowering time QTL, Kas/Col RILs FRI FLM

Kas/Col flowering time QTL GxE Chr4 FRI Chr1 FLM Chr4 FRI

Global and Local Population Structure Olivier Loudet

144 Non singleton SNPs >2000 accessions Global, Midwest, and UK common haplotypes Local Population Structure Megan Dunning, Yan Li

80 Major Haplotypes Diversity within and between populations

RNA DNA Universal Whole Genome Array Transcriptome Atlas Expression levels Tissues specificity Transcriptome Atlas Expression levels Tissues specificity Gene/Exon Discovery Gene model correction Non-coding/ micro-RNA Gene/Exon Discovery Gene model correction Non-coding/ micro-RNA Alternative Splicing Comparative Genome Hybridization (CGH) Insertion/Deletions Copy Number Polymorphisms Comparative Genome Hybridization (CGH) Insertion/Deletions Copy Number Polymorphisms Methylation Chromatin Immunoprecipitation ChIP chip Chromatin Immunoprecipitation ChIP chip Polymorphism SFPs Discovery/Genotyping Polymorphism SFPs Discovery/Genotyping Control for hybridization/genetic polymorphisms to understand TRUE expression variation RNA Immunoprecipitation RIP chip RNA Immunoprecipitation RIP chip Antisense transcription Allele Specific Expression

SNP SFP MMMMMM MMMMMM Chromosome (bp) conservation SNP ORFa start AAAAA Transcriptome Atlas ORFb deletion Improved Genome Annotation

Which arrays should be used? cDNA array Long oligo array BAC array

Which arrays should be used? Gene array Exon array Tiling array 35bp tile, 25mers 10bp gaps

Which arrays should be used? Tiling/SNP array k SNPs, 1.6M tiling probes SNP array Ressequencing array How about multiple species? Microbial communities? Pst,Psm,Psy,Psx, Agro, Xanthomonas, H parasitica, 15 virus,

Global Allele Specific Expression Zhang, X., Richards, E., Borevitz, J. Current Opinion in Plant Biology (2007) 65,000 SNPs Transcribed Accession Pairs 12,000 genes >= 1 SNP 6,000 >= 2 SNPs

Potential Deletions

Deltap0FALSECalledFDR % % % % % SFP detection on tiling arrays

Chip genotyping of a Recombinant Inbred Line 29kb interval

Map bibb 100 bibb mutant plants 100 wt mutant plants

bibb mapping ChipMap AS1 Bulk segregant Mapping using Chip hybridization bibb maps to Chromosome2 near ASYMETRIC LEAVES1

BIBB = ASYMETRIC LEAVES1 Sequenced AS1 coding region from bib-1 …found g -> a change that would introduce a stop codon in the MYB domain bibbas1-101 MYB bib-1 W49* as-101 Q107* as1 bibb AS1 (ASYMMETRIC LEAVES1) = MYB closely related to PHANTASTICA located at 64cM

Array Mapping Hazen et al Plant Physiology (2005) chr1 chr2 chr3 chr4 chr5

eXtreme Array Mapping 15 tallest RILs pooled vs 15 shortest RILs pooled

LOD eXtreme Array Mapping Allele frequencies determined by SFP genotyping. Thresholds set by simulations cM LOD Composite Interval Mapping RED2 QTL Chromosome 2 RED2 QTL 12cM Red light QTL RED2 from 100 Kas/ Col RILs (Wolyn et al Genetics 2004)

eXtreme Array Mapping BurC F2

XAM Lz x Col F2 QTL Lz x Ler F2 (Werner et al Genetics 2006)

X RED2 QTL mark1 mark2 Select recombinants by PCR >200 from >1250 plants High Low ~2Mb ~8cM >400 SFPs Col Kas Col het Col ~2 Kas hetCol het ~43 Kas Col Kashet Kas ~268 ~43~539 ~43 ~268~43 ~2 het ~539 Kas eXtreme Array Fine Mapping

Unite Genetic and Physical Map Shotgun genomic or 454 reads ESTs/ cDNAs/ BAC ends 1000s of contigs Genotype mapping population on arrays –Create very high density genetic map Known position of genes/contigs allow QTL candidatet gene identification –Control hybridization variation for gene expression

Potential Deletions >500 potential deletions 45 confirmed by Ler sequence 23 (of 114) transposons Disease Resistance (R) gene clusters Single R gene deletions Genes involved in Secondary metabolism Unknown genes

Potential Deletions Suggest Candidate Genes FLOWERING1 QTL Chr1 (bp) Flowering Time QTL caused by a natural deletion in FLM MAF1 FLM natural deletion (Werner et al PNAS 2005)

Fast Neutron deletions FKF1 80kb deletion CHR1cry2 10kb deletion CHR1 Het

Ecological and Evolutionary context Abiotic conditions –Light, temperature, humidity –Soil, water Biotic conditions –Pathogens and pollinators –Conspecifics, grasses, shrubs, trees Industrial Agriculture -> Sustainable EcoAgriculture Green, Super Hybrids!

Local adaptation under strong selection

Seasonal Variation Matt Horton Megan Dunning

Aquilegia (Columbines) Recent adaptive radiation, 350Mb genome

Genetics of Speciation along a Hybrid Zone

Aquilegia (Columbine) NSF Genome Complexity Microarray floral development –QTL candidates Physical Map (BAC tiling path) –Physical assignment of ESTs QTL for pollinator preference –~400 RILs, map abiotic stress –QTL fine mapping/ LD mapping Develop transformation techniques –VIGS Whole Genome Sequencing (JGI 2007) Scott Hodges (UCSB) Elena Kramer (Harvard) Magnus Nordborg (USC) Justin Borevitz (U Chicago) Jeff Tompkins (Clemson)

NaturalVariation.org USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zoellner USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zoellner University of Chicago Xu Zhang Yan Li Peter Roycewicz Evadne Smith Megan Dunning Joy Bergelson Michigan State Shinhan Shiu Purdue Ivan Baxter University of Chicago Xu Zhang Yan Li Peter Roycewicz Evadne Smith Megan Dunning Joy Bergelson Michigan State Shinhan Shiu Purdue Ivan Baxter