Identification and Genotyping of Single Feature Polymorphisms in Complex Genomes Justin Borevitz University of Chicago naturalvariation.org.

Slides:



Advertisements
Similar presentations
Planning breeding programs for impact
Advertisements

Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant.
Identification of markers linked to Selenium tolerance genes
Genetics of Adaptation: Arabidopsis thaliana as an ecological model Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
1.Generate mutants by mutagenesis of seeds Use a genetic background with lots of known polymorphisms compared to other genotypes. Availability of polymorphic.
Genomic Approaches to the Genetics of Adaptation Justin Borevitz Ecology & Evolution University of Chicago
Toward the genomics of Adaptation to seasonal environments in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Genomics of Natural Variation in Arabidopsis thaliana Justin Borevitz Salk Institute naturalvariation.org.
Natural Variation in Arabidopsis thaliana Light Response: Genomic Approaches Justin Borevitz Salk Institute naturalvariation.org.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
MicroArray Evolution: expression to mapping and back again Justin Borevitz Salk Institute naturalvariation.org.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Light response QTL in Arabidopsis thaliana: LIGHT1 cloning Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Microarrays for mapping and expression analysis: Toward the genetic determinants of light response adaptation in Arabidopsis and Aquilegia Justin Borevitz.
Ecological Genomics Underlying Plant Evolution Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Genomic Methods for Cloning QTL Justin Borevitz University of Chicago naturalvariation.org.
Composite/ LegumeCotton wood, OakArabidopsis lyrata Miriam grass Aquilegia, Arabidopsis, Mimulus? Indiana Dunes National Lakeshore Justin Borevitz Ecology.
Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org.
Arrays as tools for Natural Variation studies: Mapping, Haplotyping, and gene expression Justin Borevitz University of Chicago naturalvariation.org`
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Natural Variation in Light Response using Whole Genome Tiling Arrays Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Toward the Ecological Genomics Underlying Plant Adaptation Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Bacterial Physiology (Micr430)
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
High Density Oligo Arrays for Single Feature Polymorphism Genotyping and Mapping Justin Borevitz Ecology & Evolution University of Chicago
QTL mapping using Single Feature Polymorphisms Justin Borevitz Salk Institute naturalvariation.org.
Haplotype mapping with Single Feature Polymorphisms in Arabidopsis Justin Borevitz Ecology & Evolution University of Chicago
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
EXtreme Array Mapping and Haplotype analysis Using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Genomic Systems underlying the genetics of adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
SNP/Tiling arrays for very high density marker based breeding and QTL candidate gene identification Justin Borevitz Ecology & Evolution University of Chicago.
Mechanisms of Sustainable re Development: Lessons from Plants Justin Borevitz Ecology & Evolution University of Chicago
Towards the Arabidopsis Haplotype Map using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Studies of Genome Wide Molecular Variation in Arabidopsis thaliana using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang Borevitz Lab Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang.
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Array Genotyping to Dissect Quantitative Trait Loci in Arabidopsis thaliana Justin Borevitz Ecology and Evolution University of Chicago naturalvariation.org.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
with an emphasis on DNA microarrays
Recombinant DNA Technology Site directed mutagenesis Genetics vs. Reverse Genetics Gene expression in bacteria and viruses Gene expression in yeast Genetic.
Using mutants to clone genes Objectives 1. What is positional cloning? 2.What is insertional tagging? 3.How can one confirm that the gene cloned is the.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Data Type 1: Microarrays
Regulation of gene expression in the mammalian eye and its relevance to eye disease Todd Scheetz et al. Presented by John MC Ma.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Ecological and Evolutionary Systems biology: Conceptual and molecular tools for analysis Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
The trait defines the two major germplasm groups in barley
Invest. Ophthalmol. Vis. Sci ;52(6): doi: /iovs Figure Legend:
upstream vs. ORF binding and gene expression?
Statistical Applications in Biology and Genetics
ChipViewer is coded to visualize and analyze the tiling chip data.
Microarray Technology and Applications
Map-based cloning of interesting genes
Volume 17, Issue 8, Pages (April 2007)
Volume 24, Issue 16, Pages (August 2014)
Volume 22, Issue 2, Pages (January 2012)
Flowering-time QTL in crosses of Lz-0 with Ler and Col.
MicroRNA Binding Sites in Arabidopsis Class III HD-ZIP mRNAs Are Required for Methylation of the Template Chromosome  Ning Bao, Khar-Wai Lye, M.Kathryn.
Presentation transcript:

Identification and Genotyping of Single Feature Polymorphisms in Complex Genomes Justin Borevitz University of Chicago naturalvariation.org

Talk Outline Intro/QTL mapping Single Feature Polymorphisms (SFPs) –Potential deletions Bulk Segregant Mapping –Extreme Array Mapping Transcriptional profiling – for QTL candidate genes

Quantitative Trait Loci EPI1 EPI2

SNP377 SM184 SM50 SM35 SM106 G2395 SNP65 SM40 SEQ8298 TH1 MSAT7964 MAT7787 CER MbMarker Near-Isogenic Lines for LIGHT1 Ler / Cvi #3 mm 81N-J17A-A/J Ler Plants Line RVE7 GI Phenotype

What is Array Genotyping? Affymetrix expression GeneChips contain 202,806 unique 25bp oligo nucleotides. 11 features per probset for genes New array’s have even more Genomic DNA is randomly labeled with biotin, product ~50bp. 3 independent biological replicates compared to the reference strain Col GeneChip

Potential Deletions

Spatial Correction Spatial Artifacts Improved reproducibility Next: Quantile Normalization

False Discovery and Sensitivity PM only SAM threshold 5% FDR GeneChip SFPs nonSFPs Cereon marker accuracy % Sequence Sensitivity Polymorphic % Non-polymorphic False Discovery rate: 3% Test for independence of all factors: Chisq = , df = 1, p-value = 1.845e-40 SAM threshold 18% FDR GeneChip SFPs nonSFPs Cereon marker accuracy % Sequence Sensitivity Polymorphic % Non-polymorphic False Discovery rate: 13% Test for independence of all factors: Chisq = , df = 1, p-value = 1.309e-59 3/4 Cvi markers were also confirmed in PHYB 90%80%70% 41%53%85% 90%80%70% 67%85%100% Cereon may be a sequencing Error TIGR match is a match

Effect of SNP position 340 Candidate Polymorphisms False negative True Positive

Chip genotyping of a Recombinant Inbred Line 29kb interval Discovery 6 replicates X $500 12,000 SFPs = $0.25 Typing 1 replicate X $500 12,000 SFPs = $0.041

LIGHT1 NIL

Potential Deletions >500 potential deletions 45 confirmed by Ler sequence 23 (of 114) transposons Disease Resistance (R) gene clusters Single R gene deletions Genes involved in Secondary metabolism Unknown genes

Potential Deletions Suggest Candidate Genes FLOWERING1 QTL Chr1 (bp) Flowering Time QTL caused by a natural deletion in MAF1 MAF1 MAF1 natural deletion

Fast Neutron deletions FKF1 80kb deletion CHR1cry2 10kb deletion CHR1 Het

Map bibb 100 bibb mutant plants 100 wt mutant plants

bibb mapping ChipMap AS1 Bulk segregant Mapping using Chip hybridization bibb maps to Chromosome2 near ASYMETRIC LEAVES1

BIBB = ASYMETRIC LEAVES1 Sequenced AS1 coding region from bib-1 …found g -> a change that would introduce a stop codon in the MYB domain bibbas1-101 MYB bib-1 W49* as-101 Q107* as1 bibb AS1 (ASYMMETRIC LEAVES1) = MYB closely related to PHANTASTICA located at 64cM

arythmic11 Mapping confirmed Sam Hazen

arythmic90 Gene cloned Sam Hazen arythmic21 Allelic to arr90 Sam Hazen

stamenstay Ler Sarah Liljegren Mapping confirmed

ein6een double mutant Ramlah Nehring Mapping confirmed

eXtreme Array Mapping 15 tallest RILs pooled vs 15 shortest RILs pooled

LOD eXtreme Array Mapping Red light QTL RED2 from 100 Kas/ Col RILs Allele frequencies determined by SFP genotyping. Thresholds set by simulations 15 tallest RILs pooled vs 15 shortest RILs pooled cM LOD Composite Interval Mapping RED2 QTL Chromosome 2 RED2 QTL 12cM

Fine Mapping with Arrays Single Additive Gene 1000 F2s Select recombinants by PCR 1Mb region

SFPs for reverse genetics 14 Accessions 30,950 SFPs

Barley SFPs gDNA 9 arrays, random labeled genomic DNA 3 wild type, 3 parent 1, 3 parent 2 Hope to verify some RNA SFPs Pairs plots, correlation matrix SFP table

Just better than permutations delta ori.data perm.data difference FDR Increase specific activity with other labeling methods Perform more replicates

Single Feature Polymorphisms –Improve with replicates (easy) –Improved statistical models Genotyping –Precisely define recombination breakpoints –Fine mapping Potential Deletions –Candidate genes/ induced mutations Bulk segregant Mapping –eXtreme Array Mapping, F2s etc

Look for gene expression differences between genotypes Identify candidate genes that map to mutation Downstream targets that map elsewhere Transcription based cloning

differences may be due to expression or hybridization

PAG1 down regulated in Cvi PLALE GREEN1 knock out has long hypocotyl in red light

SFPs from RNA Barley Affy array probe sets –Most probes sets 11 probes –Background correction “rma2” –Quantile normalization 36 arrays total –3 replicates –6 tissues, leaf, crown, root, radical, gem, col? –2 genotypes (Golden Promise 7,459 ESTs) – (Morex 52,695 ESTs)

Look at some plots raw data

Remove probe effect

Remove tissue effect

Remove Genotype effect

SAM False Discovery Rate delta ori.data perm.data difference FDR Both + and – SFPs since no reference comparison Need to compare with ESTs

Review Single Feature Polymorphisms (SFPs) can be used to identify recombination breakpoints, potential deletions, for eXtreme Array mapping, and haplotyping Expression analysis to identify QTL candidate genes and downstream responses that consider polymorphisms

RNADNA Universal Whole Genome Array Transcriptome Atlas Expression levels Tissues specificity Transcriptome Atlas Expression levels Tissues specificity Gene Discovery Gene model correction Non-coding/ micro-RNA Antisense transcription Gene Discovery Gene model correction Non-coding/ micro-RNA Antisense transcription Alternative Splicing Comparative Genome Hybridization (CGH) Insertion/Deletions Comparative Genome Hybridization (CGH) Insertion/Deletions Methylation Chromatin Immunoprecipitation ChIP chip Chromatin Immunoprecipitation ChIP chip Polymorphism SFPs Discovery/Genotyping Polymorphism SFPs Discovery/Genotyping ~35 bp tile,non-repetitive regions, “good” binding oligos,evenly spaced

NaturalVariation.org Syngenta Hur-Song Chang Tong Zhu Syngenta Hur-Song Chang Tong Zhu University of Guelph, Canada Dave Wolyn University of Guelph, Canada Dave Wolyn Salk Jon Werner Todd Mockler Sarah Liljegren Ramlah Nehring Joanne Chory Detlef Weigel Joseph Ecker UC Davis Julin Maloof UC San Diego Charles Berry Scripps Sam Hazen Elizabeth Winzeler NaturalVariation.org Salk Jon Werner Todd Mockler Sarah Liljegren Ramlah Nehring Joanne Chory Detlef Weigel Joseph Ecker UC Davis Julin Maloof UC San Diego Charles Berry Scripps Sam Hazen Elizabeth Winzeler