Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago

Slides:



Advertisements
Similar presentations
Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
Advertisements

Genetics of Adaptation: Arabidopsis thaliana as an ecological model Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
1.Generate mutants by mutagenesis of seeds Use a genetic background with lots of known polymorphisms compared to other genotypes. Availability of polymorphic.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Genomic Approaches to the Genetics of Adaptation Justin Borevitz Ecology & Evolution University of Chicago
Toward the genomics of Adaptation to seasonal environments in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Genomics of Natural Variation in Arabidopsis thaliana Justin Borevitz Salk Institute naturalvariation.org.
Natural Variation in Arabidopsis thaliana Light Response: Genomic Approaches Justin Borevitz Salk Institute naturalvariation.org.
MicroArray Evolution: expression to mapping and back again Justin Borevitz Salk Institute naturalvariation.org.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Light response QTL in Arabidopsis thaliana: LIGHT1 cloning Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Microarrays for mapping and expression analysis: Toward the genetic determinants of light response adaptation in Arabidopsis and Aquilegia Justin Borevitz.
Ecological Genomics Underlying Plant Evolution Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Genomic Methods for Cloning QTL Justin Borevitz University of Chicago naturalvariation.org.
Composite/ LegumeCotton wood, OakArabidopsis lyrata Miriam grass Aquilegia, Arabidopsis, Mimulus? Indiana Dunes National Lakeshore Justin Borevitz Ecology.
Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org.
Arrays as tools for Natural Variation studies: Mapping, Haplotyping, and gene expression Justin Borevitz University of Chicago naturalvariation.org`
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Identification and Genotyping of Single Feature Polymorphisms in Complex Genomes Justin Borevitz University of Chicago naturalvariation.org.
Natural Variation in Light Response using Whole Genome Tiling Arrays Justin Borevitz Ecology & Evolution University of Chicago
EcoSystems Biology EcoSystems Biology
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Toward the Ecological Genomics Underlying Plant Adaptation Deer mouse burrowBirds/insects in a cotton woodFresh water and marine invasives Aquilegia, Arabidopsis,
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
High Density Oligo Arrays for Single Feature Polymorphism Genotyping and Mapping Justin Borevitz Ecology & Evolution University of Chicago
A dynamic program algorithm for haplotype block partitioning Zhang, et. al. (2002) PNAS. 99, 7335.
QTL mapping using Single Feature Polymorphisms Justin Borevitz Salk Institute naturalvariation.org.
Haplotype mapping with Single Feature Polymorphisms in Arabidopsis Justin Borevitz Ecology & Evolution University of Chicago
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
EXtreme Array Mapping and Haplotype analysis Using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Genomic Systems underlying the genetics of adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
SNP/Tiling arrays for very high density marker based breeding and QTL candidate gene identification Justin Borevitz Ecology & Evolution University of Chicago.
Mechanisms of Sustainable re Development: Lessons from Plants Justin Borevitz Ecology & Evolution University of Chicago
Towards the Arabidopsis Haplotype Map using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Studies of Genome Wide Molecular Variation in Arabidopsis thaliana using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang Borevitz Lab Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang.
Array Genotyping to Dissect Quantitative Trait Loci in Arabidopsis thaliana Justin Borevitz Ecology and Evolution University of Chicago naturalvariation.org.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Large-Scale Copy Number Polymorphism in the Human Genome J. Sebat et al. Science, 305:525 Luana Ávila MedG 505 Feb. 24 th /24.
Fine Structure and Analysis of Eukaryotic Genes
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Technology for Systems Biology. Nucleic Acid Hybridization In principle complementary strands will associate Chemistry is quite different on surfaces.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Microarrays and Their Uses Brad Windle, Ph.D
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Ecological and Evolutionary Systems biology: Conceptual and molecular tools for analysis Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
No reference available
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Global Variation in Copy Number in the Human Genome
Microarray Technology and Applications
Rotation review Gaurav Moghe Genetics Program
MicroRNA Binding Sites in Arabidopsis Class III HD-ZIP mRNAs Are Required for Methylation of the Template Chromosome  Ning Bao, Khar-Wai Lye, M.Kathryn.
Presentation transcript:

Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago

Widely Distributed Olivier Loudet

Local Population Variation Scott Hodges Ivan Baxter

Seasonal Variation Matt Horton Megan Dunning

Seasons in the Growth Chamber Changing Day length Cycle Light Intensity Cycle Light Colors Cycle Temperature Sweden Spain Seasons in the Growth Chamber Changing Day length Cycle Light Intensity Cycle Light Colors Cycle Temperature Developmental Plasticity == Behavior

Which arrays should be used? cDNA array Long oligo array BAC array

Which 25mer arrays should be used? Gene array Exon array Tiling array 35bp tile, 25mers 10bp gaps

Which 25mer arrays should be used? Tiling/SNP array SNP array Ressequencing array

RNA DNA Universal Whole Genome Array Transcriptome Atlas Expression levels Tissues specificity Transcriptome Atlas Expression levels Tissues specificity Gene/Exon Discovery Gene model correction Non-coding/ micro-RNA Gene/Exon Discovery Gene model correction Non-coding/ micro-RNA Alternative Splicing Comparative Genome Hybridization (CGH) Insertion/Deletions Copy Number Polymorphisms Comparative Genome Hybridization (CGH) Insertion/Deletions Copy Number Polymorphisms Methylation Chromatin Immunoprecipitation ChIP chip Chromatin Immunoprecipitation ChIP chip Polymorphism SFPs Discovery/Genotyping Polymorphism SFPs Discovery/Genotyping Control for hybridization/genetic polymorphisms to understand true EXPRESSION polymorphisms RNA Immunoprecipitation RIP chip RNA Immunoprecipitation RIP chip Antisense transcription Allele Specific Expression

SNP SFP MMMMMM MMMMMM Chromosome (bp) conservation SNP ORFa start AAAAA Transcriptome Atlas ORFb deletion Improved Genome Annotation

Talk Outline Whole Genome Tiling Arrays –Spatial Correction, grid alignment –Alternative splicing –Methylation –Single Feature Polymorphisms (SFPs) –Genetic Mapping –Potential deletions/ Copy Number Variants –Allele Specific Expression Resequencing/ Haplotypes –Variation Scanning Whole Genome Tiling Arrays –Spatial Correction, grid alignment –Alternative splicing –Methylation –Single Feature Polymorphisms (SFPs) –Genetic Mapping –Potential deletions/ Copy Number Variants –Allele Specific Expression Resequencing/ Haplotypes –Variation Scanning

Tiling Array Re annotation 6.25Million probes 3.125Million PM probes 1.67Million unique PM probes 17bp (blast) 736k PM features in TUs (exon array) 130k TUs 28k genes

Spatial Correction, grid Alignment Background correction for RNA, ! For DNA

Transcription subUnits (TUs) Exon1Exon2 Intron1 Tu1Tu2Tu3

Alternative Splicing V V V C C C Van Col Xu Zhang

Gene/Tu model for alternative splicing

ChIP chip treatment effect! Experimental Design same protocol/antibody dynamic binding model treatment effect Actual biological signal

Potential Deletions

Methods for labeling Extract genomic 100ng DNA (single leaf) Digest with either msp1 or hpa2 CCGG Label with biotin random primers Hybridize to array Fit model Y =  + E * G + 

Deltap0FALSECalledFDR % % % % % SFP detection on tiling arrays IntergenicExonintron SFPs total %8.86%3.53%5.71% SFPs/gene0>=1>=2>=3>=4>=5 genes

methylated features and mSFPs >10,000 of 100,000 at 5% FDR Enzyme effect, on CCGG featuresGxE 276 at 15% FDR mQTL?

Chip genotyping of a Recombinant Inbred Line 29kb interval

Mapbibb 100bibb mutant plants 100wt mutant plants

Array Mapping Hazen et al Plant Physiology 2005

Potential Deletions (wild lines) >500 potential deletions 45 confirmed by Ler sequence 23 (of 114) transposons Disease Resistance (R) gene clusters Single R gene deletions Genes involved in Secondary metabolism Unknown genes

Fast Neutron deletions FKF1 80kb deletion CHR1cry2 10kb deletion CHR1 Het

Natural Variation on Tiling Arrays

Potential Deletions Suggest Candidate Genes FLOWERING1 QTL Chr1 (bp) Flowering Time QTL caused by a natural deletion in FLM FLM FLM natural deletion (Werner et al PNAS 2005)

Allele specific expression

cis regulatory variation Col/Col Col/Van Van/Col Van/Van Van allele expressed Col allele expressed Col Female imprint

Allele specific expression between Col and Van

Array Haplotyping What about Diversity/selection across the genome? A genome wide estimate of population genetics parameters, θ w, π, Tajima’D, ρ LD decay, Haplotype block size Deep population structure? Col, Lz, Bur, Ler, Bay, Shah, Cvi, Kas, C24, Est, Kin, Mt, Nd, Sorbo, Van, Ws2 Fl-1, Ita-0, Mr-0, St-0, Sah-0

Array Haplotyping Inbred lines Low effective recombination due to partial selfing Extensive LD blocks ColLerCviKasBayShahLzNd Chromosome1 ~500kb

SFPs for reverse genetics 14 Accessions 30,950 SFPs`

Chromosome Wide Diversity

Diversity 50kb windows

Tajima’s D like 50kb windows RPS4 unknown

R genes vs bHLH

NaturalVariation.org USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zollner University of Chicago Xu Zhang Evadne Smith Ken Okamoto Yan Li Michigan State Shinhan Shui Purdue Ivan Baxter Sainsbury Laboratory Jonathan Jones USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zollner University of Chicago Xu Zhang Evadne Smith Ken Okamoto Yan Li Michigan State Shinhan Shui Purdue Ivan Baxter Sainsbury Laboratory Jonathan Jones