Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab
Col and Van are collected from distinct geographic locations
Expression regulatory variation can act in cis or in trans cis polymorphism gene 1 gene 2 gene 3 trans polymorphism regulator 1 regulator 2
eQTL mapping in segregating populations CAUSAL cis and trans variation can be mapped individually Additive effects and non-additive interactions among the eQTL can be assessed limited detection power due to the sample size and extent of recombination of mapping population; it actually maps LOCAL and DISTANT variation Dissection of cis and trans variation
Allele specific expression in a heterozygous system allele A allele B Allele A and B are exposed to the same pool of trans factors. Thus allelic expression should be due to cis difference within or nearby the gene
Col Van hybrids Composite trans effects are inferred by comparison of hybrids and parents cis effect trans effect expression level
Allele specific expression in a heterozygous system closely locates cis causal variation composite effect of trans variation can be inferred by testing the deviation of allelic expression in heterozygous from the differential expression between homozygous parents individual causal loci can NOT be mapped, their interaction can NOT be assessed Dissection of cis and trans variation
Van ♀ x Van ♂ Col ♀ x Van ♂ Van ♀ x Col ♂ 4 replicates for parental gDNA1:1 mix 4 replicates for parental mRNA1:1 mix 4 replicates for Col (mother) x Van F1s 4 replicates for Van (mother) x Col F1s Biological samples for microarray hybridization Col ♀ x Col ♂
cis and trans The measurement: allele intensity ratio of the transcript – Col allele/Van allele parental expression difference mRNA 1:1 mix vs gDNA 1:1 mix cis regulatory difference F1s vs gDNA 1:1 mix trans regulatory difference F1s vs mRNA 1:1 mix imprinting F1c vs F1v
~1.4M tiling probes at ~35bp resolution ~1M SNP probes for 250K SNPs each SNP 2 alleles x 2 strands allele A antisense strand: GACCAATTTTGACCCTAGATCGCCA allele A sense strand : CTGGTTAAAACTGGGATCTAGCGGT allele B antisense strand: GACCAATTTTGAACCTAGATCGCCA allele B sense strand : CTGGTTAAAACTTGGATCTAGCGGT AtSNPtile: A SNP/tiling array
Allele intensity ratio as a measurement of allelic difference log (A / B) of probe intensity log (A / B) of template mixturetarget amount in ug Across strands and SNPs A: Col allele; B: Van allele
Linear regression for each SNP and strand The effect of overall target amount is small
SNPs within transcribed region
The model ContrastmRNA1:1 mixgDNA1:1 mixF1cF1v Model 1 1 (parental difference) (imprinting effect)001 Model 2 11/3 2 (cis variation)01/ Model 3 11/31/3 2 (trans variation)01/2 3001
cis only Here cis variation up-regulates Van allele
trans only Here trans variation up-regulates Col allele
FLC FLC – cis and trans cis variation up-regulates Col allele trans variation up-regulates Van allele Col Van reciprocal F1 hybrids
FRIGIDA FRIGIDA – No difference
Parental differencecis DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % % % % % % % % % % % % % transimprinting DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % %303137% %0000NA %0000NA %0000NA %0000NA %0000NA Summary for 12,311 analyzed genes An over-estimation of cis variation genes
The direction of cis and trans effects relative to that of parental expression difference
The size of cis and trans effects relative to that of parental expression difference cis vs parental difference trans vs parental difference trans vs cis
Validation for microarray result by single base extension coupled with Mass-spectrometry (ρ = 0.74, p < 1.08E -06, n=32)
Regional difference in sequence polymorphism between cis and trans genes
(Wittkopp 2004) Regulatory genes Structural genes Gene regulatory network
Chromosomal distributions of cis and trans genes Sliding window of 120 genes
model chr1 (df=3105)chr2 (df=1500)chr3 (df=2418)chr4 (df=1831)chr5 (df=2852) overall (df=11714) ρp-valueρ ρ ρ ρ ρ cis~trans E E E E E E-16 cis~distance E E E E E E-16 trans~distance E E E E E E-16 cis~polymorphism E E E E E E-16 trans~polymorphism E E E E E E-16 polymorphism~distance E E E E E E-16 Correlation of chromosomal distributions among cis, trans, sequence polymorphism and gene distance Sliding window of 120 genes
CG methylation for cis or trans genes CG methylation within promoter: repression gene expression CG methylation within gene proper: facilitate gene expression
Histone modification for cis and trans genes H3K27me3: Histone 3 lysine 27 trimethylation H3K9me3: Histone 3 lysine 9 trimethylation LND: low nucleosome density regions
Gene expression specificity for cis and trans genes Data from Schmid et al, diverse tissues on Col wild type background expression levelexpression entropy gene length
Conclusions Large cis effect but more trans effect cis genes tend to locate in polymorphic, gene-poor chromosomal regions, where repressive epigenetic modifications are enriched and gene expression is tightly regulated trans genes tend to locate in conserved, gene-rich chromosomal regions, where activating epigenetic modifications are dominant and gene expression is more constitutive
Allele specific intron expression Intron retention is common in plant Allele specific intron expression suggests differential intron splicing
Summary for 6,707 analyzed introns Parental differencecis DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % % % % % % % % % % % % % transimprinting DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % %0003NA %0001NA NA NA NA NA NA0000
Distribution of sequence polymorphisms along up- and down-stream exons and the cis intron upstream exon downstream exon intron
Conclusions extensive intron splicing variation largely contribute by cis variation, no trans effect detected
Borevitz Lab: Justin Borevitz Yan Li Christos Noutsos Geoffrey Morris Andrew Cal Paul Grabowski Traci Viinanen Whitney Panneton Acknowledgements Greenhouse: Judy Coswell Sandra Suwanski John Zdenek