Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab.

Slides:



Advertisements
Similar presentations
Functional Genomics with Next-Generation Sequencing
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Epigenetics Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Identification of markers linked to Selenium tolerance genes
Geuvadis RNAseq UNIGE Genetic regulatory variants
SHI Meng. Abstract The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants,
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Epigenetics 12/05/07 Statisticians like data.
Composite/ LegumeCotton wood, OakArabidopsis lyrata Miriam grass Aquilegia, Arabidopsis, Mimulus? Indiana Dunes National Lakeshore Justin Borevitz Ecology.
Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org.
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Chromatin Structure in Water-Deficit Stress in Arabidopsis Yong Ding, Karin van Dijk, Sridhar Malkaram, Rong Liu, J.J.M. Riethoven, Jingi Yang, Han Chen,
QTL mapping using Single Feature Polymorphisms Justin Borevitz Salk Institute naturalvariation.org.
Hybridization Diagnostic tools Nucleic acid Basics PCR Electrophoresis
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
EXtreme Array Mapping and Haplotype analysis Using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang Borevitz Lab Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang.
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Geuvadis RNAseq analysis at UNIGE Analysis plans
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Regulation of Gene Expression Eukaryotes
IAP workshop, Ghent, Sept. 18 th, 2008 Mixed model analysis to discover cis- regulatory haplotypes in A. Thaliana Fanghong Zhang*, Stijn Vansteelandt*,
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Supplemental Figure 1A. A small fraction of genes were mapped to >=20 SNPs. Supplemental Figure 1B. The density of distance from the position of an associated.
The generalized transcription of the genome Víctor Gámez Visairas Genomics Course 2014/15.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Sackler Medical School
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
1 Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene.
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
Supplemental Figure 1. False trans association due to probe cross-hybridization and genetic polymorphism at single base extension site. (A) The Infinium.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Genomics 2015/16 Silvia del Burgo. + Same genome for all cells that arise from single fertilized egg, Identity?  Epigenomic signatures + Epigenomics:
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Quest for epigenetic determinants of local coexpression clusters Wieslawa Mentzen Labrador and Corces, 2002.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Understanding GWAS SNPs Xiaole Shirley Liu Stat 115/215.
Integrative Genomics. Double-helix DNA strands are separated in the gene coding region Which enzyme detects the beginning of a gene ? RNA Polymerase (multi-subunit.
Squeezing out the histone modifications data Wieslawa Mentzen with Matteo Floris and Paolo Uva Connections between epigenetics and microRNAs during embryonic.
Mapping of high temperature growth genes derived from industrial yeast strains Justin Goh, Richard Gardner School of Biological Sciences, University of.
EQTLs.
Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
ChipViewer is coded to visualize and analyze the tiling chip data.
Transcription & Gene Expression
Gene Hunting: Design and statistics
MICROBIAL GENETICS CHAPTER 7.
Identification of imprinted genes and imprinted DMRs
Volume 28, Issue 3, Pages (November 2007)
1. Interpreting rich epigenomic datasets
by Holger Weishaupt, Mikael Sigvardsson, and Joanne L. Attema
Volume 33, Issue 4, Pages (February 2009)
Epigenetic regulation of miR-193b in liposarcomagenesis.
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Mechanisms and Consequences of Alternative Polyadenylation
Alex M. Plocik, Brenton R. Graveley  Molecular Cell 
Volume 126, Issue 6, Pages (September 2006)
Volume 7, Issue 9, Pages (September 2014)
Dynamic Regulation of Nucleosome Positioning in the Human Genome
Volume 10, Issue 3, Pages (March 2017)
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
MicroRNA Binding Sites in Arabidopsis Class III HD-ZIP mRNAs Are Required for Methylation of the Template Chromosome  Ning Bao, Khar-Wai Lye, M.Kathryn.
Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
Presentation transcript:

Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab

Col and Van are collected from distinct geographic locations

Expression regulatory variation can act in cis or in trans cis polymorphism gene 1 gene 2 gene 3 trans polymorphism regulator 1 regulator 2

eQTL mapping in segregating populations  CAUSAL cis and trans variation can be mapped individually  Additive effects and non-additive interactions among the eQTL can be assessed  limited detection power due to the sample size and extent of recombination of mapping population; it actually maps LOCAL and DISTANT variation Dissection of cis and trans variation

Allele specific expression in a heterozygous system allele A allele B Allele A and B are exposed to the same pool of trans factors. Thus allelic expression should be due to cis difference within or nearby the gene

Col Van hybrids Composite trans effects are inferred by comparison of hybrids and parents cis effect trans effect expression level

Allele specific expression in a heterozygous system  closely locates cis causal variation  composite effect of trans variation can be inferred by testing the deviation of allelic expression in heterozygous from the differential expression between homozygous parents  individual causal loci can NOT be mapped, their interaction can NOT be assessed Dissection of cis and trans variation

Van ♀ x Van ♂ Col ♀ x Van ♂ Van ♀ x Col ♂  4 replicates for parental gDNA1:1 mix  4 replicates for parental mRNA1:1 mix  4 replicates for Col (mother) x Van F1s  4 replicates for Van (mother) x Col F1s Biological samples for microarray hybridization Col ♀ x Col ♂

cis and trans The measurement: allele intensity ratio of the transcript – Col allele/Van allele  parental expression difference mRNA 1:1 mix vs gDNA 1:1 mix  cis regulatory difference F1s vs gDNA 1:1 mix  trans regulatory difference F1s vs mRNA 1:1 mix  imprinting F1c vs F1v

 ~1.4M tiling probes at ~35bp resolution  ~1M SNP probes for 250K SNPs each SNP 2 alleles x 2 strands allele A antisense strand: GACCAATTTTGACCCTAGATCGCCA allele A sense strand : CTGGTTAAAACTGGGATCTAGCGGT allele B antisense strand: GACCAATTTTGAACCTAGATCGCCA allele B sense strand : CTGGTTAAAACTTGGATCTAGCGGT AtSNPtile: A SNP/tiling array

Allele intensity ratio as a measurement of allelic difference log (A / B) of probe intensity log (A / B) of template mixturetarget amount in ug Across strands and SNPs A: Col allele; B: Van allele

Linear regression for each SNP and strand The effect of overall target amount is small

SNPs within transcribed region

The model ContrastmRNA1:1 mixgDNA1:1 mixF1cF1v Model 1 1 (parental difference) (imprinting effect)001 Model 2 11/3 2 (cis variation)01/ Model 3 11/31/3 2 (trans variation)01/2 3001

cis only Here cis variation up-regulates Van allele

trans only Here trans variation up-regulates Col allele

FLC FLC – cis and trans cis variation up-regulates Col allele trans variation up-regulates Van allele Col Van reciprocal F1 hybrids

FRIGIDA FRIGIDA – No difference

Parental differencecis DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % % % % % % % % % % % % % transimprinting DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % %303137% %0000NA %0000NA %0000NA %0000NA %0000NA Summary for 12,311 analyzed genes An over-estimation of cis variation genes

The direction of cis and trans effects relative to that of parental expression difference

The size of cis and trans effects relative to that of parental expression difference cis vs parental difference trans vs parental difference trans vs cis

Validation for microarray result by single base extension coupled with Mass-spectrometry (ρ = 0.74, p < 1.08E -06, n=32)

Regional difference in sequence polymorphism between cis and trans genes

(Wittkopp 2004) Regulatory genes Structural genes Gene regulatory network

Chromosomal distributions of cis and trans genes Sliding window of 120 genes

model chr1 (df=3105)chr2 (df=1500)chr3 (df=2418)chr4 (df=1831)chr5 (df=2852) overall (df=11714) ρp-valueρ ρ ρ ρ ρ cis~trans E E E E E E-16 cis~distance E E E E E E-16 trans~distance E E E E E E-16 cis~polymorphism E E E E E E-16 trans~polymorphism E E E E E E-16 polymorphism~distance E E E E E E-16 Correlation of chromosomal distributions among cis, trans, sequence polymorphism and gene distance Sliding window of 120 genes

CG methylation for cis or trans genes CG methylation within promoter: repression gene expression CG methylation within gene proper: facilitate gene expression

Histone modification for cis and trans genes H3K27me3: Histone 3 lysine 27 trimethylation H3K9me3: Histone 3 lysine 9 trimethylation LND: low nucleosome density regions

Gene expression specificity for cis and trans genes Data from Schmid et al, diverse tissues on Col wild type background expression levelexpression entropy gene length

Conclusions  Large cis effect but more trans effect  cis genes tend to locate in polymorphic, gene-poor chromosomal regions, where repressive epigenetic modifications are enriched and gene expression is tightly regulated  trans genes tend to locate in conserved, gene-rich chromosomal regions, where activating epigenetic modifications are dominant and gene expression is more constitutive

Allele specific intron expression  Intron retention is common in plant  Allele specific intron expression suggests differential intron splicing

Summary for 6,707 analyzed introns Parental differencecis DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % % % % % % % % % % % % % % % transimprinting DeltaSig+Sig-TotalFALSEFDRSig+Sig-TotalFALSEFDR % % % % %0003NA %0001NA NA NA NA NA NA0000

Distribution of sequence polymorphisms along up- and down-stream exons and the cis intron upstream exon downstream exon intron

Conclusions  extensive intron splicing variation  largely contribute by cis variation, no trans effect detected

Borevitz Lab: Justin Borevitz Yan Li Christos Noutsos Geoffrey Morris Andrew Cal Paul Grabowski Traci Viinanen Whitney Panneton Acknowledgements Greenhouse: Judy Coswell Sandra Suwanski John Zdenek