An integrative genomics approach to infer causal associations between gene expression and disease Schadt, E. E., Lamb, J., Yang, X., Zhu, J., Edwards,

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Qualitative and Quantitative traits
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Chapter 6: Quantitative traits, breeding value and heritability Quantitative traits Phenotypic and genotypic values Breeding value Dominance deviation.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
Basics of Linkage Analysis
Linkage analysis: basic principles Manuel Ferreira & Pak Sham Boulder Advanced Course 2005.
Regulatory variation and eQTLs Chris Cotsapas
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL FastANOVA: an Efficient Algorithm for Genome-Wide Association Study Xiang Zhang Fei Zou Wei Wang University.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Class activity: What are my asthma variants doing? In the subset of individuals for whom expression data are available, the T nucleotide allele at rs
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Genome-Wide Association Studies Xiaole Shirley Liu Stat 115/215.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Genome Evolution. Amos Tanay 2009 Genome evolution Lecture 9: Quantitative traits.
Identification of obesity-associated intergenic long noncoding RNAs
Department of Biomedical Informatics Bioinformatics and Genetics Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared Resource.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Geuvadis RNAseq analysis at UNIGE Analysis plans
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Multifactorial Traits
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
Regulation of gene expression in the mammalian eye and its relevance to eye disease Todd Scheetz et al. Presented by John MC Ma.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
An quick overview of human genetic linkage analysis
A Transmission/disequilibrium Test for Ordinal Traits in Nuclear Families and a Unified Approach for Association Studies Heping Zhang, Xueqin Wang and.
Lecture 15 Regulatory variation and eQTLs Chris Cotsapas 6.047/6.878/HST.507 Computational Biology: Genomes, Networks, Evolution.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Pedagogical Objectives Bioinformatics/Neuroinformatics Unit Review of genetics Review/introduction of statistical analyses and concepts Introduce QTL.
Genetic correlations and associative networks for CNS transcript abundance and neurobehavioral phenotypes in a recombinant inbred mapping panel Elissa.
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Final Remarks Genetical.
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.
13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Understanding GWAS SNPs Xiaole Shirley Liu Stat 115/215.
5 th Annual Cytoscape Symposium Amsterdam Medical CenterNovember
EQTLs.
University of Tennessee-Memphis
upstream vs. ORF binding and gene expression?
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Genome-wide Associations
Linking Genetic Variation to Important Phenotypes
Complex Traits Qualitative traits. Discrete phenotypes with direct Mendelian relationship to genotype. e.g. black or white, tall or short, sick or healthy.
Inferring Genetic Architecture of Complex Biological Processes Brian S
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Exercise: Effect of the IL6R gene on IL-6R concentration
Medical genomics BI420 Department of Biology, Boston College
One SNP at a Time: Moving beyond GWAS in Psoriasis
Medical genomics BI420 Department of Biology, Boston College
Modes of selection.
Hunting for Celiac Disease Genes
Genetic Inheritance of Gene Expression in Human Cell Lines
Presentation transcript:

An integrative genomics approach to infer causal associations between gene expression and disease Schadt, E. E., Lamb, J., Yang, X., Zhu, J., Edwards, S., Guhathakurta, D., Sieberts, S. K., Monks, S., Reitman, M., Zhang, C., Lum, P. Y., Leonardson, A., Thieringer, R., Metzger, J. M., Yang, L., Castle, J., Zhu, H., Kash, S. F., Drake, T. A., Sachs, A., and Lusis, A. J. Nature Genetics (37): Speaker: Yen-Yi Ho Advisor: Giovanni Parmigiani Department of Biostatistics, Johns Hopkins University

Outline Introduction –Background & Definitions –Scientific Questions Previous eQTL Studies –Gene Expression Data in Humans –Statistical Analytic Approaches –Results Schadt et al. 2005: An Integrative Approach –Causality Models –Application: Gene Expression in BXD Mice –Results from Application Discussion of New Approach

QTL (Quantitative Trait Locus) Genetic locus (QTL; L), Disease (D) More than 1000 monogenic Mendelian diseases controlling genes have been identified using traditional gene mapping approach. Multiple genes, environmental factors, and interactions have limited the successes in human complex traits (such as cancer, diabetes, asthma). L D Introduction

mRNA DNA Genotype Data (SNP polymorphism) Gene expression Data Expression QTL (eQTL) Goal : Identify genomic locations where genotype significantly affects gene expression. We have more information …

Cis-, trans-, master trans- eQTLs cis- eQTL trans- eQTL master trans- eQTL

1.1 (B) = cis 2.2 (A) = cis controlled by 1 (B) 3.No controls 4.4(D) = cis controlled by 3 (F) 5.Not a cis, controlled by Not a cis, controlled by all Constructing regulatory networks ( hypothetical example) Genetic locus Expression Jansen, R.C. & Nap, J.P. (2001) Trends Genet, 2001, 17,

Genetic locus Expression Scientific Questions What is the variation and heritability of gene expression? Are there associations between genetic loci and target gene expression? What is the proportion of cis-/trans-eQTLs? How do we verify of cis-? Are there any master trans-eQTLs? Annotation and functional categories do cis-, trans- and master trans-eQTLs (KEGG, GO,… ).

Transcript abundance may act as intermediate phenotype between genetic loci and the clinical phenotype. Secondary goal Incorporate information of genotype, expression, and clinical traits together to construct regulatory networks and to improve understanding of disease etiologies. Scientific questions and goals

Data

They all used lymphoblastoid cell line from CEPH families to measure expression. Differences 1. Selected different expression traits 2. Platforms to measure expression / preprocess 3. SNP markers density 4. Different statistical approaches. The data

Statistical methods of human eQTL mapping study Linkage Nonparametric linkage analysis 1. Sib-pair analysis for quantitative trait (ASP) 2. Variance component analysis (VC) Association (Linkage disequilibrium) Family-based association analysis (QTDT) Population-based association analysis (GWA) Generally, the resolution of association approach would be greater than linkage.

Comparison of resolution between linkage and association analysis Literature Review

Genes with between / within individual variation > 1 Literature review

Heritability

None Literature Review

Hit rate: The proportion of expression traits significantly linked to eQTLs (range from 0.8-4%) Proportion of cis-eQTL is about 30 % 2 master trans-eQTLs were identified eQTL findings from previous studies Literature Review

Master trans-eQTLs Literature Review 14q32 20q13

Genetic locus Expression An Integrative Approach: Schadt et al., Nature Genetics, 2005

Models for causality –Causal Model –Reactive Model –Independent Model L mRNA Disease L mRNA Disease L mRNA A integrative approach New approach

Causal Model –Joint Probability –Likelihood L: Genotype R: mRNA level D: Disease L mRNA Disease M1 Likelihood

Reactive Model –Joint probability –Likelihood L mRNA Disease M2 Likelihood L: Genotype R: mRNA level D: Disease

Independent Model –Joint Probability –Likelihood L : Genotype R: mRNA level D: Disease L Disease mRNA M3 Likelihood

Model Selection Likelihood-based Causality Model Selection (LCMS) –Calculating the Likelihood based on the data. –The model best supported by the data : smallest AIC (Akaike Information Criterion)

Simulation study The model with an AIC significantly smaller than the AIC’s of the competing models was noted. L T1

Application to BXD mice data The data BXD mice: F2 offspring from C57BL/6J (B6) and DBA/2J (DBA). C57BL/6J: ob mutation in the C57BL/6J mouse background (B6-ob/ob) causes obesity, but only mild and transient diabetes (Coleman and Hummel, 1973). DBA/2J: mice show a low susceptibility to developing atherosclerotic aortic lesions Gene expression Liver extracted at 16 months of age 23,574 gene expression measured using Agilent arrays Genetic loci 139 autosomal genetic loci (microsatellite markers, 13 cM) Disease Omental fat pad mass (OFPM) trait New approach

Filtering L mRNA Disease Identify 4 candidate regions for OFPM traits chr1 at 95cM, chr6 at 43 cM, chr9 at 8cM, chr19 at 28cM. Expression traits significantly correlated with OFPM 440 intermediate expression traits were selected (P<0.001) Expression trait with significant linkage eQTLs at the candidate regions. 113 expression trait and 267 eQTLs are identified Perform LCM model selections for the 113 expression traits and ranked the expression traits by percent genetic variation in OFPM causally explained by traits. ? ? ?

Results from Application Zfp90: zinc finger protein 90 Hsd11b1 : 11-beta hydroxysteroid dehydrogenase isoform 1 C3ar1 : complement component 3a receptor 1 Tgfbr2 : transforming growth factor, beta receptor II

C3ar1 -/- Knockout mice (n=5-7) Tgfbr2 +/- Knockout mice (n=5-7) 10 weeks of age

Discussion Fail to discriminate highly correlated traits. Multiple filtering steps are involved. Need more development if try to automatically apply to general data sets. Measurement error of mRNA exceed D Advantage of constructing eQTL networks is less likely. L Disease L mRNA Disease

Reference Morley, M.; Molony, C.M.; Weber, T.M.; Devlin, J.L.; Ewens, K.G.; Spielman, R.S. & Cheung, V.G., Genetic analysis of genome-wide variation in human gene expression. Nature, 2004, 430, Monks, S.A.; Leonardson, A.; Zhu, H.; Cundiff, P.; Pietrusiak, P.; Edwards, S.; Phillips, J.W.; Sachs, A. & Schadt, E.E., Genetic inheritance of gene expression in human cell lines. Am J Hum Genet, 2004, 75, Cheung, V.G.; Spielman, R.S.; Ewens, K.G.; Weber, T.M.; Morley, M. & Burdick, J.T. Mapping determinants of human gene expression by regional and genome-wide association. Nature, 2005, 437, Stranger, B.E.; Forrest, M.S.; Clark, A.G.; Minichiello, M.J.; Deutsch, S.; Lyle, R.; Hunt, S.; Kahl, B.; Antonarakis, S.E.; Tavar?, S.; Deloukas, P. & Dermitzakis, E.T., Genome- wide associations of gene expression variation in humans. PLoS Genet, 2005, 1, e78 Deutsch, S.; Lyle, R.; Dermitzakis, E.T.; Attar, H.; Subrahmanyan, L.; Gehrig, C.; Parand, L.; Gagnebin, M.; Rougemont, J.; Jongeneel, C.V. & Antonarakis, S.E. Gene expression variation and expression quantitative trait mapping of human chromosome 21 genes., Hum Mol Genet, 2005, 14, Jansen, R.C. & Nap, J.P., Genetical genomics: the added value from segregation. Trends Genet, 2001, 17, Schadt, E.E.; Lamb, J.; Yang, X.; Zhu, J.; Edwards, S.; Guhathakurta, D.; Sieberts, S.K.; Monks, S.; Reitman, M.; Zhang, C.; Lum, P.Y.; Leonardson, A.; Thieringer, R.; Metzger, J.M.; Yang, L.; Castle, J.; Zhu, H.; Kash, S.F.; Drake, T.A.; Sachs, A. & Lusis, A.J., An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet, 2005, 37,

Thank you ☺