Colocalization of GWAS and eQTL Signals Detects Target Genes

Slides:



Advertisements
Similar presentations
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
Advertisements

Marc A. Coram, Huaying Fang, Sophie I. Candille, Themistocles L
Colocalization of GWAS and eQTL Signals Detects Target Genes
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Claudio Verzilli, Tina Shah, Juan P
Common Variants of Large Effect in F12, KNG1, and HRG Are Associated with Activated Partial Thromboplastin Time  Lorna M. Houlihan, Gail Davies, Albert.
Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci  Gosia Trynka,
Was ADH1B under Selection in European Populations?
Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits  Nicholas Mancuso, Huwenbo Shi, Pagé.
Daniel Greene, Sylvia Richardson, Ernest Turro 
High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans  Winston Lau, Toby Andrew,
Comparing Algorithms for Genotype Imputation
Huwenbo Shi, Nicholas Mancuso, Sarah Spendlove, Bogdan Pasaniuc 
Miao-Xin Li, Hong-Sheng Gui, Johnny S.H. Kwan, Pak C. Sham 
Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development 
So Many Correlated Tests, So Little Time
Improved Heritability Estimation from Genome-wide SNPs
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
Rounak Dey, Ellen M. Schmidt, Goncalo R. Abecasis, Seunggeun Lee 
Signatures of Purifying and Local Positive Selection in Human miRNAs
Weight Loss after Gastric Bypass Is Associated with a Variant at 15q26
Parisa Shooshtari, Hailiang Huang, Chris Cotsapas 
Arpita Ghosh, Fei Zou, Fred A. Wright 
Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project  Trevor J. Pemberton, Zachary.
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
HYST: A Hybrid Set-Based Test for Genome-wide Association Studies, with Application to Protein-Protein Interaction-Based Association Analysis  Miao-Xin.
Gene Expression in Skin and Lymphoblastoid Cells: Refined Statistical Method Reveals Extensive Overlap in cis-eQTL Signals  Jun Ding, Johann E. Gudjonsson,
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Towfique Raj, Manik Kuchroo, Joseph M
A Selection Operator for Summary Association Statistics Reveals Allelic Heterogeneity of Complex Traits  Zheng Ning, Youngjo Lee, Peter K. Joshi, James.
Transethnic Genetic-Correlation Estimates from Summary Statistics
An Excess of Risk-Increasing Low-Frequency Variants Can Be a Signal of Polygenic Inheritance in Complex Diseases  Yingleong Chan, Elaine T. Lim, Niina.
Malika Kumar Freund, Kathryn S
Random-Effects Model Aimed at Discovering Associations in Meta-Analysis of Genome- wide Association Studies  Buhm Han, Eleazar Eskin  The American Journal.
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
Studying Gene and Gene-Environment Effects of Uncommon and Common Variants on Continuous Traits: A Marker-Set Approach Using Gene-Trait Similarity Regression 
Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies 
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Genotype Imputation with Millions of Reference Samples
Jon Wakefield  The American Journal of Human Genetics 
Volume 25, Issue 15, Pages (August 2015)
Multipoint Approximations of Identity-by-Descent Probabilities for Accurate Linkage Analysis of Distantly Related Individuals  Cornelis A. Albers, Jim.
Pier Francesco Palamara, Laurent C. Francioli, Peter R
Diego Calderon, Anand Bhaskar, David A
Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent  Sharon R. Browning, Brian L. Browning  The.
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Are Interactions between cis-Regulatory Variants Evidence for Biological Epistasis or Statistical Artifacts?  Alexandra E. Fish, John A. Capra, William.
An Expanded View of Complex Traits: From Polygenic to Omnigenic
Huwenbo Shi, Gleb Kichaev, Bogdan Pasaniuc 
A Common Genetic Variant in the Neurexin Superfamily Member CNTNAP2 Increases Familial Risk of Autism  Dan E. Arking, David J. Cutler, Camille W. Brune,
Imputing Phenotypes for Genome-wide Association Studies
GWAS-eQTL signal colocalisation methods
Xiaoquan Wen, Yeji Lee, Francesca Luca, Roger Pique-Regi 
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals  Xiaowei Wu, Mary Sara McPeek 
Benjamin Tycko  The American Journal of Human Genetics 
Widespread Allelic Heterogeneity in Complex Traits
Xiang Wan, Can Yang, Qiang Yang, Hong Xue, Xiaodan Fan, Nelson L. S
Complex History of Admixture between Modern Humans and Neandertals
Pleiotropic Effects of Trait-Associated Genetic Variation on DNA Methylation: Utility for Refining GWAS Loci  Eilis Hannon, Mike Weedon, Nicholas Bray,
Towfique Raj, Joshua M. Shulman, Brendan T. Keenan, Lori B
Yu Zhang, Tianhua Niu, Jun S. Liu 
Common Variants of Large Effect in F12, KNG1, and HRG Are Associated with Activated Partial Thromboplastin Time  Lorna M. Houlihan, Gail Davies, Albert.
Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP Platforms  Carl A. Anderson, Fredrik H. Pettersson,
Genotype-Imputation Accuracy across Worldwide Human Populations
Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach  Marc A. Coram, Sophie I. Candille, Qing.
Quanhe Yang, W. Dana Flanders, Ramal Moonesinghe, John P. A
Sarah A. Gagliano, Carolyn Ptak, Denise Y. F
Beyond GWASs: Illuminating the Dark Road from Association to Function
Presentation transcript:

Colocalization of GWAS and eQTL Signals Detects Target Genes Farhad Hormozdiari, Martijn van de Bunt, Ayellet V. Segrè, Xiao Li, Jong Wha J. Joo, Michael Bilow, Jae Hoon Sul, Sriram Sankararaman, Bogdan Pasaniuc, Eleazar Eskin  The American Journal of Human Genetics  Volume 99, Issue 6, Pages 1245-1260 (December 2016) DOI: 10.1016/j.ajhg.2016.10.003 Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 1 Overview of Our Method for Detecting the Target Gene and Most Relevant Tissue We compute the CLPP for all genes and all tissues. (A) A simple case where we have only one tissue and want to find the target gene. We consider all genes for this GWAS risk locus and observe that gene 4 has the highest CLPP. Thus, the target gene is gene 4. (B) We have three tissues and utilize the quantity of CLPP. Thus, the target gene is gene 4 again. Moreover, in this example, liver and blood are considered the relevant tissues for this GWAS risk locus, whereas the pancreas is not relevant. The American Journal of Human Genetics 2016 99, 1245-1260DOI: (10.1016/j.ajhg.2016.10.003) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 2 Overview of eCAVIAR Broadly, eCAVIAR aligns the causal variants in an eQTL study and GWAS. The x axis is the variant (SNP) location, and the y axis is the significance score (−log of p value) for each variant. The gray triangle indicates the LD structure, and every diamond in this triangle indicates the Pearson’s correlation. The darker the diamond, the higher the correlation; and the lighter the diamond, the lower the correlation between the variants. (A) In the case where the causal variants are aligned, the colocalization posterior probability (CLPP) is high for the variant that is embedded in the dashed black rectangle. (B) However, in the case where the causal variants are not aligned (the causal variants are not the same variants), the quantity of CLPP is low for the variant that is embedded in the dashed black rectangle. (C) In this case, the LD is high, which implies that the uncertainty is high as a result of LD, and the CLPP value is low for the variant that is embedded in the dashed black rectangle. (D) A case where a locus has two independent causal variants. If we consider that we have only one causal variant in a locus, then the CLPP of the causal variants is estimated to be 0.25. However, if we allow more than one causal variant in the locus, eCAVIAR estimates the CLPP to be 1. The American Journal of Human Genetics 2016 99, 1245-1260DOI: (10.1016/j.ajhg.2016.10.003) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 3 eCAVIAR Is Robust to the Presence of AH We simulated marginal statistics directly from the LD structure for an eQTL study and GWAS. In both studies, we implanted one, two, or three causal variants on which the statistical power was 50% (A–C, respectively) or 80% (D–F, respectively). eCAVIAR had a low TP for a high cutoff and a low FP. This indicates that eCAVIAR has high confidence in detecting a colocalized locus in both the GWAS and eQTL study, even in the presence of AH. The American Journal of Human Genetics 2016 99, 1245-1260DOI: (10.1016/j.ajhg.2016.10.003) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 4 eCAVIAR Is More Accurate Than Existing Methods for Regions with One Causal Variant We compare the accuracy and precision of eCAVIAR with those of the two existing methods (RTC and COLOC). The x axis is the colocalization cutoff threshold. In these datasets, we implanted one causal variant, and we utilized simulated genotypes. We simulated the genotypes by using HAPGEN235 software. We used the European population from 1000 Genomes data33,34 as the starting point to simulate the genotypes. The accuracy and precision of all three methods are shown in (A) and (B), respectively. We computed the TP (true-positive rate), TN (true-negative rate), FN (false-negative rate), and FP (false-positive rate) for the set of simulated datasets for which we generated the marginal statistics in a linear model. Accuracy = (TP + TN)/(TP + FP + FN + TN), and precision = TP/(TP + FP). We set the non-colocalization cutoff threshold to 0.001. We observed that eCAVIAR and COLOC had higher accuracy and precision than RTC. The American Journal of Human Genetics 2016 99, 1245-1260DOI: (10.1016/j.ajhg.2016.10.003) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 5 eCAVIAR Is More Accurate Than Existing Methods in the Presence of AH To generate the datasets, we used a process similar to that shown in Figure 4. However, in this case, we implanted two causal variants. We simulated the genotypes by using HAPGEN235 software. We used the European population from 1000 Genomes data33,34 as the starting point to simulate the genotypes. We compared the accuracy, precision, and recall rate. In these results, eCAVIAR tended to have higher accuracy and precision than RTC and COLOC. However, RTC had a slightly higher recall rate. The American Journal of Human Genetics 2016 99, 1245-1260DOI: (10.1016/j.ajhg.2016.10.003) Copyright © 2016 American Society of Human Genetics Terms and Conditions