Widespread Allelic Heterogeneity in Complex Traits

Slides:



Advertisements
Similar presentations
Constrained Score Statistics Identify Genetic Variants Interacting with Multiple Risk Factors in Barrett’s Esophagus  James Y. Dai, Jean de Dieu Tapsoba,
Advertisements

Kyung Won Kim, MD, PhD, Rachel A
A Common Variant in SLC8A1 Is Associated with the Duration of the Electrocardiographic QT Interval  Jong Wook Kim, Kyung-Won Hong, Min Jin Go, Sung Soo.
Comprehensively Evaluating cis-Regulatory Variation in the Human Prostate Transcriptome by Using Gene-Level Allele-Specific Expression  Nicholas B. Larson,
Marc A. Coram, Huaying Fang, Sophie I. Candille, Themistocles L
Colocalization of GWAS and eQTL Signals Detects Target Genes
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Claudio Verzilli, Tina Shah, Juan P
Common Variants of Large Effect in F12, KNG1, and HRG Are Associated with Activated Partial Thromboplastin Time  Lorna M. Houlihan, Gail Davies, Albert.
Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci  Gosia Trynka,
Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits  Nicholas Mancuso, Huwenbo Shi, Pagé.
High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans  Winston Lau, Toby Andrew,
Huwenbo Shi, Nicholas Mancuso, Sarah Spendlove, Bogdan Pasaniuc 
Haplotype Estimation Using Sequencing Reads
Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development 
So Many Correlated Tests, So Little Time
Improved Heritability Estimation from Genome-wide SNPs
PheWAS and Beyond: The Landscape of Associations with Medical Diagnoses and Clinical Measures across 38,662 Individuals from Geisinger  Anurag Verma,
Rounak Dey, Ellen M. Schmidt, Goncalo R. Abecasis, Seunggeun Lee 
Gene-Expression Variation Within and Among Human Populations
Parisa Shooshtari, Hailiang Huang, Chris Cotsapas 
Jingjing Li, Xiumei Hong, Sam Mesiano, Louis J
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Towfique Raj, Manik Kuchroo, Joseph M
A Selection Operator for Summary Association Statistics Reveals Allelic Heterogeneity of Complex Traits  Zheng Ning, Youngjo Lee, Peter K. Joshi, James.
Transethnic Genetic-Correlation Estimates from Summary Statistics
Integrative Multi-omic Analysis of Human Platelet eQTLs Reveals Alternative Start Site in Mitofusin 2  Lukas M. Simon, Edward S. Chen, Leonard C. Edelstein,
Genetic Regulatory Mechanisms of Smooth Muscle Cells Map to Coronary Artery Disease Risk Loci  Boxiang Liu, Milos Pjanic, Ting Wang, Trieu Nguyen, Michael.
Malika Kumar Freund, Kathryn S
Random-Effects Model Aimed at Discovering Associations in Meta-Analysis of Genome- wide Association Studies  Buhm Han, Eleazar Eskin  The American Journal.
Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx
Robust Inference of Identity by Descent from Exome-Sequencing Data
Expression Quantitative Trait Loci Analysis Identifies Associations Between Genotype and Gene Expression in Human Intestine  Boyko Kabakchiev, Mark S.
Meta-analysis of Correlated Traits via Summary Statistics from GWASs with an Application in Hypertension  Xiaofeng Zhu, Tao Feng, Bamidele O. Tayo, Jingjing.
Lue Ping Zhao, Ross Prentice, Fumin Shen, Li Hsu 
Ivan P. Gorlov, Olga Y. Gorlova, Shamil R. Sunyaev, Margaret R
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
Structural Architecture of SNP Effects on Complex Traits
Studying Gene and Gene-Environment Effects of Uncommon and Common Variants on Continuous Traits: A Marker-Set Approach Using Gene-Trait Similarity Regression 
Matthieu Foll, Oscar E. Gaggiotti, Josephine T
Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies 
Constrained Score Statistics Identify Genetic Variants Interacting with Multiple Risk Factors in Barrett’s Esophagus  James Y. Dai, Jean de Dieu Tapsoba,
Five Years of GWAS Discovery
Pier Francesco Palamara, Laurent C. Francioli, Peter R
Diego Calderon, Anand Bhaskar, David A
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Are Interactions between cis-Regulatory Variants Evidence for Biological Epistasis or Statistical Artifacts?  Alexandra E. Fish, John A. Capra, William.
Pritam Chanda, Aidong Zhang, Daniel Brazeau, Lara Sucheston, Jo L
Huwenbo Shi, Gleb Kichaev, Bogdan Pasaniuc 
A Common Genetic Variant in the Neurexin Superfamily Member CNTNAP2 Increases Familial Risk of Autism  Dan E. Arking, David J. Cutler, Camille W. Brune,
Imputing Phenotypes for Genome-wide Association Studies
GWAS-eQTL signal colocalisation methods
Chen Yao, Roby Joehanes, Andrew D
Stephen Leslie, Peter Donnelly, Gil McVean 
Xiaoquan Wen, Yeji Lee, Francesca Luca, Roger Pique-Regi 
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
Joseph K. Pickrell  The American Journal of Human Genetics 
L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals  Xiaowei Wu, Mary Sara McPeek 
Xiang Wan, Can Yang, Qiang Yang, Hong Xue, Xiaodan Fan, Nelson L. S
Colocalization of GWAS and eQTL Signals Detects Target Genes
Towfique Raj, Joshua M. Shulman, Brendan T. Keenan, Lori B
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
Tao Wang, Robert C. Elston  The American Journal of Human Genetics 
Common Variants of Large Effect in F12, KNG1, and HRG Are Associated with Activated Partial Thromboplastin Time  Lorna M. Houlihan, Gail Davies, Albert.
Enhanced Localization of Genetic Samples through Linkage-Disequilibrium Correction  Yael Baran, Inés Quintela, Ángel Carracedo, Bogdan Pasaniuc, Eran Halperin 
A Multilocus Model of the Genetic Architecture of Autoimmune Thyroid Disorder, with Clinical Implications  Veronica J. Vieland, Yungui Huang, Christopher.
Functional Architectures of Local and Distal Regulation of Gene Expression in Multiple Human Tissues  Xuanyao Liu, Hilary K. Finucane, Alexander Gusev,
Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach  Marc A. Coram, Sophie I. Candille, Qing.
The Genomic Footprints of the Fall and Recovery of the Crested Ibis
Adiposity-Dependent Regulatory Effects on Multi-tissue Transcriptomes
Presentation transcript:

Widespread Allelic Heterogeneity in Complex Traits Farhad Hormozdiari, Anthony Zhu, Gleb Kichaev, Chelsea J.-T. Ju, Ayellet V. Segrè, Jong Wha J. Joo, Hyejung Won, Sriram Sankararaman, Bogdan Pasaniuc, Sagiv Shifman, Eleazar Eskin  The American Journal of Human Genetics  Volume 100, Issue 5, Pages 789-802 (May 2017) DOI: 10.1016/j.ajhg.2017.04.005 Copyright © 2017 Terms and Conditions

Figure 1 Overview of CAVIAR for Detecting Allelic Heterogeneity Regions (A and B) The marginal statistics for a locus where we have implanted one causal variant. In (A), SNP33 is causal and in (B), SNP23 is causal. (C) The same locus where both SNP23 and SNP33 are causal. In these figures, the x axis is the negative logarithm of the p values for each locus to indicate the strength of the marginal statistics. The gray triangle below each figure indicates the LD pattern. Each square indicates the correlation between two variants, and the magnitude of the correlation is shown by the color intensity of the square. The darker the square, the higher the correlation between two variants. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 2 ROC Curve for CAVIAR and CM We implant one causal variant to compute the false positive (FP) rate. FP indicates loci that harbor one causal variant; however, these loci are detected as AH. We implant two causal variants to compute the true positive (TP) rate. TP indicates loci that harbor AH and are detected correctly. We range the effect size such that the power at the causal variant is 20%, 40%, 60%, and 80% at the genome significant level 10−8. We obtain these results from simulated data with no epistasis interaction. We simulated data using 1,000 individuals and set γ to 0.001. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 3 CAVIAR Has Low FP Even When the True Causal Variant Is Not Collected Thus, most loci that are detected by CAVIAR to harbor AH are most probably true. x axis indicates the prior probability of causal variant (γ). We set γ to 0.01, 0.005, 0.001, 0.0005, 0.0001, 0.00005, 0.000001, and 0.000005. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 4 CAVIAR Is More Accurate than CM to Detect the Number of Causal Variants The x axis is the power of causal variants, and the y axis is the accuracy to detect the number of causal variants in a locus. We implanted one, two, and three causal variants. We compute the recall rate as the fraction of simulations where the number of causal variants in a locus is predicted correctly. Recall rate of each method for different number of causal variants: (A) one causal variant, (B) two causal variants, and (C) three causal variants. We vary the statistical power to detect the causal variant among 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, and 0.8. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 5 CAVIAR Distinguishes between Epistatic Interaction and Allelic Heterogeneity The x axis is the sample size that we vary between 500, 1,000, 1,500, 2,000, 2,500, and 3,000 individuals. The y axis is the false positive (FP) rate. We simulated datasets where we have epistatic interaction and compute the FP as the number of cases where CAVIAR incorrectly detects these loci to harbor AH. Shown are the FP for different effect sizes of the epistatic interaction. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 6 Levels of Allelic Heterogeneity in eQTL Studies (A) Linear relationship between the amount of AH and sample size. Each red circle indicates a different type of tissue from the GTEx dataset. The size of each red circle is proportional to the number of genes that harbor a significant eQTL (eGenes). (B–D) Significant overlap between AH estimations for different eQTL datasets, shown for (B) blood (p = 7.9 × 10−97), (C) skin (p = 4.9 × 10−63), and (D) adipose (p = 1.1 × 10−69) tissue. p values are computed using a hypergeometric test that is implemented in the SuperExactTest43 software. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions

Figure 7 Allelic Heterogeneity in the TCF4 Locus Associated with Schizophrenia (A) Manhattan plot obtained from Ricopili consists of all the variants (7,193 variants) in a 1 Mbp window centered on the most significant SNP in the locus (rs9636107). We use PGC-SCZ52-may13 version of the data. This plot indicates multiple significant variants that are not in tight LD with the peak variant. (B) LD plot of the 50 most significant SNPs showing several distinct LD blocks. (C) Histogram for the probability of having different number of causal variants. The American Journal of Human Genetics 2017 100, 789-802DOI: (10.1016/j.ajhg.2017.04.005) Copyright © 2017 Terms and Conditions