Rare-Variant Extensions of the Transmission Disequilibrium Test: Application to Autism Exome Sequence Data  Zongxiao He, Brian J. O’Roak, Joshua D. Smith,

Slides:



Advertisements
Similar presentations
Association Tests for Rare Variants Using Sequence Data
Advertisements

Previous Estimates of Mitochondrial DNA Mutation Level Variance Did Not Account for Sampling Error: Comparing the mtDNA Genetic Bottleneck in Mice and.
Alternative Splicing QTLs in European and African Populations Halit Ongen, Emmanouil T. Dermitzakis The American Journal of Human Genetics Volume 97, Issue.
PRIMUS: Rapid Reconstruction of Pedigrees from Genome-wide Estimates of Identity by Descent Jeffrey Staples, Dandi Qiao, Michael H. Cho, Edwin K. Silverman,
The Trimmed-Haplotype Test for Linkage Disequilibrium
Genetic Landscape of Eurasia and “Admixture” in Uyghurs
Tracing the Route of Modern Humans out of Africa by Using 225 Human Genome Sequences from Ethiopians and Egyptians  Luca Pagani, Stephan Schiffels, Deepti.
SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data  Di Zhang, Linhai Zhao,
2016 Curt Stern Award Address: From Rare to Common Diseases: Translating Genetic Discovery to Therapy1  Brendan Lee  The American Journal of Human Genetics 
Comparing Algorithms for Genotype Imputation
Yu Jiang, Glen A. Satten, Yujun Han, Michael P. Epstein, Erin L
Ren-Hua Chung, Richard W. Morris, Li Zhang, Yi-Ju Li, Eden R. Martin 
Haplotype Estimation Using Sequencing Reads
Miao-Xin Li, Hong-Sheng Gui, Johnny S.H. Kwan, Pak C. Sham 
Tuuli Lappalainen, Stephen B. Montgomery, Alexandra C
Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation- Maximization Algorithm for Unphased Diploid Genotype Data  Daniele.
Improved Heritability Estimation from Genome-wide SNPs
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
Rounak Dey, Ellen M. Schmidt, Goncalo R. Abecasis, Seunggeun Lee 
Weight Loss after Gastric Bypass Is Associated with a Variant at 15q26
Jingjing Li, Xiumei Hong, Sam Mesiano, Louis J
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex.
Volume 173, Issue 1, Pages e9 (March 2018)
Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies  Hugues Aschard, Bjarni J. Vilhjálmsson,
An Excess of Risk-Increasing Low-Frequency Variants Can Be a Signal of Polygenic Inheritance in Complex Diseases  Yingleong Chan, Elaine T. Lim, Niina.
Sanger Confirmation Is Required to Achieve Optimal Sensitivity and Specificity in Next- Generation Sequencing Panel Testing  Wenbo Mu, Hsiao-Mei Lu, Jefferey.
Random-Effects Model Aimed at Discovering Associations in Meta-Analysis of Genome- wide Association Studies  Buhm Han, Eleazar Eskin  The American Journal.
Robust Inference of Identity by Descent from Exome-Sequencing Data
The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS.
Family-Based Association Studies for Next-Generation Sequencing
Alkes L. Price, Gregory V. Kryukov, Paul I. W. de Bakker, Shaun M
Characteristics of Neutral and Deleterious Protein-Coding Variation among Individuals and Populations  Wenqing Fu, Rachel M. Gittelman, Michael J. Bamshad,
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Genotype Imputation with Millions of Reference Samples
Christoph Lange, Nan M. Laird  The American Journal of Human Genetics 
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test  Michael C. Wu, Seunggeun Lee, Tianxi Cai, Yun Li, Michael.
Johanna Jakobsdottir, Mary Sara McPeek 
Dan-Yu Lin, Zheng-Zheng Tang  The American Journal of Human Genetics 
Family-Based Tests of Association in the Presence of Linkage
Shuhua Xu, Wei Huang, Ji Qian, Li Jin 
Erratum The American Journal of Human Genetics
Estimating Genetic Effects and Quantifying Missing Heritability Explained by Identified Rare-Variant Associations  Dajiang J. Liu, Suzanne M. Leal  The.
An Efficient Multiple-Testing Adjustment for eQTL Studies that Accounts for Linkage Disequilibrium between Variants  Joe R. Davis, Laure Fresard, David A.
A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals  Brian L. Browning, Sharon.
A Common Genetic Variant in the Neurexin Superfamily Member CNTNAP2 Increases Familial Risk of Autism  Dan E. Arking, David J. Cutler, Camille W. Brune,
James A. Lautenberger, J. Claiborne Stephens, Stephen J
Exploring Population Admixture Dynamics via Empirical and Simulated Genome-wide Distribution of Ancestral Chromosomal Segments  Wenfei Jin, Sijia Wang,
Daniel Greene, Sylvia Richardson, Ernest Turro 
Suzanne M. Leal, Jurg Ott  The American Journal of Human Genetics 
Population Structure in Admixed Populations: Effect of Admixture Dynamics on the Pattern of Linkage Disequilibrium  C.L. Pfaff, E.J. Parra, C. Bonilla,
Haplotype Diversity across 100 Candidate Genes for Inflammation, Lipid Metabolism, and Blood Pressure Regulation in Two Populations  Dana C. Crawford,
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
Jared R. Kohler, David J. Cutler 
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals  Xiaowei Wu, Mary Sara McPeek 
Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data  Zihuai.
Parental Genotypes in the Risk of a Complex Disease
Are Variants in the CAPN10 Gene Related to Risk of Type 2 Diabetes
Test for Interaction between Two Unlinked Loci
Jung-Ying Tzeng, Chih-Hao Wang, Jau-Tsuen Kao, Chuhsing Kate Hsiao 
Tao Wang, Robert C. Elston  The American Journal of Human Genetics 
Iuliana Ionita-Laza, Seunggeun Lee, Vlad Makarov, Joseph D
Detection and Integration of Genotyping Errors in Statistical Genetics
Regie Lyn P. Santos-Cortez, Rabia Faridi, Atteeq U
Alice S. Whittemore, Jerry Halpern 
Michael P. Epstein, Richard Duncan, Erin B. Ware, Min A
Kung-Yee Liang, Fang-Chi Hsu, Terri H. Beaty, Kathleen C. Barnes 
Gonçalo R. Abecasis, Janis E. Wigginton 
Presentation transcript:

Rare-Variant Extensions of the Transmission Disequilibrium Test: Application to Autism Exome Sequence Data  Zongxiao He, Brian J. O’Roak, Joshua D. Smith, Gao Wang, Stanley Hooker, Regie Lyn P. Santos-Cortez, Biao Li, Mengyuan Kan, Nik Krumm, Deborah A. Nickerson, Jay Shendure, Evan E. Eichler, Suzanne M. Leal  The American Journal of Human Genetics  Volume 94, Issue 1, Pages 33-46 (January 2014) DOI: 10.1016/j.ajhg.2013.11.021 Copyright © 2014 The American Society of Human Genetics Terms and Conditions

Figure 1 Two-by-Two Table for the McNemar's Test Displays the manner in which transmission and nontransmission of the parental minor alleles are counted for the transmission disequilibrium test. The American Journal of Human Genetics 2014 94, 33-46DOI: (10.1016/j.ajhg.2013.11.021) Copyright © 2014 The American Society of Human Genetics Terms and Conditions

Figure 2 QQ Plot of Negative Natural Log p Values Obtained for Trio Data under the Null Hypothesis of No Association when the Variant Sites that Are Tested Are in Perfect LD For each scenario, a total of 1,500 trios were analyzed and 20,000 replicates were generated. For the TDT-CMC and TDT-BRV, variants with MAF ≤ 1% were analyzed while for the TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS, variants with MAF ≤ 5% were analyzed. (A) Displays the results for the TDT-BRV and TDT-CMC when p values were obtained analytically (Anal). (B) Displays the results for the TDT-BRV, TDT-CMC, TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS. All p values were obtained empirically by performing 10,000 genotype (Geno) permutations for each replicate. (C) Displays the results for the TDT-BRV, TDT-CMC, TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS. All p values were obtained empirically by performing 10,000 haplotype (Haplo) permutations for each replicate. The American Journal of Human Genetics 2014 94, 33-46DOI: (10.1016/j.ajhg.2013.11.021) Copyright © 2014 The American Society of Human Genetics Terms and Conditions

Figure 3 QQ plot of p Values Obtained from the Analysis of African and European Admixed Populations Genetic variant data for African and European populations were generated under the Boyko model. A total of 1,500 trios were analyzed using 20,000 replicates. Type I error rates were evaluated for the TDT-BRV, TDT-CMC, TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS. For the TDT-CMC and TDT-BRV, variants with a MAF ≤ 1% were analyzed while for the TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS, variants with MAF ≤ 5% were analyzed. All p values were obtained empirically by performing 10,000 haplotype permutations for each replicate, except for the TDT-CMC analytical. The data were generated with different proportions of African and European admixture: in (A) 75% African and 25% European, (B) 50% African and 50% European, and (C) 25% African and 75% European. The American Journal of Human Genetics 2014 94, 33-46DOI: (10.1016/j.ajhg.2013.11.021) Copyright © 2014 The American Society of Human Genetics Terms and Conditions

Figure 4 Comparison of Power for the RV-TDT Methods and FB-SKAT Power was evaluated for an α level of 0.05 for 1,500 trios by generating 2,000 replicates. Analysis was performed with TDT-BRV, TDT-CMC, TDT-VT-BRV, TDT-VT-CMC, TDT-WSS, and FB-SKAT. For the TDT-CMC, TDT-BRV, and FB-SKAT, variants with a MAF ≤ 1% were analyzed while for the TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS, variants with MAF ≤ 5% were analyzed. For the TDT-BRV, TDT-CMC, TDT-VT-BRV, TDT-VT-CMC, and TDT-WSS, p values were obtained empirically by performing 2,000 haplotype permutations for each replicate. For the TDT-CMC, p values were also obtained analytically. For the FB-SKAT, p values were obtained with a moment matching approach by using 10,000 Monte Carlo simulations. Genetic variant data were generated under the Kryukov model and the proband’s affection status was obtained with two different penetrance models: variable-effects model (A, B, C) and equal-effect model (D, E, F). Different proportions of the variant sites were deemed to be causal: (A and D) 50%, (B and E) 75%, and (C and F) 100%. The American Journal of Human Genetics 2014 94, 33-46DOI: (10.1016/j.ajhg.2013.11.021) Copyright © 2014 The American Society of Human Genetics Terms and Conditions

Figure 5 Comparison of Power to Detect Rare-Variant Associations with Population-Based and Trio Data The BRV was used to analyze samples of size 1,000 cases and 1,000 controls and 1,500 cases and 1,500 controls, and the TDT-BRV was used to analyze 1,000 trios. Power was evaluated for an α level of 0.05 for both case-control and trio data by generating 2,000 replicates. P values were obtained empirically by performing 2,000 haplotype permutations for each replicate. Genetic variant data were generated with Kryukov model. Affection status was determined with an equal-effect penetrance model with ORs varying between 1.8 and 2.5. Different proportions of causal variants were used in the analysis with 50% (A), 75% (B), and 100% (C). The American Journal of Human Genetics 2014 94, 33-46DOI: (10.1016/j.ajhg.2013.11.021) Copyright © 2014 The American Society of Human Genetics Terms and Conditions