Zheng-Zheng Tang, Dan-Yu Lin  The American Journal of Human Genetics 

Slides:



Advertisements
Similar presentations
Replication Stress Induces Genome-wide Copy Number Changes in Human Cells that Resemble Polymorphic and Pathogenic Variants Martin F. Arlt, Jennifer G.
Advertisements

Length Distributions of Identity by Descent Reveal Fine-Scale Demographic History Pier Francesco Palamara, Todd Lencz, Ariel Darvasi, Itsik Pe’er The American.
TFIIH Subunit Alterations Causing Xeroderma Pigmentosum and Trichothiodystrophy Specifically Disturb Several Steps during Transcription Amita Singh, Emanuel.
Previous Estimates of Mitochondrial DNA Mutation Level Variance Did Not Account for Sampling Error: Comparing the mtDNA Genetic Bottleneck in Mice and.
Differential Relationship of DNA Replication Timing to Different Forms of Human Mutation and Variation Amnon Koren, Paz Polak, James Nemesh, Jacob J. Michaelson,
Functional Analysis of the Neurofibromatosis Type 2 Protein by Means of Disease- Causing Point Mutations Renee P. Stokowski, David R. Cox The American.
Gene Preference in Maple Syrup Urine Disease Mary M. Nellis, Dean J. Danner The American Journal of Human Genetics Volume 68, Issue 1, Pages (January.
Alternative Splicing QTLs in European and African Populations Halit Ongen, Emmanouil T. Dermitzakis The American Journal of Human Genetics Volume 97, Issue.
Genome Scan Meta-Analysis of Schizophrenia and Bipolar Disorder, Part I: Methods and Power Analysis Douglas F. Levinson, Matthew D. Levinson, Ricardo Segurado,
Fragile X and X-Linked Intellectual Disability: Four Decades of Discovery Herbert A. Lubs, Roger E. Stevenson, Charles E. Schwartz The American Journal.
Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 統計論文 奈良原.
Peopling of Sahul: mtDNA Variation in Aboriginal Australian and Papua New Guinean Populations Alan J. Redd, Mark Stoneking The American Journal of Human.
Chen Yao, Roby Joehanes, Andrew D
Genetic Landscape of Eurasia and “Admixture” in Uyghurs
DOMINO: Using Machine Learning to Predict Genes Associated with Dominant Disorders  Mathieu Quinodoz, Beryl Royer-Bertrand, Katarina Cisarova, Silvio.
Practical Approaches for Whole-Genome Sequence Analysis of Heart- and Blood- Related Traits  Alanna C. Morrison, Zhuoyi Huang, Bing Yu, Ginger Metcalf,
Daniel J. Schaid, Charles M. Rowland, David E. Tines, Robert M
Marc A. Coram, Huaying Fang, Sophie I. Candille, Themistocles L
Gene-centric Association Signals for Lipids and Apolipoproteins Identified via the HumanCVD BeadChip  Philippa J. Talmud, Fotios Drenos, Sonia Shah, Tina.
Genetic Association Analysis under Complex Survey Sampling: The Hispanic Community Health Study/Study of Latinos  Dan-Yu Lin, Ran Tao, William D. Kalsbeek,
SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data  Di Zhang, Linhai Zhao,
CHEK2*1100delC and Susceptibility to Breast Cancer: A Collaborative Analysis Involving 10,860 Breast Cancer Cases and 9,065 Controls from 10 Studies 
2016 Curt Stern Award Address: From Rare to Common Diseases: Translating Genetic Discovery to Therapy1  Brendan Lee  The American Journal of Human Genetics 
Reliable Identification of Genomic Variants from RNA-Seq Data
Yu Jiang, Glen A. Satten, Yujun Han, Michael P. Epstein, Erin L
Meta-analysis of Genetic-Linkage Analysis of Quantitative-Trait Loci
Huwenbo Shi, Nicholas Mancuso, Sarah Spendlove, Bogdan Pasaniuc 
Tuuli Lappalainen, Stephen B. Montgomery, Alexandra C
Improved Heritability Estimation from Genome-wide SNPs
Rounak Dey, Ellen M. Schmidt, Goncalo R. Abecasis, Seunggeun Lee 
Genetic Association Analysis under Complex Survey Sampling: The Hispanic Community Health Study/Study of Latinos  Dan-Yu Lin, Ran Tao, William D. Kalsbeek,
Elizabeth Theusch, Analabha Basu, Jane Gitschier 
XMCPDT Does Have Correct Type I Error Rates
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Kristina Allen-Brady, Peggy A. Norton, James M
A Subset-Based Approach Improves Power and Interpretation for the Combined Analysis of Genetic Association Studies of Heterogeneous Traits  Samsiddhi.
A Selection Operator for Summary Association Statistics Reveals Allelic Heterogeneity of Complex Traits  Zheng Ning, Youngjo Lee, Peter K. Joshi, James.
Transethnic Genetic-Correlation Estimates from Summary Statistics
Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex.
Xiangqing Sun, Robert Elston, Nathan Morris, Xiaofeng Zhu 
Xing Hua, Haiming Xu, Yaning Yang, Jun Zhu, Pengyuan Liu, Yan Lu 
Random-Effects Model Aimed at Discovering Associations in Meta-Analysis of Genome- wide Association Studies  Buhm Han, Eleazar Eskin  The American Journal.
Meta-analysis of Correlated Traits via Summary Statistics from GWASs with an Application in Hypertension  Xiaofeng Zhu, Tao Feng, Bamidele O. Tayo, Jingjing.
Family-Based Association Studies for Next-Generation Sequencing
Are Rare Variants Responsible for Susceptibility to Complex Diseases?
Structural Architecture of SNP Effects on Complex Traits
Christoph Lange, Nan M. Laird  The American Journal of Human Genetics 
Diego Calderon, Anand Bhaskar, David A
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test  Michael C. Wu, Seunggeun Lee, Tianxi Cai, Yun Li, Michael.
Johanna Jakobsdottir, Mary Sara McPeek 
Dan-Yu Lin, Zheng-Zheng Tang  The American Journal of Human Genetics 
Yu Jiang, Yujun Han, Slavé Petrovski, Kouros Owzar, David B
Erratum The American Journal of Human Genetics
Michael P. Epstein, Xihong Lin, Michael Boehnke 
Estimating Genetic Effects and Quantifying Missing Heritability Explained by Identified Rare-Variant Associations  Dajiang J. Liu, Suzanne M. Leal  The.
Imputing Phenotypes for Genome-wide Association Studies
Increasing the Power and Efficiency of Disease-Marker Case-Control Association Studies through Use of Allele-Sharing Information  Tasha E. Fingerlin,
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
Statistical Treatment of Fluorescence in Situ Hybridization Validation Data to Generate Normal Reference Ranges Using Excel Functions  Allison L. Ciolino,
Interpretation of Association Signals and Identification of Causal Variants from Genome- wide Association Studies  Kai Wang, Samuel P. Dickson, Catherine.
Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data  Zihuai.
Test for Interaction between Two Unlinked Loci
Tao Wang, Robert C. Elston  The American Journal of Human Genetics 
Iuliana Ionita-Laza, Seunggeun Lee, Vlad Makarov, Joseph D
Xing Hua, Haiming Xu, Yaning Yang, Jun Zhu, Pengyuan Liu, Yan Lu 
Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach  Marc A. Coram, Sophie I. Candille, Qing.
Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics  Omer Weissbrod, Jonathan Flint,
Jung-Ying Tzeng, Daowen Zhang  The American Journal of Human Genetics 
Presentation transcript:

Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs  Zheng-Zheng Tang, Dan-Yu Lin  The American Journal of Human Genetics  Volume 97, Issue 1, Pages 35-53 (July 2015) DOI: 10.1016/j.ajhg.2015.05.001 Copyright © 2015 The American Society of Human Genetics Terms and Conditions

Figure 1 The Impact of Transformation on Meta-analysis of Two Studies The original trait distributions are skewed in both studies such that variance is larger in study 2 than in study 1. After INT, both distributions are normal and have the same variance. After R-INT, both distributions are normal, and the two variances are the same as their original values. If the effect sizes are originally equal between the two studies, then the effect sizes are unequal after INT but are approximately equal after R-INT. The American Journal of Human Genetics 2015 97, 35-53DOI: (10.1016/j.ajhg.2015.05.001) Copyright © 2015 The American Society of Human Genetics Terms and Conditions

Figure 2 Power of FE and RE Tests as a Function of the SD of the Effect Size when the Mean Effects of Causal Variants Are Either 0.25 or 0 Under model 1 (rare-variant model), 50% of variants with a MAF < 0.5% are causal such that each increases the trait value by the same amount. Under model 2 (low-frequency-variant model), 50% of all variants are causal such that each increases the trait value by the same amount. Under model 3 (opposite-effects model), 50% of all variants are causal such that 80% increase the trait value by the same amount, and the remaining 20% decrease the trait value by the same amount. The nominal significance level is 2.5 × 10−6. The American Journal of Human Genetics 2015 97, 35-53DOI: (10.1016/j.ajhg.2015.05.001) Copyright © 2015 The American Society of Human Genetics Terms and Conditions

Figure 3 Power of FE Tests as a Function of the Coefficient of Variation of the Trait Variance in Six Studies Under model 1 (rare-variant model), 50% of variants with a MAF < 0.5% are causal such that each increases the trait value by the same amount. Under model 2 (low-frequency-variant model), 50% of all variants are causal such that and each increases the trait value by the same amount. Under model 3 (opposite-effects model), 50% of all variants are causal such that 80% increase the trait value by the same amount, and the remaining 20% decrease the trait value by the same amount. Burden, burden-INT, and burden-R-INT are based on normalized score statistics with no transformation, INT, and R-INT, respectively, and burden-UN is based on un-normalized (UN) score statistics without transformation. VT, VT-UN, VT-INT, VT-R-INT, SKAT, SKAT-UN, SKAT-INT, and SKAT-R-INT are defined analogously. The nominal significance level is 2.5 × 10−6. Under normal distributions, the curves for normalized score statistics without transformation and with R-INT largely overlap. CV, coefficient of variation. The American Journal of Human Genetics 2015 97, 35-53DOI: (10.1016/j.ajhg.2015.05.001) Copyright © 2015 The American Society of Human Genetics Terms and Conditions

Figure 4 Four Pipelines for Meta-analysis and the Role of PreMeta Each pipeline inputs individual-participant data, performs study-specific analysis to produce relevant summary statistics, combines summary statistics across studies, and performs gene-level association tests. PreMeta reformats the files for the summary statistics such that the information can be exchanged across the four pipelines. The American Journal of Human Genetics 2015 97, 35-53DOI: (10.1016/j.ajhg.2015.05.001) Copyright © 2015 The American Society of Human Genetics Terms and Conditions

Figure 5 The Role of PreMeta in the Meta-analysis of Four Studies with Different Operators SCORE-Seq is used to generate summary statistics for study 1, and the output can be directly used as input for MASS. RAREMETALWORKER, MetaSKAT, and seqMeta are used to generate summary statistics for the other three studies, and the output files are reformatted by PreMeta to be input into MASS. The American Journal of Human Genetics 2015 97, 35-53DOI: (10.1016/j.ajhg.2015.05.001) Copyright © 2015 The American Society of Human Genetics Terms and Conditions