Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 2013.06.17 統計論文奈良原.

Slides:

Advertisements

Similar presentations

Pattern Recognition and Machine Learning

Advertisements

Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee.

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

An Introduction of Support Vector Machine

Machine learning continued Image source:

CS Statistical Machine learning Lecture 13 Yuan (Alan) Qi Purdue CS Oct

Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.

Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 10 Statistical Modelling Martin Russell.

Linear Methods for Classification

CS Pattern Recognition Review of Prerequisites in Math and Statistics Prepared by Li Yang Based on Appendix chapters of Pattern Recognition, 4.

Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.

What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.

Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.

Random Variable and Probability Distribution

5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.

5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.

Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.

Ensemble Learning (2), Tree and Forest

Sampling Distributions  A statistic is random in value … it changes from sample to sample.  The probability distribution of a statistic is called a sampling.

Today Wrap up of probability Vectors, Matrices. Calculus

Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.

1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.

Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.

Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.

Outline Separating Hyperplanes – Separable Case

Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.

Generalized Linear Mixed Model (GLMM) & Weighted Sum Test (WST) Detecting Association between Rare Variants and Complex Traits Qunyuan Zhang, Ingrid Borecki,

Support Vector Machines Mei-Chen Yeh 04/20/2010. The Classification Problem Label instances, usually represented by feature vectors, into one of the predefined.

1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.

Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)

Learning Theory Reza Shadmehr Linear and quadratic decision boundaries Kernel estimates of density Missing data.

Applying Statistical Machine Learning to Retinal Electrophysiology Matt Boardman January, 2006 Faculty of Computer Science.

Computational Intelligence: Methods and Applications Lecture 23 Logistic discrimination and support vectors Włodzisław Duch Dept. of Informatics, UMK Google:

New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.

CS 478 – Tools for Machine Learning and Data Mining SVM.

Sparse Kernel Methods 1 Sparse Kernel Methods for Classification and Regression October 17, 2007 Kyungchul Park SKKU.

1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.

CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.

Linear Methods for Classification Based on Chapter 4 of Hastie, Tibshirani, and Friedman David Madigan.

Support Vector Machines. Notation Assume a binary classification problem. –Instances are represented by vector x   n. –Training examples: x = (x 1,

Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.

Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.

Powerful Regression-based Quantitative Trait Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.

Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.

Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

Biostatistics Class 3 Probability Distributions 2/15/2000.

1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.

Estimating standard error using bootstrap

Deep Feedforward Networks

CH 5: Multivariate Methods

Propagating Uncertainty In POMDP Value Iteration with Gaussian Process

Beyond GWAS Erik Fransen.

What is Regression Analysis?

Pattern Recognition and Machine Learning

More Parameter Learning, Multinomial and Continuous Variables

Zheng-Zheng Tang, Dan-Yu Lin The American Journal of Human Genetics

Generally Discriminant Analysis

Support Vector Machines

Parametric Methods Berlin Chen, 2005 References:

Multivariate Methods Berlin Chen

Mathematical Foundations of BME

Multivariate Methods Berlin Chen, 2005 References:

Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test Michael C. Wu, Seunggeun Lee, Tianxi Cai, Yun Li, Michael.

Linear Discrimination

Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data Zihuai.

Kernel Methods for large-scale Genomics Data Analysis

Support Vector Machines 2

Presentation transcript:

Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 2013.06.17 統計論文奈良原

The American journal of Human Genetics (2013) SKAT Wu, M.C., Lee, S., Cai, T., Li, Y., Boehnke, M., and Lin, X. (2011). Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93. Developed for rare-variant analysis

Background of rare variant analysis Classic: burden tests Collapsing method: rare variant +/- in a region Counts of rare alleles Combined multivariate and collapsing (CMC) method rare variants: collapsed, common variants: each forms a separate group --> Combined by Hotelling's T2 statistic Weighted sum Non-burden tests C-alpha test Sequence kernel association test (SKAT) Problem of Burden tests Burden tests assume that all rare variants influence the phenotype in the same direction with the same magnitude of effects (after weighting). methods that are robust to different direction and magnitude of effects

Development of SKAT SKAT (2011) A kernel regression approach non-parametric non-linear regression flexible weighting function weights based on minor allele frequency based on SNP functional annotation Wide range of application binary/continuous traits adjustment for covariates both rare and common variants (up-weighting rare variants) Efficient computation Score test for variance-component in linear mixed model

Development of SKAT (2) SKAT-O: Optimal unified approach (AJHG, 2012) Combination of a burden test and SKAT Burden test ... optimal when most variants in a region are causal and the effects are in the same direction SKAT ... optimal when a large fraction of the variants in a regions are non-causal or the effects of causal variants are in different directions Extension of SKAT-O to testing a combined effect of rare and common variants (AJHG, 2013)

methods

Linear mixed model Genetic effect: random effect

Variance component score test Choice of a kernel function weighted linear weighted quadratic weighted IBS = kernel function Genetic similarity between subjects (weighted) Choice of weights Typical parameter: a1=1, a2=25 P value given by the Davies method Approximation of Q statistic

Optimal unified approach, SKAT-O Next, they unified burden test and SKAT to optimize the rare variant analysis. Burden test More powerful than SKAT when most variants in a region are causal and the effects are in the same direction SKAT More powerful than burden test when a large fraction of the variants in a regions are non-causal or the effects of causal variants are in different directions

Weighted burden test statistic SKAT statistic aggregates the variants before regression first regresses and aggregates the individual variant statistics

Unifying two test statistics Optimal value of ρ is determined by grid search. Qρ is equivalently calculated by the formula of score test statistic ρ: correlation between different βj's ρ=0: regression coefficients are not correlated to each other --> SKAT ρ=1: regression coefficients are perfectly correlated --> Burden test

Rare and common variants together in the SKAT-O framework Different weighting functions are defined for rare and common variants. The effects of rare and common variants are fitted together using separate random effect terms.

Model

Statistic Weighted sum of statistics of rare and common variants

Predefined parameters Weights Rare variants Common variants Contribution, φ Equal contribution or searching the optimal value of φ Beta(1, 25) Beta(0.5, 0.5) MAF

Appendix

Kernel in statistics In Bayesian statistics The kernel of a probability density function or probability mass function is the form of PDF or PMF in which any factors that are not functions of any of the variables in the domain are omitted (normalization factor). Ex. kernel of a normal distribution PDF: Kernel:

Kernel in statistics (2) In non-parametric statistics A kernel is a weighting function Usage Kernel density estimation to estimate random variables' density functions In kernel regression to estimate the conditional expectation of a random variable In time-series to estimate the spectral density Estimation of a time-varying intensity for a point process Definition A kernel is a non-negative real-valued integrable function K satisfying the following two requirements: If K is a kernel, then so is the function K*. K*(u) = λK(λu), where λ > 0 --> A kernel is a PDF. --> A kernel is symmetric about u=0.

Kernel regression The kernel regression is a non-parametric approach to find a non-linear relation between a pair of random variables X and Y. The goal is to estimate a function m that gives conditional expectation of a variable Y relative to a variable X: A kernel is used to estimate a function m.

Kernel trick A kernel trick is a method to project data into a higher-dimensional space so that non-linear data can be separated by a hyperplane. non-linear --> linear Kernel function K(x, z) = <Φ(x), Φ(z)> Φ(・): a function to project data into higher-dimensional space <・, ・> : inner product

Application of kernel method Kernel PCA (non-linear PCA) Kernel CCA Support vector machine

Support vector machine SVM is a machine learning approach that utilizes the kernel function to project data in a higher-dimensional space that can separate the data by a hyperplane. SVM is a non-linear classifier.

Variance component score test Lin, X. (1997). Variance component testing in generalised linear models with random effects. Biometrika84, 309–326. Variance component tests in linear mixed model Likelihood ratio test Score statistic Computationally efficient Wald statistic