Rank-Based Approach to Optimal Score via Dimension Reduction Shao-Hsuan Wang National Taiwan University, Taiwan Nov 2015 1.

Slides:



Advertisements
Similar presentations
COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Advertisements

How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.
Face Recognition Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Haftu Shamini Thomas Temesgen Seyoum
The General Linear Model. The Simple Linear Model Linear Regression.
STAT 497 APPLIED TIME SERIES ANALYSIS
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Chapter 10 Simple Regression.
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
REGRESSION What is Regression? What is the Regression Equation? What is the Least-Squares Solution? How is Regression Based on Correlation? What are the.
Statistics 200b. Chapter 5. Chapter 4: inference via likelihood now Chapter 5: applications to particular situations.
Data mining and statistical learning, lecture 5 Outline  Summary of regressions on correlated inputs  Ridge regression  PCR (principal components regression)
Data Basics. Data Matrix Many datasets can be represented as a data matrix. Rows corresponding to entities Columns represents attributes. N: size of the.
Psychology 202b Advanced Psychological Statistics, II February 17, 2011.
Basic Mathematics for Portfolio Management. Statistics Variables x, y, z Constants a, b Observations {x n, y n |n=1,…N} Mean.
Measures of Character Fit Measures of Character Fit.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Maximum likelihood (ML)
Statistics for the Social Sciences Psychology 340 Spring 2005 Course Review.
Correlation. The sample covariance matrix: where.
Lecture II-2: Probability Review
Learning Letter Sounds Jack Hartman Shake, Rattle, and Read
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
1 Enviromatics Environmental statistics Environmental statistics Вонр. проф. д-р Александар Маркоски Технички факултет – Битола 2008 год.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
On ranking in survival analysis: Bounds on the concordance index
Chapter 18 Measures of Association McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All Rights Reserved.
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
From GLM to HLM Working with Continuous Outcomes EPSY 5245 Michael C. Rodriguez.
A statistical model Μ is a set of distributions (or regression functions), e.g., all uni-modal, smooth distributions. Μ is called a parametric model if.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Relationship between two variables Two quantitative variables: correlation and regression methods Two qualitative variables: contingency table methods.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
Estimation in Marginal Models (GEE and Robust Estimation)
Brief Review Probability and Statistics. Probability distributions Continuous distributions.
Robust Regression. Regression Methods  We are going to look at three approaches to robust regression:  Regression with robust standard errors  Regression.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
ReCap Part II (Chapters 5,6,7) Data equations summarize pattern in data as a series of parameters (means, slopes). Frequency distributions, a key concept.
Tutorial I: Missing Value Analysis
Armando Teixeira-Pinto AcademyHealth, Orlando ‘07 Analysis of Non-commensurate Outcomes.
McGraw-Hill/Irwin Business Research Methods, 10eCopyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 18 Measures of Association.
CJT 765: Structural Equation Modeling Class 9: Putting it All Together.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Antonym means opposite: Come - Go Synonym means same Happy - Glad Homophones are two words that sound the same Red - Read Standard Form 85 Expanded Form.
Basis Expansions and Generalized Additive Models Basis expansion Piecewise polynomials Splines Generalized Additive Model MARS.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 19-1 Chapter 19 Measures of Association.
Estimating standard error using bootstrap
Inference about the slope parameter and correlation
Multiple Random Variables and Joint Distributions
HGEN Thanks to Fruhling Rijsdijk
CH 5: Multivariate Methods
Inference for the mean vector
Classification Discriminant Analysis
Statistical Methods For Engineers
Classification Discriminant Analysis
From GLM to HLM Working with Continuous Outcomes
Ch11 Curve Fitting II.
Basis Expansions and Generalized Additive Models (2)
(Approximately) Bivariate Normal Data and Inference Based on Hotelling’s T2 WNBA Regular Season Home Point Spread and Over/Under Differentials
STATISTICAL INFERENCE PART I POINT ESTIMATION
RES 500 Academic Writing and Research Skills
Principal Component Analysis
Factor Analysis.
Presentation transcript:

Rank-Based Approach to Optimal Score via Dimension Reduction Shao-Hsuan Wang National Taiwan University, Taiwan Nov

Rank-based measures Kendall’s  Concordance Index Rank correlation Widely used in medical statistics, epidemiology, economics, and sociology, etc. 2

Rank-based measures Regression Model Y : a univariate response Z Z  (Z(Z,, Z ) : multiple covariates 1 p 3

Rank-based measures YRYR Response Y TZTZ TZRTZR Composite score 4

Rank-based measures (Y(Y T,  Z ) ( Y,  Z ) For pair of observations concordant : T 1 and 2 T, Y  Y and  T Z  Z Y  Y and  T Z  Z 1212 discordant : T Y  Y and  T Z  Z Y T  Y and  T Z  Z

Rank-based measures Kendall’s   P(Y P(Y T  Y,  Z  T Z )  P ( Y  Y T,  Z  T Z)Z) 1212 Rank correlation T rc  P ( Y  Y,  Z  Z ) Concordance Index TT CI  P (  Z  Z | Y  Y )

Rank-based measures YRYR Response Y TZTZ TZRTZR Composite score 7

Rank-based measures There could not exist a monotonic association !! 8

Motivation

Composite score TZTZ g (Z) measurable functions 10

C-max YRYR Response Concordance-index function : C ( g )  P ( g ( Z g(Z)Rg(Z)R Composite score )  g ( Z )| Y  Y ) C (g)(g) C-max : max  sup gFgF c Optimal score : m ( Z ) such that m  sup C ( g ) g  F 11 c

Intrinsic model behind Rank-based measures M1 Distributional assumption : Generalized Regression Model (Han 1987) M2 Structural assumption : Dimension Reduction (Li 1991, Cook 1991) 12

Intrinsic model behind Rank-based measures M1 a non-degenerate monotonic function on R YG(mYG(m d (Z),)(Z),) 0 13

Intrinsic model behind Rank-based measures M1 a non-degenerate monotonic function on R YG(mYG(m d (Z),)(Z),) 0 an unspecifed bivariate function strictly increasing at each component for the other one being fixed 14

Intrinsic model behind Rank-based measures M2 Y  D G ( m d (Z),)(Z),) 0 a multivariate polynomial of the unknown degree d 0 15

Intrinsic model behind Rank-based measures M2 Dimension Reduction m(Z)m(Z) T m(Bm(B Z)Z) d dk (1) d 0 be the smallest degree such that YZYZ | m d (Z)(Z) 0 B (2) 0 {{ 01,,  0k00k0 } is a basis of the central subspace (CS) 16

Model Flexibility Linear regression model Y T 0 Z T Binary Choice model Accelerated Failure time model Y I ( log( Y ) 0 Z T 0 0) Z Generalized linear regression model (GLM) Non-monotonic regression model Y(Y( T 0 2 Z)Z) 17

Types of covariates all discrete but continuous covariates Covariates which moments could not exist 18

Theories Propositions: (1) Existence m ( Z  arg max C ( g ) d0d0 g (2) Uniqueness f ( Z )  arg max C ( g ) f(Z)f(Z) cm ( Z )  c (3) Optimality d0d0 g for a ploynomial f d0d0 d01d01 ( z ) of the degree d02d02 d 0 g( Z )  arg max C ( g ) g(Z)T(m(Z))g(Z)T(m(Z)) d g for some monotonic function T 0 19

Summary TZTZ could not be the best composite score Model flexibility Various types of covariates Optimal score : existence, uniqueness, and optimality 20

How to estimate d k 0 0 : structural degree : structural dimension S(BS(B ) : the central subspace 0 m ( BZ ) : the optimal score d k 0 C 0 max : the C-max

Estimation Procedure

Derive m ( Z ) by maximizing the concordance index function via Step1 d the generalized single-index form of the polynomial Tips:(1) dpdp m(Z)cZm(Z)cZ rjrj  T Z d    r  1p1p r0r1rprj1r0r1rprj1 n I(I( T Z  T Z,Y  Y ) (2) C (m ( Z )) C()C()  ijij i1j1i1j1 nd 0 nn n  i1j1i1j1 I (Y  Y ) ijij

Estimation Procedure Step 2 Apply the outer grandient approach to obtain B Tips :(1) T k m (u) mm (B(Bu)u) d dk (2) col( S ( B ))  col(  m ( u )(  m T (u)) dW (u)) 0  p uRuR d0d0d0d0

Estimation Procedure Step 3 Derive the estimator of Tips :(1) m dk T (B(B k Z)Z) ZBTZBTZ k n I(I( T Z  T Z,Y  Y ) (2)  ˆ  arg max  (3) T  i1j1i1j1 T i n  i1j1i1j1 jijjij I (Y  Y ) ijij m (BZ) ˆ Z dk k

Estimation Procedure Step 4 Adopt the concordance-based generalized BIC to estimate T d, k, S ( B ), m ( BZ ), and C 000d0k0000d0k0 Tips : (1) IC ( d, k ) 0max T  nC (m(B log n kdkd Z ))  (C  1) ndkk with IC (0, k )  1/2 (2) (d,k)  arg max IC ( d, k ) 0  d,1   p  1 2 k

Asymptotic results Consistent model selection --- parsimonious model among the class of Correct models (d(d,k,k ) 0 0 n -consistency of estimators of T S ( B ) and m ( B Z)Z) 0 Asymptotic normality of estimators of C d0k00d0k00 max 27

Wine Data Vinho verde wine : red wine and white wine (from the Minho Region of Northern Portugal) Collected from May/2004 -February/2007 Red wine : sample size (n)=1599 White wine : n=4898 Physicochemical and sensory tests

Wine data Response (Y): Preferences 0 (bad) -10 (excellent) 11 Covariates (Z) : fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, PH, sulphates, and alcohol

Wine data 30

Wine data 31

Thank You !