Mai Zhou Dept. of Statistics, University of Kentucky

Slides:



Advertisements
Similar presentations
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
STA/BST 222 Fall 2011 Nov 22,  K-M estimate  COX MODEL  AFT MODEL.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
Sampling Distributions (§ )
Sampling Distributions
Parametric Inference.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Inference about a Mean Part II
Simple Linear Regression and Correlation
AM Recitation 2/10/11.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Confidence Intervals for Means. point estimate – using a single value (or point) to approximate a population parameter. –the sample mean is the best point.
CHAPTER 14 MULTIPLE REGRESSION
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Empirical Likelihood Mai Zhou Dept. of Statistics, University of Kentucky.
Mathematical Model for the Law of Comparative Judgment in Print Sample Evaluation Mai Zhou Dept. of Statistics, University of Kentucky Luke C.Cui Lexmark.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Empirical Likelihood for Right Censored and Left Truncated data Jingyu (Julia) Luan University of Kentucky, Johns Hopkins University March 30, 2004.
Bootstrap Event Study Tests Peter Westfall ISQS Dept. Joint work with Scott Hein, Finance.
Reasoning in Psychology Using Statistics Psychology
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
Inferences Concerning Variances
Chapter 10 Statistical Inference for Two Samples More than one but less than three! Chapter 10B < X
Two Types of Empirical Likelihood Zheng, Yan Department of Biostatistics University of California, Los Angeles.
Lecture 22 Dustin Lueker.  Similar to testing one proportion  Hypotheses are set up like two sample mean test ◦ H 0 :p 1 -p 2 =0  Same as H 0 : p 1.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
AP Statistics Chapter 14 Section 1.
CHAPTER 10 Comparing Two Populations or Groups
Chapter 4. Inference about Process Quality
Chapter 9: Inferences Involving One Population
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
This Week Review of estimation and hypothesis testing
Mai Zhou Dept. of Statistics, University of Kentucky Chengwu Cui
Statistics 262: Intermediate Biostatistics
Reasoning in Psychology Using Statistics
Confidence Intervals Tobias Econ 472.
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Review of Statistics (SW Chapters 3)
Reasoning in Psychology Using Statistics
Statistical Inference about Regression
Tutorial 4 For the Seat Belt Data, the Death Penalty Data, and the University Admission Data, (1). Identify the response variable and the explanatory.
CHAPTER 10 Comparing Two Populations or Groups
Confidence Intervals Tobias Econ 472.
If we can reduce our desire,
Introduction to SAS Essentials Mastering SAS for Data Analytics
Chapter 11: Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Reasoning in Psychology Using Statistics
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 18: The Chi-Square Statistic
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
STA 291 Spring 2008 Lecture 22 Dustin Lueker.
Lecture 4: Likelihoods and Inference
Inference for Distributions
CHAPTER 10 Comparing Two Populations or Groups
Lecture 4: Likelihoods and Inference
Introductory Statistics
How Confident Are You?.
Presentation transcript:

Mai Zhou Dept. of Statistics, University of Kentucky Empirical Likelihood Mai Zhou Dept. of Statistics, University of Kentucky

Any first year Statistical Inference course will talk about (parametric) “likelihood function”. Three inference methods (tests) based on likelihood: 1. Wald test 2. Score test (Rao’s Score test) 3. Likelihood ratio test (Wilks)

Empirical likelihood is a nonparametric version of 3.

Empirical Likelihood allows the statistician to employ likelihood methods, without having to pick a parametric family of distributions for the data. --- Owen Empirical Likelihood allows for hypothesis testing and confidence region construction without an information/variance estimator.-- me Plus many additional nice properties.

First book on this subject (2001) by A. Owen “Empirical Likelihood” . But in Cox model the (partial) likelihood ratio exists for a long time (over 20 years). SAS proc phreg, Splus function coxph( ) all have it computed. Claim: The (partial) likelihood ratio statistic for the regression coefficients in the Cox model can be interpreted as a case of Empirical Likelihood Ratio.

For n observations, independent, from the empirical likelihood is EL(F) = Where

EL(F) is maximized by the empirical distribution function:

An additional parameter of interest, when maximizing the EL(F) F(t) or can be considered as nuisance parameters

Censored Observations For a right censored observation The likelihood contribution is For a left censored observation the contribution is Interval censored:

Truncated observations For a left truncated observation (often referred to as delayed entry) : (entry time, survival time) = The likelihood contribution is If the survival time is also right censored, then the likelihood contribution is

Empirical Likelihood Theorem: If the null hypothesis is true then

R = Gnu S/Splus http://cran.us.r-project.org + many add-on packages A Package for empirical likelihood with censored/truncated data Contributed package – emplik (maintained by Mai Zhou) It Does Empirical likelihood ratio tests for means or weighted hazard, based on left-truncated, right censored or left, right, doubly censored data.

Tests hypothesis of the form: with right, left, doubly censored data. Or with left-truncated, right censored data.

Example: Data taken from Klein & Moeschberger (1997) book as reported in their table 1.7 y = left truncation time = (51, 58, 55, 28, 25, 48, 47, 25, 31, 30, 33, 43, 45, 35, 36) x = survival times of female psychiatric inpatients = (52, 59, 57, 50, 57, 59, 61, 61, 62, 67, 68, 69, 69, 65, 76) d = censoring status = ( 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 1, 1, 0, 1 )

> library(emplik) > el.ltrc.EM( y, x, d, mu=62) The mean of the NPMLE is 63.18557. (if ‘fun’ is left out, then fun=t, by default). Two of the outputs are -2LLR = 0.2740571 Pval = 0.6006231

Repeat the test for many different values of the mean. (mu=59, etc. ) If the hypothesized mean is inside the interval [58.78936, 67.81304], the p-value of the test is larger then 0.05. ----- the 95% confidence interval for the mean is [58.78936, 67.81304]

For doubly censored data, the standard deviation of the NPMLE is hard to compute. The Wald test/confidence interval is hard to do. No problem with empirical likelihood ratio! No need to estimate the standard deviation, instead, we need to maximize EL under some constraint. The maximization can be achieved with the help of modern computer. (E-M algorithm etc.)