Bootstrap Estimation of Disease Incidence Proportion with Measurement Errors Masataka Taguri (Univ. of Tokyo.) Hisayuki Tsukuma (Toho Univ.)

Slides:



Advertisements
Similar presentations
The Simple Linear Regression Model Specification and Estimation Hill et al Chs 3 and 4.
Advertisements

Review bootstrap and permutation
3.3 Hypothesis Testing in Multiple Linear Regression
Brief introduction on Logistic Regression
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Logistic Regression.
April 21, 2010 STAT 950 Chris Wichman. Motivation Every ten years, the U.S. government conducts a population census, and every five years the U. S. National.
Section 3.4 Bootstrap Confidence Intervals using Percentiles.
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Point estimation, interval estimation
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Sample size computations Petter Mostad
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
2. Point and interval estimation Introduction Properties of estimators Finite sample size Asymptotic properties Construction methods Method of moments.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Inferences About Process Quality
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Regression Eric Feigelson. Classical regression model ``The expectation (mean) of the dependent (response) variable Y for a given value of the independent.
AM Recitation 2/10/11.
Chapter 7 Estimation: Single Population
Bootstrap and Cross-Validation Bootstrap and Cross-Validation.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Statistical Interval for a Single Sample
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Estimating Incremental Cost- Effectiveness Ratios from Cluster Randomized Intervention Trials M. Ashraf Chaudhary & M. Shoukri.
1 Nonparametric Methods II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
Sampling and Confidence Interval Kenneth Kwan Ho Chui, PhD, MPH Department of Public Health and Community Medicine
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
Sample size and power estimation when covariates are measured with error Michael Wallace London School of Hygiene and Tropical Medicine.
RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 6.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Simple linear regression Tron Anders Moger
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Nonparametric Methods II 1 Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Case Selection and Resampling Lucila Ohno-Machado HST951.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Computacion Inteligente Least-Square Methods for System Identification.
Quantifying Uncertainty
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Estimating standard error using bootstrap
Bootstrap and Model Validation
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Statistics in Applied Science and Technology
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
SA3202 Statistical Methods for Social Sciences
Stats Club Marnie Brennan
Quantifying uncertainty using the bootstrap
Review for Exam 2 Some important themes from Chapters 6-9
Simple Linear Regression
Confidence Interval.
Logistic Regression.
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Presentation transcript:

Bootstrap Estimation of Disease Incidence Proportion with Measurement Errors Masataka Taguri (Univ. of Tokyo.) Hisayuki Tsukuma (Toho Univ.)

Contents Motivation Formulation of the Problem Regression Calibration Method Proposed Estimator Interval Estimation (Delta & Bootstrap) Numerical Examination

Motivation Measurement error problem –The measurement error of exposure variable will tend to give bias estimates of relative risk 〈 ex 〉 In nutritional studies, FFQ (food frequency questionnaire) is not exact measurement ★ How to obtain correct risk estimates?

Formulation of Problem (1) Data form –Main data : : binary response (Disease or not) : exposure measured with error : No. of main data-set –Validation data : : true exposure : No. of validation data-set

Formulation of Problem (2) Disease model Assume linear relationship of and ~ Naive model ( : observable) ★ Usual interest : estimation of

Regression Calibration (RC) method Model ~ Rosner, et al.1989 (i) estimate by maximum likelihood (ii) estimate by ordinary least squares (iii) estimate by

Asymptotic Variance of Apply the delta method, then asymptotic variance of is given by

Subject for investigation ★ RC estimator : might be unstable (heavy tailed) ・ : asymptotically ratio of two normal variables MLE OLSE

Objective of our study ① To propose an estimator of –examination of the stability of estimators ② Interval estimation of –Bootstrap Method [Percentile & BCa] –Delta Method ③ Interval estimation of –Bootstrap Method [Percentile & BCa] –Delta Method

Inverse Regression Estimator To construct alternative estimator of, consider the inverse regression model; The ordinary least squares estimator of

Generalized Estimator Ridge-type Estimator for Generalized Inverse Regression Estimator ★ ⇒ RC estimator ★ The asymptotic distribution of has moments.

Interval Estimation of (by Delta method) [Algorithm] 1) Use and to construct the confidence intervals of 2) Estimate the asymptotic variance of and by delta method 3) Make confidence intervals by normal approximation Rosner, et al.1989

Interval Estimation of (by Bootstrap method) Estimator : [Algorithm] 1)Compute by resamples from main data-set. 2) Compute by resamples from validation data-set. 3) Compute for 4) Construct confidence intervals of by Bootstrap Method [Percentile & BCa]. (Efron & Tibshirani, 1993)

Interval Estimation of Estimator of p : : RC / Inverse Regression Est. Confidence Interval (C.I.) of p : by Delta or Bootstrap (Percentile & BCa) ★ p should exist between 0 and 1 ⇒ C.I. of logit(p) → C.I. of p

Numerical Examination – Set-up Validation sample 1°Generate from N(0,1) 2°Assume the model Generate Make validation sample for ★ The above model is a special case of

Main sample 3°Generate Generate from Bernoulli dist.( ). Combine with (paired value of ) ⇒ 4°Set Compute 95% C.I. of logit(p), then p by Normal approximation, Percentile & BCa. ★ Set ・ Computation : for all combination of

Table 1 Estimates of α 1 ( R=1000, B=2000, n=1000, m=100) Esti- mator MeanSD Skew- ness Kurto- sis MinMax Range α 1 = (exp(α 1 )=1.5) , λ 1 = 0.5 α1Rα1R α1Gα1G α 1 = (exp(α 1 )=2.0) , λ 1 = 0.5 α1Rα1R α1Gα1G α 1 = (exp(α 1 )=3.0) , λ 1 = 0.5 α1Rα1R α1Gα1G RC GI

Examination on stability of α 1 (1) underestimates except : monotone increasing w.r.t. except for is smaller than for all cases. Biases of, : not so different on the whole. (2) SD, skewness, kurtosis, range : values for > values for ⇒ Tail of distribution is heavier than that of → Outliers may often appear.

Table % Confidence Intervals of α 1 ( R=1000, B=2000, n=1000, m=100) α 1 = (exp(α 1 )=1.5) , λ 1 = 0.5 MethodCov. Prob.LengthShape NR NG PR PG BR BG Nor.app. BCa Parcentile

Table % Confidence Intervals of α 1 ( R=1000, B=2000, n=1000, m=100) α 1 = (exp(α 1 )=2.0) , λ 1 = 0.5 MethodCov. Prob.LengthShape NR NG PR PG BR BG

Table % Confidence Intervals of α 1 ( R=1000, B=2000, n=1000, m=100) α 1 = (exp(α 1 )=3.0) , λ 1 = 0.5 MethodCov. Prob.LengthShape NR NG PR PG BR BG

Examination on confidence interval of α 1 (1) Cov. Prob. of NG < Cov. Prob. of NR. Cov. Prob. of normal approximation : monotone decreasing w.r.t. (odds ratio) (2) Length of C.I. by < Length of C.I. by (for all cases) ⇒ stable property of (3) Bootstrap : slightly worse than Normal approxim. ( from viewpoint of Cov. Prob. ) ★ Percentile is best in case of (4) BCa : not better than other methods ⇔ The distribution of estimates is not so skew?

Table % Confidence Intervals of p ( R=1000, B=2000, n=1000, m=100 ; exp(α 1 )=2.0, λ 1 =0.5) MethodCov. Prob.LengthShape T=-1, Pr[D|T]= NR NG PR PG BR BG T=0, Pr[D|T]=0.05 NR NG PR PG BR BG

Table % Confidence Intervals of p ( R=1000, B=2000, n=1000, m=100 ; exp(α 1 )=2.0, λ 1 =0.5) MethodCov. Prob.LengthShape T=1, Pr[D|T]= NR NG PR PG BR BG T=2, Pr[D|T]= NR NG PR PG BR BG

Examination on confidence interval of p (1) Cov. Prob. for Percentile : does not keep the nominal level, especially in case of Cov. Prob. for BCa : quite satisfactory for almost all cases. (2) Length of C.I. is minimum when T=0. I t becomes large rapidly with increase of T. (3) Length for NG is always shorter than that for NR.

References [1] Carroll, R.J., Ruppert, D., Stefanski, L.A.: Measurement Error in Nonlinear Models, Chapman & Hall, New York (1995). [2] Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap, Chapman & Hall, New York (1993). [3] Rosner, B, Willett, W.C., Spiegelman D.: Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error, Statistics in Medicine, 8, (1989).