Reference Value Estimators in Key Comparisons Margaret Polinkovsky Advisor: Nell Sedransk NIST May 2004.

Slides:



Advertisements
Similar presentations
DATA & STATISTICS 101 Presented by Stu Nagourney NJDEP, OQA.
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Exploring uncertainty in cost effectiveness analysis NICE International and HITAP copyright © 2013 Francis Ruiz NICE International (acknowledgements to:
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
Primary and Derived Measures Terminology Prioritization 1.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
ANALYTICAL CHEMISTRY CHEM 3811
1 D r a f t Life Cycle Assessment A product-oriented method for sustainability analysis UNEP LCA Training Kit Module k – Uncertainty in LCA.
Chapter 2: The Research Enterprise in Psychology
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Section 2: Science as a Process
Chapter 2: The Research Enterprise in Psychology
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
Fundamentals of Data Analysis Lecture 7 ANOVA. Program for today F Analysis of variance; F One factor design; F Many factors design; F Latin square scheme.
Regression Analysis (2)
QNT 531 Advanced Problems in Statistics and Research Methods
INT 506/706: Total Quality Management Introduction to Design of Experiments.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
American Association for Laboratory Accreditation Practical Solutions to Traceability and Uncertainty in Accreditation Presented to CITAC-NCSLI Joint Workshop.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
1 Design and Analysis of Engineering Experiments Chapter 1: Introduction.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Meta-analysis 統合分析 蔡崇弘. EBM ( evidence based medicine) Ask Acquire Appraising Apply Audit.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
1 Review from previous class  Error VS Uncertainty  Definitions of Measurement Errors  Measurement Statement as An Interval Estimate  How to find bias.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
ANOVA: Analysis of Variance.
POOLED DATA DISTRIBUTIONS GRAPHICAL AND STATISTICAL TOOLS FOR EXAMINING COMPARISON REFERENCE VALUES Alan Steele, Ken Hill, and Rob Douglas National Research.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Lecture 2d: Performance Comparison. Quality of Measurement Characteristics of a measurement tool (timer) Accuracy: Absolute difference of a measured value.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Measures of Reliability in Sports Medicine and Science Will G. Hopkins Sports Medicine 30(4): 1-25, 2000.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-5 Estimating a Population Variance.
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
Chapter 10 The t Test for Two Independent Samples
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
1 URBDP 591 A Analysis, Interpretation, and Synthesis -Assumptions of Progressive Synthesis -Principles of Progressive Synthesis -Components and Methods.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Evaluation of statistical methods for meta-analysis Julian Higgins School of Social and Community Medicine University of Bristol, UK 1 Cochrane Methods.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Indicate the length up to one estimated value…. A. B. C.
Home Reading Skoog et al. Fundamental of Analytical Chemistry. Chapters 5 and 6.
1 Life Cycle Assessment A product-oriented method for sustainability analysis UNEP LCA Training Kit Module k – Uncertainty in LCA.
Statistical issues in the validation of surrogate endpoints Stuart G. Baker, Sc.D.
Stats Methods at IC Lecture 3: Regression.
Comparing Three or More Means
Section 2: Science as a Process
Psychology 202a Advanced Psychological Statistics
Chapter 2 Simple Comparative Experiments
Discrete Event Simulation - 4
Types of Control I. Measurement Control II. Statistical Control
1-Way Random Effects Model
Chapter 7 Lecture 3 Section: 7.5.
Design Of Experiment Eng. Ibrahim Kuhail.
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Reference Value Estimators in Key Comparisons Margaret Polinkovsky Advisor: Nell Sedransk NIST May 2004

Statistics, Metrology and Trade Statistics Estimation for measurements (1 st moment) Attached Uncertainty (2 nd moment) Incredible precision in National Metrology Institute (NMI) Superb science Exquisite engineering Statistical analysis

What are Key Comparisons? Each comparability experiment Selected critical and indicative settings – “Key” Tightly defined and uniform experimental procedures Purpose Establish degree of equivalence between national measurement standards Mutual Recognition Arrangement (MRA) 83 nations Experiments for over 200 different criteria

Seller a Buyer b SpecificationsRequirements Measurements NMI b NMI a Comparability NMI: National Metrology Institute products money Equivalence

Elements of Key Comparisons Key points for comparisons Experimental design for testing Participating NMIs Measurement and procedure for testing Statistical design of experiments Analysis of target data Statistical analysis of target data Scientific review of measurement procedure

Issues for Key Comparisons Goals: To estimate NMI-NMI differences To attach uncertainty to NMI-NMI differences To estimate Key Comparison Reference Value (KCRV) To establish individual NMI conformance to the group of NMIs To estimate associated uncertainty Complexity Artifact stability; Artifact compatibility; Other factors Pilot NMI NMIs

Statistical Steps Step 1 Design Experiment (statistical) Step 2 Data collected and statistically analyzed Full statistical analysis Step 3 Reference value and degree of equivalence determined Corresponding uncertainties estimated

Present State of Key Comparisons No consensus among NMIs on best choice of procedures at each step Need for a statistical roadmap Clarify choices Optimize process

Outcomes of Key Comparisons Idea “True value” Near complete adjustment for other factors Model based, physical law based Non-measurement factors Below threshold for measurement Precision methodology assumptions Highly precise equipment used to minimize variation Repetition to reduce measurement error

Outcomes of Key Comparisons Each NMI: Observation= “True Value”+ measurement +non-measurable error error same for all NMIs varies for NMI varies for NMI (after adjustment,if any ) data based estimate different expert for each NMI common artifact “statistical uncertainty“ “non-statistical or physical event uncertainty” Combined uncertainty Goal Estimate “True Value”: KCRV Estimated combined uncertainty and degrees of equivalence

Problems to Solve Define Best Estimator for KCRV Data from all NMIs combined Many competing estimators Unweighted estimators Median ( for all NMIs) Simple mean Weighted by Type A Weighted by 1/Type A (Graybill-Deal) Weighted by both Type A and Type B Weighted by 1/(Type A + Type B) (weighted sum) DerSimonian-Laird Mandel-Paule

Role of KCRV Used as reference value “95% confidence interval” Equivalence condition for NMI

Research Objectives Objectives Characterize behavior of 6 estimators Examine differences among 6 estimators Identify conditions governing estimator performance Method Define the structure of inputs to data Simulate Analyze results of simulation Estimator performance Comparison of estimators

Model for NMI i  Reference value: KCRV   (Laboratory/method) bias : Type B   (Laboratory/method) deviation : Type B   Measurement error: Type A 

Process  2 =  2 +   2 (Type B)  = systematic bias   2 = extra-variation NMI y =  +  +   * +  s 2 = data-based variance estimate (Type A)  = experiment-specific bias   * = experiment-specific deviation  = random variation Data y s 2 Expertise  2 Sources of Uncertainty NMI Process

Translation to Simulation  Scientist: unobservable  Systematic Bias ~N(, ) Simulate random  Extra-variation Simulate random  Data: observable  Observed “Best Value”  Variance estimate s 2 ~ 1/df( ) Simulate random  Experimental Bias Simulate random  Experimental Deviation Simulate random Uncertainty Type A: s 2 Type B:

Conclusions and Future Work Conclusions Uncertainty affects MSE more than Bias Estimators performance Graybill-Deal estimator is least robust Dersimonian-Laird and Mandel-Paule perform well When 1 NMI is not exchangeable the coverage is effected Number of labs changes parameters Future work Use on real data of Key Comparisons Examine other possible scenarios Further study degrees of equivalence Pair-wise differences

Looking Ahead Use on real data of Key Comparisons Examine other possible scenarios Further study degrees of equivalence Pair-wise differences

References R. DerSimonian and N. Laird. Meta-analysis in clinical trials. Controlled Clinical Trials, 75: , 1980 F. A. Graybill and R. B. Deal. Combining unbiased estimators. Biometrics, 15: , 1959 R. C. Paule and J. Mandel. Consensus values and weighting factors. Journal of Research of the National Bureau of Standards, 87: P.S.R.S. Rao. Cochran’s contributions to the variance component models for combining estimators. In P. Rao and J. Sedransk, editors. W.G. Cochran’s Impact on Statistics, Volume II. J. Wiley, New York, 1981 A. L. Rukhin. Key Comparisons and Interlaboratory Studies (work in progress) A. L. Rukhin and M.G. Vangel. Estimation of common mean and weighted mean statistics. Jour. Amer. Statist. Assoc., 73: , 1998 J.S. Maritz and R.G. Jarrett. A note on estimating the variance of the sample median. Jour. Amer. Statist. Assoc., 93: , 1998 SED Key Comparisons Group (work in progress)