Standardizing VBP's Patient Satisfaction Data into Percentile Ranks Using Reference Distribution Jenhao Jacob Cheng, PhD, MS Alice Li, MS Nikolas Matthes,

Slides:



Advertisements
Similar presentations
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Advertisements

HCAHPS and Hospital Value-Based Purchasing (Hospital VBP) Agency for Healthcare Research and Quality Centers for Medicare and Medicaid Services.
The HCAHPS and Competency Connection HealthStream, Inc. The HCAHPS and Competency Connection HealthStream, Inc.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Meeting Patient Expectations Presentation by: Rhonda Jones & Rachael Seiter.
More about Correlations. Spearman Rank order correlation Does the same type of analysis as a Pearson r but with data that only represents order. –Ordinal.
1 Econometric Load Forecasting Peak and Energy Forecast 06/14/2005 Econometric Load Forecasting Peak and Energy Forecast 06/14/2005.
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Understanding Research Results
Patient Experience/ Satisfaction What’s at Stake ?
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
David Corne, and Nick Taylor, Heriot-Watt University - These slides and related resources:
Correlation.
QUALITY ASSURANCE Reference Intervals Lecture 4. Normal range or Reference interval The term ‘normal range’ is commonly used when referring to the range.
Nursing Excellence Conference April 19,2013
Jennifer Faerberg Director Health Care Affairs Medicare Hospital Value-Based Purchasing (VBP) Program – Proposed Rule Published January.
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
1 The Patient Perspective: Satisfaction Survey Presented at: Disease Management Colloquium June 22, 2005 Shulamit Bernard, RN, PhD.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Components of Care Vary in Importance for Overall Patient-Reported Experience by Type of Hospitalization in the HCAHPS Survey Marc N. Elliott, PhD David.
Descriptive Statistics Prepared by: Asma Qassim Al-jawarneh Ati Sardarinejad Reem Suliman Dr. Dr. Balakrishnan Muniandy PTPM-USM.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
Leapfrog’s Resource Utilization Measures & Severity-Adjustment Models April 25, 2008.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Measures of variability: understanding the complexity of natural phenomena.
Midterm Review Ch 7-8. Requests for Help by Chapter.
Sample Size Determination
Statistics Unit 9 only requires us to do Sections 1 & 2. * If we have time, there are some topics in Sections 3 & 4, that I will also cover. They tie in.
1.  In the words of Bowley “Dispersion is the measure of the variation of the items” According to Conar “Dispersion is a measure of the extent to which.
Introduction to statistics I Sophia King Rm. P24 HWB
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
The Hospital CAHPS Program Presented by Maureen Parrish.
Copyright © Cengage Learning. All rights reserved. 8 9 Correlation and Regression.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
CMS SAS Users Group Conference Learn more about THE POWER TO KNOW ® October 17, 2011 Medicare Payment Standardization Modeling using SAS Enterprise Miner.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Estimating standard error using bootstrap
Bootstrap and Model Validation
Examining Achievement Gaps
Sample Size Determination
CHAPTER 3 Describing Relationships
Data Mining: Concepts and Techniques
Impact of State Reporting Laws on Central Line– Associated Bloodstream Infection Rates in U.S. Adult Intensive Care Units Hangsheng Liu, Carolyn T. A.
Data Analysis & Report Writing
Sec. 2.1 Review 10th, z = th, z = 1.43.
Elementary Statistics
L. Isella, A. Karvounaraki (JRC) D. Karlis (AUEB)
CHAPTER 3 Describing Relationships
Quality Assurance Reference Intervals.
What is Regression Analysis?
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CAASPP Results 2015 to 2016 Santa Clara Assessment and Accountability Network May 26, 2017 Eric E, Zilbert Administrator, Psychometrics, Evaluation.
CHAPTER 3 Describing Relationships
MGS 3100 Business Analysis Regression Feb 18, 2016
Using the Rule Normal Quantile Plots
Using the Rule Normal Quantile Plots
CHAPTER 3 Describing Relationships
Presentation transcript:

Standardizing VBP's Patient Satisfaction Data into Percentile Ranks Using Reference Distribution Jenhao Jacob Cheng, PhD, MS Alice Li, MS Nikolas Matthes, MD, PhD, MPH Quality Indicator Project Press Ganey Associates, Inc. APHA Annual Meeting, Denver CO, November

2 Background –The 2010 Heath Care Reform Bill mandates the implementation of Medicare’s value-based Purchasing (VBP) program for fiscal year 2013 –VBP evaluates hospitals’ performance by 17 clinical measures for process of care and 9 patient satisfaction measures on a rate-based (percent) scale –Prior year’s performance serves as the baseline reference from which the benchmarks (median and 95 th percentile) are derived for assessment year period –Patient satisfaction data must be converted into percentile rank (0-100) based on baseline data to facilitate the scoring process

Percentile Rank –People usually get confused between percentile rate (in original data scale) and percentile ranks (always 0-100) –Typically we are interested to know a data distribution by percentile rates on some commonly used percentile ranks (e.g., 25, 50, 75) –Sometimes we convert the data into percentile ranks, like in this study, to reflect the position in the reference distribution for comparison purpose –Therefore, percentile rank is a standardized score to tell how good you are for any of the target measures –Determining percentile ranks within one dataset is easy but is more challenging when two datasets involved 3

4 Scoring for Patient Satisfaction Patient Satisfaction MeasureRatePercentile RankScore 1. Nurse Communication Doctor Communication Cleanliness and Quiet Staff Responsiveness Medications Communication Pain Management Discharge Information Overall Hospital Rating Lowest Percentile Rank Total First 8 measures are assigned a score up to 10 points based on percentile ranks and up to 20 additional points are awarded for the lowest percentile rank

5 Study Design –Retrospective cohort study based on hospital- level data. –Study population: 3,725 hospitals with HCAHPS data for both 2007 and 2008 from Hospital Compare website, maintained by Centers for Medicare and Medicaid Services (CMS) –For demonstration purposes only one patient satisfaction measure, Overall Hospital Rating, has been used.

6 Methodology –We investigated three methods to assign percentile ranks based on reference distribution: Cross Ranking (CR) Percentile Mapping (PM) Distribution Modeling (DM) –Each method is individually evaluated for accuracy, variation, and efficiency –Pairwise comparison is performed for each pair of methods to evaluate agreement with each other

Cross Ranking 7 –Ranking each data point repeatedly within the entire reference distribution, implemented by SAS Proc SQL –When using properly, SQL can be viewed as a rank-based statistical tool for data analysis –The feature of Cartesian product allows SQL to handle the repeated ordering of each data point (Y i ) in reference distribution (X) –Percentile rank of each data point is calculated by Count (X<Y i ) / N –N x N processing

SQL Code: Cartesian Product select y.rate, sum(case when x.rate<=y.rate then 1 else 0 end) as num, count(*) as den, 100*(calculated num / calculated den) as pctl from x, y group by id, y.rate; 8

Percentile Mapping –Create a percentile lookup table which contains 101 pairs of “rank” and “rate” listings, implemented by SAS Proc UNIVARIATE (or Oracle percentile function) –Mapping each data point (rate) to the nearest percentile value (rate) to obtain the corresponding percentile rank –Mapping can be implemented by a compound SQL technique called “Nearest Neighbor Join” which involves two “Unequal Join” procedures in opposite directions –Two unequal joins not only make sure the nearest mapping is obtained but also take care of the potential outlier in either direction of unequal join –N x 101 processing 9

SQL Code: Nearest Neighbor Join select rate, pctl from ( select y.rate, max(look.pctl) as pctl, min(abs(y.rate-look.rate)) as diff from y, look where y.rate>=look.rate group by id, y.rate union select y.rate, min(look.pctl) as pctl, min(abs(y.rate-look.rate)) as diff from y, look where y.rate<=look.rate group by id, y.rate ) having diff=min(diff); 10

Distribution Modeling –Compute the empirical cumulative distribution function (CDF) based on Kernel density estimation, implemented by SAS Proc KDE –Fitting CDF data by piecewise linear regression with 9 knots, implemented by SAS Proc REG –Note that the rounded predictions represent percentile ranks –Increase # of knots or include quadratic functions can improve the model fit but the gain is limited –Cubic functions are not suggested to avoid fitting a decreasing curve section 11

12 Modeling CDF Curve

13 Comparison Analysis –Accuracy of each method is evaluated by Kendall’s t coefficient for rank correlation between percentile rank and original data (rate) –Variation of each method is evaluated by the standard deviation –Efficiency of each method is evaluated by computation time using SAS 9.2 on a reference desktop computer –Pairwise agreement is evaluated by root mean square error (RMSE) between each pair of methods

Individual ComparisonPairwise Comparison MethodKendall tSDTime [sec]MethodRMSE CR CR-PM0.68 PM CR-DM0.89 DM PM-DM Comparison Results

15 Conclusion –DM is the most parsimonious and efficient method to convert data into percentile ranks based on reference distribution –Its statistical properties in terms of accuracy and variation are also a bit superior but requires more mathematical knowledge to understand the approach –CR can be easily implemented by standard IT solution like SQL but is much more computationally intensive and this may prevent it to be used as an online tool –PM presents an in-between choice which is still efficient for online applications but not that difficult for non- statistician (IT) to understand and implement it

16 Discussion –Statistical properties like accuracy and variation are not very different since they simply reflect how smoothly the percentile ranks are calculated –Therefore, they can be more different when N is smaller and/or many ties exist –On the other hand, efficiency is more distinguishing in this study when N is large and vise versa –PM is conceptually similar to DM but its calculation is more like CR in a discrete domain (while DM in continuous domain) –Pairwise comparison also confirms that CR and PM are in more agreement