Concurrent Validity of Alternative CANS Outcome Metrics William A. Shennum Julian Leiro Delisa Young Five Acres Altadena, California
What is test validity? The concept of validity was formulated by T.L. Kelley in 1927, who stated that a test is valid if it measures what it claims to measure. For example a test of intelligence should measure intelligence and not something else, such as memory. Simplypsychology.org/validity
Concurrent validity Concurrent validity is a measure of how well a particular test correlates with a previously validated measure of the same construct. Concurrent validity relies upon measurements that take place at the same time. Explorable.com
Who we are Non profit organization serving greater Los Angeles community Research & Evaluation Department – Outcome & performance measurement – Quality & business process improvement – Program evaluation – Project management
Five Acres’ service programs
Why CANS? Need for a common outcome measure across programs for use in planning & CQI. CANS roll-out Training Inspiration – Dr. Lyons’ visit Technology Reporting
Individual Score Report (Excerpt) Client:“I” CANS Date:9/24/20132/3/2014 Life Domain Functioning IntakeDischarge 1. Family Living Situation Medical Sexuality School Achievement School Attendance 10 Youth Strengths 15. Family Interpersonal Educational 21
What About Program –Level CANS Outcomes? Several scoring options available Which ones work best? – Easily understood by stakeholders – Closely associated with other outcomes important to the organization (concurrent validity)
Study Design Three service levels were included. 1. Community based mental health 2. Wraparound 3. Residential treatment
Study Design - continued Youth intake and discharge CANS were compared (N=233) Five CANS outcome scoring metrics were computed for each youth Statistical associations of each CANS metric with concurrently measured discharge outcomes were examined.
Concurrently Measured Outcomes Service Intensity ProgramProgram Outcome/Success Measures Lower Higher Community based mental health % Service completion (EBP’s) Avg. YOQ points improved (caregiver) Wraparound % Program completion/ graduation % In community placement at discharge Residential treatment (RBS) % Program completion/ graduation % In less restrictive placement at discharge
CANS Outcome Metrics Examined in This Study
Metric 1. Intake-discharge change scores Method: Compute the average change from intake to discharge, for total score and for each domain Strength- easy to calculate Weaknesses – unit of measurement not common-sense, not tied to clients’ specific needs
Metric 1 – Intake/Discharge Change Scores, Example Youth # CANS Total Score Intake CANS Total Score DischargeChange RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ Total Score
Program Results for Metric 1 Community Based Services WraparoundResidential Treatment Avg. Change SignificanceP<.001P<.004P<.02
Metric 2 - Reliable Change Index Analysis Method: Determine whether each individual’s change score is statistically significant, by comparing it with a Reliable Change cutoff value RCI Formula: change in an individual’s score divided by the standard error of the difference for the test. Parameters in formula include Intake and Discharge scores, test standard deviations, and test (rater) reliability. Result is a yes-no score for each youth, for CANS total score, and each domain. – Strength: statistically sound – Weaknesses: difficult to compute and explain to stakeholders
How to Calculate the RCI M. Horsill, 1. Calculate standard error of measurement 2. Calculate standard error of the difference 3. Calculate RCI If RCI is 1.96 or greater, the difference is statistically significant (95% confidence level, 2-tailed) = one-tailed test cutoff
Metric 2 - Reliable Change Index Analysis, Example Youth # CANS Total Score ChangeRCI Score >=1.96? (1=yes, 0-no) RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ RBS_ % Exceed RCI Cutoff 40.0%
Program Results for Metric 2 RCI Community Based Services WraparoundResidential Treatment >= %10.2%24.2% >= %12.6%30.3%
Metric 3. Reduction in Number of Actionable Needs Method: Compare the number of actionable needs at intake and discharge, total and by domain. – Strength- addresses standout needs – Weakness- not tied to an individual’s specific needs identified at intake.
Metric 3. Reduction in Number of Actionable Needs, Example Youth # # Actionable Needs Intake # Actionable Needs DischargeChange RBS_22324 RBS_ RBS_ RBS_ RBS_ RBS_71266 RBS_ RBS_ RBS_1356 RBS_ Total Score
Program Results for Metric 3 Community Based Services WraparoundResidential Treatment Avg. Change SignificanceP<.001P<.12P<.03
Metric 4 - % Actionable Needs Resolved Method: Track specific actionable items identified at intake, and calculate the % of these that are no longer actionable at discharge, total and by domain. – Strengths - tailored to individual, easy for stakeholders to understand – Weakness – somewhat complicated scoring
Metric 4 - % Actionable Needs Resolved, Example CANS Domain: Youth Strengths IntakeDischarge Need Resolved? (1=yes, 0=no) 15. Family Interpersonal Optimism Educational Vocational UU 20. Talents/Interests Spiritual/Religious Community Life Relationship Permanence Resiliency Resourcefulness 211 % Needs Resolved 75%
Program Results for Metric 4 Community Based Services WraparoundResidential Treatment % Needs Met72.8%49.2%61.6%
Metric 5 – Actionable Needs Improved Method: Track specific actionable items identified at intake, and calculate % of these that show any improvement, even if still actionable at discharge, total and by domain. – Strength - tailored to individual, easy for stakeholders to understand – Weaknesses – includes needs still remaining at discharge, somewhat complicated scoring
Metric 5 – Actionable Needs Improved, Example Youth Behavioral/Emotional Needs IntakeDischarge Any Improvement? (1=yes, 2=no) 41. Psychosis Impulse/Hyper Depression Anxiety Oppositional Conduct Adjustment to Trauma Anger Control Substance Use 00 % Any Improvement in Actionable Needs 80%
Program Results for Metric 5 Community Based Services WraparoundResidential Treatment % Needs Improved 75.3%59.4%66.6%
Concurrent Validity of the 5 CANS Program Outcome Metrics Are the metrics statistically associated with concurrently collected measures of program success?
All CANS Metrics Significantly Associated with Concurrent Program Outcomes CANS Outcome Metrics Association w/ Concurrent Outcomes 1. Avg. Change 2. RCI Cutoff 3. Change in # Needs 4. Needs Resolve d 5. Needs Improved Median Statistical Significance p =.020 p =.043 p =.009 p =.013 p =.018
Specific CANS Domains were Positively Associated with Concurrent Outcomes CANS Outcome Metrics for Selected Domains FunctioningYouth Strength Family Needs & Strengths Behavior & Emotional Needs Median Correlation w/ Outcomes r =.23 r =.33 r =.22 r =.28
Some Differences Across Program Were Evident CANS Outcome Metrics Program / OutcomeAvg. Change RCI Cutoff Change in # Needs Needs Resolved Needs Improved Community Based Service completionp<.001p<.05p<.001p<.01p<.05 YOQ Pointsp<.05 p<.01 Wraparound Program completion p<.001 In communityp<.01nsp<.05p<.10ns Residential (RBS) Program completion p<.10p<.05 Less restrictivens p<.05p<.10
CANS outcomes for ‘successful’ youth CANS Outcome Metrics Program Avg. Change in Total Score % Youth exceeding RCI 1.65 Avg. Change in # of Actionable Needs Avg. % Actionable Needs Met Avg. % Actionable Needs Improved Community MH % %78.9% Wraparound % %79.9% Residential % %74.6% Average % %77.8%
Trends “Reduction in number of actionable needs” and “% of actionable needs resolved” metrics tended to work best for most programs “Average change score” metrics worked well in program serving less severe youth Suggestive evidence favored youth strengths domain as a driver of overall improvement CANS outcome profiles for successful youth were similar across programs – benchmarks would assist in setting performance targets.
Take Aways All CANS metrics examined showed evidence of concurrent validity across a range of programs All CANS domain improvements were related to concurrent outcomes, with suggestive evidence that “youth strengths” had strongest relationship to success. Programs serving higher severity needs may benefit from measuring metrics related to actionable need resolution and improvement. Benchmark databases could assist programs in setting outcome targets for youth.
Contact Information Bill Shennum Director of Research & Evaluation Five Acres (626)