Presentation is loading. Please wait.

Presentation is loading. Please wait.

Measuring Coding Accuracy Artificial Intelligence in Medicine National Cancer Institute.

Similar presentations


Presentation on theme: "Measuring Coding Accuracy Artificial Intelligence in Medicine National Cancer Institute."— Presentation transcript:

1 Measuring Coding Accuracy Artificial Intelligence in Medicine National Cancer Institute

2 Project This project was funded in part by Contract Number 263-MQ-514922 from the National Cancer Institute Participating registries Kentucky Cancer Registry Los Angeles Cancer Registry Atlanta Cancer Registry New Jersey Cancer Registry

3 Objective Develop a software tool that measures the accuracy of an automated coding system against a reference data set. Sub-tasks Define a coding accuracy model. Create a software tool that accepts input from any automated coding system to produce accuracy data.

4 Automated coding CLINICAL HISTORY/MACROSCOPY Right mastectomy and axillary tissue. A right mastectomy specimen with overlying skin measuring 220mm x 85mm and underlying breast tissue measuring 220mm x 100mm x 70mm. The axillary tail measures 125 x 60mm. The nipple is slightly retracted and located centrally. The superior margin is painted red, the inferior margin painted green and the deep cut margin is painted blue. Cut sections of the underlying breast tissue shows an ill-defined grey white yellow lesion with patchy areas of haemorrhage measuring 35 x 35 x 35mm located immediately below the nipple, 20mm from the inferior margin, 45mm from the deep cut margin, 50mm from the superior margin, 85mm from the medial margin and 100mm from the lateral cut margin. A1 - nipple, B1 - upper outer quadrant, C1 - upper inner quadrant, D1 - lower outer quadrant, E1 - lower inner quadrant, F1, G1 - tumour composite blocks, H1, I1 - tumour composite blocks, J1 - deep cut margin, K1 - superior margin, L1 – inferior margin, M4 - lymph nodes, N4 - lymph nodes, O - 3 serial slices, lymph node, P - 3 lymph nodes. MICROSCOPY This right mastectomy specimen demonstrates an invasive ductal carcinoma with the following pathological features: TUMOUR HISTOLOGY & GRADE The tumour is of an infiltrating poorly differentiated ductal carcinoma of non-otherwise specified type. The tumour is poorly defined and extremely infiltrative, comprising poorly-formed tubules, nests or strands of cuboidal tumour cells displaying high grade nuclei. The tumour cells are set within fibrotic desmoplastic stroma. Many lactiferous ducts are entrapped within the tumour. Frequent tumour mitoses are seen. Microcalcification is seen in some neoplastic tubules. Tumour grade (Modified Bloom-Richardson Scoring System): Tubular formation: 3 Nuclear atypia: 3 Tumour mitoses: 2 Total score: 8 (Grade III) TUMOUR LOCATION, SIZE AND EXTENT The tumour is located 5mm below the nipple and has a macroscopic size of 35mm across. The border of the tumour is poorly circumscribed and infiltrative. INTRA-LYMPHOVASCULAR OR PERINEURAL TUMOUR PERMEATION Focal intralymphatic tumour permeation is noted. No perineural tumour invasion is seen in sections submitted. M-80103 M-85003 M-80003 C50.9 C77.9

5 What is measured? Sensitivity, specificity, reducibility and confidence Of a single code, either (topography) or (morphology) Of a pair of codes (topography, morphology)

6 Notation: Adenocarcinoma “M-81403” as the subject code NotationThe report was coded by the: Reference methodAutomatic method X : XAs X (M-81403) As X (M-81403) X : OAs X (M-81403) As other than X (M-82113) O : XAs other than X (M-82003) As X (M-81403) O : OAs other than X (M-82003) As other than X (M-82003) X : X+OAs X (M-81403) As X plus other codes (M-81403, M-80103) O : X+OAs other than X (M-82003) As X plus other codes (M-81403, M-82003) 6 Possibilities

7 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs X X : OAs XAs other than X O : XAs other than XAs X O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

8 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

9 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Venn Diagram

10 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

11 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

12 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

13 Definitions NotationThe report was coded by the: Reference methodAutomatic method X : XAs XAs X only X : OAs XAs other than X O : XAs other than XAs X only O : OAs other than X X : X+OAs XAs X plus other codes O : X+OAs other than XAs X plus other codes Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Venn Diagram

14 Definitions Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Coding accuracy measures Confidence = A / (A+R) How much confidence can we place in the result. Reducibility = (A+R) / (A+R+C+S) How often is the subject code the only code identified when the code is identified. Specificity = Q / (Q+R+S) How often is the subject code not returned in those reports where it is not the reference code. Sensitivity = (A+C) / (A+B+C) How often is the reference code returned in those reports where it is the subject code.

15 Definitions Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Coding accuracy measures Confidence = A / (A+R) How much confidence can we place in the result. Reducibility = (A+R) / (A+R+C+S) How often is the subject code the only code identified when the code is identified. Specificity = Q / (Q+R+S) How often is the subject code not returned in those reports where it is not the reference code. Sensitivity = (A+C) / (A+B+C) How often is the reference code returned in those reports where it is the subject code.

16 Definitions Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Coding accuracy measures Confidence = A / (A+R) How much confidence can we place in the result. Reducibility = (A+R) / (A+R+C+S) How often is the subject code the only code identified when the code is identified. Specificity = Q / (Q+R+S) How often is the subject code not returned in those reports where it is not the reference code. Sensitivity = (A+C) / (A+B+C) How often is the reference code returned in those reports where it is the subject code.

17 Definitions Note: The labeled areas include only the portions that are bounded by the arcs, not the entire circle. Q O:O B X:O A X:X R O:X C X:X+O S O:X+O Coding accuracy measures Confidence = A / (A+R) How much confidence can we place in the result. Reducibility = (A+R) / (A+R+C+S) How often is the subject code the only code identified when the code is identified. Specificity = Q / (Q+R+S) How often is the subject code not returned in those reports where it is not the reference code. Sensitivity = (A+C) / (A+B+C) How often is the reference code returned in those reports where it is the subject code.

18 Data Flow Input Data Comparison, Matching and Analysis SQL Data Base With Query Assist Output Spread Sheets & Graphs SQL Database CODAC Source Data Coded Pathology Reports Reference Codes Automated Coding System Machine Generated Codes Accuracy Calculations Display Accuracy Data Input Data Comparison, Matching and Analysis SQL Data Base With Query Assist Output Spread Sheets & Graphs SQL Database CODAC Source Data Coded Pathology Reports Reference Codes Automated Coding System Machine Generated Codes Accuracy Calculations Display Accuracy Data

19 Software inputs

20 CODAC Front End

21 Software description Written in C#, uses latest.NET technology Runs on Standard Pentium workstation Imports and exports (CSV). Files can be edited with use text editor or Excel Optional Links to SQL database engine The performance of any automated coding system can be measured by using the specified data format

22 Software operation We ran 17128 pathology reports through the software. Software automatically calculates accuracy parameters by comparing reference data to test data.

23 Example of high confidence M-81403 (Adenocarcinoma) Sensitivity 0.82 Specificity 0.90 Reducibility 0.08 Confidence 0.87 Reference count 2647=15%

24 Example of high confidence M-81403,C61.9 (Adenocarcinoma, Prostate) Sensitivity 0.87 Specificity 0.99 Reducibility 0.06 Confidence 1.00 Reference count 1008=6%

25 Example of low confidence C44.9 (Skin) Sensitivity.57 Specificity.76 Reducibility.05 Confidence.03 Reference count 67 =.4%

26 Morphology Accuracy Plots

27 Code Pairs

28 An Experiment Modify AutoCode to produce output as follows: Take the largest morphology value Take the smallest topography value Example: Reduce M-82403 M-80001 C17.0 C16.9 C17.9 To M-82403 C16.9

29 Morphology – MinMax rule

30 Morphology – Before & After

31 Code Pairs – Before & After

32 Wrap Up Created a coding accuracy measurement system Applied to AIM’s AutoCode, but can be used to measure any coding system. Software available to public domain

33 Topography

34 Code Pairs – Min Max Rule


Download ppt "Measuring Coding Accuracy Artificial Intelligence in Medicine National Cancer Institute."

Similar presentations


Ads by Google