Donna Morrell, CTR NAACCR 2014 Annual Conference Ottawa, Ontario, Canada June 25, 2014 Using Scanners and Optical Character Recognition for Pathology Report.

Slides:



Advertisements
Similar presentations
Corporate Imaging Kathy Mumford Session 2 Room C Tennessees BEST 2005 Tech Tips User Conference.
Advertisements

Why AAA Digital Document Solutions? Are you still storing documents in file cabinets? Do you have a plan in case of fire or natural disaster? Is your company.
The following is designed to give a brief understanding of the different methods you can use to scan, file and search documents in FILEstream. FILEstream.
 A word processor is a computer that can be used for writing, editing and printing text.  We are going to use a word processing package called Microsoft.
Premier Director Document Imaging
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Web Plus Overview Division of Cancer Prevention and Control National Center for Chronic Disease Prevention and Health Promotion CDC Registry Plus Training.
IMAGES OF THE PAST AND FUTURE Donna Calabrese Director, Office of Vital Statistics Center for Health and Environmental Statistics Kansas Department of.
Importing Transfer Equivalencies: How to Maximize Efficiency How Columbia College Office of Registrar improved productivity through third party solutions.
Got Paper? Thinking about going paperless or at least as paperless as possible? NAMVBC-2013.
Digitizing Jury Records QUEENS DIVISION OF JURORS AUDREY I. PHEFFER QUEENS COUNTY CLERK COMISSIONER OF JURORS 1.
Software to Manage EEP Vegetation Plot Data A design proposal Michael Lee January 31, 2011.
Chapter 7 The Paper Claim: CMS-1500 (08-05) Elsevier items and derived items © 2010, 2008 by Saunders, an imprint of Elsevier Inc.
The Geant4 physics validation repository
© 2013 The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Tests, Procedures, and Codes.
Unit 4: Monitoring Data Quality For HIV Case Surveillance Systems #6-0-1.
Manifest Program Review and QA/QC Discussion. CIWMB
TS16949 requirements Subjects –Audit planning –Recertification audit requirements –Auditing Remote supporting functions.
Access 2007 ® Use Databases How can Microsoft Access 2007 help you to enter and organize information?
It’s an Element, Dear Watson! Project Overview Teacher Planning Work Samples & Reflections Teaching Resources Assessment & Standards Classroom Teacher.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
Document Management System for Healthcare Industry From Infocrew Solutions Pvt.Ltd.
Page 1 Copyright Jill M. Forrester This work is the intellectual property of the author. Permission is granted for this material to be shared for.
CHAPTER 6 ELECTRONIC DATA PROCESSING SYSTEMS
TH-OCR NK. content introduction go to next page background assumptions overall structure chart IPO for overall structure dataflow diagram of overall structure.
Scanning on Your Ricoh 2232 / 2238 Instructions for scanning documents on your Ricoh copier (takes about 5 minutes) Click your mouse or press your “Down.
HEALTH FLOW Eliminate paper forms and documents Eliminate ordering and inventorying costs of paper forms Substantially reduce operating costs Improve.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
University of Idaho Business 378 – Project Management Yoshi Pitkin.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
The ePortfolio and Student Evaluation A training presentation by: Amy Cannady Robin Drewry Bonnie Hicks.
Poster Print Size: This poster template is 21” high by 45” wide and is printed at 200% for a 42” high by 90” wide poster. It can be used to print any poster.
IN THE MEANTIME…. INTERIM SOLUTIONS TO AUTOMATED DATA CAPTURE.
Questys Text & Image Management System Records Management for the Information Age.
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
BNR – Stroke: data entry and data management CAREC/PAHO Curacoa,15-16 November 2010 Gina Pitts, BNR-CVD Registrar Chronic Disease Research Centre, Jemmotts.
ITGS Databases.
Poster Print Size: This poster template is 44” high by 44” wide. It can be used to print any poster with a 1:1 aspect ratio. Placeholders: The various.
Back to the Future: The Brave New World of Electronic Documentation/Processes Society of Ontario Adjudicators and Regulators (SOAR) 2014 SOAR Annual Conference.
Verification & Validation. Batch processing In a batch processing system, documents such as sales orders are collected into batches of typically 50 documents.
Word Editing Tools. Word Automatic Editing Tools §Word has three features that automatically change or insert text and graphics as you type §You can easily.
Copyright © 2011 Delmar, Cengage Learning. ALL RIGHTS RESERVED. Chapter 5 Electronic Health Records.
Chapter 11 Enhancing an Online Form and Using Macros Microsoft Word 2013.
1. STARTING POINT Data is an important part of helping State and Local HDs achieve better health outcomes for their constituencies. Currently there appear.
Session 6: Data Flow, Data Management, and Data Quality.
ImageNow -- An Overview --. What is ImageNow?  Loyola’s document imaging and workflow application  Primary application (web based and desktop) of the.
Automation Living in a Paper Oriented World and The Steps to Automation.
Capture This! PO105 James Green. Table of Contents Capture Overview Laserfiche Tools Case Scenarios Questions and Answers.
Using NAPIIA to Improve the Accuracy of Asian Race Code in Registry Data Mei-Chin Hsieh, MSPH, CTR Lisa A. Pareti, BS, RHIT, CTR Vivien W. Chen, PhD NAACCR.
Scott Van Heest IT Specialist, Data Analysis and Support Team, NPCR, CDC Denise Farmer CDC/NPCR Contractor Division of Cancer Prevention and Control National.
Poster Print Size: This poster template is 36” high by 48” wide. It can be used to print a Tri-Fold poster with 12” wings. Placeholders: The various elements.
Automatic Data Capture  Process where many techniques are used to automatically collect data without need for manual entry. Manual entry has following.
Treatment Capture from Follow Back to Oncology Offices by Frances Ross Presented at the 2013 NAACCR Annual Conference Austin, TX.
NPCR – Advancing E-cancer Reporting and Registry Operations (NPCR-AERRO): An Update on Innovative Activities NAACCR Annual Conference June 16, 2009 Sandy.
Melody A. Currey, Commissioner State of Connecticut Out of State Compact Notice Scanning and Data Entry System.
Dr. Kęstutis Adamonis, Dr. Romanas Zykus,
Automating Accounts Payable
Your Name Proposal Creation Module 5 Your Name
Optimizing your EMR in the Cancer Registry
Presented by: Jeff Moore – Artsyl Technologies, Inc.
ACE Presentation for Industry
Web Plus Version 2: Secure Web-based Functions For Death Certificate and Pathology Lab Follow-back Efforts Kathleen Thoburn, Sanjeev Baral (CDC/NPCR.
Why did you choose us? To address and provide a solution to the many problems associated with your current manual filing system -Problems include: -Lack.
Word Editing Tools.
Cayuse 424 Proposal Creation.
Data Capture Process Stages
Component 11/Unit 5 Creating Data Entry Templates
Presented by: Jeff Moore – Artsyl Technologies, Inc.
Final Design Authorization
CHAPTER 6 ELECTRONIC DATA PROCESSING SYSTEMS
Presentation transcript:

Donna Morrell, CTR NAACCR 2014 Annual Conference Ottawa, Ontario, Canada June 25, 2014 Using Scanners and Optical Character Recognition for Pathology Report Collection

Background Background Plan Plan Method Method OCR Process OCR Process Results Results Conclusion Conclusion

Using Scanners and Optical Character Recognition for Pathology Report Collection A hallmark of the Los Angeles Cancer Surveillance Program (CSP) is 100% pathology report collection since 1972 A hallmark of the Los Angeles Cancer Surveillance Program (CSP) is 100% pathology report collection since % of pathology reports are received electronically through ePath 65% of pathology reports are received electronically through ePath 35% are obtained as paper pathology reports from the hospitals or labs 35% are obtained as paper pathology reports from the hospitals or labs In this presentation we describe a new approach to obtaining the paper pathology reports In this presentation we describe a new approach to obtaining the paper pathology reports

Background Pathology reports are used for: Pathology reports are used for: –casefinding - assuring a complete cancer case report is received for every reportable pathology report –research studies –quality assurance visual editing

Background Paper pathology reports were stapled to paper copies of the reported case abstract Paper pathology reports were stapled to paper copies of the reported case abstract –For all cases –Even though abstracts were received electronically and ePath was implemented at some facilities All paper documents have been digitized, capturing an image of the pathology report All paper documents have been digitized, capturing an image of the pathology report Key identifiers (regional admission and tumor numbers) have been captured for easy retrieval of the pathology report image Key identifiers (regional admission and tumor numbers) have been captured for easy retrieval of the pathology report image Images of abstracts and pathology reports are available for use by researchers and registry staff Images of abstracts and pathology reports are available for use by researchers and registry staff

Plan Realizing the importance of a more secure method for capturing the non-ePath pathology reports, in 2010 we began development of a “paperless” process Realizing the importance of a more secure method for capturing the non-ePath pathology reports, in 2010 we began development of a “paperless” process  Replaces insecure transportation of paper pathology reports containing personal health information from hospitals and labs to the CSP office  Replaces storage of paper pathology reports at the registry office Added benefits: Decreases both field staff and in-house staff effort in acquiring and processing and storing paper pathology reports Eliminates the need to digitize the reports

Plan Create the capacity to electronically capture key data elements using Optical Character Recognition (OCR) technology Create the capacity to electronically capture key data elements using Optical Character Recognition (OCR) technology –Patient name, birthdate, pathology report number, pathology report date and originating facility Create and electronically store an image of the original pathology report Create and electronically store an image of the original pathology report Ultimately, merge the OCR data and images with the ePath pathology reports for use by researchers and registry staff Ultimately, merge the OCR data and images with the ePath pathology reports for use by researchers and registry staff

Methods Software was created to allow on-site scanning of pathology reports by registry field staff into an encrypted laptop Software was created to allow on-site scanning of pathology reports by registry field staff into an encrypted laptop –Replaced photocopying pathology reports and transporting paper copies to the registry office –Staff have 2 scanners and a laptop  Heavy duty scanner that can scan 80 pages per minute and weighs a bit under 7 pounds  Light scanner that scans 15 pages per minute and weighs 2.2 pounds –Depending on the volume of pathology reports at a facility, the staff has a choice between the two scanners

Field Tech Equipment

Methods Pathology report templates were created for over 120 unique pathology report formats Pathology report templates were created for over 120 unique pathology report formats –Labor-intensive process –Never-ending, as formats change constantly, often with minor changes, such as insertion or deletion of a comma – or a space, which severely impacts the OCR process

Methods The software used is Abbyy Flexi-Layout 10 for the template creation and Flexi-Capture for the OCR process The software used is Abbyy Flexi-Layout 10 for the template creation and Flexi-Capture for the OCR process The software can be programed to recognize the specific data items to be captured for OCR process The software can be programed to recognize the specific data items to be captured for OCR process The next slide shows capture of patient name, medical record number, birthdate, pathology report date, and pathology report number The next slide shows capture of patient name, medical record number, birthdate, pathology report date, and pathology report number

Creating a Template for an Individual Hospital

OCR Process Divider pages are manually inserted to identify each individual pathology report Divider pages are manually inserted to identify each individual pathology report –Purpose of divider page is so the OCR program can identify the beginning page for each individual pathology report –Boring, monotonous process performed by non- CTR staff person –We originally had CTR field staff inserting dividers (not a good use of their skills) and are still investigating a more automated process

USC Divider Page

OCR Process Files with divider pages inserted are electronically run through the OCR process, using Flexi-Capture software Files with divider pages inserted are electronically run through the OCR process, using Flexi-Capture software The OCR process runs at night, without human intervention The OCR process runs at night, without human intervention After the OCR process, the software will electronically split the pathology reports into two parts: After the OCR process, the software will electronically split the pathology reports into two parts: –If no data items need to be reviewed, the pathology reports will be exported to a CSP server –If a problem(s) is detected, the pathology report will be exported to a verification process

Verification Process –Verification process is for pathology reports identified as needing review of data item(s)  Questionable items are highlighted by OCR system –examples:  0 vs O; l vs 1; c vs e, etc –spacing on scanned report moves a data item out of identified space on template –Non-CTR CSP staff review the highlighted data item(s) and manually make corrections

OCR Problems

Aligned Scan Non Aligned Scan

Verification The red check indicates this pathology report has a problem that needs review The red check indicates this pathology report has a problem that needs review

Verification Example of an opened pathology report needing verification. The left view shows the pathology report, the right view shows the field in red that needs to be verified.

Verification Once all corrections have been made, a green “Verified” will appear Once all corrections have been made, a green “Verified” will appear After all pathology reports in the batch have been corrected, the OCR process automatically exports all of the pathology reports to a CSP server After all pathology reports in the batch have been corrected, the OCR process automatically exports all of the pathology reports to a CSP server

OCR Process Finally, both the problem-free pathology reports and verified pathology reports are processed through a final “checker” program Finally, both the problem-free pathology reports and verified pathology reports are processed through a final “checker” program The checker program identifies additional incorrect information that was not identified by the OCR process or verification process The checker program identifies additional incorrect information that was not identified by the OCR process or verification process –duplicate pathology reports –problems with the captured data items, such as dates with too few characters –missing data items –problems with patient name and/or age This program requires registry staff to manually review and correct the problems This program requires registry staff to manually review and correct the problems After this final check, all pathology reports are made available for viewing and research uses After this final check, all pathology reports are made available for viewing and research uses –Are also available for linkage to full case reports

Checker Program

Results All pathology reports from 2011-forward are being processed through the OCR system All pathology reports from 2011-forward are being processed through the OCR system The security of pathology reports and the patient PHI is vastly improved The security of pathology reports and the patient PHI is vastly improved The 2-step verification achieves near 100% accuracy The 2-step verification achieves near 100% accuracy

Lessons Learned Changes and enhancements are continually improving the entire process Changes and enhancements are continually improving the entire process The OCR process is not error-proof The OCR process is not error-proof Implementing the process has taken more time than envisioned and has been very labor-intensive Implementing the process has taken more time than envisioned and has been very labor-intensive

Lessons Learned Even those reports indicated as problem- free by the OCR process may contain errors Even those reports indicated as problem- free by the OCR process may contain errors –We are still discovering errors and defining processes to both correct the errors and to prevent them from occurring in the future The “checker” program has greatly enhanced accuracy The “checker” program has greatly enhanced accuracy

Lessons Learned Continual staff interaction is needed to assure accuracy Continual staff interaction is needed to assure accuracy –pathology report templates constantly need monitoring for changes –the verification of data items and final check process must be completed –continual monitoring that all files are processed in a timely manner is important

Conclusions We are encouraged that use of this technology has increased security, minimized duplicative data entry and eliminated the redundant digitizing of already-electronic reports compared to our previous paper-based processes We are encouraged that use of this technology has increased security, minimized duplicative data entry and eliminated the redundant digitizing of already-electronic reports compared to our previous paper-based processes It has not been easy, but it has been worthwhile! It has not been easy, but it has been worthwhile!

Acknowledgements Moses Villa, OCR Specialist John Casagrande, DrPH Meryl Leventhal, MA, CTR Dianne Kerford, CTR Dennis Deapen, DrPH Please feel free to contact Moses Villa with any further questions: