Clinical Natural Language Processing: Part I Guergana K. Savova, PhD Childrens Hospital Boston and Harvard Medical School.

Slides:



Advertisements
Similar presentations
NLP Highlights GS Savova And team. Medication CEM template associatedCode Change_status Conditional Dosage Duration End_date Form Frequency Generic Negation_indicator.
Advertisements

Discovering Severity and Body Site Modifiers Dmitriy Dligach, Ph.D. Boston Children’s Hospital and Harvard Medical School.
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data Project 3: High-Throughput Phenotyping Project Lead: Jyotishman.
Data Normalization Milestones. Data Normalization  Goals –To conduct the science for realizing semantic interoperability and integration of diverse data.
10 Points to Remember for the Management of Overweight and Obesity in Adults Management of Overweight and Obesity in Adults Summary Prepared by Elizabeth.
Searching Patient Data: A Role for Librarians in the Improvement of Healthcare Margaret Henderson, MLIS, AHIP Tompkins-McCaw Library.
Recent Efforts in Clinical NLP: Clinical Text Analysis and Knowledge Extraction System (cTAKES) Guergana K. Savova, PhD Children’s Hospital Boston and.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
Towards Next Generation Integrative Mobile Semantic Health Information Assistants Evan W. Patton John Sheehan Yue.
Area 4 SHARP Face-to-Face Conference Phenotyping Team – Centerphase Project Assessing the Value of Phenotyping Algorithms June 30, 2011.
ABSTRACT The goal of this project was to develop a self-help Android application to aid pregnant women in achieving healthy weight gain during pregnancy.
Information Extraction from Clinical Reports Wendy W. Chapman, PhD University of Pittsburgh Department of Biomedical Informatics.
Blood Glucose Portal Xinformatics Blue Team. Overview Use Case - sumitra Modeling - fred Design - hithika Back-End - scott Front-End - evan Demo - evan.
BHS Guidelines for the management of hypertension BHS IV, 2004 and Update of the NICE Hypertension Guideline, 2006 Guidelines for management of hypertension:
HIBBs is a program of the Global Health Informatics Partnership Introduction to Form Design Regional East African Centre for Health Informatics (REACH-INFORMATICS)
PBHCI Grantee Technical Assistance - Physical Health Indicator Data Jam Session Friday, February 11, :00 PM - 2:00 PM EST.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
HEALTH EDUCATION Věra Kernová National Institute of Public Health Prague.
Apache Clinical Text Analysis and Knowledge Extraction System (cTAKES)
Information Extraction Group Health David Carrell, PhD Group Health Research Institute June 29, 2010.
SHARPn Data Normalization November 18, Data-driven Healthcare Big Data Knowledge Research Practice Analytics Domain Pragmatics Experts.
Clinical Registries Needs and Solutions Dr. Peter Greene, CMIO Diana Gumas, IT Director 1.
Project HealthDesign Overview Patricia Flatley Brennan, RN, PhD, FAAN University of Wisconsin-Madison Funded by the Robert Wood Johnson Foundation with.
Source: Site Name and Year IHS Diabetes Audit Diabetes Health Status Report ______Site Name_________ Health Outcomes and Care Given to Patients with Diabetes.
SHARPn Milestones: Natural Language Processing Guergana Savova, PhD Boston Childrens Hospital and Harvard Medical School.
Initial Prototype for Clinical Data Normalization and High Throughput Phenotyping SHARPn F2F June 30,2011.
Quality in Laboratory Medicine Conference Business Case for Quality Recognizing Excellence in Practice Presented to the Institute for Quality in Laboratory.
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use Dr. Friedman on-site visit, Mayo Clinic 3 September 2010.
Improving the Quality of Physical Health Checks
Preventive Health Partnership Our organizations came together in 2004 to: – Harness the individual strengths of each to collectively engage the issue.
Improving the Quality of Physical Health Checks Kate Dale, Mental/Physical Health Lead BDCT.
Connected Health: Using patient-centric technologies to change behavior and improve outcomes Joseph C. Kvedar, MD Director Center for Connected Health.
Basma Y. Kentab MSc.. 1. Define ambulatory care 2. Describe the value of ambulatory care practices 3. Explore pharmacy services in some ambulatory care.
Examples of ADE Surveillance Systems MedDRA ® Processing of Adverse Event Reports in ADE Surveillance Systems Amarilys Vega, M.D, M.P.H., Sonja Brajovic,
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data SHARPfest June 2-3, 2010 PI: Christopher G Chute, MD DrPH.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
The analyses upon which this publication is based were performed under Contract Number HHSM C sponsored by the Center for Medicare and Medicaid.
1 Peter Fox Xinformatics 4400/6400 Week 11, April 16, 2013 Information Audit and dealing with Unstructured Information.
SHARPn High-Throughput Phenotyping (HTP) November 18, 2013.
Open Health Natural Language Processing Consortium (OHNLP)
Treatment Summary University of California San Francisco Center of Excellence for Breast Cancer Care PI: Laura J Esserman MD MBA; Edward Mahoney; Elly.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
CTAKES The clinical Text Analysis and Knowledge Extraction System.
Facilitate Scientific Data Sharing by Sharing Informatics Tools and Standards Belinda Seto and James Luo National Institute of Biomedical Imaging and Bioengineering.
Information Technology and Data Collection: February 28, 2008 Optimizing Lab Results and Pharmacy Data Collection Under P4P Concurrent Session 1.07 Horace.
1 Guy Divita Qing Zeng-Treitler Salt Lake City VA, University of Utah School of Medicine Pragmatic Interoperability.
Clinical Practice Glycemic Management of Type 2 Diabetes Mellitus Faramarz Ismail-Beigi, M.D., Ph.D. Dr.kalantar N Engl J Med Volume 366(14):
Clinical Data Normalization Dr. Chute Aims: Build generalizable data normalization pipeline Semantic normalization annotators involving LexEVS Establish.
Mary Gardner, RN, MA, CCM, CDE Program Manager, High Risk Diabetes and COPD XLHealth Member Management Using The Med-eXpert System and Med-eMonitor Patient.
MedKAT Medical Knowledge Analysis Tool December 2009.
Clinical Health Indicator Improvements and Hospital Usage Report Health Integration Project December 2013 Matthew Rich Matthew Rich – Health Integration.
Clinical Language Annotation, Modeling, and Processing Toolkit (CLAMP)
Mt. Hood. IOM Report: 10 Years After & More Coming Mitch Greenlick, Ph.D. Oregon State Representative April 21, 2010.
Open Health Natural Language Processing Consortium
SAGE Nick Beard Vice President, IDX Systems Corp..
Research Tools Brought to you by the Clinical and Translational Science Institute Presented by: Terri Shkuda Systems Analyst Research Informatics The Penn.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
Terminology Server - Semantics for Medical Documentation and Interoperability / Dortmund © Peter Haas, Robert Mützner / Working Group Med. Informatics.
Results of 12 month follow up in Tulppa outpatient rehabilitation program.
Documentation in Practice Dept. of Clinical Pharmacy.
Project Proposal to IHE IHE ITI Representational State Transfer (REST) Transport Implementation Guide for Data Segmentation for Privacy (DS4P) Submitted.
Showcasing work by Jonnageddala, Liaw, Ray, Kumar, Chang, and Dai on
Health Natural Language Processing Center
cTAKES: Demo Clinical Text Analysis and Knowledge Extraction System
Diabetes Health Status Report
Treating Alcohol Abuse
Electronic Health Information Systems
Strategic Health IT Advanced Research Projects (SHARP)
Sandy Jones, Public Health Advisor
EPocrates The Coalition of Orange County Community Clinics Information Technology Activities A case study on the pursuit of HIT in Community Clinic Healthcare.
Presentation transcript:

Clinical Natural Language Processing: Part I Guergana K. Savova, PhD Childrens Hospital Boston and Harvard Medical School

Investigators (in alphabetical order)  Childrens Hospital Boston and HMS (site PI: Guergana Savova)  MIT (site PI: Peter Szolovits)  MITRE corporation (site PI: Lynette Hirschman)  Seattle Group Health (site PI: David Carrell)  SUNY Albany (site PI: Ozlem Uzuner)  University of California, San Diego (site PI: Wendy Chapman  University of Colorado (site PI: Martha Palmer)  University of Pittsburg (site PI: Henk Harkema)  University of Utah and Intermountain Healthcare (site PI: Peter Haug)

Special Acknowledgement  Our talented super software developers – Vinod Kaggal, lead – Dingcheng Li – Pei Chen – James Masanz

Overview  Part 1: –Background and objectives of SHARP 4 cNLP project – Year 1 achievements – Clinical Text Analysis and Knowledge Extraction System (cTAKES) – Year 2 proposed projects – Graphical User Interface to cTAKES: demo  Part 2: – cTAKES: demo

Aims  Information extraction (IE): transformation of unstructured text into structured representations and merging clinical data extracted from free text with structured data –Entity and Event discovery –Relation discovery –Normalization template: Clinical Element Model (CEM)  Overarching goal –high-throughput phenotype extraction from clinical free text based on standards and the principles of interoperability –general purpose clinical NLP tool with applications to the majority of all imaginable use cases

A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 mpresentation. Her initial blood glucose was 340 mg/dL. Glyburide A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic. Processing Clinical Notes A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic.

Clinical Element Model Disorder CEM text: diabetes mellitus code: subject: patient relative temporal context: 3 months ago negation indicator: not negated Disorder CEM text: diabetes mellitus code: subject: family member relative temporal context: negation indicator: not negated Tobacco Use CEM text: smoking code: subject: patient relative temporal context: 25 years negation indicator: not negated Medication CEM text: Glyburide code: subject: patient frequency: once daily negation indicator: not negated strength:2.5 mg A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic. A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic. A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic. A 43-year-old woman was diagnosed with type 2 diabetes mellitus by her family physician 3 months before this presentation. Her initial blood glucose was 340 mg/dL. Glyburide 2.5 mg once daily was prescribed. Since then, self-monitoring of blood glucose (SMBG) showed blood glucose levels of mg/dL. She was referred to an endocrinologist for further evaluation. On examination, she was normotensive and not acutely ill. Her body mass index (BMI) was 18.7 kg/m2 following a recent 10 lb weight loss. Her thyroid was symmetrically enlarged and ankle reflexes absent. Her blood glucose was 272 mg/dL, and her hemoglobin A1c (HbA1c) was 10.3%. A lipid profile showed a total cholesterol of 261 mg/dL, triglyceride level of 321 mg/dL, HDL level of 48 mg/dL, and an LDL of 150 mg/dL. Thyroid function was normal. Urinanalysis showed trace ketones. She adhered to a regular exercise program and vitamin regimen, smoked 2 packs of cigarettes daily for the past 25 years, and limited her alcohol intake to 1 drink daily. Her mother's brother was diabetic.

Comparative Effectiveness Disorder CEM text: diabetes mellitus code: subject: patient relative temporal context: 3 months ago negation indicator: not negated Disorder CEM text: diabetes mellitus code: subject: family member relative temporal context: negation indicator: not negated Tobacco Use CEM text: smoking code: subject: patient relative temporal context: 25 years negation indicator: not negated Medication CEM text: Glyburide code: subject: patient frequency: once daily negation indicator: not negated strength:2.5 mg Compare the effectiveness of different treatment strategies (e.g., modifying target levels for glucose, lipid, or blood pressure) in reducing cardiovascular complications in newly diagnosed adolescents and adults with type 2 diabetes. Compare the effectiveness of traditional behavioral interventions versus economic incentives in motivating behavior changes (e.g., weight loss, smoking cessation, avoiding alcohol and substance abuse) in children and adults.

Meaningful Use Disorder CEM text: diabetes mellitus code: subject: patient relative temporal context: 3 months ago negation indicator: not negated Disorder CEM text: diabetes mellitus code: subject: family member relative temporal context: negation indicator: not negated Tobacco Use CEM text: smoking code: subject: patient relative temporal context: 25 years negation indicator: not negated Medication CEM text: Glyburide code: subject: patient frequency: once daily negation indicator: not negated strength:2.5 mg Maintain problem list Maintain active med list Record smoking status Provide clinical summaries for each office visit Generate patient lists for specific conditions Submit syndromic surveillance data

Clinical Practice Disorder CEM text: diabetes mellitus code: subject: patient relative temporal context: 3 months ago negation indicator: not negated Medication CEM text: Glyburide code: subject: patient frequency: once daily negation indicator: not negated strength:2.5 mg Provide problem list and meds from the visit

Applications  Meaningful use of the EMR  Comparative effectiveness  Clinical investigation –Patient cohort identification –Phenotype extraction  Epidemiology  Clinical practice  …..

How does NLP fit?  Demo pipeline, v1 –All medications in Mayo dataset extracted with cTAKES (NLP method) –Processed 360,452 notes for 10,000 patients –3,442,000 CEMs were created –Processing time: 1.6 sec/doc

Year 1

Y1 Technical and Scientific Activities  Gold standard corpus development: – corpus creation methodology – de-id and PHI surrogate generation tools – seed corpus generation (PAD, pneumonia, breast cancer) – annotation schema development based on CEM normalization target – annotation guidelines and pilot annotations – gold standard annotations are in progress  Type System for software development  Development of Evaluation workbench  Methods development – entity and event discovery – relation discovery

Y1 Software Deliverables (cTAKES modules) JULAUGSEPOCTNOVDECJANFEBMARAPRMAYJUN Dependency Parser Drug Profile Module Smoking Status Classifier CEM ‘orderMedAmb’ Population Full-Cycle Pipeline v1

SHARP Security Roundtable for Cloud-Deployed cNLP  May 23-24, 2011  Participants: SHARP 1, SHARP 4, health care organizations, the Veterans Administration, industry, and other research institutions  Providing guidance to institutions seeking to use cloud technologies to support development and application of cNLP tools  A set of recommendations for the novel legal and governance issues regarding the proper stewardship and use of clinical data

SHARP Collaborations  SHARP 1: –Around security in a cloud computing environment  SHARP 3 (SMaRT): –Around extraction of data from the clinical narrative –I2b2 database for data persistence?

Partnerships  NCBC-funded initiatives –Integrating Informatics and Biology to the Bedside (i2b2) –Integrating Data for Analysis, Anonymization and Sharing (iDASH) –Ontology Development and Information Extraction (ODIE)  Veterans Administration  R01s –Shared annotated lexical resource –Temporal relation discovery for the clinical domain –Milti-source integrated platform for answering clinical questions  University of York (UK), University of Trento (Italy), Brandeis University (USA)  eMERGE, PGRN (Pharmacogenomics Research Network)

clinical Text Analysis and Knowledge Extraction System (cTAKES)

Overview Goal: Phenotype extraction Generic – to be used for a variety of retrievals and use cases Expandable – at the information model level and methods Modular Cutting edge technologies – best methods combining existing practices and novel research with rapid technology transfer Terminology agnostic: able to plug in any terminology Best software practices (80M+ notes) Stand-alone tool easily pluggable within other platforms/toolsets Apache v2.0 license Commitment to both R and D in R&D

cTAKES Adoption  May, 2011: –2306 downloads*  i2b2 NLP cell integration; relevance to CTSAs  eMERGE (SGH, NW)  PGRN (HMS, NW)  Extensions: Yale (YTEX), MITRE * Source:

cTAKES Technical Details Open source Apache v2.0 license Java 1.5 Framework IBM’s Unstructured Information Management Architecture (UIMA) open source framework, Apache project Methods Natural Language Processing methods (NLP) Based on standards and conventions to foster interoperability Application High-throughput system

cTAKES: Components Sentence boundary detection (OpenNLP technology) Tokenization (rule-based) Morphologic normalization (NLM’s LVG) POS tagging (OpenNLP technology) Shallow parsing (OpenNLP technology) Named Entity Recognition Dictionary mapping (lookup algorithm) Machine learning (MAWUI) types: diseases/disorders, signs/symptoms, anatomical sites, procedures, medications Negation and context identification (NegEx) Dependency parser Drug Profile module Smoking status classifier CEM normalization module

Output Example: Drug Object “Tamoxifen 20 mg po daily started on March 1, 2005.” Drug Text: Tamoxifen Associated code: C Strength: 20 mg Start date: March 1, 2005 End date: null Dosage: 1.0 Frequency: 1.0 Frequency unit: daily Duration: null Route: Enteral Oral Form: null Status: current Change Status: no change Certainty: null

Conversion to CEMs CASTransformCEM Freemarker Transform Template jCAS Consumer cTAKES Drug NER

Year 2 and Forward

AgentLoc the patient will complete his thiotepa dose today, and he will return tomorrow for the last dose of his thiotepa. His donor completed stem-cell collection yesterday The patient returns to the outpatient clinic today for follow-up Courtesy of Martha Palmer

Agent LocTheme, and he will return tomorrow for the last dose of his thiotepa. His donor completed stem-cell collection yesterday The patient returns to the outpatient clinic today for follow-up the patient will complete his thiotepa dose today Courtesy of Martha Palmer

Agent LocTheme Agent Purpose His donor completed stem-cell collection yesterday The patient returns to the outpatient clinic today for follow- up the patient will complete his thiotepa dose today, and he will return tomorrow for the last dose of his thiotepa. Courtesy of Martha Palmer

AgentAction Agent LocTheme Agent Purpose Coreference: “patient’s donor” The patient returns to the outpatient clinic today for follow-up the patient will complete his thiotepa dose today, and he will return tomorrow for the last dose of his thiotepa. His donor completed stem-cell collection yesterday Courtesy of Martha Palmer

AgentAction Agent LocTheme Agent TERMINATESOVERLAP Purpose Coreference: “patient’s donor” The patient returns to the outpatient clinic today for follow-up the patient will complete his thiotepa dose today, and he will return tomorrow for the last dose of his thiotepa. His donor completed stem-cell collection yesterday Courtesy of Martha Palmer

The patient returns to the outpatient clinic today for follow-up the patient will complete his thiotepa dose today, and he will return tomorrow for the last dose of his thiotepa. His donor completed stem-cell collection yesterday Courtesy of Martha Palmer

Y2 Proposed Deliverables  Release of a library of de-identification tools (Sept, 2011) –MIST –MIT/SUNY  Evaluation workbench (Sept, 2011)  cTAKES Side Effects module (Aug, 2011)  Modules for relation extraction (Dec, 2011) –Semantic role labeler –Relation classifier –Integration of CLEAR-TK (University of Colorado)  End-to-end tool, v2 (cTAKES v2) (April, 2012) –NLP to populate CEMs for Diseases, Sign/Symptoms, Procedures, Labs, Anatomical sites –Integration of LexGrid/LexEVS services

Development Challenges and Opportunities  Open source strategy  Release early release often  Test driven development with continuous integration  All milestones measured by what we can get IRB and DUA approved and deployed with real or de- identified clinical data

Courtesy of David Carrell

Partnerships  Strengthen existing SHARP collaborations –Initiate collaborations with SHARP 2 around usability –SHARP 1: methods for data security in a cloud deployed framework –I2b2: the glue between SHARP 3 and SHARP 4  Non-SHARP collaborations

Graphical User Interface (GUI) to cTAKES: a Prototype Pei Chen Childrens Hospital Boston

cTAKES as a Service  Objectives 1.Demo cTAKES prototype web application  Empower End Users to leverage cTAKES 2.Gather feedback for future cTAKES GUI 3.Potential system integrations with other applications (i.e. i2b2, ARC, Web Annotator)  Developed within i2b2 to integrate cTAKES in the i2b2 NLP cell

cTAKES Web Application

Single clinical note

Technologies Front-End Web GUI –ExtJS –JavaScript Back-End cTAKES –JAVA –UIMA Middleware Web Services JAVA Apache CXF JSON

Deployment Considerations  Deployment Model  Security  Performance  Licensing (UMLS, Apache, GPL v.3)