Semantic Web Technologies: A Paradigm for Medical Informatics Chimezie Ogbuji (Owner, Metacognition LLC.)

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Web use cases in outcomes research Experiences from building a patient repository and developing standards Chimezie Ogbuji Metacognition Inc.
Semantic Web Thanks to folks at LAIT lab Sources include :
CS570 Artificial Intelligence Semantic Web & Ontology 2
Common Terminology Services 2 (CTS2)
Amy Sheide Clinical Informaticist 3M Health Information Systems USA Achieving Data Standardization in Health Information Exchange and Quality Measurement.
Guoqian Jiang, MD, PhD Mayo Clinic
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
The work proposed in this study is an attempt to use Semantic Web technologies for integrating patient clinical data derived from Electronic Health Records.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
OASIS Reference Model for Service Oriented Architecture 1.0
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A Secure Interoperable Infrastructure For Healthcare Information System Ehsan ul Haq Abrar Ahmed Sair
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
IACT303 – INTI 2005 World Wide Networking Security and Next Generation Networking Technologies University of Wollongong.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
DCMO - CIO Architecture Federation Pilot Larry Singer 5 January, 2012.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
1 Electronic Health Records with Cleveland Clinic and Oracle Semantic Technologies David Booth, Ph.D., Cleveland Clinic (contractor) Oracle OpenWorld 20-Sep-2010.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Working Together to Advance Terminology Tooling Presentation to OHT Board, Birmingham Jennifer Zelmer & Karen Gibson.
Logics for Data and Knowledge Representation
Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
A Model-Driven Approach to Interoperability and Integration in Systems of Systems Gareth Tyson Adel Taweel Steffen Zschaler Tjeerd Van Staa Brendan Delaney.
LexRDF: A Semantic-Web Compatible Extension of LexGrid Cui Tao Jyotishman Pathak Harold R. Solbrig Wei-Qi Wei Christopher G. Chute Division of Biomedical.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Schema Interoperability Liam Magee Global Cities Institute RMIT University Melbourne, Australia.
A School of Information Science, Federal University of Minas Gerais, Brazil b Medical University of Graz, Austria, c University Medical Center Freiburg,
Semantic Web - an introduction By Daniel Wu (danielwujr)
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
A Semantic-Web Representation of Clinical Element Models
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Christopher Pierce (Cleveland Clinic)
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Components.
The Semantic Web By: Maulik Parikh.
Agenda Federated Enterprise Architecture Vision
WHIT 3.0 December 11, 2007 Christopher Pierce and Chimezie Ogbuji
Using OWL for the RESO Data Dictionary
Meaningful Use of Electronic Medical Records through Semantic Technologies: The Cleveland Clinic Experience Christopher Pierce, Ph.D. (Cleveland Clinic)
Analyzing and Securing Social Networks
LOD reference architecture
Information Networks: State of the Art
Presentation transcript:

Semantic Web Technologies: A Paradigm for Medical Informatics Chimezie Ogbuji (Owner, Metacognition LLC.)

Who I am Circa 2001: Introduced to web standards and Semantic Web technologies : Lead architect of CCF in-house clinical repository project : Member representative of CCF in World-wide Web Consortium (W3C) ◦ Editor of various standards and Semantic Web Health Care and Life Sciences Interest Group chair : Senior Research Associate at CWRU Center for Clinical Investigations 2012-current: Started business providing resource and data management software for home healthcare agencies (Metacognition LLC)

Medical Informatics Challenges Semantic interoperability ◦ Exchange of data with common meaning between sender and receiver Most of the intended benefits of HIT depend on interoperability between systems Difficulties integrating patient record systems with other information resources are among the major issues hampering their effectiveness ◦ Interoperability is a major goal for meaningful use of Electronic Health Records (EHR) Rodrigues et al. 2013; Kadry et al. 2010; Shortliffe and Cimino, 2006

Requirements and Solutions Semantic interoperability requires: ◦ Structured data ◦ A common controlled vocabulary Solutions emphasize the meaning of data rather than how they are structured ◦ “Semantic” paradigms

Registries and Research DBs Patient registries and clinical research repositories capture data elements in a uniform manner The structure of the underlying data needs to be able to evolve along with the investigations they support Thus, schema extensibility is important

Querying Interfaces Standardized interfaces for querying facilitate: ◦ Accessibility to clinical information systems ◦ Distributed querying of data from where they reside Requires: ◦ Semantically-equivalent data structures Alternatively, data are centralized in data warehouses Austin et al. 2007, “Implementation of a query interface for a generic record server”

Biomedical Ontologies Ontologies are artifacts that conceptualize a domain as a taxonomy of classes and constraints on relationships between their members Represented in a particular formalism Increasingly adopted as a foundation for the next generation of biomedical vocabularies Construction involves representing a domain of interest independent of behavior of applications using an ontology Important means towards achieving semantic interoperability

Biomedical Ontology Communities Prominent examples of adoption by life science and healthcare terminology communities: ◦ The Open Biological and Biomedical Ontologies (OBO) Foundry ◦ Gene Ontology (GO) ◦ National Center for Biomedical Ontology (NCBO) Bioportal ◦ International Health Terminology Standards Development Organization (IHTSDO)

Semantic Web and Technologies The Semantic Web is a vision of how the existing infrastructure of the World-wide Web (WWW) can be extended such that machines can interpret the meaning of data on it Semantic Web technologies are the standards and technologies that have been developed to achieve the vision

An Analogy (Technological) singularity is a theoretical moment when artificial intelligence (AI) will have progressed to a greater-than- human intelligence Despite remaining in the realm of science fiction, it has motivated many useful developments along the way ◦ The use of ontologies for knowledge representation and IBM Watson capabilities, for example

Background: Graphs Graphs are data structures comprising nodes and edges that connect them The edges can be directional Either the nodes, the edges, or both can be labeled The labels provide meaning to the graphs (edge labels in particular)

Resource Description Framework The Resource Description Framework (RDF) is a graph-based knowledge representation language for describing resources It’s edges are directional and both nodes and edges are labeled It uses Universal Resource Identifiers (URI) for labeling Foundation for Semantic Web technologies

RDF: Continued The edges are statements (triples) that go from a subject to an object Some objects are text values Some subjects and objects can be left unlabeled (Blank nodes) ◦ Anonymous resources: not important to label them uniquely The URI of the edge is the predicate Predicates used together for a common purpose are a vocabulary

Subject: Dr. X (a URI) Object: Chime Predicate: treats Vocabulary: ◦ treats, subject of record, author, and full name

RDF vocabularies How meaning is interpreted from an RDF graph There are vocabularies that constrain how predicates are used ◦ Want a sense of treats where the subject is a clinician and the object is a patient There is a predicate relating resources to the classes they are a member of (type) There are vocabularies that define constraints on class hierarchies These comprise a basic RDF Schema (RDFS) language Represented as an RDF graph

Ontologies for RDF The Ontology Web Language (OWL) is used to describe ontologies for RDF graphs More sophisticated constraints than RDFS Commonly expressed as an RDF graph Defines the meaning of RDF statements through constraints: ◦ On their predicates ◦ On the classes the resources they relate belong to

OWL Formats Most common format for describing ontologies Distribution format of ontologies in the NCBO BioPortal SNOMED CT distributions include an OWL representation ◦ RDF graphs can describe medical content in a SNOMED CT-compliant way through the use of this vocabulary

Validation and Deduction OWL is based on a formal, mathematical logic that can be used for validating the structure of an ontology and RDF data that conform to it (consistency checking) Used to deduce additional RDF statements implied by the meaning of a given RDF graph (logical inference) Logical reasoners are used for this

Inference Can infer anatomical location from SNOMED CT definitions Hypertension DX / “Benign essential hypertension (disorder)”

Querying RDF Graphs SPARQL is the official query language for RDF graphs Comparable to relational query languages ◦ Primary difference: it queries RDF triples, whereas SQL queries tables of arbitrary dimensions Includes various web protocols for querying RDF graphs Foundation of SPARQL is the triple pattern (?clinician, treats, ?patient) ◦ ?clinician and ?patient are variables (like a wildcard)

Which physicians have given essential hypertension diagnoses and to whom? (?physician, author, ?dx) (?physician, treats, ?patient) (?dx, subject of record, ?patient) (?dx, type, Hypertension DX) ?physician?patient?dx Dr. XChime…

SPARQL over Relational Data Most common implementations convert SPARQL to SQL and evaluate over: ◦ a relational databases designed for RDF storage ◦ an existing relational database There are products for both approaches Former requires native storage of RDF ◦ Relational structure doesn’t change even as RDF vocabulary does (schema extensibility) Elliot et al. 2009, “A Complete Translation from SPARQL into Efficient SQL”

SPARQL over Existing Relation Data “Virtual RDF view” ◦ Translation to SQL follows a given mapping from existing relational structures to an RDF vocabulary ◦ Allows non-disruptive evolution of existing systems ◦ Well-suited as a standard querying interface over clinical data repositories ◦ They can be queried as SPARQL, securely over encrypted HTTP

Example: Cleveland Clinic (SemanticDB) Content repository and data production system released in Jan million (native) RDF statements ◦ Uses vocabulary from a patient record OWL ontology for the registry Based on ◦ Existing registry of heart surgery and CV interventions ◦ 200,000 patient records ◦ Generating over 100 publications per year Pierce et al. 2012, “SemanticDB: A Semantic Web Infrastructure for Clinical Research and Quality Reporting”

Cohort Identification Interface developed in conjunction with Cycorp Leverage their logical reasoning system (Cyc) ◦ Identifies cohorts using natural language (NL) sentence fragments ◦ Converts fragments to SPARQL ◦ SPARQL is evaluated against RDF store

Example: Mayo Clinic (MCLSS) Mayo Clinic Life Sciences System (MCLSS) ◦ Effort to represent Mayo Clinic EHR data as RDF graphs ◦ Patient demographics, diagnoses, procedures, lab results, and free-text notes ◦ Goal was to wrap MCLSS relational database and expose as read-only, query-able RDF graphs that conform to standard ontologies ◦ Virtual RDF view Pathak et al. 2012, "Using Semantic Web Technologies for Cohort Identification from Electronic Health Records for Clinical Research"

Example: Mayo Clinic (CEM) Clinical Element Model (CEM) ◦ Represents logical structure of data in EHR ◦ Goal: translate CEM definitions into OWL and patient (instance) data into conformant RDF ◦ Use tools (logical reasoners) to check semantic consistency of the ontology, instance data, and to extract new knowledge via deduction ◦ Instance data validation:  correct number of linked components, value within data range, existence of units, etc. Tao et al. 2012, ”A semantic-web oriented representation of the clinical element model for secondary use of electronic health records data"

Summary Schema extensibility ◦ Use of RDF Semantic Interoperability ◦ Domain modeling using OWL and RDFS Standardized query interfaces ◦ Querying over SPARQL Incremental, non-disruptive adoption ◦ Virtual RDF views Main challenge: highly disruptive innovation