Terminology Metadata Extension of the Service Meta Model SWG Proposal January 2008.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
The OCLC Metadata Switch Project Jean Godby, Thomas Hickey, Diane Vizine-Goetz OCLC Office of Research Digital Library Federation May 14, 2003.
Edition 3 Metadata registry (MDR) Ray Gates May 12, /05/20151.
© Copyright 2008, Mayo Clinic College of Medicine Mayo Clinic Open Health Tools Application for Membership OHT Board Meeting, Birmingham, UK July 1, 2008.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
Ontology Notes are from:
CaGrid Service Metadata Scott Oster - Ohio State
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Mayo LexWiki: A Prototype of Collaborative Platform for Terminology/Ontology Content Development Guoqian Jiang, Ph.D. Division of Biomedical Informatics,
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
OCLC Online Computer Library Center A Global OpenURL Resolver Registry Phil Norman OCLC Dlsr4lib Workshop March 23 rd, 2006 Arlington VA.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
ISO Standards: Status, Tools, Implementations, and Training Standards/David Danko.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
LexEVS 6.0 Overview Scott Bauer Mayo Clinic Rochester, Minnesota February 2011.
Terminology Metadata Salvatore Mungal Duke University Extension of the Service Meta Model Faro, Portugal, 16 th November 2008.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
1 LexEVS 5.0 Advanced Topics Configuration Options LexEVS Boot Camp November, 2009.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
LexBIG Release Overview Aug 21, LexBIG Context Project Goals for Sept –Incremental point release of LexBIG infrastructure to support EVS activities.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The International Classification for Patient Safety: an overview In collaboration with WHO Classifications, Standards and Terminology March 2011.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Open Terminology Portal (TOP) Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer Institute, Center for Biomedical Informatics.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
LexGrid Philosophy, Model and Interfaces Harold R Solbrig Division of Biomedical Statistics and Informatics Mayo Clinic.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Ontology Resource Discussion
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal Natasha Noy Stanford University.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
SNOMED-CT Vocabulary Standard (Certification) Review Final Recommendations VCDE-WS bi-monthly meeting | 2 Oct 2008 Review Team: Christopher Chute Brian.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
VCDE WS in EY2 Where we are, where we’re going ICR WS Teleconference Brian Davis – VCDE WS Lead March 26, 2008.
Metadata models to support the statistical cycle: IMDB
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Session 3A: Catalog Services and Metadata Models
Ontology Evolution: A Methodological Overview
The Re3gistry software and the INSPIRE Registry
OBO Foundry Principles
Health Ingenuity Exchange - HingX
Session 2: Metadata and Catalogues
HingX Project Overview
Database Design Hacettepe University
Introduction to the MIABIS SOP Working Group
Presentation transcript:

Terminology Metadata Extension of the Service Meta Model SWG Proposal January 2008

Agenda Background (5 min) Review Proposed Model (15 min) Discussion (5 min) Vote Next Steps (5 min)

Team Members Tom Johnson (Mayo) Frank Hartel (NCI) George Komatsoulis (NCI) Sal Mungal (Duke) Hua Min (Fox Chase) Scott Oster (OSU) Mike Riben (MD Anderson) Brian Davis (3rd Millennium)

Background - Goals Goals: Identify metadata queryable at the index service level Narrow focus for first revision … discoveryInitial model defined to satisfy discovery use cases Support development of enhanced grid discovery client Resolve runtime services for terminologies of interest Additional metadata available through runtime services Allow/anticipate future expansion

Background – Use Cases Use Case Collection & Classification of Attributes Identification Internationalization Intended/Allowed Usage Provenance Administration

Background - Use Cases Samples Browse Existing Ontologies Viewing Differences Detecting Recently Added Ontologies Web of Trust

(1) Browse Existing Ontologies An ontology developer is interested in creating an ontology for a domain (e.g., radiographic anatomy). Determine if there are already similar ontologies in that domain. Evaluates assigned categories for registered ontologies. Discovers match for “anatomy” Views available titles and descriptions Finds listings for “human” and “mouse” anatomy, but not “radiology” Looks at the human anatomy ontology to see if it fits the need  Attributes: category, title, description Background - Use Cases

(2) Viewing Differences An ontology developer wants to view what has changed between two versions of an ontology. Retrieve listing of registered terminology services Sort by URI, then version Select and resolve grid services for differing versions Invokes runtime services to resolve and compare content  Attributes: uri, version Background - Use Cases

(3) Detecting Recently Added Ontologies A user wants to contact the providers for new ontologies registered within the last quarter. Query registered ontologies by registration date Pull point of contact information (source, curator, registration authority) from listed items  Attributes: registration date, registration authority, source, curator Background - Use Cases

(4) Web of Trust Quality of ontologies: User is aware that there are several anatomy ontologies, and is unclear which to use. Trusts certain ontology sources (anatomists) more than others Views ontology source to determine content origin Views intended and example use to consider alignment with application Considers caBIG certification level  Attributes: source, intended use, example use, certification level Background - Use Cases

Background – Model Focus of work on … Model alignment External … Incorporate feedback from review and alignment with relevant specifications and standards. Internal … Take better advantage of previously registered models and classes. Incorporating specific feedback on model classes and attributes.

Background - Alignment Specifications/standards considered … Dublin Core ISO /3/6: classification, registries, admin LexGrid/LexBIG model National Center for Biomedical Ontology (NCBO) BioPortal Public Health Information Network (CDC/PHIN) Simple Knowledge Organization System (SKOS core) UMLS Rich Release Format (RRF) CTS/CTS2

Background – Model Alignment

Findings … No silver bullet General alignment for defined items All SWG items and definitions represented conceptually in one or more specifications Adequate, but not perfect, alignment of semantics Some name changes Some new attributes identified Supplement existing use case Generally not found to be required unless we add use cases

Model - Overview

Model – Core Identification & Description uri (1) Unique persistent identifier. urn:oid: title (1) Formal or published name for display. International Classification of Disease, 9th… localName (1..n) Name used to refer to the terminology within a localized context; often a mnemonic. ICD-9-CM, ICD-9 description (0..1) Human-readable explanation or narrative. The International Classification of … category (0..n) Applicable domains or scientific fields. e.g. anatomy, genomic, proteomic, phenotype…

type (0..1) Nature of content relative to the category. application – describes domain in an application dependent manner core – describes domain in an application independent manner domain – describes the most important concepts in a domain task – describes generic types of tasks or activities (e.g. selling, selecting) upperLevel – describes general, domain independent concepts (e.g. space, time) structure (1) Indicates complexity of maintained relationships flat – no hierarchy simple - supports a single inheritance mono- hierarchical structure. complex - supports multiple relationships and/or relationship types Model – Core Identification & Description

defaultLanguage (1) Language for text unless otherwise specified eng supportedLanguage (1..n) Languages supported for text-based content eng, spa, … supportedContentType (1..n) Supported type of text or imbedded multimedia e.g. mime type (text/plain, image) keyword (0..n) Words or phrases of special significance. patient record, nursing protocol, … Model – Core Identification & Description

Model - Usage intendedUse (0..n) Human-readable description of intended use. data integration exampleUse (0..n) Human-readable example of use. Integration of protein data. isRestricted (1) Indication of intellectual property boundaries. true rights (0..n) Human-readable description of IP rights. NCI Thesaurus terms of use … rightsHolder (point of contact) (0..1) Contact point for intellectual property rights. National Cancer Institute

Model - Provenance source (0..1) Origin or provider of content National Center for Health Statistics (NCHS) curator (0..1) Maintains the content in the release format (e.g. OWL, OBO, RRF) National Library of Medicine releaseDate (0..1) Date of availability in released format releaseFormat (0..1) Format as released by the curator. e.g. OWL, OBO, RRF releaseLocation (0..1) Location of resource in the releaseFormat. ftp://ftp1.nci.nih.gov/pub/cacore/EVS/NCI_Thes aurus/Thesaurus_07.12a.OWL.zip

Model - Provenance releasePackage (0..1) Name of the composite ontology or meta distribution containing the terminology as released. e.g. UMLS, NCI_MetaThesaurus, BiomedGT releaseVersion (0..1) Represented version identifier. 2007

Model - Administration registrationAuthority (1) Responsible for maintaining content on the grid National Cancer Institute registrationDate (1) Date of grid availability or last change of registration status registrationStatus (1) Designation of terminology status in life cycle. Possible values from registration life cycle status category. registrationTag (0..1) Supports lookup by version-agnostic designation development, test, production certification (0..1) caBIG level of compliance. bronze, silver, gold

Model – Anticipated Alignment against available classes Superclasses Based on 11179

Vote Vote will be for … Approval of the identified criteria Acknowledgement that model will be aligned with existing (e.g based) superclasses, with model and attribute details to be addressed as required.

Questions/Discussion before Vote

Next Steps Model harmonization w/ recommended superclasses Change caGRID tooling to capture additional metadata when registering terminology Create custom discovery client for terminology services, to take advantage of additional metadata in support of identified use cases