Metadata Open Forum 2008 ISO/IEC/IEC 11179: Metadata Registries A Tutorial from the National Cancer Institute Dianne M. Reeves, RN, MSN National Cancer.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
Direction of Proposals for New Edition (E3) of ISO/IEC 11179
1 Submitted to: NCI Center for Bioinformatics Prepared by: 101 West Renner Road, Suite 130 Richardson, TX September 22, 2004 Contact Information:
Is Your Data Facility ISO Compliant? Progress Towards Harmonizing the DDI and ISO/IEC Dan Gillman Information Scientist US Bureau of Labor Statistics.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Best Practices for Including Enumerated Value Domains in UML Models What are the mechanics of creating CDEs associated with enumerated value domains in.
Procedures to Develop and Register Data Elements in Support of Data Standardization September 2000.
Form Builder Iteration 2 User Acceptance Testing (UAT) Denise Warzel Semantic Infrastructure Operations Team Presented to caDSR Curation Team March.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
1 SAIC Phong Ngo Considerations for Establishing and Managing A Registry for Metadata Phong Ngo (NCITS/L8 - SAIC) April 15-17, 1998 Metadata Registry Workshop.
Status report of : Framework for generating ontologies ISO/IEC JTC 1/SC 32/WG 2 Interim Meeting, Redwood City, USA, November 17, 2010 Dongwon Jeong,
Representing variables according to the ISO/IEC standard.
United States Health Information Knowledgebase (USHIK) A Data Registry Project 17 February 1999 Open Forum on METADATA REGISTRIES.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
IMDB Registration of Survey Variables Dec 19, 2005.
Metadata Registries Workshop April 15, 1998 Slide 1 of 20 ANSI X Douglas D. Mann Stewardship Naming & Identification Classification.
The Final Study Period Report on MFI 6: Model registration procedure SC32WG2 Meeting, Sydney May 26, 2008 H. Horiuchi, Keqing He, Doo-Kwon Baik SC32WG2.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
ISO/IEC : Framework for a Metadata Registry By Daniel W. Gillman Bureau of Labor Statistics USA.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
CaCORE Software Development Kit George Komatsoulis 25-Feb-2005.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
1 Introduction to the caDSR Presented to HL7 Vocab SIG January 24, 2005 Denise Warzel National Cancer Institute, Center for Bioinformatics caDSR Project.
1 ECCF Training 2.0 Introduction ECCF Training Working Group January 2011.
SDC JE What is a Data Registry? v A place to keep facts about characteristics of data that are necessary to clearly describe, inventory,
LoG: A Methodology for Metadata Registry-based Management of Scientific Data July 5, 2002 Doo-Kwon Baik
CaDSR O&M Draft Scope September 2010 Denise Warzel National Cancer Institute Center for Biomedical Informatics and Information Technology.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
International Security Management Standards. BS ISO/IEC 17799:2005 BS ISO/IEC 27001:2005 First edition – ISO/IEC 17799:2000 Second edition ISO/IEC 17799:2005.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
Common Queries for MDRs WG4 SQL16 ISO/IEC JTC1 SC 32 WG2 input to WG4 on SQL-MM Part 8 November, 2010 ISO/IEC JTC1/SC32/WG2 N1484.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Considerations for Establishing and Managing A Metadata Registry Phong Ngo (SAIC) February, 1999 Metadata Registration Open Forum Washington, D.C., USA.
Copyright © 2007, Oracle. All rights reserved. Managing Items and Item Catalogs.
CaBIG ™ is an initiative of the National Cancer Institute, NIH, DHHS Semantic Integration Workbench (SIW) v3.1 and UML Model Browser v.5  Session Date:
Body Mass Index VCDE Small Group Lynne Wilkens, Lewis Frey, Mary Cooper, Brian Davis, Mike Keller, Daniela Smith 3/22/2007.
CaCORE Training Forms- based Metadata Curation Session 1 Course Number:1061 Duration: 90 Minutes Intended Audience: Metadata Curators – Using Forms Instructor:
CaCORE In Action: An Introduction to caDSR and EVS Browsers for End Users A Tool Demonstration from caBIG™ caCORE (Common Ontologic Representation Environment)
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
CgMDR and Excel Addin Overview Denise Warzel Nano WG May 5, 2011.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
SDTM Metadata Curation Process  Dianne Reeves. Session Outline  Submit Candidate Terminology – Example spreadsheet  Load new terms into EVS (Enterprise.
Semantic Interoperability: caCORE and the Cancer Data Standards Repository (caDSR)  Jennifer Brush.
International Planetary Data Alliance Registry Project Update September 16, 2011.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
NCI Center for Biomedical Informatics and Information Technology (CBIIT) The CBIIT is the NCI’s strategic and tactical arm for research information management.
Metadata models to support the statistical cycle: IMDB
Template library tool and Kestrel training
Networking and Health Information Exchange
Database Design Hacettepe University
Presentation transcript:

Metadata Open Forum 2008 ISO/IEC/IEC 11179: Metadata Registries A Tutorial from the National Cancer Institute Dianne M. Reeves, RN, MSN National Cancer Institute CBIIT Tommie G. Curtis, MS Science Applications International Corporation (SAIC)

Metadata Open Forum Goals Explain the role of ISO/IESC in capturing structured metadata Discuss the added value of binding vocabulary/terminology, to ISO/IEC administered items Estimate the level of effort needed to collect and maintain metadata Assess and justify metadata registration needs for an organization

Metadata Open Forum 2008 – Activities Review and discuss the ISO/IEC standard Examine a registry implementation of ISO/IEC Map source metadata to registry content Utilize semantics to bind to metadata Assess the value and role of an ISO/IEC registry in an organization

Metadata Open Forum 2008 – ISO/IEC Metadata Registries What is the Standard? Six-part standard defining various aspects of metadata development and metadata registry management Common way of representing metadata A “Grammar” for describing data –Descriptive (pattern for creating meaning) –Prescriptive (pre-existing rules for the pattern)

ISO/IEC Information technology Standard Metadata Open Forum 2008 – ISO/IEC Information technology Standard ISO/IEC Part 1: Framework ISO/IEC Part 2: Classification ISO/IEC Part 3: Registry metamodel and basic attributes ISO/IEC Part 4: Formulation of data definitions ISO/IEC Part 5: Naming and Identification Principles for Data Elements ISO/IEC Part 6: Registration Publicly Available from: standards.org/11179/

Basic ISO/IEC Metamodel Components Metadata Open Forum 2008 – Basic ISO/IEC Metamodel Components Data_Element_Concept * +specifing having 0..* data_element_concept_conceptual_domain_relationship 0..* providing_representation_to 0..* +represented_by 1..1 expression 0..* represented_with 0..* +providing_representation_for 1..1 representation 0..* representing 0..* +specified_by 1..1 specification Data Element Concept Conceptual_Domain Conceptual Domain Data_Element Data Element Value_Domain Value Domain Perception Representation

Metadata Open Forum 2008 – Terms and Definitions for ISO/IEC Data Element: A unit of data for which the definition, identification, representation, and permissible values are specified by means of a set of attributes. Data Element Concept: An idea that can be represented in the form of a data element, described independently of any particular representation. Conceptual Domain: A set of valid Value Meanings. Representation Class: A classification of data elements based upon the type of representational form. Value Domain: A set of attributes describing representational characteristics of instance data with or without enumerated permissible values. Value Meaning: A member of the set of finite allowed inventory of notions that can be categorized for a conceptual domain. Permissible Value: An expression of a Value Meaning expressed in a Value Domain. Data Element Data Element Concept Value Domain Value Meaning Permissible Value Conceptual Domain Representation Class

Metadata Open Forum 2008 – Terms and Definitions for ISO/IEC Data Element Concept: An idea that can be represented in the form of a data element, described independently of any particular representation. - The suggested pattern for creating the meaning of a DEC is further described using Object Class and Property Object Class: The part of the DEC ‘pattern’ pertaining to the thing in the real world. A person, a gene, a vehicle. Property: The part of the DEC ‘pattern’ pertaining to an observable or recordable characteristic of the thing in the real world. These characteristics, or attributes, are those things that help to differentiate instances of one thing of the same type or kind, from another. For example characteristics of a person that differentiate one person from another: Hair color, Eye color, Height, Weight, BSA Data Element Concept Object Class Qualifiers Property

Metadata Open Forum 2008 – ISO caDSR Implementation Diagram Object Class Chemopreventative Agent Property Name Conceptual Domain Agent Data Element Concept Chemopreventive Agent Name Data Element Chemopreventive Agent Name Value Domain CTEP Drug Names Representation Name Valid Values Cyclooxygenase Inhibitor Doxercalciferol Eflornithine … Ursodiol

Metadata Open Forum 2008 – NCI CBIIT Extensions Mandatory Object Class and Property –NCI Compliance ensures that the parts of the semantics are clearly, unambiguously identified –Simplifies development of programs and interfaces that can reliably detect similar or different content (uses the ‘grammar’ to interpret metadata) Value Meanings as Administered Items –Alternate names and definitions –Reference documents –Origins Forms and parts of forms as administered items –Unique identifier –Versioning –Simplify creating and sharing Data Elements –Promote reuse of standards

Metadata Open Forum 2008 – NCI CBIIT Extensions Concepts as Administered Items –Provides links to external vocabularies and code systems –Minimal concept information extracted from external vocabulary systems to populate the Administered Item Record to simplify reuse of NCI standardized concepts Preferred name, definition, concept identifier, source vocabulary identification Concepts bound to Controlled Vocabulary –Binding registry semantics to immutable external vocabulary concepts –Provides access to extensive synonymy and semantics represented in ontologies, taxonomies and code systems where the concepts are more fully described Extended use of Concepts: Property, Representation, Value Meanings, Value Domains, Conceptual Domain, etc. –Enhances programmatic interpretation of semantics –(*ISO/IEC Ed. 2 specifies concepts as optionally associated with Object Class)

Metadata Open Forum 2008 – NCI CBIIT Extensions Applied business rules to make the addition of semantics mandatory for Object Class, Property, Representation, Qualifiers, and Value Meanings Include Preferred Question Text Next steps:  Forms as administered items  CSI as administered items

Metadata Open Forum 2008 – NCI CBIIT Business Rules for Metadata Development and Maintenance Metadata Development –Naming and Definitions –Semantic Assignment –Completeness Criteria –Ownership and Usage –Status Assignment Metadata Maintenance –Updating/Modifying –Versioning –Status assignment

Metadata Open Forum 2008 – NCI CBIIT Best Practices Describe common processes Improve quality and encourage reuse Facilitate training and understanding Documented in FAQs and documents Encourage use of data standards

Metadata Open Forum 2008 – Enterprise Vocabulary Services - Thesaurus Controlled vocabulary resources for caCORE and the cancer research community Vocabulary Products and Services –NCI Thesaurus –NCI Metathesaurus –External vocabularies NCI Thesaurus - controlled vocabulary source for metadata –Has excellent coverage of cancer terminology –Expands based on needs for additional terminology –Based on concepts rather than terms –Each concept has a unique identifier or CUI with definitions and synonym

Preferred Name Synonyms Definition Relationships Concept Code Metadata Open Forum 2008 – Enterprise Vocabulary Services - Thesaurus

Metadata Open Forum 2008 – Curation: Manual Curation Use a suite of caDSR Tools: CDE Browser to locate existing metadata Curation tool to create metadata –Applies rules for well formed metadata Administration tool to create classifications, classification scheme items

Metadata Open Forum 2008 – ISO/IEC Implementation in NCI CBIIT- Browser

Metadata Open Forum 2008 – NCI CBIIT and caBIG™ Data Standards

Metadata Open Forum 2008 – NCI CBIIT and caBIG™ Data Standards - Details

Metadata Open Forum 2008 – CDE Browser – Advance Search Long name: Permissible Value: Workflow Status:

Metadata Open Forum 2008 – Curation Tool

Example – Searching for a Representation Term in the Curation Tool brings up The list of 37 preferred Representation terms.

Metadata Open Forum 2008 – Preferred Representation Terms Anatomic Site Category Code Count Date Date/Time Dose Duration Float Frequency Grade Identifier Ind-2 Ind-3 Indicator Integer Interval Measurement Name Number Range Rate Reason Result Scale Score Source Specify Stage Status Text Time Type Unit of Measure Value

Metadata Open Forum 2008 – Curation Tool

Metadata Open Forum 2008 – Administration Tool

Metadata Open Forum 2008 – Ways to Register Metadata into the caDSR Manual Curation Model Loading Batch Loader

Metadata Open Forum 2008 – Sources of Metadata

Metadata Open Forum 2008 – ISO/IEC Implementation in NCICBIIT

Metadata Open Forum 2008 – Curation of Content: Data Element

Metadata Open Forum 2008 – Curation: Loading a Model into caDSR

Metadata Open Forum 2008 – ISO/IEC Administered Items

Metadata Open Forum 2008 – ISO/IEC Administration Record

Metadata Open Forum 2008 – Creation of Metadata: Data Element Concept What guidance does the ISO/IEC Standard give for DEC creation? Conceptual Domain Object + Qualifiers (optional) Property + Qualifiers (optional) Administration Record: –Data Identifier (‘Public ID’) –Version –Long, Short, and alternate names –Definitions (we use 3 types) –Effective date –Until date –Classifications –Origin –Administrative status –Registration status –And more characteristics…

Metadata Open Forum 2008 – Creation of Metadata: Value Domain What guidance does ISO/IEC give for VD creation? Conceptual Domain Representation term + Qualifiers Data Identifier (‘Public ID’) Version Long, Short, and alternate names Definitions Effective date Until date Classifications Origin Administrative Status Registration Status Data type Field length UOM Permissible values/Value meanings/Concepts/Value meaning Descriptions Reference Documents

Metadata Open Forum 2008 – Creation of Content: Data Element What guidance does ISO/IEC give for DE creation? DE VD Document Text – Question used on a form Definition Effective Date Until Date Data Identifier Version Classifications Documents Origin Administrative status Registration Status Reference Documents

Metadata Open Forum 2008 – caDSR Organization of Content Organization of Metadata in caDSR By Context or owning group By Model (UML Browser) By Classification (CS) / Classification Scheme Item (CSI) –Different ‘types’ of CS’s represent Business Categories, Data or Web Services, Items used together, etc. By Form

Metadata Open Forum 2008 – Organization of Metadata in caDSR: Contexts A context is a group owning metadata Context administrator Business rules for aspects of metadata curation and maintenance Privileges for an identified set of users/curators

Metadata Open Forum 2008 – NCI CBIIT Data Quality Metrics Analyze the current content and identify issues Clean-up quality of content in the caDSR by addressing incomplete, inconsistent, and redundant metadata in the caDSR Establish best practices and business rules to prevent the creation of data quality problems in the future Strengthen the reuse of metadata across user communities

Metadata Open Forum 2008 – Cleanup Activities Identify incomplete, redundant, and inconsistent CDEs and their components Reduce duplication Remove orphans Ensure conformance with current business rules Continue to monitor content over time

Metadata Open Forum 2008 – Example Metrics Report - Concepts – Baseline Update Number of Concepts by Workflow Statuses 3/19/20084/16/2008 RELEASED11,51811,603 RETIRED ARCHIVED78 RETIRED DELETED44 RETIRED PHASED OUT17 RETIRED WITHDRAWN37 Total Number of Concepts11,58311,669

Metadata Open Forum 2008 – caDSR Users: Training Courses

Metadata Open Forum 2008 – Role: Context Administrator

Metadata Open Forum 2008 – Role: Subject Matter Expert/Content Expert

Metadata Open Forum 2008 – Training Course Materials

Metadata Open Forum 2008 – What is the value of using ISO/IEC 11179? Standardize structure and content Promote reuse of standards Enhance the ability to successfully search metadata

Metadata Open Forum 2008 – What organizations are using ISO/IEC 11179? Australian Institute of Health & Welfare Nordic Common Data Elements Registry UDEF (Universal Data Element Framework) UK – cgMDR (Cancer Grid Metadata Registry) US - National Cancer Institute US - Department of Justice US - Environmental Protection Agency or EPA US – United States Health Information Knowledge Base Others?

Metadata Open Forum 2008 – What is Needed for a Metadata Program? Management support Commitment of resources A registry tool Business Rules Best Practices Quality Measurement Plan Training program

Metadata Open Forum 2008 – Resources/References caDSR SDK Guide: ftp://ftp1.nci.nih.gov/pub/cacore/SDK/v3.2.1/caCORE_S DK_3.2.1_Programmers_Guide.pdf caCORE User Application Manual: ftp://ftp1.nci.nih.gov/pub/cacore/NCICBapplications/NCI CBAppManual.pdf caCORE Technical Guide: ftp://ftp1.nci.nih.gov/pub/cacore/caCORE3.2_Tech_Guid e.pdf caDSR Homepage:

Metadata Open Forum 2008 – Contact Information NCI CBIIT Instructor: Dianne Reeves caDSR Home Page: rview/cadsr caDSR Training Home Page: caDSR Training ListServ:

Metadata Open Forum 2008 – Your Questions Thank you for your attention! Please join us on a future caDSR Training teleconference, or send an .