LoG: A Methodology for Metadata Registry-based Management of Scientific Data July 5, 2002 Doo-Kwon Baik

Slides:



Advertisements
Similar presentations
2 Introduction A central issue in supporting interoperability is achieving type compatibility. Type compatibility allows (a) entities developed by various.
Advertisements

Status Report of the Study Group on MDR/MFI Implemenations ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Santa Fe, NM, USA, November 11~15, 2013 Dongwon Jeong,
Prentice Hall, Database Systems Week 1 Introduction By Zekrullah Popal.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
6 Chapter 6 Database Design Hachim Haddouti. 6 2 Hachim Haddouti and Rob & Coronel, Ch6 In this chapter, you will learn: That successful database design.
Modeling & Designing the Database
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 10 Structuring.
Procedures to Develop and Register Data Elements in Support of Data Standardization September 2000.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
2 1 Chapter 2 Data Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Computer System Analysis Chapter 10 Structuring System Requirements: Conceptual Data Modeling Dr. Sana’a Wafa Al-Sayegh 1 st quadmaster University of Palestine.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Bridging : FGO and ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Krakow, Poland, October 16, 2012 Dongwon Jeong, Kunsan National University
Chapter 1: The Database Environment and Development Process
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
Database Design - Lecture 2
Representing variables according to the ISO/IEC standard.
Database System Concepts and Architecture
Classification and the Metadata Registry Judith Newton NIST IRS XML Stakeholders/ XML Working Group May 18, 2004.
Architecture for a Database System
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Query Language for MDR/XMDR February 05-08, 2007 ISO/IEC JTC 1/SC 32 Study Period: Future Database Needs Clearwater, Florida, USA Doo-Kwon Baik
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
SDC JE-8019 February 16, 1999 Bruce Bargmeyer EPA/OIRM/EIM Division Tel: (202) WWW URL:
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
The Final Study Period Report on MFI 6: Model registration procedure SC32WG2 Meeting, Sydney May 26, 2008 H. Horiuchi, Keqing He, Doo-Kwon Baik SC32WG2.
Study Period Report on Registration Procedure SC32WG2 Interim Meeting, Seoul H. Horiuchi SC32WG2 N1070.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
Chapter 1 : Introduction §Purpose of Database Systems §View of Data §Data Models §Data Definition Language §Data Manipulation Language §Transaction Management.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
Lecture 7: Requirements Engineering
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 5 Data Resource Management.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
ISO/IEC : Framework for a Metadata Registry By Daniel W. Gillman Bureau of Labor Statistics USA.
Database Management System Prepared by Dr. Ahmed El-Ragal Reviewed & Presented By Mr. Mahmoud Rafeek Alfarra College Of Science & Technology- Khan younis.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SDC JE What is a Data Registry? v A place to keep facts about characteristics of data that are necessary to clearly describe, inventory,
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
1 ISO/IEC 11179, Part 2: Classification Schemes Jim Carpenter Bureau of Labor Statistics Metatopia 2001 Conference September 20 – 21, 2001.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
Data Registry to support HIPAA standards The Health Insurance Portability and Accountability Act of 1996 Title II - Subtitle F Administrative Simplification.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Chapter 2 Database Environment.
1 Chapter 2 Database Environment Pearson Education © 2009.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 10 Structuring.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
Dynamic/Deferred Document Sharing (D3S) Profile for 2010 presented to the IT Infrastructure Technical Committee Karen Witting February 1, 2010.
ITEC 3220A Using and Designing Database Systems
An Introduction to database system
Database Management:.
ISO/IEC Past, Present, Future -- A Thumbnail Sketch
Chapter 2 Database Environment Pearson Education © 2009.
Data, Databases, and DBMSs
Chapter 5 Data Resource Management.
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Data Model.
Metadata Framework as the basis for Metadata-driven Architecture
Ontology-Based Approaches to Data Integration
The Database Environment
Database Design Hacettepe University
Metadata The metadata contains
Chapter 2 Database Environment Pearson Education © 2014.
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Presentation transcript:

LoG: A Methodology for Metadata Registry-based Management of Scientific Data July 5, 2002 Doo-Kwon Baik

July 5, 2002CODATA/DSAO Content  Motivation  Objectives  Related works Overview on the MDR  The scientific data properties  User levels and the data property  Data visibility  The conceptual model of the LoG  A LoG Framework  An Example  Conclusions and Future work

July 5, 2002CODATA/DSAO Motivation  The existing data integration approaches just focus on the technical researches and system developments not consider the properties of the domain knowledge

July 5, 2002CODATA/DSAO The Domain Knowledge  The domain knowledge property is a very important factor in data integration Many works and services depends on the domain knowledge properties The quality degree and the quantity scope in data integration are defined depending on the domain knowledge property. Many other services such as data services and application services depend on it. Domain knowledge the quality degree of data integration the quantity scope of data integration data services (information providing) application services

July 5, 2002CODATA/DSAO Objectives  The objectives of our research to solve the problems of the existing data integration approaches to analyze and define the domain knowledge properties In this paper, we focus on the scientific data. to define relationship among the domain knowledge properties, users and metadata i.e., define the considerations for data integration. to create a new methodology considering the results of domain knowledge analysis we called it as LoG (Localization-based Global MDR methodology). finally to design a framework which is suitable for the methodology.

July 5, 2002CODATA/DSAO Related works: Bottom-up approach(1/2)  The existing data integration approaches are classified into the top-down approach and the bottom-up approach  Bottom-up approach is the most general approach The ontology-based methodology is representative Design and create a guideline such as a global view from the specified databases new databases (the number of them = c) Analyze all factual databases (the number of databases = n) the number of databases = n + c

July 5, 2002CODATA/DSAO Related works: Bottom-up approach(2/2)  Advantages can reach the perfect data integration because we use a global guideline which is created through analysis and design about all databases  Disadvantages the creation of a global guideline spends many costs and time is not suitable for very large scale data integration provides a static integration management mechanism Whenever a new schema or a new database is added to the integrated database, the previous processes is required. It causes the increase of costs and time geometrically. not provide a standardized guideline i.e., it depends on its domain. each application domain for integration define and utilize the different and various guidelines respectively.

July 5, 2002CODATA/DSAO Related works: Top-down approach(1/2)  Top-down approach to solve the problems of the bottom-up approach MDR(ISO/IEC 11179) is representative MDR is the international standard Design and create a guideline such as a global view(metadata elements) from the specified databases new databases Analyze all factual databases Define the schemas of new database according to the standardized guideline

July 5, 2002CODATA/DSAO Related works: Bottom-up approach(2/2)  Advantages reduces many costs because it doesn’t require for the rebuilding process of the global guideline. provides a standardized schema all new databases can be built and managed consistently.  Disadvantages It also spends many costs initially as the bottom-up approach because it require for the create a global view through analysis of all legacy databases. It is a hard work in case of the very large scale integration.

July 5, 2002CODATA/DSAO Overview on the MDR: Definition  Definition of MdR Metadata Registry System of Registering, Storing and managing the specification(Metadata) about data elements Evolution of ISO/IEC Metamodel of Data Registry : ANSI X3.285  Purpose Metadata Registry for data standardization Support of data search, data specification Support of data sharing among systems or organizations Supporting System of creating, registering and managing data element Support understanding of meaning, representation and identification of data for users

July 5, 2002CODATA/DSAO Overview on the MDR: Basic concepts  Data Element The basic unit of data management the unit specifying the identification, context, representation of value about data  Components of Data Element Object Class : The data for collecting or storing Property : the characteristics needed to identify and explain objects Representation : The description about representational form and value domain of each data elements Object Class Property Data Element Concept 1:N 1:1 Object Class Property Data Element 1:N 1:1 Representation 1:1

July 5, 2002CODATA/DSAO Overview on the MDR: Specification  Specification of Data Element Basic Attribute for specifying data element ClassificationCharacteristics IdentificationIdentification of data element DefinitionDescription of meaning RelationRelation of data elements RepresentationDescription of data element representation AdministrationDescription of data element management

July 5, 2002CODATA/DSAO Overview on the MDR: An Example  Definition of a metadata element Identifying and Definition Attributes Data Element NameStudent_ID Identifier Version1 Synonymous nameStudent Number ContextStudent’s ID Definitional Attribute DefinitionAssigned the unique number to each student Relational and Representational Attributes TypeData Element Representation CategoryNumber Representation FormCode Data TypeNumeric Min.size7 Max.size12 Representation LayoutN(12) Data Domainreference of student ID classification Administrative Attribute Registration AuthorityKOREA UNIV. Registration Statusrecorded

July 5, 2002CODATA/DSAO The scientific data properties  The scientific data(knowledge) has the following properties: the general data most people can understand and use it easily. most databases in the scientific fields have the similar or same data elements. the specialized data are more complicated and detailed. the general users can’t understand it. the experts in the specific group are interested in the data, and can utilize it. ※ Building the MDR for all data as a whole is not necessary

July 5, 2002CODATA/DSAO User levels and the data property  Classification of users The users are classified into two groups according to the scientific data property The general users and the specialized users. The general users use the general data in high-level and in the many fields. The specialized users domain experts in a specific field. use the general data and specialized data. also differentiated into more detailed fields. i.e., The specialized users are distributed into several groups, the experts in each group are interested in more specialized data independently.

July 5, 2002CODATA/DSAO Data visibility  Data visibility The quantity and the specialized degree is differentiated into several levels according to the knowledge property, and each level has a independent data set all users detailed -specialized users n specialized users detailed -specialized users 1 general users... used by all users used by specialized users used in independent expert domain group the whole data set set 1 set 2 set 3 set 4 set 5

July 5, 2002CODATA/DSAO The conceptual relation diagram General User 1General User 2General User n Domain Expert 1... Domain Expert 2 Domain Expert n Local MDR 1 (Domain 1) Local MDR 2 (Domain 2) Local MDR m (Domain m ) DB 11DB 12 DB 1n... DB 21DB 22 DB 2n... DB m1DB m2 DB mn... Domain m Domain 2 Domain 1... Global MDR Localization Globalization Specialization Generalization...

July 5, 2002CODATA/DSAO The conceptual model of the LoG  The LoG methodology has four layers Interface Layer provides the user interface environments for all users. Global MDR Layer manages the global MDR for the most generalized and common data which all users(general and specialized users) utilize and access. Local MDR Layer manages the local MDRs for the specialized data which the experts use. The local MDR may be hierarchical structure. Factual Database Layer manages the low and factual data. User Interface Layer Factual Database Layer Global MDR Layer (Generalized Layer) Local MDR Layer (Specialized Layer)

July 5, 2002CODATA/DSAO Factual DB Layer A LoG Framework(1/2) DB 11DB 12 DB 1n... DB 21DB 22 DB 2n... DB m1DB m2 DB mn... Domain m Domain 2 Domain 1 Global User Interface (General User Level Interface) Local User Interface (Expert Level Interface) Expert Level Interface Agent LMDR Agent (Registration, Classification, Authorization) LMDRs LMDR 1LMDR 2LMDR n … LMeta Repository (Sets of actual metadata) General User Level Interface Agent GMDR Agent (Registration, Classification) GMDR GMeta Repository Global MDR Layer Local MDR Layer User Interface Layer Factual DB Layer

July 5, 2002CODATA/DSAO A LoG Framework(2/2)  Interface Layer Global user interface and local user interface sub-layers  Global MDR layer GMDR agent manage the GMDR(global MDR) and the GMeta(global metadata repository). GMDR(global MDR) a standardized guideline for general users and experts. the set of metadata elements used commonly in all databases. GMeta(global metadata repository) the set of actual metadata  Local MDR layer LMDR agent manage the LMDRs and the LMeta LMDRs(local MDRs) a standardized guideline for the specialized users. a set of metadata elements which is to generalize data in each field or detailed field.

July 5, 2002CODATA/DSAO GMDR LMDRs An Example Name definition the unique object name version1 registration status standard datatypecharacter formatcharacter(20) Biological Order Name definition The systematic name that represents the biological Species version1 registration status standard datatypecharacter formatcharacter(50) Chemical Molecular Formula Code definition The code that represents the number of atoms of each element in a molecule of a chemical substance version1 registration status standard datatypecharacter formatcharacter(100) Name Biological Order Name... Name Chemical Molecular Formula Code...

July 5, 2002CODATA/DSAO Conclusions and Future work  Conclusions We considered and defined the domain knowledge property The LoG methodology is proposed with the knowledge property provides a dynamic integration mechanism partially. provides a standardization guideline based on ISO/IEC 11179, the international standard. reduces unnecessary costs from analysis and design all databases for creation of a global view.  Future work to analyze and define the domain knowledge property in detail to implement a prototype based on the framework we described

Q / A Thanks !