Download presentation
Presentation is loading. Please wait.
Published byAlannah Clarke Modified over 9 years ago
1
Larry Fitzwater, U.S. EPA Judith Newton, NIST Lois Fritts, SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC-0002-021-JE-2026
2
Contents of Working Paper 1. Scope 2. References 3. Definitions 4. Types of Abstraction 5. Registry Population Annexes Annexes
3
SDC-0002-021-JE-2026Scope Describes business rules for the registration of data elements and their attributes in a registry, to assist in consistently establishing good quality data elements. Based on the model of a data registry described in ISO/IEC 11179, Part 3. Helps to achieve metadata content consistency through procedures and examples.
4
Judith Newton, NIST Open Forum on Metadata Registries Santa Fe, NM SDC-0002-021-JE-2026
5
Types of Abstraction Relevant to Data Elements 1. Specialization/generalization–all items in the superclass are also in the subclass 2. Decomposition/aggregation–the part-of relationship
6
SDC-0002-021-JE-2026Specialization/Generalization State USPS Code Geographic State USPS Code Mailing Address State USPS Code FACILITY Geographic State USPS Code CUSTOMER Geographic State USPS Code FACILITY Mailing Address State USPS Code CUSTOMER Mailing Address State USPS Code
7
SDC-0002-021-JE-2026Decomposition/Aggregation Country Identifier Country Subdivision Code County Code Borough Code Metropolitan District Code Unitary Auth. Code Special Area Code
8
Lois Fritts, SAIC Open Forum on Metadata Registries Santa Fe, NM SDC-0002-021-JE-2026
9
A Metadata Registry Can Be Overwhelming SDC-0002-021-JE-2026
10
This presentation is a practical approach to populating the content of a data registry for data elements. SDC-0002-021-JE-2026
11
Overview l General Procedures l Examples of Registration l Data Element Groups
12
SDC-0002-021-JE-2026 General Procedures 1. Understanding the data element 2. Content research 3. Population of metadata attributes 4. Classification 5. Quality control
13
SDC-0002-021-JE-2026 Population of Metadata Attributes l Bottom Up Approach l Top Down Approach A data element is attributed with known facts prior to defining the conceptual information about a data element. A classified group is added to the registry, beginning with conceptual domains, value domains, and working down to the individual data elements.
14
Data Element Definition Permissible Values Data Element Name and Identifiers Other Data Element Attributes Data Element Concept Conceptual Domain and Value Meanings Logical Bottom Up Process SDC-0002-021-JE-2026
15
Data Element Definition Permissible Values Data Element Name and Identifiers Other Data Element Attributes Data Element Concept Conceptual Domain and Value Meanings Logical Top Down Process SDC-0002-021-LF-1005SDC-0002-021-JE-2026
16
Other Attributes l Submitting Organization l Data Steward l Comment l Example l Origin u Document u System u Standard l Administrative
17
SDC-0002-021-JE-2026 Data Element Examples l ISO Standard–Enumerated l ISO Standard–Non-enumerated l Application System–Enumerated
18
SDC-0002-021-JE-2026 ISO Standard–Enumerated --United States --United States of America --US--USA--840--ÉTATS-UNIS --États-Unis d’Amérique ISO 3166 Country Identifiers Short English Name Long English Name 2-character abbrev. 3-character abbrev. 3-digit code Short French Name Long French Name
19
SDC-0002-021-JE-2026 Codes for Data Element Registration Definition (Def) Permissible Value (PV) Value Domain (VD) Value Domain Origin (VDO) Data Element Name and Identifiers (DEID) Data Element Name Context (CNTX) Data Element Concept (DEC) Conceptual Domain (CD) Classification (Cl) Layer of Abstraction (LA) Registration Status (RS)
20
SDC-0002-021-JE-2026 ISO 3166–Enumerated Def:The short name of a country, represented in the English language PV:Afghanistan, Albania,…Zimbabwe VD:Short English-language country names VDO:ISO 3166-1:1997 DEID:209033:1 Short English-language country name CNTX:Registry DEC:Country identifier CD:Countries of the world Cl:Geopolitical entities; country identifiers LA:Generalization RS:Standard
21
ISO Standard–Non-enumerated ISO 6709 Geographic Point Locations LatitudeLongitudeAltitude Latitude Sexagesimal Measure SDC-0002-021-LF-XXXX
22
SDC-0002-021-JE-2026 ISO 6709–Non-enumerated Def:The sexagesimal measure of the angular distance of a position on the earth on a meridian north or south of the equator PV: VD:Sexagesimal measures of latitude VDO:Not applicable DEID:312345:1 Latitude sexagesimal measure CNTX:Registry DEC:Latitude distance CD:Latitude coordinates Cl:Geographic point location LA:Generalization RS:Recorded
23
Application System–Enumerated 33c Name Street Address City, State Postal Code Country Mailing Address Country Name SDC-0002-021-JE-2026
24
Application Data Element Def: The name of a country where the addressee is located PV: Afghanistan, Albania,…Zimbabwe VD: Short English-language country names VDO: ISO 3166-1:1997 DEID: 5394:1 Mailing_Address.Country_Name CNTX: Facility data system DEC: Address country identifier CD: Countries of the world Cl: Mailing address LA: Specialization RS: Recorded
25
SDC-0002-021-JE-2026 1. Understanding the classified group 2. Specifying the data elements 3. Understanding the group’s source: – Name – Definition – Authority – Rationale – Potential usage – Identifier Register a Classification of Data Elements
26
SDC-0002-021-JE-2026 Data Element Classifications l Document l Standard l Composite data element
27
SDC-0002-021-JE-2026 Classify by Document Source:Facility Location and Identification Standard Definition:Core set of data elements that supports location and identification of place-based objects Authority:Federal Geographic Data Committee(FGDC) Rationale:Proposed U.S. National Standard Usage:Facilitates data sharing about facilities Identifier:1234
28
SDC-0002-021-JE-2026 Data Elements in Document Facility Name Facility Category Type Facility Identification Number Latitude Measure Longitude Measure
29
SDC-0002-021-JE-2026 Classify by Standard Source:Standard representation of latitude, longitude, and altitude for geographic point locations Definition:The horizontal and vertical coordinates that define a point on the earth Authority: ISO 6709 Rationale: International standard Usage: System developers to design a database entity and transfer data files database entity and transfer data files Identifier: 1345
30
SDC-0002-021-JE-2026 Data Elements in Standard Latitude Degrees Measure Longitude Degrees Measure Altitude Measure Latitude Sexagesimal Measure Longitude Sexagesimal Measure
31
SDC-0002-021-JE-2026 Classify by Composite Data Element Name: Urban-style street address Definition: A set of precise data elements that can be combined into a street address Authority: U.S. Postal Service, Publication 28: Postal Address Standards Rationale: U.S. national standard for creating a mail piece Usage: Parse street address for validation of individual segments Identifier: 2543
32
SDC-0002-021-JE-2026 Data Elements in Street Address Building Number Pre-directional Code Street Name Street Suffix Code Post-directional Code Secondary Unit Code Suite Number
33
Composite Data Element Example of data values for Urban-Style Street Address 200 N Glebe Road SW Suite 300 SDC-0002-021-JE-2026
34
Linking Data Elements l Vertical l Horizontal l Used Together
35
SDC-0002-021-JE-2026 Vertical Linking Generalization to Specialization State USPS Code Mailing Address State Code Facility Mailing Address State Code
36
SDC-0002-021-JE-2026 Horizontal Linking Equivalent Layer of Abstraction PCS_Permit_Facility.Mailing _State BRS_Site_Information.Mail _State RCR_Mailing_Location.State Facility Mailing Address State Code
37
SDC-0002-021-JE-2026 Linking by Use Sample Quantity Units Name Sample Quantity 17 milligrams Example:
38
This is a practical, logical approach to registering “good” data elements. SDC-0002-021-JE-2026
39
Discussion SDC-0002-021-JE-2026
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.