Larry Fitzwater, U.S. EPA Judith Newton, NIST Lois Fritts, SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2026
Contents of Working Paper 1. Scope 2. References 3. Definitions 4. Types of Abstraction 5. Registry Population Annexes Annexes
SDC JE-2026Scope Describes business rules for the registration of data elements and their attributes in a registry, to assist in consistently establishing good quality data elements. Based on the model of a data registry described in ISO/IEC 11179, Part 3. Helps to achieve metadata content consistency through procedures and examples.
Judith Newton, NIST Open Forum on Metadata Registries Santa Fe, NM SDC JE-2026
Types of Abstraction Relevant to Data Elements 1. Specialization/generalization–all items in the superclass are also in the subclass 2. Decomposition/aggregation–the part-of relationship
SDC JE-2026Specialization/Generalization State USPS Code Geographic State USPS Code Mailing Address State USPS Code FACILITY Geographic State USPS Code CUSTOMER Geographic State USPS Code FACILITY Mailing Address State USPS Code CUSTOMER Mailing Address State USPS Code
SDC JE-2026Decomposition/Aggregation Country Identifier Country Subdivision Code County Code Borough Code Metropolitan District Code Unitary Auth. Code Special Area Code
Lois Fritts, SAIC Open Forum on Metadata Registries Santa Fe, NM SDC JE-2026
A Metadata Registry Can Be Overwhelming SDC JE-2026
This presentation is a practical approach to populating the content of a data registry for data elements. SDC JE-2026
Overview l General Procedures l Examples of Registration l Data Element Groups
SDC JE-2026 General Procedures 1. Understanding the data element 2. Content research 3. Population of metadata attributes 4. Classification 5. Quality control
SDC JE-2026 Population of Metadata Attributes l Bottom Up Approach l Top Down Approach A data element is attributed with known facts prior to defining the conceptual information about a data element. A classified group is added to the registry, beginning with conceptual domains, value domains, and working down to the individual data elements.
Data Element Definition Permissible Values Data Element Name and Identifiers Other Data Element Attributes Data Element Concept Conceptual Domain and Value Meanings Logical Bottom Up Process SDC JE-2026
Data Element Definition Permissible Values Data Element Name and Identifiers Other Data Element Attributes Data Element Concept Conceptual Domain and Value Meanings Logical Top Down Process SDC LF-1005SDC JE-2026
Other Attributes l Submitting Organization l Data Steward l Comment l Example l Origin u Document u System u Standard l Administrative
SDC JE-2026 Data Element Examples l ISO Standard–Enumerated l ISO Standard–Non-enumerated l Application System–Enumerated
SDC JE-2026 ISO Standard–Enumerated --United States --United States of America --US--USA ÉTATS-UNIS --États-Unis d’Amérique ISO 3166 Country Identifiers Short English Name Long English Name 2-character abbrev. 3-character abbrev. 3-digit code Short French Name Long French Name
SDC JE-2026 Codes for Data Element Registration Definition (Def) Permissible Value (PV) Value Domain (VD) Value Domain Origin (VDO) Data Element Name and Identifiers (DEID) Data Element Name Context (CNTX) Data Element Concept (DEC) Conceptual Domain (CD) Classification (Cl) Layer of Abstraction (LA) Registration Status (RS)
SDC JE-2026 ISO 3166–Enumerated Def:The short name of a country, represented in the English language PV:Afghanistan, Albania,…Zimbabwe VD:Short English-language country names VDO:ISO :1997 DEID:209033:1 Short English-language country name CNTX:Registry DEC:Country identifier CD:Countries of the world Cl:Geopolitical entities; country identifiers LA:Generalization RS:Standard
ISO Standard–Non-enumerated ISO 6709 Geographic Point Locations LatitudeLongitudeAltitude Latitude Sexagesimal Measure SDC LF-XXXX
SDC JE-2026 ISO 6709–Non-enumerated Def:The sexagesimal measure of the angular distance of a position on the earth on a meridian north or south of the equator PV: VD:Sexagesimal measures of latitude VDO:Not applicable DEID:312345:1 Latitude sexagesimal measure CNTX:Registry DEC:Latitude distance CD:Latitude coordinates Cl:Geographic point location LA:Generalization RS:Recorded
Application System–Enumerated 33c Name Street Address City, State Postal Code Country Mailing Address Country Name SDC JE-2026
Application Data Element Def: The name of a country where the addressee is located PV: Afghanistan, Albania,…Zimbabwe VD: Short English-language country names VDO: ISO :1997 DEID: 5394:1 Mailing_Address.Country_Name CNTX: Facility data system DEC: Address country identifier CD: Countries of the world Cl: Mailing address LA: Specialization RS: Recorded
SDC JE Understanding the classified group 2. Specifying the data elements 3. Understanding the group’s source: – Name – Definition – Authority – Rationale – Potential usage – Identifier Register a Classification of Data Elements
SDC JE-2026 Data Element Classifications l Document l Standard l Composite data element
SDC JE-2026 Classify by Document Source:Facility Location and Identification Standard Definition:Core set of data elements that supports location and identification of place-based objects Authority:Federal Geographic Data Committee(FGDC) Rationale:Proposed U.S. National Standard Usage:Facilitates data sharing about facilities Identifier:1234
SDC JE-2026 Data Elements in Document Facility Name Facility Category Type Facility Identification Number Latitude Measure Longitude Measure
SDC JE-2026 Classify by Standard Source:Standard representation of latitude, longitude, and altitude for geographic point locations Definition:The horizontal and vertical coordinates that define a point on the earth Authority: ISO 6709 Rationale: International standard Usage: System developers to design a database entity and transfer data files database entity and transfer data files Identifier: 1345
SDC JE-2026 Data Elements in Standard Latitude Degrees Measure Longitude Degrees Measure Altitude Measure Latitude Sexagesimal Measure Longitude Sexagesimal Measure
SDC JE-2026 Classify by Composite Data Element Name: Urban-style street address Definition: A set of precise data elements that can be combined into a street address Authority: U.S. Postal Service, Publication 28: Postal Address Standards Rationale: U.S. national standard for creating a mail piece Usage: Parse street address for validation of individual segments Identifier: 2543
SDC JE-2026 Data Elements in Street Address Building Number Pre-directional Code Street Name Street Suffix Code Post-directional Code Secondary Unit Code Suite Number
Composite Data Element Example of data values for Urban-Style Street Address 200 N Glebe Road SW Suite 300 SDC JE-2026
Linking Data Elements l Vertical l Horizontal l Used Together
SDC JE-2026 Vertical Linking Generalization to Specialization State USPS Code Mailing Address State Code Facility Mailing Address State Code
SDC JE-2026 Horizontal Linking Equivalent Layer of Abstraction PCS_Permit_Facility.Mailing _State BRS_Site_Information.Mail _State RCR_Mailing_Location.State Facility Mailing Address State Code
SDC JE-2026 Linking by Use Sample Quantity Units Name Sample Quantity 17 milligrams Example:
This is a practical, logical approach to registering “good” data elements. SDC JE-2026
Discussion SDC JE-2026