Data Type Registry Data set descriptions for automation

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Database Planning, Design, and Administration
Chapter 4 Enterprise Modeling.
SYSTEM ANALYSIS & DESIGN (DCT 2013)
Database Planning, Design, and Administration Transparencies
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
Organizing Data Chapter 5. Data Hierachy Table = Entities X Attributes Entities = Records Attributes = Fields.
1 SWE Introduction to Software Engineering Lecture 15 – System Modeling Using UML.
Geospatial standards Beyond FGDC Geog 458: Map Sources and Errors March 3, 2006.
Lecture Nine Database Planning, Design, and Administration
Requirements Gathering : Determining the scope of the system 1. Elicitiation – fact finding 2. Specification 3. Validation.
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Project “European CDDA and INSPIRE”: scope, transformation workflow and mapping rules INSPIRE Conference 2014 Workshop: Implementing Existing European.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
U.S. Department of the Interior U.S. Geological Survey NWIS, STORET, and XML National Water Quality Monitoring Council August 20, 2003.
ITEC224 Database Programming
SOEN 6011 Software Engineering Processes Section SS Fall 2007 Dr Greg Butler
An Experiment on Spatial Data Exchange October 24, 2001 MURAO, Yoshiaki (IBM Japan) ISO/TC211 Workshop on Standards in Action.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
XSLT transforms Mapping from Different Metadata Standards.
North American Profile: Partnership across borders. Sharon Shin, Metadata Coordinator, Federal Geographic Data Committee Raphael Sussman; Manager, Lands.
BMAN Integrative Team Project Week 2 Professor Linda A Macaulay.
1 Minggu 9, Pertemuan 17 Database Planning, Design, and Administration Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Data Management David Nathan & Peter Austin & Robert Munro.
SOEN 343 Software Design Section H Fall 2006 Dr Greg Butler
Database Planning, Design, and Administration Transparencies
Information Systems & Databases 2.2) Organisation methods.
 Three-Schema Architecture Three-Schema Architecture  Internal Level Internal Level  Conceptual Level Conceptual Level  External Level External Level.
AUKEGGS Architecturally Significant Issues (that we need to solve)
CSDGM Overview CSDGM Tools and Resources. Resources Series Materials: ftp://ftp.ncddc.noaa.gov/pub/Metadata/Online_ISO_Tr aining/Intro_to_Geospatial_Metadata/
Mapping the IntesaGIS model to the EuroRoads model Alberto Belussi Università di Verona Giuseppe Pelagatti, Mauro Negri Politecnico di Milano ESDI Workshop.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
XML Schema. Why Validate XML? XML documents can generally have any structure XML grammars define specific document structures Validation is the act of.
WIGOS Data model – standards introduction.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Winter 2011SEG Chapter 11 Chapter 1 (Part 1) Review from previous courses Subject 1: The Software Development Process.
Silverstein Group Presenter Moshe Silverstein A Content Assembly Mechanism Technology Overview Context & Integration A Content Assembly Mechanism Technology.
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
Viewpoint Modeling and Model-Based Media Generation for Systems Engineers Automatic View and Document Generation for Scalable Model- Based Engineering.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Data Type Registries #2 Co-Chairs: RDA Chairs’ Mtg Gothenburg
Data Modeling Using the Entity- Relationship (ER) Model
OGP Seabed Survey Data Model (SSDM)
Lesson 10 Databases.
Entity- Relationship (ER) Model
Data Type Registries #2 12 Month Status Larry Lannom, Tobias Weigel Date Location TBD? CC BY-SA 4.0.
System sequence diagrams
Data Type Registries Breakout
RDA Plenary 9 Breakout Session
Data Type Registries (DTR)
Data Modeling II XML Schema & JAXB Marc Dumontier May 4, 2004
The Re3gistry software and the INSPIRE Registry
Integrating Survey data into a geographic information system
Chapter 4 Entity Relationship (ER) Modeling
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Data Model.
Database Systems Instructor Name: Lecture-3.
Session 2: Metadata and Catalogues
Grid Based Data Integration with Automatic Wrapper Generation
Database Design Hacettepe University
Digital AIM Training - AIXM
Entity vs Datatype.
Task 55 Scope – TOSCA Profile
Presentation transcript:

Data Type Registry Data set descriptions for automation Stephen M Richard, IEDA, EarthCube RDA Plenary 8, Denver, CO

Use cases Document the meaning of entities and attributes in data. Re-use of data type and attribute definitions Machine-assisted data integration: matching attribute content. Validation of data instances against a type definition. Tools that spin up a UI for a particular data type. Link software to data sources that it can use Support file introspection to assist with deep data registration

Use Case Scenario: Data integration For demonstration, we are interested in hydraulic conductivity Different field names, table layouts Workflow: Import to document store using source data structure Inspect the field names and values to propose mapping to known data types Curate to valid and refine mappings

Role of DTR in processing Pipeline

DTR model update: Some Input Sources RDA data Type registry prototype (WG output, March 2015) ISO19110 ISO11179 Clause 11 CSDGM Entity/Attribute OGC SWE Common Data Model NetCDF common data model Apache AVRO XML schema

The model.. Thing Property DataObject Attribute Conceptualization of the world Thing has Property 0..many meaning representation has DataObject Attribute Some properties on a thing might be essential, others might be optional. A property may have links back to the Things (ObjectClass in the UML) that can carry that property. A property may have a conceptual domain defined that scopes the intention of the values for that property. 0..many Representation of the world

Representation.. Attribute DataType DataObject Array Dictionary List has Attribute 0..many has DataType DataObject Array Dictionary List Primitive

Simple ‘Flat File’ DataObject = meanPorosity observation Well Name location depthTop_ft depthBottom_ft meanPorosity lithology formation source DCL & FA #1 SE1/4NW1/4NW1/4, sec. 13, T. 12 S., R. 17 E. 2,420 2,443 0.105 vuggy dolostone Jefferson City Dolomite. Gogel, 1981; reported by usgs2321 Table 2 Geis #1 SW1/4SW1/4SW1/4, sec. 32, T. 13 S., R. 2 W. 1,980 2,200 0.13 oolitic limestone Lansing and Kansas City Groups. 3,482 3,493 0.074 dolostone Roubidoux Formation. SE1/4NW1/4NW1/4, sec. 13, R. 17 E. 2,934 2,985 0.92 calcareous sandstone and granite Lamotte Sandstone and Precambrian rock. 2,616 2,804 0.181 cherty limestone and dolostone Warsaw, Keokuk, and Burlington Limestones. 2,944 3,046 0.125 porous dolostone Hunton Group. DataObject = meanPorosity observation Attributes: Well Name Location depthTop_ft depthBottom_ft meanPorosity lithology formation source Values all have datatype = primitive DataObject Attribute DataType Primitive

More complex DataObject <Description> <scope> <scopeCode>service</scopeCode> <name>Internal service</name> </scope> <dateInfo> <date>2011-11-11T00:00:00</date> <dateType>creation</dateType> </dateInfo> </Description> attribute attribute Data object attribute Data object attribute attribute Data object attribute

For Proposed Model Details see http://usgin.github.io/usginspecs/DataTypeModel- current.htm