Extended Metadata Registry (XMDR) September 2004 Bruce Bargmeyer +1 (510) Interagency/International Cooperation on Ecoinformatics Brussels, Belgium
2 Past, Present, Future Lots of users Lots of information systems Lots of Data Sources Users EEA DOE DoD EPA environ agriculture climate human health industry tourism soil water air textdata environ agriculture climate human health industry tourism soil water air text ambiente agricultura tiempo salud hunano industria turismo tierra agua aero textdata environ agriculture climate human health industry tourism soil water air textdata Others... ambiente agricultura tiempo salud huno industria turismo tierra agua aero textdata
3 Data Standards F Avoid a combinatorial explosion of data content, description, and metadata arrangements for information storage, access and interchange. Data standards and metadata registries can help.
4 Data Element Concept Afghanistan Belgium China Denmark Egypt France Germany ………… Data Elements AFG BEL CHN DNK EGY FRA DEU ………… ISO 3166 English Name ISO Numeric Code ………… ISO Alpha Code Afghanistan Belgium China Denmark Egypt France Germany ………… Name: Context: Definition: Unique ID: 4572 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Name: Context: Definition: Unique ID: 3820 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Name: Context: Definition: Unique ID: 1047 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Name: Country Identifiers Context: Definition: Unique ID: 5769 Conceptual Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others
5 AFG BEL CHN DNK EGY FRA DEU ………… ………… Afghanistan Belgium China Denmark Egypt France Germany …………
6 Metadata Registries Semantics Management Evolution F Database (schema) integration u System design F Data use - metadata F Warehouse support – schema and metadata F XML support (schema) F Backed into terminology support F Next: Semantics servers -- for semantics web and semantics based computing
7 Metadata Registries Metadata Registry Terminology Thesaurus Themes Data Standards Ontology GEMET Structured Metadata ISO/IEC Metadata Registries
8 Concept SignObject Elements of Terminology trout Salmo trutta brown trout truite any of several game fishes of the genus Salmo, related to the salmon...
9 Terminology TermsContextConcept trout Salmo trutta truite common name scientific name French name any of several game fishes of the genus Salmo, related to the salmon... UIN=6349
10 TermsContext Concept Brown trout Salmo trutta truite common name scientific name French name UIN=6349 Data Elements Name Name: trout species Definition: The names of species of trout. Values: brook trout Salvelinus fontinalis brown trout Salmo trutta cutthroat trout Oncorhynchus clarkii
11 Systems: STORET Envirofacts... DBMS Query TermsContext Concept Brown trout Salmo trutta truite common name scientific name French name UIN=6349 XML Schemas EDI Messages Data Interchange Federal Register Regulations Reports Documents Publishing Data Elements
12 Search Example: Trout FishTrout DocumentsData Search Engine TermsContext Concept Brown trout Salmo trutta truite common name scientific name French name UIN=6349 Thesaurus Salmo trutta Brown Trout trout fish
13 TermsContext Concept Brown trout Salmo trutta truite common name scientific name French name UIN=6349 Local Mapping Central Mapping Query Agent Broker Mediator Resource Agent Intelligent Information Services (IIS): Ontology Example: fishtroutbrown trout
14 Semantic Mapping
15 GEneral Multilingual Environmental Thesaurus (GEMET) And CNR Earth Thesaurus DOE, NIH and NCI Safety and Health Concepts/terms Data Elements Terminology Sources TermsContext Concept Brown trout Salmo trutta truite common name scientific name French name UIN=6349 I/ICE Participants Ecoterm Government State/Local Private Enterprise Academe
16 Terminology Management Dictionary Keyword OntologyThesaurus Data Elements Search Engine DBMS/EDI/ Documents IIS a category of vertebrate, cold-blooded craniate animals with permanent gills... Search Engine DBMS/EDI/ Documents Semantics Server
17 Purpose of XMDR F Extend Semantics Management Capabilities of ISO/IEC F Test & Demo Extended Capabilities in a Reference Implementation F Produce Design for Next Generation Operational Registries u Propose Revisions to Parts 2 & 3 (Ver. 3) F Adapt & Adopt Emerging (Semantic) Technologies F Help Resolve Registration & Interrelation Issues for Complex Metadata Standards Forging Semantics Based Computing
18 Project Background F Collaborative, Interagency Effort u DOD, EPA, LBNL, USGS, NCI, Mayo Clinic…Others? F Draws on and Contributes to Interagency/International Cooperation on Ecoinformatics F Involves International, National, State, Local Government Agencies, other Organizations F Recognizes Great Potential of Semantics-based Computing, Management of Metadata u Improving Collection, Maintenance, Dissemination, Processing of Very Diverse Data Structures F Collaboration Arises from Need to Share Diverse Data Across Multiple Organizations F Project Duration Expected to be July 04 – Jun 05, + Many Players, Many Interests…Shared Context
19 Concept Of Operations F Service Oriented Architecture u Enables Heterogeneous, Disparate Systems to Interoperate u Agreement in the Interface, Not the Implementation u Publish, Find, Bind F Standards Based Design u Lifecycle Application Support u Abstract Technology Commonalities u Open Standards, Technology Agnostic u Durable: Used for Current and Future Technologies F Semantic Web Service u Publish, Find, Bind…Automatically u Component of Semantic Web u Bootstrap Semantic Web?
20 Major Tasks, Deliverables & Milestones Task/DeliverableDate Develop Project PlanJul 04 Identify, Select TechnologiesDec 04 Identify, Select Metadata SourcesDec 04 Initial Architecture DesignAug 04 Research and Development< EOP System Test & Evaluation (Internal Participants)< Mar 05 Test Implementation (External Users)Mar 05 Present Proposed Part 2 Revisions to SC32 WG2 mtg in DC Nov 04 Prepare Draft Revision of Part 2 for SC32 mtg in Berlin Apr 05
21 Potential Standards/Technologies F DBMS u Object, XML, Relational, RDF/Graph, Logic, Text, Document, Multimedia F Knowledge Representation u Web Ontology Language (OWL) u Simple Common Logic (SCL) F Middleware/Messaging u Cocoon 2, Jini, CoABS, JMS, XMLBlaster, SOAP F XML [Semantic] Web Services u Axis, JWSDP F Agent Development u ABLE, JADE F Engines/Servers u OMS (IBM), Federator/OMS (OWI) u Jess
22 ISO/IEC Expressed as an Ontology <rdf:RDF xmlns:rdf=" xmlns:rdfs=" xmlns:owl=" xmlns=" xml:base=" <owl:cardinality rdf:datatype=" >1
23 Potential Content Domains F Environmental (Ecoterm, GBIF, …) F Biomedical F Chemical F Geographic Information Systems F Bibliographic Ontologies/Metadata Standards F General Terminologies/Ontologies F Economic Code Sets F Other Diverse Domains, Structures…Representative Samples
24 Other Calendar Events DateEventLocationComments Jul 04Project KickoffBerkeley, CALBNL Hosted 7-11 Sep 04MedInfo 2004San Fran, CA OctQPRBerkeley, CAQuarterly Project Review 4-6 Nov 04FOISTorino, ITFormal Ontologies in Info. Systems 7-11 Nov 04ISWCHiroshimaIntl Semantic Web Conference 8-12 Nov 04SC32/WG2 MeetingDCProvide Progress Report Apr 05Open Forum on Metadata Registries BerlinReport Revisions Involved in Other Disciplines
Metadata Registries Companies Universities Agencies Data Services Semantic Services Others Users September 2004 Environmental Data Grid Environmental Computer Grid High Performance, cluster, Personal Environmental Semantics Grid Terminology Thesaurus Ontology Taxonomy Structured Metadata Computation Services Software: Models, Visualization, Analysis Agent systems Semantic Based Computing Data Standards
26 XMDR & I/ICE F How do we collaborate? u Ecoterm u GBIF u EPA EDR/TRS u EEA Data dictionary u NIH/NCI u Agriculture