Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Roadmap to XML Richard Marciano Research Scientist San Diego Supercomputer Center

Similar presentations


Presentation on theme: "1 Roadmap to XML Richard Marciano Research Scientist San Diego Supercomputer Center"— Presentation transcript:

1 1 Roadmap to XML Richard Marciano Research Scientist San Diego Supercomputer Center marciano@sdsc.edu

2 XML Roadmap @SCA 2001 – Santa Rosa 2 Outline ~ 9:00 – 10:00 –XML core overview, the XML 1.0 Specification: syntax, namespaces, DTDs,... ~ 10:15 – 11:15 –XML content creation tools used to create XML, … ~ 11:30 – 12:30 –XML content retrieval browsers, XSLT, … ---------------------------------------------------------------------------- ~ 2:00 – 3:30 –New XML directions knowledge and XML: Topic Maps, Semantic Web, Maps, … ~ 4:00 – 5:00 –XML for archivists uses of XML for archivists, tools?, other uses?, needs?

3 3 New XML Directions 2:00 p.m. – 3:30 p.m. ** 3:30 p.m. – 4:00 p.m. BREAK **

4 XML Roadmap @SCA 2001 – Santa Rosa 4 The Semantic Web The Semantic Web, Scientific American, May 2001, Tim Berners-Lee The Semantic Web Extension of the Web (with Knowledge & Meaning) –Where: data on the Web is defined and linked in a way that it can be used by machines not just for display purposes, but for automation, integration and reuse of data across various applications Provide a language that expresses both data and rules for reasoning about the data and that allows rules from any existing knowledge- representation system to be exported onto the Web Adding logic to the Web—the means to use rules to make inferences, choose courses of action and answer questions Important technologies: –XML –RDF with RDF, triples form webs of information about related things –collections of information called Ontologies In philosophy, an ontology is a theory about the nature of existence Here it’s a document or file that formally defines the relations among terms. The most typical kind of ontology for the Web has a taxonomy and a set of inference rules. –The taxonomy defines classes of objects and relations among them –Inference rules help further manipulate the terms

5 XML Roadmap @SCA 2001 – Santa Rosa 5 Normalized Data/Metadata Representation Resource Description Framework (RDF)RDF –Metadata model –The designer can describe objects, add properties to define and describe them, and also make complicated statements about the objects (statements about relationships between resources). –The specification comes in two sections: Model & Syntax (viewed as directed, labeled graphs) RDF Schemas (using an XML vocabulary)

6 XML Roadmap @SCA 2001 – Santa Rosa 6 Resource Description Framework (RDF) Metadata is useful for information retrieval (esp. if no other schema info or semantics is available) Idea: representation independent encoding of metadata as triples (Resource, PropertyType, Value): –(uri1, DC:creator, uri2), (uri2, vCard:name, smith),... "Semantic Net" uri1 uri2 DC:creator smith vCard:name

7 XML Roadmap @SCA 2001 – Santa Rosa 7 Ora Lassila is the creator of the resource http://www.w3.org/Home/Lassila. Subject (Resource) http://www.w3.org/Home/Lassila Predicate (Property) Creator Object (literal) "Ora Lassila" Figure 1: Simple node and arc diagram http://www.w3.org/Home/Lassila Ora Lassila Creator Figure 1: Simple node and arc diagram

8 XML Roadmap @SCA 2001 – Santa Rosa 8 Ora Lassila is the creator of the resource http://www.w3.org/Home/Lassila. Figure 1: Simple node and arc diagram RDF/XML: Ora Lassila namespace prefix 's' refers to a specific namespace prefix chosen by the author of this RDF expression and defined in an XML namespace declaration such as: xmlns:s="http://description.org/schema/"

9 9 TOPIC MAPS ISO/IEC 13250 (Jan. 2000) Bridging knowledge representation & information management STANDARD FOR: describing knowledge structures associating them with information resources solution for organizing and navigating large and large information pools XTM SPECIFICATION

10 10 New paradigm for K. navigation & synthesis Concept of creating style sheets for K.- based information access and navigation “GPS for the Web” TM’s define semantically customized views TOPIC MAPS

11 XML Roadmap @SCA 2001 – Santa Rosa 11 T is for Topic TopicsTopic typesTopic names The TAO of Topic Maps McCain, John North Carolina S.1019 Senate: Budget Senate: Finance Helms, Jesse McCain, John North Carolina S.1019 Senate: Budget Senate: Finance Helms, Jesse Relief of Edwards North Carolina Senate: Budget SBC School Lunch Senate: Budget Helms, Jesse SHJ Nov 4, 19999

12 XML Roadmap @SCA 2001 – Santa Rosa 12 O is for Occurence OccurrencesOccurrence Roles The TAO of Topic Maps (cont.)

13 XML Roadmap @SCA 2001 – Santa Rosa 13 A is for Association Topic associationsAssociation types The TAO of Topic Maps (cont.) Helms, Jesse Raleigh North Carolina S.1078 S.43 D.C. McCain, John North Carolina McCain, John Raleigh Helms, Jesse D.C. S.43 S.1078 North Carolina

14 XML Roadmap @SCA 2001 – Santa Rosa 14 ==> Independence of topic associations & topic occurrences (information resources) Topic maps as portable semantic networks The TAO of Topic Maps (cont.) McCain, John S.43 Helms, Jesse S.1078 D.C. Raleigh North Carolina

15 XML Roadmap @SCA 2001 – Santa Rosa 15 References XTM DTD --XTM DTD http://www.topicmaps.org/xtm/index.html

16 XML Roadmap @SCA 2001 – Santa Rosa 16 “Senate Legislative Activities” Collection: NARA: 106th Senate Raw Data Raw Data: rtf Senator 1: Senator 2: Senator 99:... **** S. 151 Date Introduced: 01/19/1999 Cosponsors : NONE Official title: A bill to amend the International Maritime Satellite Telecommunications Act… Jan 19, 1999 Latest status: Jan 19, 1999 Read twice and referred to the Committee on Commerce Abstract: NONE Subject Index: Academic Performance:S.7, S.514, S.564 Access to Health Care:S.6, S.1678, S.1690 … Zoning and zoning law:S.9, S.Con.Res.10, S.Res.41, S.J.Res.39 Paul S. Sarbanes of Maryland Paul S. Sarbanes of Maryland (see p. 135, p. 151, etc.) January 06, 1999 to March 31, 2000 Section I:Sponsored measures Section II: Cosponsored measures Section III: Sponsored measures organized by committee referral * Senate: Armed Services * Senate: Banking * House: Judiciary Section IV: Cosponsored measures organized by committee referral * Senate: Agriculture * House: Science Section V: Sponsored amendments Section VI: Cosponsored amendments Section VII: Subject index to measures and amendments

17 XML Roadmap @SCA 2001 – Santa Rosa 17 TM Example (“XTM-like”) DTD 1/2 <!DOCTYPE topicmap [ <!ATTLIST topic id ID #REQUIRED types CDATA #IMPLIED>

18 XML Roadmap @SCA 2001 – Santa Rosa 18 DTD 2/2 <!ATTLIST locatorrole CDATA #REQUIRED href CDATA #REQUIRED> <!ATTLIST assocrlrole CDATA #REQUIRED href CDATA #REQUIRED> ]>

19 XML Roadmap @SCA 2001 – Santa Rosa 19 TM Example – The XML doc itself (1/4) Apartment houses Apt. Houses APARTMENTHOUSES

20 XML Roadmap @SCA 2001 – Santa Rosa 20 TM XML Document (2/4) Children Child. CHILDREN

21 XML Roadmap @SCA 2001 – Santa Rosa 21 TM XML Document (3/4) Welfare Welf. WELFARE Youth employment Youth empl. YOUTEMPLOYMENT

22 XML Roadmap @SCA 2001 – Santa Rosa 22 TM XML Document (4/4)

23 XML Roadmap @SCA 2001 – Santa Rosa 23 Topic Maps Self-Control Extreme ML 2000, Montreal – Hans Holger Rath Topic Map templates –Logical container for the “schema” part of the map: Type/theme declarations Constraints Inference rules Association properties –Transitivity –Support inferencing capabilities Type hierarchies: commercial site (www.ontopia.net) commercial site –Super-subclassing –Inferencing Consistency checking with constraints –Rule-based constraints control validation process –Constraint patterns

24 XML Roadmap @SCA 2001 – Santa Rosa 24 Topic Maps Self-Control (… continued) Inference rules –Deduce additional knowledge –Inference patterns –Examples: If $topic1 is a sibling of $topic2 and $topic1 is a male then $topic1 is a brother <assoc id=“ir-male” type=“class-instance” scope=“ir-schema”> ir-topic-A-PERSON male  THE TM control their own structure and content!

25 XML Roadmap @SCA 2001 – Santa Rosa 25 Model-Based Mediation Raw Data A = (B*|C),D B =... XML DTDs Integrated-DTD := XML-QL(Src1-DTD,...) IF  THEN  Logical Domain Constraints Integrated-CM := CM-QL(Src1-CM,...)...... (XML) Objects Conceptual Models XML Elements XML Models C2 C3 C1 R Classes, Relations, is-a, has-a,... Domain Map

26 XML Roadmap @SCA 2001 – Santa Rosa 26 Simplest Definitions Data –Digital object –Objects are streams of bits Information –Any tagged data, which is treated as an attribute. –Attributes may be tagged data within the digital object, or tagged data that is associated with the digital object Knowledge –Relationships between attributes –Relationships can be procedural/temporal, structural/spatial, logical/semantic, functional

27 XML Roadmap @SCA 2001 – Santa Rosa 27 Types of Knowledge Relationships Logical / semantic –Digital Library cross-walks Temporal / procedural –Workflow systems Spatial / structural –GIS systems Functional / algorithmic –Scientific feature analysis

28 XML Roadmap @SCA 2001 – Santa Rosa 28 Knowledge Based Persistent Archive Attributes Semantics Knowledge Information Data Ingest Services ManagementAccess Services (Topic Maps / Buckets / Model-based Access) (Data Handling System - SRB / FTP / HTTP) MCAT/HDF Grids XML DTD SDLIP XTM DTD Rules - KQL Information Repository Attribute- based Query Feature-based Query Knowledge or Topic-Based Query / Browse Knowledge Repository for Rules Relationships Between Concepts Fields Containers Folders Storage (Replicas, Persistent IDs)

29 XML Roadmap @SCA 2001 – Santa Rosa 29 Further Information http://www.npaci.edu/DICE


Download ppt "1 Roadmap to XML Richard Marciano Research Scientist San Diego Supercomputer Center"

Similar presentations


Ads by Google