Download presentation
Presentation is loading. Please wait.
1
1 Metadata, Structured Documents, and XML
2
2 Metadata Literally “data about data” –“a set of data that describes and gives information about other data” ― Oxford English Dictionary
3
3 Metadata How do we encode metadata? How do we encode metadata to support interoperability? Simple example:January 31, 2001 31 janvier 2001 2001-01-31 01-31-2001 31012001
4
4 What is the Dublin Core? A metadata standard for describing digital resources An initiative to create a digital “library card catalog” for the Web Dublin Core fields: (all optional) TitleCreatorSubject DescriptionPublisherContributor DateTypeFormat IdentifierSourceLanguage Relation Coverage Rights
5
5 What’s a structured document? A structured document is a document whose structure conforms to a certain set of rules –Data and metadata encoded in an interoperable manner
6
6 What is XML? XML = eXtensible Markup Language XML is a standard for exchanging structured data –Provides standardization at the syntactic level –Does not provide “meaning” for the tags XML is a standard recommended by the W3C
7
7 Goals of XML Easy to use Easy to extend and adapt Easy to write programs that use XML Support a wide variety of applications Should be human legible Formal and concise
8
8 The Basic Rules XML is case sensitive All start tags must have end tags Elements must be properly nested XML declaration is the first statement – Every document must contain a root element Attribute values must have quotation marks – Certain characters are reserved for parsing –< = ‘<’
9
9 XML Example LBSC690 Jennifer Golbeck LBSC
10
10 Benefits Data is represented in a universal syntax This allows any website to read the data Makes the exchange of data easier
11
11 RSS RSS = Really Simple Syndication or Rich Site Summary An XML format for distributing news headlines on the Web
12
12 RSS Example Maryland’s Slots Commission Pressures Anne Arundel County http://www.newsline.umd.edu/blog/index.php/2009/ 11/12/maryland%e2%80%99s-slots-commission-pressures- anne-arundel-county/ Thu, 12 Nov 2009 23:48:41 +0000 rlorente Gambling in Anne Arundel County seemed so close Thursday, but yet so far away. The Video Lottery Facility Location Commission nearly voted Thursday on a motion to approve the proposal for a casino at the Arundel Mills shopping mall without zoning approval from the Anne Arundel County Council. The council has not approved a zoning plan
13
13 And Others… CML – chemical Markup Lang CellML – biological models BSML – bioinformatic sequences MAGE-ML – Microarray Gene Expression XSTAR – for archaeological research XMLMARC – MARC in XML AML – astronomy markup language SportsML – for sharing sports data
14
14 The next best thing since… What’s the big deal about XML? What does XML not do? How do XML tags acquire meaning? How do standards arise?
15
15 What’s wrong with the Web? It was meant for humans, not machines The current Web contains only data, not knowledge –From Web of data to Web of knowledge Difficult to –Aggregate/compare data across sites –Delegate complex tasks to “agents” –Formulate complex queries involving multiple constraints –…
16
16 Web 2.0 Tagging (“folksonomy”) Blogging Web services Wikipedia
17
17 Summary Concepts covered: –Metadata –Structured Documents –XML –Semantic Web –Ontologies
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.