The Promise and Peril of RDF for Formalizing the Humanities James Silas Creel Sarah Potvin Texas A&M University Libraries April 10, 2015 Arlington, Texas.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Web Thanks to folks at LAIT lab Sources include :
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
CS570 Artificial Intelligence Semantic Web & Ontology 2
The Semantic Web. The Web Today Designed for Human to read Cannot express meaning Architecture: URL –Decentralized: Link structure Language: html.
The Web of data with meaning... By Michael Griffiths.
Of 17 course outline. of 17 marek reformat ecerf building, w ece 627, winter'13.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Practical RDF Chapter 1. RDF: An Introduction
Knowledge representation
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
An Introduction to the Resource Description Framework Eric Miller Online Computer Library Center, Inc. Office of Research Dublin, Ohio 元智資工所 系統實驗室 楊錫謦.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantically Processing The Semantic Web Presented by: Kunal Patel Dr. Gopal Gupta UNIVERSITY OF TEXAS AT DALLAS.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
Doc.: IEEE /0169r0 Submission Joe Kwak (InterDigital) Slide 1 November 2010 Slide 1 Overview of Resource Description Framework (RFD/XML) Date:
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Definition and Technologies Knowledge Representation.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
The Semantic Web By: Maulik Parikh.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Ontology.
Piotr Kaminski University of Victoria September 24th, 2002
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Presentation transcript:

The Promise and Peril of RDF for Formalizing the Humanities James Silas Creel Sarah Potvin Texas A&M University Libraries April 10, 2015 Arlington, Texas Texas DH Conference

Talk Outline RDF Basics Knowledge Representation in Computer Science –The Rationalist tradition –A Critique Pragmatics of RDF –Need for human interpretation – the problem of readability and understanding –Need for human composition – the problem of formalizing humane expressions –Pitfalls of logical inference Successful Uses –Geonames, Pleiades, Pelagios –VIVO

Motivating Concerns RDF is extensible and flexible; it is not neutral – it involves commitment to: (1)certain way of structuring expressions, and (2)a community that has adopted this mode of expression Theoretically, RDF can represent anything you want to express In practice, use of RDF without attention to conventions can render you incomprehensible

The humanist is better prepared than most to understand the situated and multivalent nature of expression

Community Situatedness “Metadata is not simply a description of the information contained in a work or web page; the choice of a metadata scheme also signifies community membership. Every aspect of metadata-- from how it is obtained and verified to the expectations of how it will be used by humans or computer systems-- stems from the practices of a particular community.” -Marshall and Shipman, “Which Semantic Web?” 2003

RDF Basics – the W3C Web Stack URIs: Uniform Resource Identifiers - unique, unambiguous and persistent XML: eXtensible Markup Language - a markup language used for HTML, RDF, etc. RDF: Resource Description Framework – a set of conventions and syntaxes (including XML) for expressing information in triples and graphs RDFS: The RDF Schema – a set of RDF expressions that enable expression of classes and properties OWL: Web Ontology Language – an RDF extension for expressions of first-order-logic.

RDF Basics – Enabling the Semantic Web RDF enables machines to read and utilize webpages –Unambiguous references for semantic search –Automatic language translation –Question answering –Intelligent agents

RDF Basics – Triples Triples consist of a Subject, Predicate and Object, e.g. Subject Predicate Object –Expresses that James Creel is the dc:author of the document Objects can also be literals, such as strings or integers

RDF Basics – RDF Schema Extends basic RDF with terms to used to characterize classes and properties Medium for defining new “ontologies” Consists in the rdf and rdfs namespaces documented at schema/ schema/

RDF Basics – SPARQL The query language for RDF Starts with an optional list of prefixes Queries consist of clauses of triples with variables that can connect to other clauses

Traditions in Knowledge Representation Frames – Name an object, fill in its properties/relations (“slots”) with other objects (or literals) Logic programming –FOL usually expressed as Horn clauses Functional programming –Recursive functions of variables Expert Systems –Use logic or functions to express a set of rules leading from premises to conclusions –Interview an expert to get a bunch of rules about their domain and encode them

Some Cautionary Examples in Knowledge Representation Fifth-Generation computing: A multi-million dollar effort that yielded good fundamental research in parallel computing, but was held back by concentration on logic-programming (PROLOG) Knowledge Navigator: Apple’s ambition for a semantic web agent Cyc: Since its start in 1984, the goal of formalizing “common sense” has not been realized. Recent efforts have concentrated on mapping its entities to Wikipedia.

The Phenomenological Critique of the Rationalist Tradition in Knowledge Representation In normal situations, we act without the need for logical modeling of the world. Logical reasoning is an exceptional type of reasoning that we appeal to relatively rarely, considering all the actions we take

Potential pitfalls in RDF Too heavyweight a solution when a relational database will suffice –Useful only if interoperability is intended English or other natural-language labels have different meanings for different folks, and none for computers Namespaces are not references to code, but merely shorthand. They do imply acceptance of a convention - the elements of a namespace are only significant to adopters The deeper and more expressive a formalism, the greater the barriers to adoption and use

Logical inferences in RDF Unrestricted logical inference, one of the potential strengths of RDF, is seldom employed – rather, programs reason heuristically or with canned queries. This is just as well, as formal logical expressions can unexpectedly entail contradiction or false inferences –E.g. owl:sameAs can produce falsehoods by employing reification, modality, and Substitutivity

Some RDF Success Stories Geonames – Pleiades – Pelagios - isaw.nyu.edu/exhibitions/space/pelagios.ht ml isaw.nyu.edu/exhibitions/space/pelagios.ht ml VIVO? -

Geonames An online gazetteer with a webservice and free data download ~ 8 million place names with focused metadata –Latitude and longitude –Feature types –Containing place –Alternate names –Links to Wikipedia articles Geonames’ data are available as RDF, and each geoname has a URI. This availability has afforded data linking, e.g. with DBPedia Under the hood, its data are in MySQL

Pleiades An online gazetteer of the ancient world Extensive information exposed as RDF using a number of schemas –Locations –Relationships to other places –Primary source citations –Time periods Under the hood, its data are in a Zope DB.

Pelagios A collaborative effort among 30 institutions to annotate historic documents with Pleiades-linked data Effort has concentrated on tools to assist annotators concentrating on particular collections

VIVO A Semantic Web tool for describing research, scholarship, people and institutions VIVO-ISF (Integrated Semantic Framework) is a separate but related project whose “ontology” underlines the VIVO app The development of this ontology has been fraught with controversy, and most adopting institutions utilize a small sampling of the defined properties and classes while being inclined to introduce their own

Conclusions Governance and collaboration facilitate wider adoption of ontologies –“ontologies” and schemata are meaningful only to adopters Domain circumscription facilitates expression –By circumscribing your domain, you can be parsimonious about the ontologies, classes, and properties you employ. –By being parsimonious with ontological commitments, one makes expression more efficient. –This efficiency of expression facilitates growth of your knowledge base (i.e. graph) Growth leads to success in linked open data, as big knowledge bases are the big targets for linking