INLS 520 Erik Mitchell INLS 520 Information Organization.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
The Semantic Web – WEEK 4: RDF
CS570 Artificial Intelligence Semantic Web & Ontology 2
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
1 Ontology Language Comparisons doug foxvog 16 September 2004.
Basics of Knowledge Management ICOM5047 – Design Project in Computer Engineering ECE Department J. Fernando Vega Riveros, Ph.D.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Notes are from:
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
SchemaLogic Workshop Part 2 Tools for Enterprise Metadata Management and Synchronization Prepared for the University of Washington Information School Applied.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Information Organization
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Practical RDF Chapter 1. RDF: An Introduction
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
INLS 520 – Erik Mitchell INLS 520 Information Organization.
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Umi Laili Yuhana December, Context Aware Group - Intelligent Agent Laboratory Computer Science and Information Engineering National Taiwan University.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
The Semantic Web and expert metadata: pull apart then bring together Presented at 12.seminar Arhivi, Knjižnice, Muzeji Nov 2008, Pore č, Croatia.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Organization of the Lab Three meetings:  today: general introduction, first steps in Protégé OWL  November 19: second part of tutorial  December 3:
Introduction to the Semantic Web and Linked Data
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
INLS 520 – Fall 2007 Erik Mitchell INLS 520 Information Organization.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
INLS 520 – Fall 2007 Erik Mitchell INLS 520 Information Organization.
Ontology Technology applied to Catalogues Paul Kopp.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Information Organization
The Semantic Web By: Maulik Parikh.
Information Organization
ece 627 intelligent web: ontology and beyond
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
PREMIS Tools and Services
Information Networks: State of the Art
Presentation transcript:

INLS 520 Erik Mitchell INLS 520 Information Organization

INLS 520 Erik Mitchell Review Controlled vocabularies –Term Lists, Hierarchies, Trees, Paradigms, Facets, Folksonomies Knowledge organization systems –Term Lists, Thesauri, Taxonomies, Ontologies

INLS 520 Erik Mitchell Today Protege tutorial –Create a thesaurus –Create an ontology Ontologies –Basic concept –Building in protege –RDF (?) –OWL (?)

INLS 520 Erik Mitchell Assignment 1 recap Required XML tags – Required DC elements –None, need a content wrapper and at least one element,, etc. Advanced Concepts –Namespaces –Schemas/DTDs MARC & DC –Advantages / disadvantages Techniques for discovering data –View SourceView Source –DC DOT Metadata generatorDC DOT Metadata generator

INLS 520 Erik Mitchell CV Concepts & definitions Controlled Vocabularies –Organized Lists –Relationships between concepts Knowledge organization systems –Typed relationships –Direct / inferable knowledge

INLS 520 Erik Mitchell Thesauri Definitions –“Guide to use of terms, showing relationships between them, for the purpose of providing standardized, controlled vocabulary for information storage and retrieval”(Monash)Monash –“A list of words showing similarities, differences, dependencies, and other relationships to each other”(USG)USG

INLS 520 Erik Mitchell Thesauri Concepts Preferred terms Non-preferred terms Semantic relations between terms How to apply terms (guidelines, rules) Scope notes Adding terms (How to produce terms that are not listed explicitly in the thesaurus)

INLS 520 Erik Mitchell Common thesaural identifiers SN Scope Note –Instruction, e.g. don’t invert phrases USE Use (another term in preference to this one) UF Used For BT Broader Term NT Narrower Term RT Related Term

INLS 520 Erik Mitchell Thesauri Guides National Information Standards Organization. (2005). Guidelines for the construction, format, and management of monolingual thesauri. ANSI/NISO Z Bethesda, MD: NISO Press. – pdf?CFID= &CFTOKEN= http:// pdf?CFID= &CFTOKEN= Aitchison, Jean & Gilchirist, Alan. Thesaurus Construction: A Practical Guide. 3 rd ed. London: Aslib, Willpower Information Management Consultants –

INLS 520 Erik Mitchell Ontology Definitions “The study of being or existence” “A conceptualization of a specification” (Gruber)Gruber “An ontology formally defines a common set of terms that are used to describe and represent a domain.” (OWL)

INLS 520 Erik Mitchell Webster’s Dictionary Webster’s Third New International Dictionary defines Ontology as: 1.A science or study of being, specifically a branch of metaphysics* relating to the nature and relations of being. 2.A theory concerning the kinds of entities and specifically the kinds of abstract entities that are to be admitted to a language system. *Metaphysics: Nature of being “or” existence.

INLS 520 Erik Mitchell Ontology Concepts Classes –Names of objects in the domain Relationships between classes Connections between classes Properties of classes Background or identifying knowledge of these objects Constraints on these properties & relationships Limits and parameters of the relationships

INLS 520 Erik Mitchell Class exercise Protégé overview –Orientation –Object types (Classes, Slots, Instances) –Relationships (hierarchies, associative) As a group, we will work through the protege training guide – get-started.pdf

INLS 520 Erik Mitchell What is the semantic web URI (Universal resource identifier) OWL/RDFS All built on top of regular web RDF underlying language of semantic web –Xml represents data (document based) –RDF represents pure information (anyone can use, re-harvestable), you could call this knowledge Examples –SwoogleSwoogle –Goog411Goog411

INLS 520 Erik Mitchell Ontologies (review) “A common set of terms that are used to describe and represent a domain” –Classes, Relationships, Properties, Constraints A formal organization of knowledge –The primary role of an ontology is to define a language which people and computers in a given domain can share

INLS 520 Erik Mitchell A good ontology has Features: –Meaningful – all classes have instances –Accurate / correct –Non-redundant – each class/instance is represented in a single way –Rich in description – context, content Enabled functionality: –Able to use queries to connect new pieces of information –Use XML & definitions to integrate knowledge across domains

INLS 520 Erik Mitchell Ontology Continuum Keyword Lists Basic Thesauri Complex Thesauri Taxonomies Simple Ontologies (wordnet) Complex Ontologies (OWL)

INLS 520 Erik Mitchell SHOE Ontology project – Possible to build an ontology for anything –Simple HTML Ontology Extensions (SHOE) Project Sample projects –Beer Ontology –Document Ontology

INLS 520 Erik Mitchell Ontology Concepts Multiple inheritance Vertical and horizontal relationships Decomposed subject/object Predicate based description (isRelatedto, hasVersion) First Order Predicate Logic –Statements broken down into subjects/predicates Proposition –All men are mortal, Socrates is a man Therefore –Socrates is mortal

INLS 520 Erik Mitchell Creating a CV review Design methods –Re-use existing, start with content & desired use ideas –Committee / community approach Top-down –Concept driven Bottom-up –Document driven –Empirical approach Deductive approach –Select terms, create relationships, perform term control Inductive approach –Establish CV at outset, build hierarchies on as needed basis

INLS 520 Erik Mitchell Creating a CV review (2) Top-Down –Identify audience –Identify all topics, concepts, uses, and context of the domain –Sort topics identified into an appropriate organization scheme (enumerative, hierarchical, faceted) –Solidify structure and clean up gaps & redundancies –Assign documents to categories, test retrieval Bottom-up –Identify audience –Survey documents for topics/concepts. –Build system on the fly – let content drive structure and limits of system –Identify gap & redundancies in system –Test retrieval

INLS 520 Erik Mitchell Creating a CV review (3) Think about scope, use, content, maintenance Gather Terms –Based on existing systems, content –Based on user needs/expectations –Investigate issues of specificity, exhaustivity, granularity Build hierarchies, relationships –Broader/narrower terms, Related terms, Use/Use for, see/see also Establish Rules Implement Evaluate Maintain

INLS 520 Erik Mitchell Creating an Ontology Determine Scope of field, define boundaries Check for existing ontologies, vocabularies Select a top-down/bottom-up approach –Identify concepts, vocabulary, parameters, constraints Identify relationships –Multiple hierarchies, inheritance Build, test, maintain

INLS 520 Erik Mitchell Class exercise Design your own ontology –In Groups, pick a domain of knowledge Type of food (pizza, soup, beer), field of study (library science, math), etc Come up with a basic ontological framework and begin creating it in Protege Be prepared to share a brief overview with the class which will include –Domain area –Top level classses –Instance definitions –Relationships

INLS 520 Erik Mitchell Assignment 2 Overview In this assignment you will create an ontology on a topic of your choice. Your ontology should contain multiple classes and instances and be focused on a specific purpose. This assignment includes an implementation of the ontology in Protégé and a brief paper explaining your ontology. Guidelines Select a topic of interest and determine the top level (i.e. Basketball, Chocolate, etc). Define the scope (depth/breadth) and purpose of the ontology. Define specific classes and facets (known as slots in Protégé) that describe those classes. Your ontology should have between 5-10 classes with multiple (2-5) slots for each class. Think about the use of hierarchy and multiple inheritance in your ontology. Summarize your ontology in a short paper (no more than two pages). Outline your ontology and discuss your rationale and key decisions (e.g. scope, purpose, classes and slots, defining relationships) Implement the ontology in Protégé. Define your classes and instances. Create two queries that illustrate ways in which the data could be retrieved. Dates & groupwork Due – November 6 th Groupwork is acceptable

INLS 520 Erik Mitchell RDF Subject, property, object triples Transmitted in xml RDFS extends RDF with an ontology language –Properties, specialization OWL –More powerful extension of RDFS –Uses same syntax of RDF

INLS 520 Erik Mitchell RDF Model Webpage: “Saki Knafo” Author (Value) Object (Property type) Predicate (Resource) Subject “The author of the stuff webpage is Saki Knafo” - A literal, a triple, a statement

INLS 520 Erik Mitchell How is RDF different? RDF is a descriptive model that –Allows variable contextualized description –Deconstructs the descriptive process –Allows more granular automated processing of data –Uses exact markup to indicate the context of values (namespaces, schemas)

INLS 520 Erik Mitchell Encoding RDF in XML <rdf:RDF xmlns:rdf=" xmlns:dcterms=" xmlns:dc=" xmlns:rdfs=" The Hang: The Island of Black Jeans SAKI KNAFO Sun, 16 Sep :04:40 GMT descriptive content

INLS 520 Erik Mitchell Iterative RDF description <rdf:RDF xmlns:rdf=" xmlns:vcard=" card.xsd" xmlns:dc=" xmlns:rdfs=" The Hang: The Island of Black Jeans Sun, 16 Sep :04:40 GMT descriptive content rdf:about=" Saki Knafo

INLS 520 Erik Mitchell RDFS RDF Schema –Defines additional rdf elements that help type relationships Special Classes –Based on RDF Classes / Properties / Attributes with additional Allows the creation of vocabularies / ontologies

INLS 520 Erik Mitchell OWL (Web Ontology Language) An ontolgy that is geared towards representing information on the web –Classes, properties, and relationships that describe URIs and their facets. Based on the Triple concept –Subject, Predicate, Object –3 versions: OWL-Lite, OWL-DL, OWL-Full Formatted in RDF/XML –Uses RDF and RDFS as a foundation –Adds new elements in the owl namespace

INLS 520 Erik Mitchell OWL Versions OWL-Lite –Simple hierarchies, constraints OWL-DL –Uses description logics Logic-based semantic markup based on first-order predicate logic –Still guarantees finite relationship processing –Best suited for automation OWL-Full –Most complex –Open ended, possible to get into infinite processing

INLS 520 Erik Mitchell OWL Example <rdf:RDF xmlns:rdf= xmlns:rdfs=" xmlns:owl= xmlns= xml:base=" 1.0 An ontology containing the basic part relations: partOf, hasPart, partOf_directly, and hasPart_directly. These are described in the accompanying note. Author: Chris Welty (Chris Welty)

INLS 520 Erik Mitchell More OWL Examples Airport Pizza

INLS 520 Erik Mitchell Next Week(s) Fall Break – Enjoy 10/30 – Guest speaker Lorrie Eakin 11/6 – First Group presentations