BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
1 UIM with DAML-S Service Description Team Members: Jean-Yves Ouellet Kevin Lam Yun Xu.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Descriptions Robert Grimm New York University. The Final Assignment…  Your own application  Discussion board  Think: Paper summaries  Web cam proxy.
Descriptions Robert Grimm New York University. The Final Assignment…  Your own application  Discussion board  Think: Paper summaries  Time tracker.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Logics for Data and Knowledge Representation
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
INLS 520 – Erik Mitchell INLS 520 Information Organization.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Advanced topics in software engineering (Semantic web)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantically Processing The Semantic Web Presented by: Kunal Patel Dr. Gopal Gupta UNIVERSITY OF TEXAS AT DALLAS.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Artificial Intelligence 2004 Ontology
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
IndoWordNet Database Design Presented By: Konkani NLP Team Goa University IndoWordNet Database Design 1.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
An Introduction and UML Profile for the Web Ontology Language (OWL) October 23, 2002 Elisa F. KendallMark E. Dutra CEO & FounderChief Architect
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.
Semantic Web Overview Diane Vizine-Goetz OCLC Research.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Linked Data & Semantic Web Technology The Semantic Web Part 4. Resource Description Framework (1) Dr. Myungjin Lee.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Introduction to Persistent Identifiers
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
ArtsSemNet: From Bilingual Dictionary To Bilingual Semantic Network
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Presentation transcript:

BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan University BT Short Term Research Fellowship

Part I Semantic Web and Linguistic Data Processing

Content 1 Project Background: Semantic Web and NLP RDF 2 RDF – Lingua Franca of Semantic Web 3 The need for linguistic support of Semantic Web WordNet: 4 WordNet: Universal Linguistic Resource u u WordNet as a model of the word semantics u u WordNet as an online thesaurus u u WordNet as a relational database Step One: 5 Step One: Putting WordNet on the Web Step Two: 6 Step Two: Extending WordNet Step Three: 7 Step Three: LinguaShare Problems and Directions 8 Problems and Directions

1 Project Background: Semantic Web and NLP Semantic Web: Model-driven framework for semantically rich data processing over the Web – u RDF u RDF – Dublin Core (1999), W3C (1999) u DAML u DAML – DARPA (2000); OIL – FP5 (2000) Semantic Thesaurus: Linguistic database containing word meanings and semantic relations u WordNet u WordNet – George Miller, Princeton Univ. (1990) u EuroWordNet u EuroWordNet – FP4 (1997); BalkaNet – FP5 (2000)

1.1. Semantic data processing over the Web u Syntactic markupRDFTopic Maps u Syntactic markup of the data (RDF,Topic Maps) u meta-language RDFSDAML u Using a kind of a meta-language (schema) for providing intended semantics of the data represented (RDFS, DAML) u ontologies KIFOILOWL u Specify domain ontologies for representing the restrictions, dependencies, regularities and rules for inference (KIF, OIL, OWL)

Layer Cake (McGuiness, 2002)

1.2. Computer-based semantic thesaurus u Explaining the meaning of the words u Finding other words with the (synonyms) u Finding other words with the same meaning (synonyms) u Finding of other words synonymous usage) u Finding of other words with similar meaning in the same context (synonymous usage) u semantic referencing u Finding of semantically independent, related or dependent word forms (semantic referencing)

EXAMPLE: Type inference through analysis of the argument structure of verb phrases and their syntactic appearance in texts: u u The varieties of argument structure for EVENT-verbs suggests seven major subtypes: PHENOMENON, ASPECTUAL, STATE, ACT, PSYCHOLOGICAL_EVENT, CHANGE and CAUSE_CHANGE u u Based on them, we can differentiate COGNITIVE_EVENT (experiencer is syntactic subject, e.g. fear) from ACT (experiencer is syntactic object, e.g., frighten) Determining ontological information using lexical information

1.3 Project definition Aims: u utilizing the full potential of WordNet multilingual thesauri as an universal linguistic ontology for semantic verification of specialist terminology u embedding it in applications for semantic data processing over the Web u using contemporary Semantic Web Services technologies and tools Methodology: u Analytical research (WordNet) u Modeling (relational models, UML) u Software prototyping (Tomcat, MySQL)

2 RDF – Lingua Franca of Semantic Web  Language to describe resources primarily on the Web (has semantics); can be used not only on the Web – e.g. Dublin Core for library catalogues  Use XML as a syntax representation of RDF statements (serialization syntax); there are alternative serializations (e.g. triplets), but XML is the most popular  The language can formulate statements about the language itself (meta-description); RDF Schema or RDFS  The statements can be stored, processed and transported over the Web (data persistence)

2.1 RDF Model Resources – Things being described by RDF expressions. Resources are named by URIs Examples: HTML document, XML element within the document, Collection of pages, Book Properties – Specific attributes or relations used to describe a resource. Attributes and relations can be also used as resources. Examples: Creator, Title, Name Values – Simply literals or references to resources Statements, e.g.Predicate(Property)  Subject(Resource) Object(Value) Subject(Resource) Object(Value)

Example “Vassil Vassilev whose is is the creator of web page Subject (Resource): ‘ Predicate (Property): ‘Creator’ Object (Value): ‘Vassil Vassilev’

Graphical representation

Serialized representation in XML < <rdf:RDF xmlns:rdf=" xmlns:dc=" xmlns:vcard=" <rdf:Description about=“ Vassil Vassilev

2.2 Semantic Web Applications  Context-based Information Retrieval (search after semantic patterns)  Personalized Information Delivery (data presentation based on user profiles)  User tracking (dynamic construction of user profiles based on log analysis)  Document summarizing (text generation based on models of the meaning)  Automatic translation (text transformation which uses meaning models)

2.3 Semantic Web Tools  Persistent storagequery interpreters  Persistent storage and query interpreters (XML databases/XQuery, RDF repositories/RQL)  Ontology visualizerseditors  Ontology visualizers and editors (OntoEdit, Protégé, etc.)  navigatorssearch engines  Ontology navigators and semantic search engines (AskJeeves, RDF Quiz, OntoSearch)  inference engines  Ontology-based inference engines (Cyc, Kaon, OMM)

Some observations Layers separation (data storage, data communication, information description, terminology definition, fact inference) Layers separation (data storage, data communication, information description, terminology definition, fact inference) Layers isolation (syntactic wrapping vs. semantic mapping) Layers isolation (syntactic wrapping vs. semantic mapping) Information processing concentrated on the most abstract level (ontology) Information processing concentrated on the most abstract level (ontology)  Hierarchy of languages SQL  XML  RDF  RDFS  OWL

3 The Need for Linguistic Support of Semantic Web u For combining multiple namespaces and names reconciliation u For combining multiple namespaces and syntactic names reconciliation u For word disambiguation in text analysis u semantic indexing u For semantic indexing of text corpora u For resolving semantic inaccuracies in texts (esp. similarity, alternatives, exclusion, generalization,etc) u For representing text meaning in transformations which use an intermediate u For representing text meaning in transformations which use an intermediate model of the meaning Why:

4 WordNet as Universal Linguistic Resource u Word forms (nouns, verbs, adjectives and adverbs) and lexical relations between them u Synsets and meaning relations (synonymy, antonymy, hyponymy, meronymy, troponimy, etc) u Lexical database (set of indexed files or a database) u Command language interface (originally Tcl/tk scripts for direct file manipulation, but APIs for Java and other languages also available) u Multi-lingual thesauri (network of WordNet databases for most of the languages)

4.1 WordNet semantics u Relational model transitive relations u Relational model with both standard (ATTRIBUTE, ANTONYM, ENTAILMENT, CAUSE) and transitive relations (HYPERNYM,HOLONYM, MERONYM) u first-order relational structures u Formally can be interpreted in first-order relational structures (Kripke structures) – requires modal logic u database u For adequate representation of the relations either object-relational, or relational database with additional indexing of the transitive relations (transitive closure) is necessary

Fig. 1 WordNet Relations

4.2 Relational schema of the original WordNet thesaurus word word represents the syntactic word forms divided into four main categories – noun phrases, verb phrases, adjectives and adverbs synset synset defines the different meaning sets used for giving semantic interpretation of the word forms sense sense many-to-many relationship between word forms and synsets lexrel lexrel purely lexical relationships which hold between the word forms semrel semrel semantic relationships between the word forms which contains the semantic thesaurus

Fig. 2 Relational schema of WordNet

5 Putting WordNet on the Web u Synchronous query/response model of working (CGI calls) u Purely relational database for storing the thesaurus (MySQL) u Front-end implemented as a set of servlets which query the thesaurus on behalf of other applications u XML format of the data returned as a result of the queries u Separated from the applications and use of independent server (Tomcat)

Tabl. 1 Servlets to explore word relations

Part II LinguaShare: Linguistic Web Service for Semantic Web