© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontologies Contributions from Language Technology Paul Buitelaar DFKI GmbH Language Techology.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
CS570 Artificial Intelligence Semantic Web & Ontology 2
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Toward Linguistically Grounded Ontologies by Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek (Ireland, Netherlands, Germany) presented.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
© Paul Buitelaar: KnowledgeWeb Summer School, Spain - July 2004 Human Language Technology in Ontology Engineering Ontology Learning from Text Paul Buitelaar.
Language Technology for the Semantic Web OntoWeb/AgentLink, Barcelona: February 4 th,2003 OntoWeb SIG5 Language Technology in.
Ontology Notes are from:
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Methodologies, tools and languages for building ontologies. Where is their meeting point? Oscar Corcho Mariano Fernandez-Lopez Asuncion Gomez-Perez Presenter:
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Lexicalisation In collaboration with John McCrae, Philipp Cimiano (CITEC, Univ. of Bielefeld) Elena Montiel-Ponsado (Universidad Politecnica Madrid)
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Language Technology for the Semantic Web OntoWeb5,Florida,October 17 th,2003 WP12: Language Technology Overview SIG5 Paul Buitelaar.
Session II: Scientific Publishing and Semantic Web W3C Semantic Web for Life Sciences Workshop October 27, 2004 Moderator: Alan R. Aronson.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
© Paul Buitelaar, February 2002 Corpus Annotation Day at DI Multi-Layer Annotation for Cross- Lingual Information Retrieval in the Medical Domain Paul.
MPEG-7 Interoperability Use Case. Motivation MPEG-7: set of standardized tools for describing multimedia content at different abstraction levels Implemented.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Ontologies Come of Age Deborah L. McGuinness Stanford University “The Semantic Web: Why, What, and How, MIT Press, 2001” Presented by Jungyeon, Yang.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Presented by: Yuhana 12/17/2007 Context Aware Group - Intelligent Agent Laboratory Computer Science and Information Engineering National Taiwan University.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
17 April 2005Sharif University of Tech Page 1 Ontologies Come of Age Amir Hossein Assiaee
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
July 2002, DI Colloquium Semantic Annotation for Semantic Indexing Paul Buitelaar, Martin VolkMuchMore DFKI Language Technology Saarbrücken, Germany Eurospider.
Semantic Web Overview Diane Vizine-Goetz OCLC Research.
©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610)
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Working meeting of WP4 Task WP4.1
ece 627 intelligent web: ontology and beyond
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Semantic Web - Ontologies
ece 720 intelligent web: ontology and beyond
Semantic Wikis Expedition #52 Conor Shankey CEO July 18, 2006
Presentation transcript:

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontologies Contributions from Language Technology Paul Buitelaar DFKI GmbH Language Techology Lab DFKI Competence Center Semantic Web Saarbrücken, Germany

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Overview Ontologies and the Semantic Web  Semantic Web Intro  Ontologies and Knowledge Markup  Ontology Development  Ontology Lifecycle & Language Technology Language Technology  Levels of Automatic Linguistic Analysis Ontologies in Multilingual Information Access  A Medical Example: MuchMore Project  Semantic Resources in the Medical Domain  Demo MuchMore System  Language Technology in Annotation and Indexing Conclusions  MuchMore for the Legal Domain…

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Semantic Web Intelligent Man-Machine Interface Knowledge Markup Ontologies Semantic Web Services

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontology-based Knowledge Markup Semantic Metadata  Metadata, e.g. Dublin Core -- Title, Author, etc.  Semantic:Formal Properties of Objects of Class Author John Smith Knowledge Markup

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Semantic Web Architecture Layered Architecture (Tim Berners-Lee)

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Knowledge Markup Languages XML SchemaNamespaces Interpretation Context RDF Schema OWL (DAML+OIL) Formalization: Classes (Inheritance), Properties Formalization: Classes, Class Definitions, Properties, Property Types (e.g. Transitivity) Data Types XML RDF SyntaxSemantics

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontologies: Basic Idea Definition  “… Explicit, Formal Specification of a Shared Conceptualization of a Domain of Interest ” T. Gruber Towards principles for the design of ontologies used for knowledge sharing. Int. J. of Human and Computer Studies, 1994 Purpose  Knowledge Sharing (e.g. between Agents)  Inference (over Sets of Instances) Related Areas, e.g.  Terminologies, Controlled Vocabulary, Thesauri, Taxonomies, Semantic Lexicons, Wordnets, etc.  Conceptual Models, Schemas, etc.

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontologies: Applications, e.g. Semantic Web Services  Interoperability for (Semantic) Web Services Intelligent Agents  Domain Models for Intelligent Agents Text Interpretation  Ontology-aware Information Extraction Multimedia Integration  Ontology-based Alignment of Extracted Objects in Text, Audio, Video Intelligent Search/Navigation  Ontology-based Indexing in Web-Retrieval

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontologies: Development Ontology Editor / KB Management  Most Widely Used: Protégé (Stanford University, Medical Informatics, USA)  Originally for Development and Maintenance of Medical Expert Systems  Other, e.g.  KAON : University of Karlsruhe - AIFB, Germany  WebOde : UPM – Ontology Group, Madrid, Spain  WebOnto : Open University - KMI, UK  Overview at XML.com by Michael Denny: Ontology Building: A Survey of Editing Tools

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Class Hierarchy Slot Descriptions

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Ontology Lifecycle Creating Populating Validating Evolving Maintaining Deploying

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 LT in the Ontology Lifecycle Ontology (Knowledge) Creating & Evolving Linguistic Analysis to Extract Classes / Relations Populating (Knowledge Base Generation) Linguistic Analysis to Extract Instances Documents (Text) Language Technology (LT) for Ontology: Language Technology = Automated Linguistic Analysis Classes, Relations/Properties

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Linguistic Analysis: Example The Dell computer with a flat screen had to be rejected because of a failure in the motherboard. Dell computer flat screen motherboard has-a reject failure location-of animate-entity

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Part-of-Speech, Morphology Part-of-Speech  e.g.: noun, verb, adjective, preposition, …  PoS tag sets may have between 10 and 50 (or more) tags Morphology  Most languages have inflection and declination, e.g.: Singular/Plural computer, computers Present/Past reject, rejected  Many languages have also complex (de)composition, e.g.: Flachbildschirm (flat screen)> flach + Bildschirm > flach + Bild + Schirm

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Phrases, Terms, Named Entities Semantic Units  Phrases (e.g. nominal - NP, prepositional - PP) NP a flat screen PP with a flat screen NP (recursive) the Dell computer with a flat screen a failure in the motherboard  Terms (domain-specific phrases) Dell computer Dell computer with a flat screen  Named Entities (phrases corresponding to dates, names, …) COMPANY Dell COMPANY Dell Computer Corporation PERSON Michael Dell

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Dependency Structure Semantic Structure Dependencies between Predicates and Arguments the Dell computer with a flat screen had to be rejected PRED: reject ARG1: ENTITY ARG2: ‘the Dell computer with a flat screen’ ‘Logical Form’ : reject(x,y) & animate-entity(x) & computer(y) & … The Dell computer that has been rejected was claimed to have suffered from handling. reject(e 1,x 1,y 1 ) & animate-entity(x 1 ) & Dell_computer(y 1 ) & claim(e 2,x 2,e 3 ) & animate-entity(x 2 ) & suffer_from(e 3,y 1,y 2 ) & handling (y 2 )

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 MuchMore Project Demonstration Prototype  Real-Life Medical Scenario for Cross-Lingual Information Retrieval Research & Development  Combined Data- and Knowledge-Driven Performance Evaluation  Performance Comparison of Existing and Novel Methods

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 General WordNet (EN), GermaNet (DE), EuroWordNet (“linked”) Medical Domain UMLS: Unified Medical Language System Medical MetaThesaurus (only MeSH2001 is used) English, German, Spanish, … Concepts 9 Relations (Broader, Narrower,…) Semantic Network 134 Semantic Types 54 Semantic Relations Semantic Resources

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 C |ENG|P|L |PF|S |HIV|0| C |ENG|S|L |PF|S |HTLV-III|0| C |ENG|S|L |VS|S |Human Immunodeficiency Virus|0| C |ENG|S|L |VWS|S |Virus, Human Immunodeficiency|0| C |FRE|P|L |PF|S |HIV|3| C |FRE|S|L |PF|S |VIRUS IMMUNODEFICIENCE HUMAINE|3| C |GER|P|L |PF|S |HIV|3| C |GER|S|L |PF|S |Humanes T-Zell-lymphotropes Virus Typ III|3| other languagesGERMAN 66,381ENGLISH 1.462,202 Concept Names: 1.734,706 Each CUI (Concept Unique Identifier) is mapped to one out of 134 Semantic Types or TUI (Type Unique Identifier) Clozapine: C  Pharmacologic Substance: T121 MetaThesaurus, SemNet Semantic Types are organized in a Network through 54 Relations T121|T154|T047

© Paul Buitelaar: eJustice Presentation, July 15th, 2004

Token (with Part-of-Speech) German: Kreuzbandes English: ligaments Lemma (or Sequence of Lemmas - Decomposition) German: Faserknorpel  Faser + Knorpel English: ligament UMLS Concept Code and Semantic Type ligament : C _T030 MeSH Code A2.513 Semantic Relation (over a Pair of UMLS Concepts) C _T030 interconnects C _T065 Annotation & Indexing

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 UMLS Semantic Network specifies 54 types of relations between 134 semantic types Pharmacologic Substance affects Cell Function Relations are generic and potentially false Therapeutic Procedure method_of Occupation,Discipline *discectomy method_of history Relations are ambiguous Therapeutic Procedure prevents Neoplastic Process Therapeutic Procedure complicates Neoplastic Process Therapeutic Procedure affects Neoplastic Process Therapeutic Procedure treats Neoplastic Process Relations

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Discontinuation of heparin is a simple and essential maneuvre, and anticoagulation has to be continued by alternative drugs. Example

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Terms:C Heparin C Blood coagulation tests C Pharmaceutical preparations Example: Terms/Concepts Discontinuation of heparin is a simple and essential maneuvre, and anticoagulation has to be continued by alternative drugs.

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Relations:C interacts_with C C analyses C C analyses C Example: Relations Terms:C Heparin C Blood coagulation tests C Pharmaceutical preparations Discontinuation of heparin is a simple and essential maneuvre, and anticoagulation has to be continued by alternative drugs.

© Paul Buitelaar: eJustice Presentation, July 15th, 2004 Conclusions MuchMore for the Legal Domain…  Resources Legal Domain Ontology with… …Large-scale Terminology for Multiple Languages, or if not available… …Large Legal Domain Corpora in Multiple Languages for Term Extraction… …and for Relation Extraction if Ontology Needs to be Constructed/Adapted  Tools Linguistic Analysis (PoS, Morphology, Term Grammars, etc.)… …for Multiple Languages… …Tuned to the Legal Domain… Information Retrieval Infrastructure, Interface Design, etc.