USCISIUSCISI SCEC Ontology Development Tom Russ Hans Chalupsky, Stefan Decker, Yolanda Gil, Jihie Kim, Varun Ratnakar University of Southern California.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Semantic Interoperability & Semantic Models: Introduction
1 UIM with DAML-S Service Description Team Members: Jean-Yves Ouellet Kevin Lam Yun Xu.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
Describing Process Specifications and Structured Decisions Systems Analysis and Design, 7e Kendall & Kendall 9 © 2008 Pearson Prentice Hall.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
1 USC INFORMATION SCIENCES INSTITUTE Modeling and Using Simulation Code for SCEC/IT Yolanda Gil Varun Ratnakar Norm Tubman USC/Information Sciences Institute.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Semantic description of service behavior and automatic composition of services Oussama Kassem Zein Yvon Kermarrec ENST Bretagne France.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
How can Computer Science contribute to Research Publishing?
Kendall & KendallCopyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall 9 Kendall & Kendall Systems Analysis and Design, 9e Process Specifications.
Building Knowledge-Driven DSS and Mining Data
Web Service Architecture Part I- Overview and Models (based on W3C Working Group Note Frank.
Course Instructor: Aisha Azeem
Kendall & KendallCopyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall 9 Kendall & Kendall Systems Analysis and Design, 9e Process Specifications.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
UML - Development Process 1 Software Development Process Using UML (2)
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
1 USC INFORMATION SCIENCES INSTITUTE Modeling and Using Simulation Code for SCEC/IT Yolanda Gil Jihie Kim Varun Ratnakar Marc Spraragen USC/Information.
Knowledge representation
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
* * 0 OWL-S: Ontology Web Language For Services Reyhan AYDOĞAN Emre YILMAZ 21/12/2005OWL-S: Ontology Web Language for Services.
Mobile Topic Maps for e-Learning John McDonald & Darina Dicheva Intelligent Information Systems Group Computer Science Department Winston-Salem State University,
Web Server Administration Web Services XML SOAP. Overview What are web services and what do they do? What is XML? What is SOAP? How are they all connected?
Data Management David Nathan & Peter Austin & Robert Munro.
A view-based approach for semantic service descriptions Carsten Jacob, Heiko Pfeffer, Stephan Steglich, Li Yan, and Ma Qifeng
Describing Process Specifications and Structured Decisions Systems Analysis and Design, 7e Kendall & Kendall 9 © 2008 Pearson Prentice Hall.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
UT DALLAS Erik Jonsson School of Engineering & Computer Science FEARLESS engineering Semantic Web Services CS - 6V81 University of Texas at Dallas November.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
1 USC INFORMATION SCIENCES INSTITUTE Yolanda Gil Interactive Composition of Computational Pathways Jihie Kim Varun Ratnakar Students: Marc Spraragen (USC)
Semantic Web - an introduction By Daniel Wu (danielwujr)
Introduction to Server-Side Web Development Introduction to Server-Side Web Development using JSP and Web Services JSP and Web Services 18 th March 2005.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
1 USC INFORMATION SCIENCES INSTITUTE CAT: Composition Analysis Tool Interactive Composition of Computational Pathways Yolanda Gil Jihie Kim Varun Ratnakar.
Object-Oriented Software Engineering using Java, Patterns &UML. Presented by: E.S. Mbokane Department of System Development Faculty of ICT Tshwane University.
Chapter 6 – Architectural Design Lecture 1 1Chapter 6 Architectural design.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
The Use of Ontology Design Patterns for Metadata Semantics: Methods, Chances, and Limitations Gary Berg-Cross SOCoP Executive Secretary US RDA Advisory.
06/22/041 Data-Gathering Systems IRIS Stanford/ USGS UNAVCO JPL/UCSD Data Management Organizations PI’s, Groups, Centers, etc. Publications, Presentations,
A facilitator to discover and compose services Oussama Kassem Zein Yvon Kermarrec ENST Bretagne.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Topic 4: Distributed Objects Dr. Ayman Srour Faculty of Applied Engineering and Urban Planning University of Palestine.
Towards a framework for architectural design decision support
Web Service Modeling Ontology (WSMO)
Web Ontology Language for Service (OWL-S)
Multi-agent system for web services
Web Server Administration
Chapter 9 Web Services: JAX-RPC, WSDL, XML Schema, and SOAP
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Software Development Process Using UML Recap
Presentation transcript:

USCISIUSCISI SCEC Ontology Development Tom Russ Hans Chalupsky, Stefan Decker, Yolanda Gil, Jihie Kim, Varun Ratnakar University of Southern California Information Sciences Institute

USCISIUSCISI Outline Background SCEC Goals Ontology Basics Semantic Interoperability Examples Weather Seismology Building Computational Pathways Ontology Development SCEC Ontology Development Gene Ontology Development Fundamental Ontologies? Big Questions

Goals: SCEC/IT Project

USCISIUSCISI What is an Ontology? An Ontology is a framework for representing shared conceptualizations of knowledge An Ontology provides: Definitions for objects and relations in the domain Shared vocabulary and and common structure for modeling domain knowledge Domain model/theory that captures common knowledge about the domain

USCISIUSCISI Syntactic Interoperability Current technologies focus on syntactic interoperability Standardized protocols, ports (TCP/IP, FTP, HTTP, SOAP, etc.) Standardized data formats (properly lined up bits, arrays,…) Standardized description languages (XML, WSDL, …) Standardized remote invocation mechanisms (RMI, CORBA, …) Interoperation possible as long as exchanged messages/data are syntactically correct (something a compiler could check)

USCISIUSCISI Semantic Interoperability Claim: Full E-Science needs semantic interoperability Computer needs to understand what the bits mean (e.g., is this number a wavelength or a frequency of what wave in what context) Facilitates stronger semantic integrity or compatibility checks Facilitates integration (automatic transformations and translations) Facilitates more informed search engines

USCISIUSCISI Semantic Interoperability Story SCEC Java code for Community Velocity Model Inputs: longitude and latitude Output: Vs30 (m/s) Connection technology: Java serialization In other words: Ship the bits for two double precision floating point values through a network connection Make sure you send longitude first! – Non-standard convention for geography – Probably based on X-Y convention instead Better: More structured input Latitude=34.15 Longitude= Explicit identification of parameters

USCISIUSCISI Ontologizing a Domain such as “Weather”

Conditions for Joint Tasks (from: CJCSM A 9/13/96, p ) Identify Relevant Domain Concepts

USCISIUSCISI Weather Specification in English (f rom: CJCSM A 9/13/96, p ) C Weather Definition: current weather (next 24 hours). Descriptors: clear, partly cloudy, overcast, precipitating, stormy C Air Temperature Definition: atmospheric temperature at ground level Descriptors: Hot(> 85° F) Temperate(40° to 85° F) Cold (10° to 39° F) Very Cold(< 10° F)

USCISIUSCISI Formalizing Domain Concepts A knowledge-based system about “Weather” must know things like these: Terms hot, humid, windy... Definitions cold = (10° to 39° F) Relationships cold and windy may overlap cold and hot are disjoint cold and very cold are disjoint! Rules IF heavy rain lasts 2 days THEN muddy terrain and excessive runoff (probability.9)

USCISIUSCISI Earthquake Hazard Analog NEHRP Soil Types Soil TypeDescriptionVs (m/s)Rock Types AHard Rock> 1500Unweathered igneous intrusive BRock Volcanics, most Mesozoic bedrock, some Franciscan bedrock CSoft Rock Some Quarternary and Tertiary sands, sandstones and mudstones. Some Franciscan melange & serpentinite DStiff Soil Some Quarternary muds, sands, gravels, silts and mud ESoft Soil< 180 < 200 Water-saturated mud and artificial fill

USCISIUSCISI (deffunction source-hypocenter ((?s earthquake-source)) :-> (?h location) :documentation "The 3D point where the ruptured started.") (deffunction source-epicenter ((?s earthquake-source)) :-> (?e location) :documentation "The point on the earth's surface directly above the hypocenter" :axioms (=> (earthquake-source ?s) (and (= (latitude-of (source-hypocenter ?s)) (latitude-of (source-epicenter ?s))) (= (longitude-of (source-hypocenter ?s)) (longitude-of (source-epicenter ?s))) (= (depth-of (source-epicenter ?s)) (units 0 "m")))) PowerLoom: Hypocenter vs. Epicenter The epicenter is the point on the surface directly above the hypocenter. “Directly above”, more formally: The latitude and longitude of the epicenter and hypocenter are the same. The epicenter depth is zero.

USCISIUSCISI PowerLoom Knowledge representation & reasoning system Uses definitions specified in a formal logic First order predicate calculus Expressive: We can say what we need to Inference via logical deductions Support for units and dimensions Browsing tool: Ontosaurus

USCISIUSCISI Ontosaurus Diagrams and images aid domain familiarization Display of formal information and rules Navigation Tools and Control Panel Domain facts. Textual documentation

USCISIUSCISI Graphical View: Fault Hierarchy

USCISIUSCISI Plan: Building Computational Pathways Simple scenario to illustrate how a user would define computational pathways Behind the scenes, DOCKER uses descriptions of components, their I/O requirements and their constraints to: detect errors in user’s input suggest additional steps needed to make the pathway work make educated guesses about how components selected by the user may be connected to one another

USCISIUSCISI Compute PGA for an Address Using These Components Earthquake Forecast Model (USGS-02) Geocoder Fault-type Magnitude Vs30 Distance Community Velocity Model Address Lat/long Fault-type Magnitude Lat/longTime Span Lat/longVs30 Attenuation Relationship (Field-2000) PGA Distance Computation Lat/long1 Distance Lat/long2 Fault-type Magnitude Site Type Distance Attenuation Relationship (Campbell-02) PGA

USCISIUSCISI Some Data Paths Connect Easily Earthquake Forecast Model (USGS-02) Geocoder Fault-type Magnitude Vs30 Distance Community Velocity Model Address Lat/long Fault-type Magnitude Lat/long Time Span Lat/longVs30 Attenuation Relationship (Field-2000) PGA Distance Computation Lat/long1 Distance Lat/long2

USCISIUSCISI Others Require Transformation Earthquake Forecast Model (USGS-02) Geocoder Fault-type Magnitude Vs30 Distance Address Lat/long Fault-type Magnitude Lat/long Time Span Community Velocity Model Lat/longVs30 Attenuation Relationship (Field-2000) PGA Distance Computation Lat/long1 Distance Lat/long2

USCISIUSCISI Developing Ontologies

USCISIUSCISI SCEC Ontology Development Task-driven Particular application Modeled on domain inferences & reasoning Small team of Computer Scientists Seismology - Tom Russ Models - Jihie Kim, Varun Ratnakar, Tom Russ Small group of Domain Experts Ned Field and Tom Jordan Future Development and curation by domain experts Requires methodology Requires tools

Capture Inference in Ontology Ned Field’s markup of fault parameter data Computation and checking of properties Definitions of Terms

USCISIUSCISI The Gene Ontology (GO) Had a successful jumpstart Done by biologists, not knowledge engineers Developed by a wide, distributed community Focused on specific aspects of genomics Fly-base, yeast, mouse Used 24/7 from day 1 Accepted widely by the community Extended based on use requirements of a wide community Quite large (30-40K terms)

USCISIUSCISI Jumpstart of Go: Key Decisions (1) Limited scope limit domain, though it could have included many many more areas – not let anyone else in until they got somewhere – Added new groups incrementally (10) 3 related areas open (no licenses), use open standards Involve the community Had to develop own software control over own code KISS: keep it simple stupid – E.g., only two relations Transitivity

USCISIUSCISI Key Decisions (2) Use it from the beginning If you wait to have ontology finished before using it you’d never be there Errors would only be discovered through use Set things up so that you are OK when you have to fix those errors (entire chunks of ontology had to be entirely redone) Minimized change impacts by limiting most changes are to rels, which in practice does not impact the annotations Face-to-face meetings 3-4 times a year Satisfied a need for DB users that wanted to ask complex queries (1 query to all DBs) Establish migration path

USCISIUSCISI Key Decisions (3) Requests are resolved either: Immediately Over if can reach closure over 2-3 days – No voting, only consensus on agenda for next meeting Attribution was important Learned that from Flybase Both GO content and annotations are annotated with attribution Unique identifiers within GO The term can change as a lexical string, but no change in meaning and thus no change in identifier Can change defn, but not the GO string, then id changes Small number of relations

USCISIUSCISI Fundamental Ontologies What is out there? Not much. Ontolingua (Stanford University) has a number of small component ontologies – Designed as components – Not tied to applications DAML is working on fundmental physics ontologies (Jerry Hobbs, SRI International, ISI, Ken Forbus, others) – Time – Space We would like input from GEON!

USCISIUSCISI Some BIG Questions (from Gene Ontology Workshop) How do you get started? How to ensure the community will accept it (use it)? How do you (can you?) represent alternative views? What is the process to contribute to it? What is the process to make changes to it? What happens when there is an update? How is it implemented? What tools? How is it managed? Who does what, when, where, why?