February 26, 2003NCICB Jamboree1 Enhancing Quality of Retrieval Through Concept Edit History -- EVS Update Frank Hartel Sherri De Coronado Gilberto Fragoso.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

EFRONT V4 EXTENSIONS ARCHITECTURE. The goal  To offer more flexibility to 3 rd party users to modify eFront functionality  To further extend eFront.
© 2008 OSIsoft, Inc. | Company Confidential Event Frames Initiative Update Chris Nelson Chris Coen Chris Nelson Chris Coen.
Developing an XBRL Reporting Architecture Rafael Valero Arce Fujitsu España Services es.fujitsu.com.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
 Guy Jacob  Roee Shapiro – Project A Spring, 2008 INFINI DRIVE  Project Supervisor: Hai Vortman  Lab Chief Engineer: Dr. Ilana David.
Chad Berkley National Center for Ecological Analysis and Synthesis (NCEAS), University of California, Santa Barbara February.
Remote Unit Testing Brian Pruitt-Goddard Alex Riordan.
Environmental Terminology System and Services (ETSS) June 2007.
Automobile Enthusiasts Database System May 7, 2003.
The chapter will address the following questions:
ADVANCED MICROSOFT ACTIVE DIRECTORY CONCEPTS
Avaya Contact Center Control Manager. © 2010 Avaya Inc. All rights reserved. What if you could… 1 Requires purchase of additional connectors  Enable.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
JAVELIN Project Briefing 1 AQUAINT Year I Mid-Year Review Language Technologies Institute Carnegie Mellon University Status Update for Mid-Year Program.
MMHCC Informatics Providing Innovative and Integrative Informatics Solutions Johnita Beasley (SAIC) Dana Zhang (SAIC) Sharon Settnek (SAIC)
LexEVS 6.0 Overview Scott Bauer Mayo Clinic Rochester, Minnesota February 2011.
LexEVS 101 Craig Stancl Rick Kiefer February, 2010.
The Apelon Formal-Terminology Solution Terminology Creation and Maintenance Application Development and Deployment TerminologyApplications.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
PROJECT SECME1 Carthik A. Sharma Juan Carlos Vivanco Majid Ali Khan Santhosh Kumar Grandai Software Engineering Fall 2002.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Integrated Collaborative Information Systems Ahmet E. Topcu Advisor: Prof Dr. Geoffrey Fox 1.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
ISetup – A Guide/Benefit for the Functional User! Mohan Iyer January 17 th, 2008.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ENHANCING IFRS EXTERNAL REPORTING USING XBRL
What’s new in Kentico CMS 5.0 Michal Neuwirth Product Manager Kentico Software.
LexBIG/LexGrid Services for LexBIG 2.3 Model and API for the Grid.
0 SharePoint Search 2013 Rafael de la Cruz SharePoint Developer Seneca Resources twitter.com/delacruz_rafael
Event Log View and Sentry Event Log Management Copyright 2002 Engagent, Inc.
LexEVS Semantic Tooling Advancements Kevin Peterson Mayo Clinic Mayo 2009.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
New possibilities 1. EBI data pack – database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
1 Cancer Models Database (caMOD). 2 History  January 2000 – Prototype is presented during the Mouse Models of Human Cancers (MMHCC) Steering Committee.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
1 1 ECHO Extended Services February 15, Agenda Review of Extended Services Policy and Governance ECHO’s Service Domain Model How to…
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
EVS 4.0 Feature Overview EVS API and User Interface pBIO Meeting March 20, 2007 Frank Hartel Gilberto Fragoso
IT Infrastructure Planning Committee Use Case Enhanced SVS Nikolay Lipskiy, MD, DrPH, Centers for Disease Control (CDC), USA Sundak Ganesan, MD, Northrop.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Python API for EVS Konrad Rokicki, SAIC. Python Programming Language Dynamic, object-oriented, open-source Cross-platform, popular for scripting Extensive.
EVS Data Curation The processing and publication of data for web browsing and programmatic access.
Protégé 3.4 Plug-in for Editing and Maintaining the NCI Thesaurus Protégé Conference June 23, 2009 Amsterdam Sherri de Coronado, Gilberto Fragoso.
Recommending Adaptive Changes for Framework Evolution Barthélémy Dagenais and Martin P. Robillard ICSE08 Dec 4 th, 2008 Presented by EJ Park.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
Sherri de Coronado Enterprise Vocabulary Services NCI Center for Bioinformatics and Information Technology March 11, 2009 A Terminology.
National Cancer Institute 1 1 LexBIG integration caCORE Software User Meeting Aug 7, 2006.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
XML 2002 Annotation Management in an XML CMS A Case Study.
Not Your Father’s Laserfiche AA101 Michael Allen.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
IBM Software Group © 2008 IBM Corporation IBM Tivoli Provisioning Manager 7.1 Server Management/Task Management/Workflow.
Modeling Formalism Modeling Language Foundations System Modeling & Assessment Roadmap WG SE DSIG Working Group Orlando – June 2016.
BiomedGT Wiki support for CTCAE update/ Creating a pre-coordinated OWL file Sherri de Coronado NCI CBIIT/ EVS May 1, 2009.
Chapter 11: Software Configuration Management
CARA 3.10 Major New Features
CCNT Lab of Zhejiang University
What’s New in SQL Server 2016 Master Data Services
Definition SpecIfIcatIons
Data Access Service Specification: RDF(S) Ontology Access Draft
Enhancement Notification Release 5.4
Knowledge Based Workflow Building Architecture
Definition SpecIfIcatIons
Chapter 11: Software Configuration Management
Presentation transcript:

February 26, 2003NCICB Jamboree1 Enhancing Quality of Retrieval Through Concept Edit History -- EVS Update Frank Hartel Sherri De Coronado Gilberto Fragoso Iris Guo Kim Ong

February 26, 2003NCICB Jamboree2 Outline Terminology development -- concept creation, modification, split, merge, retirement Edit history Usage TDE Ontylog editor extension Next steps Summary

February 26, 2003NCICB Jamboree3 Elementary Edit Actions In Terminology Development Version 1 CreateSplit RetireMerge Modify Version 2 CreateSplit RetireMerge Modify Version 3 CreateSplit RetireMerge Modify Version 4 CreateSplit RetireMerge Modify (Create, Modify, Split, Merge, Retire) Evolution of versions/baseline over time

February 26, 2003NCICB Jamboree4 Scientific Reasons for Concept Splits Oncogene ras discovered based on sequence homology (hybridization) to the v-onc gene of the Harvey strain of murine sarcoma virus. Subsequently, it was discovered that there were multiple related ras genes, Ha-ras, and Ki-ras. Later on, a new ras, N-ras, was found.

February 26, 2003NCICB Jamboree5 BCL1 gene discovered in the vicinity of a t(11;14) translocation, involved in the malignant transformation of B cells. PRAD1 gene found in parathyroid adenomas bearing chromosomal abnormalities. CCND1 codes for one of a set of proteins, cyclins, that regulate cell cycle progression. Scientific Reasons for Concept Merges

February 26, 2003NCICB Jamboree6 Concept Based Retrieval D 1 D 2 DocumentIndexing terms Concepts used for retrieval C2C2 C1C1 Search Engine Relevant documents User

February 26, 2003NCICB Jamboree7 Edit History Usage Thesaurus version new retire split merge modify Version 1 Version 2 Version 3 Version 4 Concepts used for retrieval Document are often indexed using different versions of terminology. Re-indexing document to keep in pace with changes made to the terminology is impractical and can be very costly. Edit history can greatly enhance precision and recall. pre-indexed documents Search Engine R1R1 R2R2 R3R3 R4R4 Edit History

February 26, 2003NCICB Jamboree8 Edit History Storage

February 26, 2003NCICB Jamboree9 Terminology Development Environment

February 26, 2003NCICB Jamboree10 Terminology Development Environment Previously, only three types of edit action are logged – add, modify, and delete. Concepts created through split actions are confounded by newly created concepts. Concepts merged into other concepts are indistinguishable from retired concepts. Failure to explicitly track merge and split edit actions may result in a low recall rate in information retrieval. * Recall defines the number of relevant documents retrieved as fraction of all relevant documents.

February 26, 2003NCICB Jamboree11 Approach Taken to Extend TDE Create reusable concept edit tree Java bean Develop user interface for processing split, merge, and retirement edit actions Log edit events in TDE history database with clarity and precision

February 26, 2003NCICB Jamboree12 Extend Ontylog Editor With Plug-Ins Use Concept Edit Tree widget to build plug-ins

February 26, 2003NCICB Jamboree13 TDE Extension - Split Panel Edit action is explicitly logged in the TDE History database as a split event. A concept is created as a result of a split. Roles and properties may be transferred from one concept to another using drag & drop.

February 26, 2003NCICB Jamboree14 TDE Extension - Merge Panel Edit action is explicitly logged in the TDE History database as a merge event. Concept to stayConcept to retire Non-redundant roles and properties are transferred from the retiring concept to the resultant merged concept.

February 26, 2003NCICB Jamboree15 TDE Extension - Preretirement Concept to retire Sub-concepts are re-treed. Role relationships targeted (i.e., pointing) to the retiring concept are either removed or re-targeted. Concept can be retired only if all preconditions are met.

February 26, 2003NCICB Jamboree16 TDE Extension - Retire Panel Edit action is explicitly logged in the TDE History database as a retire event. A non-editable tree shows concept definition information pertinent to the retiring concept.

February 26, 2003NCICB Jamboree17 Next Steps Consolidate edit history logged by individual modelers in terminology development environment (TDE) into concept history data useful to Distributed Terminology System (DTS) users

February 26, 2003NCICB Jamboree18 Next Steps Extend caBIO and DTS Server capability to facilitate high quality information retrieval End User Applications caBIO.jar DTS History API DTS Extension DTS Server XMLRPC Client XMLRPC Server Edit history database EVS Repositories of Indexed Document to be developed )( External Databases Concepts used for retrieval

February 26, 2003NCICB Jamboree19 Summary Tracking explicit edit actions in TDE is absolutely essential to terminology and concept based information retrieval. We have successfully extend TDE Ontylog editor to explicitly track split, merge, and retirement edit events. Concept history data and supporting APIs will soon become available to DTS users and developers through caBIO. caBIO (Cancer Bioinformatics Infrastructure Objects)

February 26, 2003NCICB Jamboree20 EVS Team Frank Hartel Sherri De Coronado Gilberto Fragoso Margaret Haber Larry Wright Jim Oberthaler Northrop Grumman, Inc. Kevric Corporation Aspen Inc. Apelon, Inc. Kim Ong Iris Guo Bob Dione

February 26, 2003NCICB Jamboree21 Contact Dr. Francis W. Hartel Center for Bioinformatics National Cancer Institute 6116 Executive Blvd. Rockville, MD Phone: (301) Fax: (301)