Semantic Web & Semantic Web Services: Applications in Healthcare and Scientific Research International IFIP Conference on Applications of Semantic Web.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Knowledge Modeling and its Application in Life Sciences: A Tale of two ontologies Bioinformatics for Glycan Expression Integrated Technology Resource for.
Semantic empowerment of Health Care and Life Science Applications WWW 2006 W3C Track, May WWW 2006 W3C Track, May Amit Sheth LSDIS LabLSDIS.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
RDB2RDF: Incorporating Domain Semantics in Structured Data Satya S. Sahoo Kno.e.sis CenterKno.e.sis Center, Computer Science and Engineering Department,
Web Services for N-Glycosylation Process Integrated Technology Resource for Biomedical Glycomics NCRR/NIH Satya S. Sahoo, Amit P. Sheth, William S. York,
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Semantic Web Services Peter Bartalos. 2 Dr. Jorge Cardoso and Dr. Amit Sheth
Knowledge Enabled Information and Services Science What can SW do for HCLS today? Panel at HCSL Workshop, WWW2007 Amit Sheth Kno.e.sis Center Wright State.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Semantic Web Technology in Support of Bioinformatics for Glycan Expression Amit Sheth Large Scale Distributed Information Systems (LSDIS) lab, Univ. of.
Semantics powered Bioinformatics Amit Sheth, William S. York, et al Large Scale Distributed Information Systems Lab & Complex Carbohydrate Research Center.
Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Semantics for Scientific Experiments and the Web– the implicit, the formal and the powerful Amit Sheth Large Scale Distributed Information Systems (LSDIS)
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Semantic Web applications in Financial Industry, Government, Health care and Life Sciences SWEG 2006, March 2006 Amit Sheth LSDIS Lab, Department of Computer.
Knowledge Enabled Information and Services Science GlycO.
Kno.e.sis Center, Wright State University,
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
Semantics Enabled Industrial and Scientific Applications: Research, Technology and Deployed Applications Part III: Biological Applications Keynote - the.
Semantics in the Semantic Web– the implicit, the formal and the powerful (with a few examples from Glycomics) Amit Sheth Large Scale Distributed Information.
Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow,
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Interfacing Registry Systems December 2000.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Active Semantic Electronic Medical Records an Application of Active Semantic Documents in Health Care Amit Sheth, S. Agrawal, J. Lathem, N. Oldham, H.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Semantic empowerment of Life Science Applications October 2006 Amit Sheth LSDIS Lab, Department of Computer Science, University of Georgia Acknowledgement:
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Knowledge Enabled Information and Services Science SAWSDL: Tools and Applications Amit P. Sheth Kno.e.sis Center Wright State University, Dayton, OH Knoesis.wright.edu.
Semantic (Web) Technology in Action - today The Semantic Web – Scientific American article considered harmful? WWW2003 Panel (PN2), Budapest, May 21, 2003.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Knowledge Enabled Information and Services Science Glycomics project overview.
From Domain Ontologies to Modeling Ontologies to Executable Simulation Models Gregory A. Silver Osama M. Al-Haj Hassan John A. Miller University of Georgia.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
Proteomics databases for comparative studies: Transactional and Data Warehouse approaches Patricia Rodriguez-Tomé, Nicolas Pinaud, Thomas Kowall GeneProt,
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Some questions -What is metadata? -Data about data.
Applying Semantic Technologies to the Glycoproteomics Domain W. S York May 15, 2006.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
Bioinformatics Research Overview Outline Biomedical Ontologies oGlycO oEnzyO oProPreO Scientific Workflow for analysis of Proteomics Data Framework for.
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
Proposed Research Problem Solving Environment for T. cruzi Intuitive querying of multiple sets of heterogeneous databases Formulate scientific workflows.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Web Service Semantics - WSDL-S Meenakshi Nagarajan for the WSDL-SWSDL-S team R. Akkiraju *, J. Farrell *, J.Miller, M. Nagarajan, M. Schmidt *, A. Sheth,
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
High throughput biology data management and data intensive computing drivers George Michaels.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Building Enterprise Applications Using Visual Studio®
LSDIS Lab, Department of Computer Science,
Amit Sheth LSDIS Lab & Semagix University of Georgia
ece 627 intelligent web: ontology and beyond
Piotr Kaminski University of Victoria September 24th, 2002
Collaborative RO1 with NCBO
Presentation transcript:

Semantic Web & Semantic Web Services: Applications in Healthcare and Scientific Research International IFIP Conference on Applications of Semantic Web (IASW2005), Jyv ä skyl ä, Finland, August 26, 2005 Keynote: Part II Amit Sheth LSDIS Lab, Department of Computer Science, University of Georgia Thanks to collaborators, partners (at CCRC and Athens Heart Center) and students. Special thanks to: Cartic Ramakrishnan, Staya S. Sahoo, Dr. William York, and Jon Lathem..

Active Semantic Document A document (typically in XML) with Lexical and Semantic annotations (tied to ontologies) Actionable information (rules over semantic annotations) Application: Active Semantic Patient Record for Cardiology Practice

Practice Ontology

Drug Ontology Hierarchy (showing is-a relationships)

Drug Ontology showing neighborhood of PrescriptionDrug concept

First version of Procedure/Diagnosis/ICD9/CPT Ontology maps to diagnosis maps to procedure specificity

Active Semantic Doc with 3 Ontologies Referred doctor from Practice Ontology Lexical annotation ICD9 codes from Diagnosis Procedure Ontology

Active Semantic Doc with 3 Ontologies Drug Allergy Formulation Recommendation Using Insurance ontology Drug Interaction using Drug Ontology

Explore neighborhood for drug Tasmar E xplore: Drug Tasmar

Explore neighborhood for drug Tasmar belongs to group brand / generic classification interaction Semantic browsing and querying-- perform decision support (how many patients are using this class of drug, …)

Bioinformatics Apps & Ontologies GlycOGlycO: A domain ontology for glycan structures, glycan functions and enzymes (embodying knowledge of the structure and metabolisms of glycans)  Contains 770 classes and 100+ properties – describe structural features of glycans; unique population strategy  URL: ProPreOProPreO: a comprehensive process Ontology modeling experimental proteomics  Contains 330 classes, 40,000+ instances  Models three phases of experimental proteomics* – Separation techniques, Mass Spectrometry and, Data analysis; URL: Automatic semantic annotation of high throughput experimental dataAutomatic semantic annotation of high throughput experimental data (in progress) Semantic Web Process with WSDL-S for semantic annotations of Web ServicesSemantic Web Process with WSDL-S for semantic annotations of Web Services – -> Glycomics project (funded by NCRR)

GlycO – A domain ontology for glycans

GlycO

Structural modeling issues in GlycO Extremely large number of glycans occurring in nature But, frequently there are small differences structural properties Modeling all possible glycans would involve significant amount of redundant classes Redundancy results in often fatal complexities in maintenance and upgrade

GlycoTree – A Canonical Representation of N-Glycans N. Takahashi and K. Kato, Trends in Glycosciences and Glycotechnology, 15:  - D -GlcpNAc  - D -Manp -(1-4)-  - D -Manp -(1-6)+  - D -GlcpNAc -(1-2)-  - D -Manp -(1-3)+  - D -GlcpNAc -(1-4)-  - D -GlcpNAc -(1-2)+  - D -GlcpNAc -(1-6)+

A biosynthetic pathway GNT-I attaches GlcNAc at position 2 UDP-N-acetyl-D-glucosamine + alpha-D-Mannosyl-1,3-(R1)-beta-D-mannosyl-R2 UDP + N-Acetyl-$beta-D-glucosaminyl-1,2-alpha-D-mannosyl-1,3-(R1)-beta-D-mannosyl-$R2 GNT-V attaches GlcNAc at position 6 UDP-N-acetyl-D-glucosamine + G00020 UDP + G00021 N-acetyl-glucosaminyl_transferase_V N-glycan_beta_GlcNAc_9 N-glycan_alpha_man_4

A process ontology to capture proteomics experimental lifecycle: Separation, Mass spectrometry, Analysis 340 classes with 200+ properties proteomics experimental data include: a)Data Provenance b)Comparability of data, metadata (parameters settings for a HPLC run) and results c)Finding implicit relationship between data sets using relations in the ontology – leading to indirect but critical interactions perhaps leading to knowledge discovery Proteomics Process Ontology - ProPreO * (PEDRO UML schema)

N-GlycosylationProcessNGP N-Glycosylation Process (NGP) Cell Culture Glycoprotein Fraction Glycopeptides Fraction extract Separation technique I Glycopeptides Fraction n*m n Signal integration Data correlation Peptide Fraction ms datams/ms data ms peaklist ms/ms peaklist Peptide listN-dimensional array Glycopeptide identification and quantification proteolysis Separation technique II PNGase Mass spectrometry Data reduction Peptide identification binning n 1

Semantic Annotation of Scientific Data ms/ms peaklist data <parameter instrument=micromass_QTOF_2_quadropole_time_of_flight_m ass_spectrometer mode = “ms/ms”/> Annotated ms/ms peaklist data

Semantic annotation of Scientific Data Annotated ms/ms peaklist data <parameter instrument=“micromass_QTOF_2_quadropole_time_of_flight_mass_s pectrometer” mode = “ms/ms”/>

Beyond Provenance…. Semantic Annotations  Data provenance: information regarding the ‘ place of origin ’ of a data element  Mapping a data element to concepts that collaboratively define it and enable its interpretation – Semantic Annotation  Data provenance paves the path to repeatability of data generation, but it does not enable:  Its (machine) interpretability  Its computability (e.g., discovery) Semantic Annotations make these possible.

Ontology-mediated Proteomics Protocol RAW Files Mass Spectrometer Conversion To PKL PreprocessingDB SearchPost processing Data Processing Application Instrument DB Storing Output PKL Files (XML-based Format) ‘Clean’ PKL Files RAW Results File Output (*.dat) Micromass_Q_TOF_ultima_quadrupole_time_of_flig ht_mass_spectrometer Masslynx_Micromass_application mass_spec_raw_data Micromass_Q_TOF_micro_quadrupole_time_of_f light_ms_raw_data PeoPreO produces_ms-ms_peak_list All values of the produces ms-ms peaklist property are micromass pkl ms-ms peaklist RAW Files ‘Clean’ PKL Files

 Formalize description and classification of Web Services using ProPreO concepts Service description using WSDL-S <wsdl:definitions targetNamespace="urn:ngp" ….. xmlns:xsd=" <schema targetNamespace="urn:ngp“ xmlns=" ….. WSDL ModifyDBWSDL-S ModifyDB <wsdl:definitions targetNamespace="urn:ngp" …… xmlns: wssem=" xmlns: ProPreO=" > <schema targetNamespace="urn:ngp" xmlns=" …… <wsdl:message name="replaceCharacterRequest" wssem:modelReference="ProPreO#peptide_sequence"> ProPreO process Ontology data sequence peptide_sequence Concepts defined in process Ontology Description of a Web Service using: Web Service Description Language

 There are no current registries that use semantic classification of Web Services in glycoproteomics Stargate  BUDDI classification based on proteomics and glycomics classification – part of integrated glycoproteomics Web Portal called Stargate  NGP to be published in BUDDI  Can enable other systems such as my Grid to use NGP Web Services to build a glycomics workbench Biological UDDI (BUDDI) WS Registry for Proteomics and Glycomics

Summary, Observations, Conclusions Ontology Schema: relatively simple in business/industry, highly complex in science Ontology Population: could have millions of assertions, or unique features when modeling complex life science domains Ontology population could be largely automated if access to high quality/curated data/knowledge is available; ontology population involves disambiguation and results in richer representation than extracted sources Ontology freshness (and validation—not just schema correctness but knowledge—how it reflects the changing world)

Summary, Observations, Conclusions Ontology types: (upper), (broad base/ language support), (common sense), domain, task, process, … Much of power of semantics is based on knowledge that populates ontology (schema by themselves are of little value) Some applications: semantic search, semantic integration, semantic analytics, decision support and validation (e.g., error prevention in healthcare), knowledge discovery, process/pathway discovery, …

Advertisement IJSWIS (International Journal for Semantic Web & Information Systems) welcomes not only research but also vision, application (with evaluation/validation) and vision papersIJSWIS More details on Industry Applications of SW: on Scientific Applications of SW: