Helena F. Deus and Jonas S. Almeida

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Guoqian Jiang, MD, PhD Mayo Clinic
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A Secure Interoperable Infrastructure For Healthcare Information System Ehsan ul Haq Abrar Ahmed Sair
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
REACTION POWER: Political Ontology for Web Entity Retrieval Sílvio Moreira
Presented by IBM developer Works ibm.com/developerworks/ 2006 January – April © 2006 IBM Corporation. Making the most of Creating Eclipse plug-ins.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
© 2008 by Andrew Webb, Interface Ecology Lab. meta-metadata: an extensible semantic architecture for multimedia metadata definition, extraction, and presentation.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
ISIM’06, Přerov ; Corporate Memory Corporate Memory: A framework for supporting tools for acquisition, organization and maintenance of information.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Implementing an RDF Schema for Pathology Images, From the Association for Pathology Informatics Jules J. Berman, Ph.D., M.D. APIII, Pittsburgh, PA Monday,
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Application Ontology Manager for Hydra IST Ján Hreňo Martin Sarnovský Peter Kostelník TU Košice.
COOL: Control Oriented Ontology Language Component Option State Service Channel Process Rule Conclusions The control oriented ontology language has been.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
SAGE Nick Beard Vice President, IDX Systems Corp..
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
MEKON & HOBO Java Frameworks for building Ontology-Driven Applications Current use cases:  Almost (!) products:  Knowledge-driven clinical documentation.
Bioinformatics for Clinical Microbiology and Molecular Epidemiology: From Databases to Population Genetics João André Carriço 7 July 2010 Ciência 2010.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Biological Databases By: Komal Arora.
Cloud based linked data platform for Structural Engineering Experiment
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
CCNT Lab of Zhejiang University
Fundamentals of Information Systems, Sixth Edition
Vipul Kashyap1, Alfredo Morales2 ;
Markup of Educational Content
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Stanford Medical Informatics
Knowledge Management Systems
Helena F. Deus and Jonas S. Almeida
Analyzing and Securing Social Networks
Service Metadata Registry (COSMOS)
Knowledge Based Workflow Building Architecture
Analysis models and design models
An ontology for e-Research
Chapter 1: The Database Environment
The Database Environment
Metadata The metadata contains
Presentation transcript:

A method to propagate permissions in biomedical data using a semantic web framework Helena F. Deus and Jonas S. Almeida hdeus@mathbiol.org The University of Texas M. D. Anderson Cancer Center

History of the web Web 1.0 Links -> Documents Web 2.0 Links -> Data Structures -> Web services Web 3.0 Links -> Web Services -> Links -> Web Services -> Links -> Web Services .…

Evolution of data representation Nature Biotechnology. 2005 Vol 23 Nr 29

Electronic Health Records Data management in the life sciences Clinical/Medical data MDAxxxx Electronic Health Records RDBMS Life is good!

RDBMS Heterogeneous data management Core facilities data Clinical/Medical data DNA Sequencing Microarrays RDBMS MDAxxxx Protein Arrays Data everywhere! Pulse Field Gel Electrophoresis

Semantic web of data: a set of best practices

A data pyramid W3C Wisdom Knowledge OWL, OBO RDF Information SPARQL XML TEXT Data Files

S3DB Core Model

Snapshots of interfaces using S3DB’s API (Application Programming Interface). These applications exemplify why the semantic web designs can be particularly effective at enabling generic tools to assist users in exploring data documenting very specific and very complex relationships. Snapshot A was taken from S3DB’s web interface, which is included in the downloadable package. This interface was developed to assist in managing the database model and, therefore, is centered on the visualization and manipulation of the domain of discourse, its Collections of Items and Rules defining the documentation of their relations. The application depicted on snapshots B-D describe a document management tool S3DBdoc, freely available as a Bioinformatics Station module (see Figure 6). The navigation is performed starting from the Project (C), then to the Collection (B) and finally to the editing of the Statements about an Item (D). The snapshot B illustrates an intermediate step in the navigation where the list of Items (in this case samples assayed by tissue arrays, for which there is clinical information about the donor) is being trimmed according to the properties of a distant entity, Age at Diagnosis, which is a property of the Clinical Information Collection associated with the sample that originated the array results. This interaction would have been difficult and computationally intensive to manage using a relational architecture. The RDF formatted query result produced by the API was also visualized using a commercial tool, Sentient Knowledge Explorer (IO-Informatics Inc), shown in snapshot E, and by Welkin, F, developed by the digital inter-operability SIMILE project at the Massachusetts Institute of Technology. See text for discussion of graphic representations by these tools. To protect patient confidentiality some values in snapshots B and D are scrambled and numeric sample and patient identifiers elsewhere are altered. PLoS ONE. 2008 Aug 13;3(8):e2946

Example: TCGA data structure http://tcga.s3db.org

S3DB Rule http://tcga.s3db.org/R247 Sample ?? Patient blood Tissue Patient Sample tumor S3DB Statement http://tcga.s3db.org/S234 sampleX patientY R427

TCGA domain - instance PLoS ONE. 2008 Dec;3(12):e4076

SPARQL

Code portability and distributed data API API SPARQL API

Permission management Markov Model

Permission propagation

Intermediate Ontologies Domain-Specific Ontologies Experimental evolving ontologies Upper ontologies Intermediate Ontologies Domain-Specific Ontologies MGED and others Current entry level for computation Experimental, evolving Data Models Proposed entry level for computation Raw data

S3DB.ORG What is S3DB? What S3DB is not? It is a web service that manages semantic web content distinguishing the domain of discourse from its instantiation. It was configured specifically for the needs of Biomedical Informatics projects where: Those who submit the data keep a fine tuned control over its access and use. The data model is deployed over a core ontology that allows its editing. It has a distributed deployment designed to deal with heterogeneous environments. What S3DB is not? It is not a client application. It is not a “work in progress”: a SPARQL endpoint assures that experimental data is not kept outside of the Linked Data Web until is matures

In Conclusion Dissolution of boundaries between data structures is a good thing… But doing it without losing the role of each data element is even better  Some level of explicit granularity in the data is necessary to implement a permission model.

Acknowledgements http://s3db.org Jonas S. Almeida Kadir Akdemir Miriã Coelho Cintia Palú Pablo Freire The Integrative Bioinformatics Lab at the University of Texas MD Anderson Cancer Center (Houston, Tx) Instituto de Tecnologia Quimica e Biologica, Universidade Nova de Lisboa (Lisbon, Portugal) http://s3db.org