Open Source Solutions for Tissue Banking Informatics Jules J. Berman, Ph.D., M.D. INFORMATICS FOR REPOSITORIES Wednesday, May 21, 2008 3:30 pm – 4:05 pm.

Slides:



Advertisements
Similar presentations
Microsoft ® Access ® 2010 Training Design the tables for a new database.
Advertisements

XSL XSLT and XPath 11-Apr-17.
Overview of Data Structures and Algorithms
Thane Kerner Silverchair. What is… The Semantic Web? A Semantic Data Layer? Semantic Tagging? Why add semantics to my content? How can I get semantic.
Chapter 1: The Database Environment
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
CS 571. Web services Web service: "a software system designed to support interoperable machine-to-machine interaction over a network“ – W3C In short,
CS 898N – Advanced World Wide Web Technologies Lecture 21: XML Chin-Chih Chang
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
Integration of Applications MIS3502: Application Integration and Evaluation Paul Weinberg Adapted from material by Arnold Kurtz, David.
Creating Web Pages Getting Started. Overview What Web Pages Are How Web Pages are Formatted Putting Graphics on Web Pages How Web Pages are Linked Linking.
Russell Taylor Lecturer in Computing & Business Studies.
The Basic Tools Presented by: Robert E., & Jonathan Chase.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
BIS310: Week 7 BIS310: Structured Analysis and Design Data Modeling and Database Design.
4/20/2017.
CCSA 221 Programming in C CHAPTER 2 SOME FUNDAMENTALS 1 ALHANOUF ALAMR.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
Practical RDF Chapter 1. RDF: An Introduction
GRITS Working with AVM Data Astronomy Visualization Metadata June 11th, 2010 Casey Rosenthal
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
USING PERL FOR CGI PROGRAMMING
The INTERNET how it works. the internet: defined So, what is it?
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Creational Patterns CSE301 University of Sunderland Harry R Erwin, PhD.
HTML Internet Basics & Beyond. What The Heck Is HTML? HTML is the language of web pages. In order to truly understand HTML, you need to know a little.
API Imaging Working Group Jules J. Berman and Ulysses J. Balis, Co-Chairs Laboratory Digital Imaging Project APIII, 2006 Vancouver, British Columbia Thursday,
De-Identification Jules J. Berman, Ph.D., M.D. Panel #: 1, March 8.
1 XML An Overview Roger Debreceny University of Hawai`i Skip White University of Delaware XBRL Workshop, August 2006.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Metadata for the GPII Liddy Nevile. DRD metadata.
Guide to Programming with Python Chapter One Getting Started: The Game Over Program.
(Spring 2015) Instructor: Craig Duckett Lecture 10: Tuesday, May 12, 2015 Mere Mortals Chap. 7 Summary, Team Work Time 1.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
API Working Group Session – Open Discussion of Pathology Digital Imaging Standards Organizers: Jules J. Berman, Ph.D., M.D. Program Director for Pathology.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
Copyright OpenHelix. No use or reproduction without express written consent1.
Introducing the World Wide Web Internet- a structure made up of millions of interconnected computers whose users communicate with each other and share.
Construction Planning and Prerequisite
INTRODUCTION TO CSS. TOPICS TO BE DISCUSSED……….  Introduction Introduction  Features of CSS Features of CSS  Creating Style Sheet Creating Style Sheet.
ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: Introduction to Software Architecture.
Implementing an RDF Schema for Pathology Images, From the Association for Pathology Informatics Jules J. Berman, Ph.D., M.D. APIII, Pittsburgh, PA Monday,
Mining the Biomedical Research Literature Ken Baclawski.
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
Computer Science, Algorithms, Abstractions, & Information CSC 2001.
Basics of Web Based Computing. The Architecture The user’s system A Web Server What’s inside? Server software Apache or other Resources to be accessible.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal Natasha Noy Stanford University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
21/1/ Analysis - Model of real-world situation - What ? System Design - Overall architecture (sub-systems) Object Design - Refinement of Design.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Informatics Laboratory Digital Imaging Project APIII 2006 Vancouver, British Columbia Session F2, Friday, August 18, 2006, 10:30 A.M. to noon Jules J.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
By Jeremy Burdette & Daniel Gottlieb. It is an architecture It is not a technology May not fit all businesses “Service” doesn’t mean Web Service It is.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Experiences and Status
CaBig February 6, 2007 Jules Berman, Ph.D., M.D.
XML QUESTIONS AND ANSWERS
The Evils of Complexity
SVTRAININGS. SVTRAININGS Python Overview  Python is a high-level, interpreted, interactive and object-oriented scripting language. Python is designed.
Teaching slides Chapter 1.
Chapter 1: The Database Environment
Introduction to Programming Using C++
Semantic Markup for Semantic Web Tools:
CSIS110 - Introduction to Computer Science
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Presentation transcript:

Open Source Solutions for Tissue Banking Informatics Jules J. Berman, Ph.D., M.D. INFORMATICS FOR REPOSITORIES Wednesday, May 21, :30 pm – 4:05 pm

Approaches to finding open source solutions 1. Generalize (don't specialize). Wherever possible, don't think of your tissue repository problems as being unique. Try to think of your problems as instances of very general informatics problems. In most cases, the same open source solutions that work for bioinformaticians, astronomers, and factory inventories will likely work for you

Approaches to finding open source solutions 2. Learn a popular open source programming language that is easy to learn and that is supplemented by an enthusiastic biomedical community Perl Python Ruby

Approaches to finding open source solutions 3. Use open source, unencumbered nomenclatures, codes, syntactic formats. Otherwise, can't share or post data through web MESH (standard, open source, free) UMLS (standard, encumbered) SNOMED (standard, encumbered) Neoplasm Classification (non-standard, open source, free, standard syntax – XML, RDF)

Approaches to finding open source solutions 4. Use an open source and general data syntax HTML (formatting and linking) XML (describing data) RDF (getting meaning from described data)

All data can be specified using RDF, developed by the W3C. RDF files are collections of statements expressed as data triples “Jules Berman” “blood glucose level” “85” “Mary Smith” “eye color” “brown” “Samuel Rice” “eye color” “blue” “Jules Berman” “eye color” “brown” When you bind a key/value pair to a specified object, you're moving from the realm of data structure (i.e., XML) into the realm of data meaning.

Medical file: “Jules Berman” “blood glucose level” “85” “Mary Smith” “eye color” “brown” “Samuel Rice” “eye color” “blue” “Jules Berman” “eye color” “brown” Merged Jules Berman database: “Jules Berman” “blood glucose level” “85” “Jules Berman” “eye color” “brown” “Jules Berman” “hat size” “9” Hat file: “Sally Frann” “hat size” “8” “Jules Berman” “hat size” “9” “Fred Garfield” “hat size” “9” “Fred Garfield” “hat_type” “bowler” RDF permits data to be merged between different files

Approaches Use open source utilities – not software applications (open source or otherwise)l Utilities are simple programs that do one type of job, very well. Often work from command-line (i.e., no GUI) Once you've mastered a dozen or so utilities, you can handle most informatics task that you'll come across. Applications are often complex and seldom provide the functionality you need (now or future).

Approaches Learn the algorithms for your discipline. Algorithms are process descriptions that work every time. Most informatics algorithms can be implemented in under ten lines of software code You can think of software applications as many algorithms working under a GUI If you really understand algorithms, you can make important contributions to your field.

Approaches De-emphasize standards. Most standards are difficult to understand, and there are many of them, often covering obscure domains. Many standards are just bad. Data kept in a standard today may be non- standard legacy data tomorrow. Unlike physical standards, standards are transformable (so why fuss over any one standard?). Standards can be encumbered

Specifications often a better solution than Standards Specifications are just descriptions of your data. A specification requires a common language for describing data (so that you and your computer can understand what it's trying to convey). Specifications give you enormous freedom to create and describe new and unconventional data objects. Usually done in RDF If you've specified your data well, you can port between standards when you need to.

Example: Pathology image annotation

Important descriptors of an image might include: File information Image capture information Image format information Specimen information Patient information Pathology information Region of interest information

JPEG is an image format that is used by millions of people in all types of professions, including the medical profession JPEG can now be used without worrying about IP issues You can put any information you want into the header of a JPEG image (including an RDF document) so that specified clinical/pathological information can be conveyed with the image Because images non-physical, it is usually easy to interconvert image formats

By annotating our images, we can ensure that the image conveys meaning and value By using RDF, we can ensure that the individual triples can be integrated with heterogeneous data sources beyond those of images. By using pre-existing international general standards for describing any kind of data, we attain interoperability and avoid the confusion and complexity that occurs whenever a new standard is created. See:

Would you like to write a Tissue Respository/Tissue Informatics book?