Tumor Profile Discovery and Tumor Bank Management with DORA Adrian Driga 1, Russ Greiner 2, Kathryn Graham 1, 4, Sambasivarao Damaraju 1, 4, David Wishart.

Slides:

Advertisements

Similar presentations

The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.

Advertisements

A distributed architecture for crystallography data, metadata, and applications John C. Bollinger Indiana University Molecular Structure Center, Bloomington,

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.

Visibility Information Exchange Web System. Source Data Import Source Data Validation Database Rules Program Logic Storage RetrievalPresentation AnalysisInterpretation.

 Key exchange o Kerberos o Digital certificates  Certificate authority structure o PGP, hierarchical model  Recovery from exposed keys o Revocation.

Understanding Active Directory

Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?

Overview Distributed vs. decentralized Why distributed databases

70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.

XML Based Learning Environment Prashant Karmarkar Brendan Nolan Alexander Roda.

Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,

Integrated Hospital Management System. Integrated Hospital Management System software is user-friendly software. The main objectives of the system is.

A centralized system.  Active Directory is Microsoft's trademarked directory service, an integral part of the Windows architecture. Like other directory.

Presented by INTRUSION DETECTION SYSYTEM. CONTENT Basically this presentation contains, What is TripWire? How does TripWire work? Where is TripWire used?

Electronic EDI e-EDI. The EDI has been in use since 1999 using a paper-based system and computerized spreadsheets to collect and manage EDI data. Over.

Case It workshop: integrating molecular biology computer simulations and bioinformatics into case-based learning and student research Mark Bergland and.

Introductory Overview

● Problem statement ● Proposed solution ● Proposed product ● Product Features ● Web Service ● Delegation ● Revocation ● Report Generation ● XACML 3.0.

SUNY Upstate Medical University Faculty Database System for the Internet Weizhen Tu and Larry Polly Educational Communications SUNY Upstate Medical University.

Current Job Components Information Technology Department Network Systems Administration Telecommunications Database Design and Administration.

1 Web Server Administration Chapter 1 The Basics of Server and Web Server Administration.

Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.

December 2006 MAGE and the Biospecimen Research Database Experiment Design and other issues Ian Fore, D.Phil U.S. National Cancer Institute - Center for.

Database Design – Lecture 16

WNAG: Advisory Report Presented to: UCIST by: Stephen Sempson.

Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.

Component 4: Introduction to Information and Computer Science Unit 2: Internet and the World Wide Web 1 Component 4/Unit 2Health IT Workforce Curriculum.

Open Source Software Bangladesh University of Business and Technology Nizar Saadi Dahir M.Sc. Computer Engineering Computer Center- Kufa University

311: Management Information Systems Database Systems Chapter 3.

Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.

1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.

Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.

Sharing Value Sets (SVS Profile) Ana Estelrich GIP-DMP.

An Approach To Automate a Process of Detecting Unauthorised Accesses M. Chmielewski, A. Gowdiak, N. Meyer, T. Ostwald, M. Stroiński

Page 1 © 2001, Epicentric - All Rights Reserved Epicentric Modular Web Services Alan Kropp Web Services Architect WSRP Technical Committee – March 18,

Module 3 Configuring File Access and Printers on Windows 7 Clients.

REDCap Overview Institute for Clinical and Translational Science Fred McClurg Neil Nuehring.

REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.

IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.

MIS 105 LECTURE 1 INTRODUCTION TO COMPUTER HARDWARE CHAPTER REFERENCE- CHP. 1.

Lao H. Saal 1,3,*, Carl Troein 2,*, Johan Vallon-Christersson 1,*, Sofia Gruvberger 1, Björn Samuelsson 2, Åke Borg 1 and Carsten.

Agenda Overview of Seneca Computer System File Servers / Student Computer Accounts Telnet application How to Logon to Learn / Phobos accounts How to Change.

Lesson 11: Configuring and Maintaining Network Security

Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:

Introduction Hereditary predisposition (mutations in BRCA1 and BRCA2 genes) contribute to familial breast cancers. Eighty percent of the.

Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.

GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.

CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.

Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.

High throughput biology data management and data intensive computing drivers George Michaels.

Basics of the Domain Name System (DNS) By : AMMY- DRISS Mohamed Amine KADDARI Zakaria MAHMOUDI Soufiane Oujda Med I University National College of Applied.

Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.

ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.

MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.

Software sales at U Waterloo Successfully moved software sales online Handle purchases from university accounts Integrated with our Active Directory and.

Department of Pathology UC Davis School of Medicine Jeff Gregg, M.D. The Development of an Informatics Platform for the Characterization of Clinical Samples.

ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.

Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.

Michael Spiegel, Esq Timothy Shimeall, Ph.D.

PGT(CS) ,KV JHAGRAKHAND

Database System Concepts and Architecture

An Overview of Data-PASS Shared Catalog

Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.

SUBMITTED BY: NAIMISHYA ATRI(7TH SEM) IT BRANCH

Cloud based Open Source Backup/Restore Tool

Population Information Integration, Analysis and Modeling

European Network of e-Lexicography

MANAGING DATA RESOURCES

XML Based Learning Environment

The Internet and Electronic mail

Presentation transcript:

Tumor Profile Discovery and Tumor Bank Management with DORA Adrian Driga 1, Russ Greiner 2, Kathryn Graham 1, 4, Sambasivarao Damaraju 1, 4, David Wishart 2, 3, John Mackey 1, 4, Carol Cass 1, 4 1 Cross Cancer Institute, Alberta Cancer Board, 2 Department of Computing Science, 3 Faculty of Pharmacy and Pharmaceutical Sciences, 4 Department of Oncology, University of Alberta, Edmonton Main Features of DORA Integration of molecular and clinical information. Finding clinically relevant tumor profiles requires the analysis of genetic and clinical information for large cohorts of patients. For every patient, microarray, SNP, and metabonomic technologies can generate massive amounts of data. DORA speeds up the analysis process by seamlessly linking molecular data with relevant clinical data for every patient. Integrated views of the patient data are analyzed statistically and with machine learning techniques in order to discover molecular profiles for clinical factors. It has been shown that gene expression profiles can be reliable predictors of treatment response, relapse, and disease free survival, and that certain combinations of SNPs can indicate predisposition to cancer. Data sharing and portability. DORA, database and software, can be shared with or be easily replicated at other research centers. Researchers can access a centralized DORA database remotely or manage their own copy of DORA. For the latter scenario, data sharing can be done via import/export software. Data sharing is particularly important for studies on rare tumor groups (e.g., brain, pancreas) because it allows researchers to accumulate a large enough number of patients from across the province or the country. Researchers can exchange patient data, molecular and clinical, but will still retain ownership of the data that they have generated. Scalability. DORA is designed so that new modules can be quickly integrated with the existing ones. Currently, modules for microarray, SNP, and metabonomic data and clinical sections for breast, lung, gastric, and ovarian cancer are fully functional. A clinical section for brain/CNS cancer is still in the design phase and it will be implemented soon. DORA (Database for Online Retrieval and Analysis) is a web- accessible medical and laboratory information management system (LIMS), through which clinical, microarray, SNP, and metabonomic information from PolyomX-consented patients is stored, retrieved, managed, and analyzed. DORA is designed for data warehousing and has a flexible relational database architecture that can be readily scaled up to accommodate clinical data from new cancer types, or experimental data from new laboratory assays. PolyomX currently collects clinical and molecular data for four cancer types: breast, lung, ovarian, and gastric. DORA facilitates the generation of a cancer knowledge base and will help individualize cancer treatment by allowing researchers to identify patient-specific characteristics of a cancer disease at the molecular level. DORA supports translational research by providing quick electronic access to patient information from the clinical and molecular domains. For example, class prediction analysis is performed using the signal intensity of the microarray spots and the values of a clinical factor that partitions a group of patients into two classes. Database queries retrieve this information and present it to the statistical analysis software in the appropriate format. Confidentiality of the information collected in DORA is strictly maintained. Access to DORA is password-protected and confidential information is encoded before being stored in the database. Access to the confidential information and modification of data is highly restricted. For security reasons, DORA is currently available only in the Cross Cancer Institute computer network. Tumor banking information is managed through the Tumor Banking Database, a self-contained module of DORA. Please see the poster on the Tumor Banking Database for details. PolyomX is supported by the Alberta Cancer Foundation and the Alberta Cancer Board. Figure 4: Lung Cancer Stage/Progression Form DORA is implemented as a MySQL database, and is made available to users via an Apache web server. The software that connects the web forms with the database is written in Perl and runs on the DORA server. The server on which DORA resides, runs Red Hat Linux and is protected by a firewall. All the software that is needed to run a DORA server, i.e., R.H. Linux, MySQL, and Perl, is freely available for non- commercial purposes. PolyomX has designed and implemented the software specific to DORA and can make this software available to ACB researchers. DORA Modules, Schema, and Forms Figure 3: Ovarian Cancer Pathology Form Figure 1: Overview of Database Schema Patient Cancer Disease Pathology tissue Stage/ProgressionTreatment MicroarraySNPMetabonomics 1 M M M M MM M MM 1 1 M = Many Lab Work blood, urine 1 Treatment Protocol 1 1 Figure 2: Overview of Microarray (MA) Module Schema MA Slide Group tissue ID 1 MA Slide Repeat experiment details & parameters MA Slide Spot position, intensity value MA Normalized Slide Spot position, normalized intensity value MA Gene Aggregate Value tissue ID, sequence ID, aggregate of normalized intensity values for sequence across all spots from the repeats in group MA Slide Type manufacturer, version Sequence Info oligo/cDNA gene of origin IDs MA Slide Spot Sequence slide map (e.g., GAL) 1 M generates M M M M M MM M = Many Acknowledgements The authors want to thank doctors Brent Zanke, Tony Reiman, Tim Winton, Bryan Dicken, Michael Sawyer, Helen Steed, Katia Tonkin, and David Omahen for their help in designing the clinical modules of DORA, and Jennifer Listgarten for her help in designing the microarray module. Wide Area Network DORA Server S1 Data S1 Local User S2 Local User S1 Remote User DORA Server S2 Data S2 DORA Server S3 Data S3 Data Sharing Using DORA Wide Area Network DORA Server Data Remote User Local User Remote User Figure 5: Centralized Database Scenario Users connect securely to a central instance of DORA and access molecular or clinical data according to their permissions. When a user creates a new patient record in the clinical, microarray, SNP, or metabonomics module, that user is marked as the owner of the record and is notified by . All users can have access to all data as soon as the data is added to the database. However, when the central server is not accessible (e.g., server down for software upgrade), no data is available. The center hosting the DORA server is responsible for database administration and software development. Figure 6: Distributed (Federated) Database Scenario Several DORA servers are available at different sites and the databases have identical schemas. Users can connect to any of the servers, but will add new records to their local database. Integrated views of the data from several servers can be obtained via import/export tools for data exchange, or by running a query (same) against each database. Cost of administration and development is shared among the server hosts. With additional software, the schemas of the DORA databases do not need to be identical.