Design, Construction and Early Use of the Biomedical Informatics Research Network July 2004 Dr. Philip Papadopoulos Program Director, Grid and Cluster.

Slides:



Advertisements
Similar presentations
NA-MIC National Alliance for Medical Image Computing National Alliance for Medical Image Computing: NAMIC Ron Kikinis, M.D.
Advertisements

ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
High Performance Computing Course Notes Grid Computing.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Building on the BIRN Workshop BIRN Systems Architecture Overview Philip Papadopoulos – BIRN CC, Systems Architect.
Imaging, Medical Analysis and Grid Environments (IMAGE) June 3, 2015 Translating Imaging Science to the Emerging Grid Infrastructure Jeffrey S. Grethe.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
New Approaches to GIS and Atlas Production Infrastructure for spatial data integration: across scales and projects Ilya Zaslavsky David Valentine San Diego.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
SCHOOL OF INFORMATION UNIVERSITY OF MICHIGAN Biomedical Informatics Research Network.
Michael Marron, Ph.D., Director Division of Biomedical Technology National Center for Research Resources National Institutes of Health Department of Health.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
All Hands Meeting 2005 Morphometry BIRN - Overview - Scientific Achievements.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
2004 NIH Building on the BIRN Bruce Rosen, MD PhD Randy Gollub, MD PhD Steve Pieper, PhD Morphometry BIRN.
2004 All Hands Meeting Morphometry BIRN Bruce Rosen, MD PhD Jorge Jovicich PhD Steve Pieper, PhD David Kennedy, PhD.
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
BIRN Update Carl Kesselman Professor of Industrial and Systems Engineering Information Sciences Institute Fellow Viterbi School of Engineering University.
Morphometry BIRN Bruce Rosen, M.D. Ph.D.. Scientific Goal Methods –Multi-site MRI calibration, acquisition –Integrate advanced image analysis and visualization.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Biomedical Informatics Research Network BIRN Overview: Building the Biomedical Informatics Research Network October 9, nd Annual All Hands Meeting.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Mark Ellisman, Ph.D. Professor of Neurosciences and Bioengineering Director, BIRN Coordinating Center Center for Research on Biological Systems University.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
2004 All Hands Meeting Morphometry BIRN: Milestones for 2005 Jorge Jovicich PhD Steve Pieper, PhD David Kennedy, PhD.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
BIRN Advantages in Morphometry  Standards for Data Management / Curation File Formats, Database Interfaces, User Interfaces  Uniform Acquisition and.
GCRC Meeting 2004 Introduction to the Grid and Security Philip Papadopoulos.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
All Hands Meeting 2005 Morphometry BIRN - Overview - Scientific Achievements.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Authors: Ronnie Julio Cole David
The Biomedical Informatics Research Network Carl Kesselman BIRN Principal Investigator Professor of Industrial and Systems Engineering Information Sciences.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
All Hands Meeting 2005 AVI Update Morphometry BIRN Analysis, Visualization, and Interpretation.
Vicky Rowley Solution Architect BIRN Coordinating Center - University of California San Diego E-x-t-e-n-d-i-n-g Rocks: The Creation and Management of Grid.
Data Integration Progress. BIRN Data Integration Framework 2. Create conceptual links to a shared ontology 1. Create multimodal databases 3. Situate the.
2004 FBIRN Meeting FIRST BIRN: Challenges of Collaborative Multi- site, Multi-discipline Research Steven Potkin, MD Gary Glover, PhD Gregory McCarthy,
The OptIPuter Project Tom DeFanti, Jason Leigh, Maxine Brown, Tom Moher, Oliver Yu, Bob Grossman, Luc Renambot Electronic Visualization Laboratory, Department.
Morphometry BIRN Semi-Automated Shape Analysis (SASHA) JHU (CIS): M. F. Beg, C. Ceritoglu, A. Kolasny, M. I. Miller, R. Yashinski MGH (NMR): B. Fischl;
Jorge Jovicich, Ph.D. Massachusetts General Hospital - Harvard Medical School Biomedical Informatics Research Network Overview Testbeds Morphometry BIRN.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
7. Grid Computing Systems and Resource Management
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
All Hands Meeting 2003 mBIRN Scientific Session Maryann Martone October 8, 2003.
All Hands Meeting 2005 Semi-Automated Shape Analysis (SASHA)
Federating Standardized Clinical Brain Images Across Hospitals.
REQUIREMENTS GATHERING Moderators: M Miller Goals: To allow participants to provide feedback to the developers (BIRN-CC and test bed applications) of what.
By 5 pm Tuesday  Each group delivers their action plan/milestones to achieve current goals Phase I publication plan and public access Phase II—Intersite.
Biomedical Informatics Research Network The BIRN Architecture: An Overview Jeffrey S. Grethe, BIRN-CC 10/9/02 BIRN All Hands Meeting 2002.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
NA-MIC National Alliance for Medical Image Computing UCSD / BIRN Coordinating Center NAMIC Group Site PI: Mark H. Ellisman Site Project.
All Hands Meeting 2004 Clinician’s Requirements for HID Query and Statistics Interface Christine Fennema-Notestine, Ph.D. David Kennedy, Ph.D.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
BIRN: Where We Have Been, Where We are Going. Carl Kesselman BIRN Principal Investigator Professor of Industrial and Systems Engineering Information Sciences.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Strategies for NIS Development
AVI Update Morphometry BIRN
Joseph JaJa, Mike Smorul, and Sangchul Song
Clinical and Translational Science Awards Program
Presentation transcript:

Design, Construction and Early Use of the Biomedical Informatics Research Network July 2004 Dr. Philip Papadopoulos Program Director, Grid and Cluster Computing San Diego Supercomputer Center University of California, San Diego

BIRN Overview BIRN – Biomedical Informatics Research Network –Funded by the National Institutes of Health –Focused on the data sharing needs of neuro-imaging scientists 17 Institutions 3 Test bed application groups Security, integrity, and tracking of data access very important –Well-defined software and hardware infrastructure that is replicated across sites –Challenges are not just technical Differing policies on across universities Sharing of data is new to the scientists

Agenda Overview of BIRN Some of the software/hardware details Initial results for grid-based science An incomplete set of challenges

BIRN is Team Science Applied to Stretch Goals A Big Challenge or Vision: A Big Challenge or Vision: “Enable new understanding of the healthy and diseased brain by linking data about macroscopic brain function to its molecular and cellular underpinnings” Taking practical steps toward a grand goal using cyberinfrastructure: Taking practical steps toward a grand goal using cyberinfrastructure: Federate geographically distributed brain data of the same & different types Federate geographically distributed brain data of the same & different types Accommodate requirements to collaboratively interact with shared databases of large-scale data, share methods, and computational resources Accommodate requirements to collaboratively interact with shared databases of large-scale data, share methods, and computational resources Scales of NS data from Maryann Martone

IT Infrastructure to hasten the derivation of new understanding and treatment of disease through use of distributed knowledge The BIRN Network

BIRN Today is … Three neuroscience test beds building on research projects –Mouse BIRN –Morph BIRN –Functional BIRN BIRN Coordinating Center (BIRN-CC) – IT hub for BIRN Major Activities include Integrating advanced biomedical imaging and clinical research centers in the US. Developing hardware and software infrastructure for managing distributed data: creation of data grids. Exploring data using “intelligent” query engines that can make inferences upon locating “interesting” data. Building bridges across tools and data formats. Changing the use pattern for research data from the individual laboratory/project to shared use

BIRN Project Coordination Internet 2 Functional Imaging BIRN Test-bed Human Morphometry BIRN Test-bed Mouse BIRN Test-bed BIRN Coordinating Center The BIRN-CC leads… the deployment and maintenance of a network infrastructure capable of quickly moving large amounts of data between BIRN sites across the country. the deployment and maintenance of a network infrastructure capable of quickly moving large amounts of data between BIRN sites across the country. the creation of a federation of databases pertaining to the BIRN scientific projects. the creation of a federation of databases pertaining to the BIRN scientific projects. the development and integration of software to refine, combine, compare, and analyze complex biomedical data. the development and integration of software to refine, combine, compare, and analyze complex biomedical data. and cultivates group activities to overcome cultural barriers to building a forum for collaborative research, co-authoring research papers, and sharing methods/tools/codes across institutions. and cultivates group activities to overcome cultural barriers to building a forum for collaborative research, co-authoring research papers, and sharing methods/tools/codes across institutions.

Basic Premise of BIRN If given access to larger data populations, scientists can –Investigate new scientific questions –Have a better statistical basis for testing hypothesis Working together –Improves the pace with which discoveries can be made Reduce redundant activities in labs

BIRN Forms a Virtual Data Grid Defines a Distributed Data Handling System Integrates Storage Resources in the BIRN network Integrates Access to Data, to Computational and Visualization Resources Acts as a Virtual Platform for Knowledge-based Data Integration Activities Provides a Uniform Interface to Users

Each BIRN Site Has Standard Hardware Controlled Software and Hardware configuration Software managed from the BIRN Coordinating Center OS and BIRN tool integration enabled by Rocks Cluster management Software Stack Components –Globus –Storage Resource Broker –Test bed application tools –Portal Technologies –Oracle Database –Data Mediation SW

Function BIRN: Integrated Data Query fMRI Are chronic, but not first-onset patients, associated with superior temporal gyrus dysfunction (MMN)? Integrated View Receptor Density ERP Web PubMed, Expasy Wrapper Structure Wrapper Clinical Wrapper Mediator

Function BIRN: Federated Imaging Databases Calibration, Integration from ½ dozen sites. First-ever normalization protocol for fMRI machines

Overall Goal: Develop capability to analyze and mine data acquired at multiple sites using processing and visualization tools developed at multiple sites Context: – Human Brain MR Based Morphometry Initial Applications: –Alzheimer’s, Depression, Aging Brain Participants: –BWH, MGH, Duke, UC Los Angeles, UC San Diego, Johns Hopkins, UC Irvine, Washington University Human Morphometry BIRN

Multi-site Structural MRI Data Acquisition & Calibration Methods: common acquisition protocol, distortion correction, evaluation by scanning human phantoms multiple times at all sites MGH (NMR): J. Jovicich, A. Dale, D. Greve, E. Haley BWH (SPL): S. Pieper UCI: D. Keator UCSD (fMRI): G. Brown Duke University (NIRL): J. MacFall Corrected Uncorrected Image intensity variability on same subject scanned at 4 sites Morphometry BIRN: Solving Issues in Distributed Data Acquisition Accomplishment: develop acquisition & calibration protocols that improve reproducibility, within- and across-sites

MIRIAD Project: Improving throughput Segmentation Duke BIRN-MIRIAD Item (semi-automated)(fully-automated) # of tissue classes3 (Fig1)23 (Fig2) Time for 200 brains400 hours1 hour Time for 200 lobe &250 hours all lobes (Fig3) and 27 regional analysis regions included above Improved computational capabilities 1 2 3

BIRN Portal: Launches Scientific Workflow 1. User Login In BIRN Portal, selects data and LONI settings 2. LONI Pipeline is launched from Portal 3. Results are automatically displayed in Slicer 3D

Mouse BIRN: Multiscale Data Mediation 1. Create databases at each site 2. Create conceptual links to a shared ontology 3. Situate the data in a common spatial framework 4. Use mediator to navigate and query across data sources

1)Established a data sharing infrastructure using the BIRN for multiscale investigations of animal models of human neurological disease Shared file collections using the Storage Resource Broker Developed common specimen preparation protocols Developed a set of shared analysis and visualization tools working through the BIRN portal 2)Developed a database federation as a data sharing mechanism and a persistent data archive Established independent databases at each site and populated them with mouse imaging data Mapped data to shared knowledge sources like the UMLS and atlas coordinate systems Created a virtual data federation through semantic and spatial mediation tools Accomplishments of Mouse BIRN

Purkinje neuron Registering My Data UMLS Spatial Registration

Human-Mouse Data Integration (Unanticipated New Science Questions) Query Atlas (3D Slicer) -Alex Joyner, Steve Pieper, Greg Brown, Nicole Aucoin

Key Systems Challenges Large-scale data is distributed on a National Scale –How do you easily locate what you want? –How do you translate it to what your SW tools understand? –Where do you analyze it? –How do you move it efficiently? –How do you secure it to properly limit and log access? The underlying software systems are complex –How effectively can this complexity be hidden? Software technology continually evolves and BIRN must adapt Goal: provide a systems “cookie-cutter” for adding new, secured, resources to form a federation

BIRN CC A View on BIRN Federated Data MRI Images Mouse DB-B EM Images Access Mouse DB-D Histology Access Mouse DB-C 2 Ph. Img Access Mouse DB-A EM Images ? Give me an index of all DAT- KO Striatum Images Federated data may be in a variety of representations databases image files simulation files flat text files

Key Software Systems Being Deployed Rocks Cluster Mgmt – BIRN Certificate Authority - MyProxy Globus Storage Resource Broker – Oracle Data Mediator – being developed by BIRN-CC ½ dozen specific applications Netscout Monitoring – Commercial tooling BIRN Portal BIRN

What have we learned Top-down –Works because of committed collaborators –Application drivers are critical to keeping focus Grid is deployed and used even when all SW was not available. –Hands on experience has taught us a great deal –A large fraction of grid software is still “fragile” Software packaging and availability is critical to making things practical Integration of networked resources and people have enabled new ways of doing research

Key Observations Computer scientists have to learn some new language to better understand needs Grids are new to scientists and it is natural for them to be skeptical Data sharing policy issues are quite troublesome –No uniform policy across institutions on how, but NIH has declared that all data taken with public (tax) money will eventually be public –Tracking use of human data is important Removing identifiers (like facial features in a full- skull MRI) is essential.

One Final Thought I have been involved in 4 large-scale scientific collaborations –BIRN –GEON (GeoSciences Network) –OptIPuter –Teragrid In all cases it has taken at least 18 months for the large projects to make their first significant steps as a group. –Is there something fundamental about large group creation for distributed projects that limits how quickly new results can be obtained? –Is there a way to shorten this “spin-up” time?