1 BioGrids in the US: Current status and future opportunities Craig A. Stewart 15 April 2004 Director, Research and Academic Computing Director,

Slides:



Advertisements
Similar presentations
April 19, 2015 CASC Meeting 7 Sep 2011 Campus Bridging Presentation.
Advertisements

What is Cyberinfrastructure?
Bill Barnett, Bob Flynn & Anurag Shankar Pervasive Technology Institute and University Information Technology Services, Indiana University CASC. September.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
ESE Einführung in Software Engineering X. CHAPTER Prof. O. Nierstrasz Wintersemester 2005 / 2006.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Prepared for the.
An Introduction to the Open Science Data Cloud Heidi Alvarez Florida International University Robert L. Grossman University of Chicago Open Cloud Consortium.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Rockhopper: Penguin on Demand at Indiana.
INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Implementing advanced IT facilities for the Indiana Genomics Initiative Craig A. Stewart
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Statewide IT Conference, Bloomington IN (October 7 th, 2014) The National Center for Genome Analysis Support, IU and You! Carrie Ganote (Bioinformatics.
Next Generation Cyberinfrastructures for Next Generation Sequencing and Genome Science AAMC 2013 Information Technology in Academic Medicine Conference.
Information technology, collaboration, and achieving IU ’ s research goals Craig A. Stewart 13 November 2003 Director, Research and Academic.
Craig Stewart 23 July 2009 Cyberinfrastructure in research, education, and workforce development.
INDIANAUNIVERSITYINDIANAUNIVERSITY January 2002 INGEN's advanced IT facilities Craig A. Stewart
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Goodbye from Indianapolis, IUPUI, and Craig A. Stewart Executive Director, Pervasive Technology Institute Associate Dean, Research Technologies Indiana.
High Performance Computing for University Medical Research: A Successful Implementation Dr. Craig A. Stewart, Ph.D. Director, Research and.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
I-Light: A Network for Collaboration between Indiana University and Purdue University Craig Stewart Associate Vice President Gary Bertoline Associate Vice.
Genomics, Transcriptomics, and Proteomics: Engaging Biologists Richard LeDuc Manager, NCGAS eScience, Chicago 10/8/2012.
The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
XSEDE12 Closing Remarks Craig Stewart XSEDE12 General Chair Executive Director, Indiana University Pervasive Technology Institute.
September 6, 2013 A HUBzero Extension for Automated Tagging Jim Mullen Advanced Biomedical IT Core Indiana University.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. The IQ-Table & Collection Viewer A.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream - A self-provisioned, scalable science and.
July 18, 2012 Campus Bridging Security Challenges from “Panel: Security for Science Gateways and Campus Bridging”
©2013 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Making Campus Cyberinfrastructure Work for Your Campus Guy Almes Patrick Dreher Craig Stewart Dir. Academy for Dir. Advanced Computing Associate Dean Advanced.
© 2015 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream Overview – XSEDE ’15 Panel - New and emerging.
INDIANAUNIVERSITYINDIANAUNIVERSITY 1 Parallel implementation and performance of fastDNAml - a program for maximum likelihood phylogenetic inference Craig.
Using Prior Knowledge to Improve Scoring in High-Throughput Top-Down Proteomics Experiments Rich LeDuc Le-Shin Wu.
Research Computing Archived Presentation Title:Indiana Economic Development From Indiana Economic Development Corporation to Indiana and Purdue.
INDIANAUNIVERSITYINDIANAUNIVERSITY Spring 2000 Indiana University Information Technology University Information Technology Services Please cite as: Stewart,
November 18, 2015 Quarterly Meeting 30Aug2011 – 1Sep2011 Campus Bridging Presentation.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Clinical Research Informatics at the University of Michigan Daniel Clauw M.D. Professor of Medicine, Division of Rheumatology Assistant Dean for Clinical.
1 Global Analysis of Arthropod Evolution – a successful grid project Craig A. Stewart, Rainer Keller, Matthias Hess, Uwe Woessner, Martin Aumüller, Matthias.
Leveraging the InCommon Federation to access the NSF TeraGrid Jim Basney Senior Research Scientist National Center for Supercomputing Applications University.
UITS Research Technologies – Services Available to Regenstrief Institute 13 Oct 2015 Craig Stewart ORCID ID Executive Director, Indiana.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Recent key achievements in research computing at IU Craig Stewart Associate Vice President, Research & Academic Computing Chief Operating Officer, Pervasive.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Update on EAGER: Best Practices and.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Jetstream: A new national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
1 A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Informatics Tools at the Indiana CTSI.
EngageNY.org ©2012 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Jetstream Overview Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor,
PAUL STACEY Except where otherwise noted these materials are licensed under a Creative Commons Attribution 3.0 (CC BY)CC BY Open Licensing Requirements.
© 2015 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
1 Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration and.
Research & Academic Computing Indiana University Statewide IT Conference 11 September 2003 Indianapolis IN.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Matt Link Associate Vice President (Acting) Director, Systems
Methodology Overview 2 basics in user studies Lecture /slide deck produced by Saul Greenberg, University of Calgary, Canada Notice: some material in this.
Research and Academic Computing Division
Elliptic Partial Differential Equations – Direct Method
Presentation transcript:

1 BioGrids in the US: Current status and future opportunities Craig A. Stewart 15 April 2004 Director, Research and Academic Computing Director, Information Technology Core, Indiana Genomics Initiative

License Terms Please cite this presentation as: Stewart, C.A. BioGrids in the US: Current status and future opportunities Presentation. Presented at: International School on Physics and Industry workshop on Particle Accelerators and Detectors: from Physics to Medicine (Ettore Majorana Foundation and Center for Scientific Culture, Erice, Italy, 15 Apr 2005). Available from: Portions of this document that originated from sources outside IU are shown here and used by permission or under licenses indicated within this document. Items indicated with a © are under copyright and used here with permission. Such items may not be reused without permission from the holder of copyright except where license terms noted on a slide permit reuse. Except where otherwise noted, the contents of this presentation are copyright 2004 by the Trustees of Indiana University. This content is released under the Creative Commons Attribution 3.0 Unported license ( This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.

3 What is a grid? A grid is a system including computational resources data storage resources visualization resources specialized instruments tied together by high-performance networks Why grids? –Transcend limits of location –Use resources that would otherwise not be accessible –To do things that would otherwise not be possible

4 Types of grids By area of focus: Collaboration grid Computational grid –Supercomputer grids –Cycle scavenging Data grids Hybrid grids Not included as part of this classification system: openness of software or organizational structure

5 Computational Grid: TeraGrid US key national grid effort Based on Globus infrastructure Attempts to solve grid technology challenges in a very general fashion Currently 9 sites Little application thus far specifically in the area of biology Construction project: first we build it, then…

6 Special purpose Computational Grid: IU/HLRS 2003 HPC Challenge Global analysis of Arthropod evolution One application: fastDNAml 8 types of systems; 641 processors; 6 continents 200 trees analyzed

7 Cycle scavenging Computational Grids Folding at home ( group/pandegroup/ folding/) Fight AIDS at home (fightaidsathome. scripps.edu/) ( research.net/) pandegroup/folding/results.html

8 Data grids in biology Research data grids –Centralized Life Sciences Data Service –Teragrid ( Research and clinical data grids –SPIN (Shared Pathology Informatics Network) –Central Indiana Hospitals

9 Centralized Life Sciences Data Service at Indiana University Goal: transparent and integrated access to multiple data sources Federated database approach focuses on establishing glue between existing databases “ Private ” databases stay where they are – under local control “ Public ” databases may be replicated locally for performance Queries are entered as standard SQL

10 NR EST Swiss prot BLAST Data sources BLAST engine CLSD Engine (IBM II) LIGAN D BIND ENZY ME dbSN P Public data sources MS SQL Server IUSM workgroup databases Custom Web Applicati on Portal

11 CLSD: Finding Genes Queries multiple databases, linking expression data (local and remote) and location data Built by research lab in IUSM Portal, built with CLSD as a grid back end Hereditary Diseases and Family Studies Division, Dept. of Medical and Molecular Genetics, IU School of Medicine. Supported in part by NIH R01 NS37167.

12 Understanding Microarray Data The Microarray Data Portal was created by the Center for Medical Genomics at IU School of Medicine. Supported in part by the 21st Century Research & Technology Fund and the Indiana Genomics Initiative. The Indiana Genomics Initiative is supported in part by a grant from the Lilly Foundation, Inc.

13 Clinical data grids in Indiana SPIN (Shared Pathology Informatics Network) –Distributed database of anonymized data about pathology specimens provides –Data in compliance with US privacy regulations –SPIN software runs at participating institutions Regenstrief Institute –From data vaults to data grids –Hundreds of millions of patient records –Clinical service grid serving central Indiana hospitals

14 Semantic requirements for BioData Grids Interoperability of nomenclature and metadata a critical challenge! “ A biologist would rather use another biologist ’ s toothbrush than another biologist ’ s terminology ” – Thomas Kaufman Consistent semantics are required! Example projects: –GO: Gene Ontology –SBML: Systems Biology Markup Language –MAGE-ML: MicroArray Gene Expression Markup Language –SNOMED – CT: SNOMED Clinical Terms

15 Hybrid Grids SCrAPS –Advanced Photon Source at Argonne National Laboratories –“ Better than being there ” functionality –Real time integration of remote instruments, collaboration, computation, and visualization –Near real time data movement BIRN –Key NIH funded biogrid –Includes data, computation, visualization Encyclopedia of Life eDiamond

16 Where are we today? By area of focus: Collaboration grid Computational grid –Supercomputer grids –Cycle scavenging Data grids Hybrid grids By status: Very general construction projects Handcrafted grid solutions Special projects (heroic efforts involved) Ongoing production services

17 Looking Ahead Access to computing power via grids is still largely experimental Access to data via grids has transformed biomedical research and is transforming clinical practice Access to instruments is still experimental Great opportunities to advance biomedical research through use of grids Biology is different –Data is always collected somewhere –Affinities between grid structure and future software structure Sometimes grids are not the answer

18 Acknowledgments This research was supported in part by the Indiana Genomics Initiative. The Indiana Genomics Initiative of Indiana University is supported in part by Lilly Endowment Inc. This work was supported in part by Shared University Research grants from IBM, Inc. to Indiana University, and in particular by IU’s relationship with IBM as an IBM Life Sciences Institute of Innovation. This material is based upon work supported by the National Science Foundation under Grant No and Grant No. CDA Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF). UITS staff: Mary Papakhian, Stephen Simms, Richard Repasky, Matt Link, John Samuel, Eric Wernert, Anurag Shankar, Andrew Arenson, John Herrin, Malinda Lingwall, W. Les Teach

19 Thank you! Further information available at: