INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Implementing advanced IT facilities for the Indiana Genomics Initiative Craig A. Stewart

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Pervasive Technology Institute overview PTI RT All Hands Meeting 22 May 2009 Craig Stewart Executive Director, PTI; Associate Dean, Research Technologies.
April 19, 2015 CASC Meeting 7 Sep 2011 Campus Bridging Presentation.
What is Cyberinfrastructure?
Bill Barnett, Bob Flynn & Anurag Shankar Pervasive Technology Institute and University Information Technology Services, Indiana University CASC. September.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
Status of IU’s E10000 and recent activities with Sun Sun NDA Visit June Craig Stewart, Ph.D. Please cite as: Stewart, C.A. Status.
Bindley Bioscience Center Vision: Nurture interactive communication and interdisciplinary discovery with flexible laboratory project spaces and an open.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
© The Trustees of Indiana University Centralize Research Computing to Drive Innovation…Really Thomas J. Hacker Research & Academic Computing University.
INDIANAUNIVERSITYINDIANAUNIVERSITY 1 Getting More for Less: A Software Distribution Model John V. Samuel, Craig A. Stewart, and Kevin J. Wilhite University.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Prepared for the.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Rockhopper: Penguin on Demand at Indiana.
1 IBM – IU Recent Research Activities and Collaboration Opportunities Craig Stewart Associate Vice President for Research & Academic Computing (Acting)
Research & Academic IU Bradley C. Wheeler Associate Vice President & Dean Office of the VP for Information Technology & CIO
Computational Biology: Data, computation, and visualization Dr. Craig A. Stewart & Dr. Eric Wernert 7 August 2003.
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Statewide IT Conference, Bloomington IN (October 7 th, 2014) The National Center for Genome Analysis Support, IU and You! Carrie Ganote (Bioinformatics.
Delivering a New Desktop and Application Deployment Strategy Indiana University and the New Emerging Personal Computing Model Duane Schau
Next Generation Cyberinfrastructures for Next Generation Sequencing and Genome Science AAMC 2013 Information Technology in Academic Medicine Conference.
Research & Academic Computing Bradley C. Wheeler Associate Vice President & Dean.
Information technology, collaboration, and achieving IU ’ s research goals Craig A. Stewart 13 November 2003 Director, Research and Academic.
Craig Stewart 23 July 2009 Cyberinfrastructure in research, education, and workforce development.
INDIANAUNIVERSITYINDIANAUNIVERSITY January 2002 INGEN's advanced IT facilities Craig A. Stewart
Barbara Sims, Co-Director National SISEP Center FPG Child Development Center University of North Carolina at Chapel Hill Greensboro.NC March 20, 2013 Implementation.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
Goodbye from Indianapolis, IUPUI, and Craig A. Stewart Executive Director, Pervasive Technology Institute Associate Dean, Research Technologies Indiana.
High Performance Computing for University Medical Research: A Successful Implementation Dr. Craig A. Stewart, Ph.D. Director, Research and.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
I-Light: A Network for Collaboration between Indiana University and Purdue University Craig Stewart Associate Vice President Gary Bertoline Associate Vice.
Genomics, Transcriptomics, and Proteomics: Engaging Biologists Richard LeDuc Manager, NCGAS eScience, Chicago 10/8/2012.
The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
September 6, 2013 A HUBzero Extension for Automated Tagging Jim Mullen Advanced Biomedical IT Core Indiana University.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. The IQ-Table & Collection Viewer A.
The Research Computing Center Nicholas Labello
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
ACCELERATING CLINICAL AND TRANSLATIONAL RESEARCH
1 BioGrids in the US: Current status and future opportunities Craig A. Stewart 15 April 2004 Director, Research and Academic Computing Director,
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream - A self-provisioned, scalable science and.
July 18, 2012 Campus Bridging Security Challenges from “Panel: Security for Science Gateways and Campus Bridging”
©2013 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Get Creative: Get Connected Tippi Clayborne EDUC 7102 Walden University.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream Overview – XSEDE ’15 Panel - New and emerging.
INDIANAUNIVERSITYINDIANAUNIVERSITY 1 Parallel implementation and performance of fastDNAml - a program for maximum likelihood phylogenetic inference Craig.
Using Prior Knowledge to Improve Scoring in High-Throughput Top-Down Proteomics Experiments Rich LeDuc Le-Shin Wu.
Research Computing Archived Presentation Title:Indiana Economic Development From Indiana Economic Development Corporation to Indiana and Purdue.
INDIANAUNIVERSITYINDIANAUNIVERSITY Spring 2000 Indiana University Information Technology University Information Technology Services Please cite as: Stewart,
November 18, 2015 Quarterly Meeting 30Aug2011 – 1Sep2011 Campus Bridging Presentation.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
UITS Research Technologies – Services Available to Regenstrief Institute 13 Oct 2015 Craig Stewart ORCID ID Executive Director, Indiana.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Grand Challenges Discussion 7 Oct 2015 Craig.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Recent key achievements in research computing at IU Craig Stewart Associate Vice President, Research & Academic Computing Chief Operating Officer, Pervasive.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Update on EAGER: Best Practices and.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Jetstream: A new national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration.
Jonathan Carroll-Nellenback.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
1 A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Informatics Tools at the Indiana CTSI.
Indiana University - IBM Visit IT at IU. n Please cite as: Stewart, C.A IU. Presentation. Presented at IBM T.J. Watson Research Center, Feb.
Jetstream Overview Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor,
Research & Academic Computing Indiana University Statewide IT Conference 11 September 2003 Indianapolis IN.
A Path to the Community Cloud Making Above Campuses Services a Reality
Matt Link Associate Vice President (Acting) Director, Systems
funded by the National Science Foundation Award #ACI
Research and Academic Computing Division
Presentation transcript:

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Implementing advanced IT facilities for the Indiana Genomics Initiative Craig A. Stewart meeting April 23-24, 2002, HPC User Forum meeting, Santa Fe, New Mexico

INDIANAUNIVERSITYINDIANAUNIVERSITY License terms Please cite as: Stewart, C.A. Implementing advanced IT facilities for the Indiana Genomics Initiative Presentation. Presented at: HPC User Forum (Santa Fe, New Mexico, 23 Apr 2002). Available from: Except where otherwise noted, by inclusion of a source url or some other note, the contents of this presentation are © by the Trustees of Indiana University. This content is released under the Creative Commons Attribution 3.0 Unported license ( This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work. 2

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Indiana University ’ s Goals IT Goal: “ To be a leader in absolute terms in information technology. ” IU president Myles Brand, 1996 Goals of the Indiana Genomics Initiative: To advance understanding of life ’ s processes, develop new therapies for human diseases, improve the quality of human health in Indiana, and enhance the strength of the central Indiana high-tech economy

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 IU in a nutshell Founded in 1820 $2B Annual Budget 8 campuses Campuses well connected; esp. IUB, IUPUI, and Purdue ’ s campus at W. Lafayette connected by I-light IU Operates TransPAC, GlobalNOC

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 in a nutshell Academic programs in IT through computer science, library and information sciences, engineering and technology, and most notably through new School of Informatics CIO: Vice President Michael A. McRobbie ~$100M annual budget Technology services offered university-wide pervasivetechnologylabs

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 School of Medicine in a nutshell 2 nd largest School of Medicine in the US IU Cancer Center nationally recognized leader Regenstrief Institute longstanding leader in medical informatics National leader in optical and tomographic imaging Longstanding leader in genetically influenced diseases including Huntington ’ s (Conneally), alcoholism (Li); currently lead institution in national study of bipolar disorder

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 INGEN Created by $105 M grant from the Lilly Endowment to Indiana University Involves IU School of Medicine (IUPUI), Departments of Biology and Chemistry (IUB), Center for Genomics and Proteomics (IUB), and University Information Technology Services Comprised of “ Programs ” (central research areas) and “ Cores ” (supporting units that are also generally research areas)

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002

INDIANAUNIVERSITYINDIANAUNIVERSITY IT and INGEN INGEN ’ s IT core is a critical part of the infrastructure for the initiative as a whole –Networking (using I-light facility) –Supercomputing –Massive Data Storage –Visualization –Support IT is one of the paths by which INGEN should enhance the Indiana Economy

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Supercomputing - Oct 17 IU/IBM announcement IU tripled the capacity of its IBM SP, to > 1 TFLOPS (a trillion mathematical operations per second). IU ’ s SP is very large when considered within the set of supercomputers owned by individual universities Large part of this acquisition made possible via funding from INGEN IU and IBM also announced a partnership in developing new supercomputer applications for the life sciences

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Photo: Tyagan Miller. May be reused by IU for noncommercial purposes. To license for commercial use, contact the photographer

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Sun E10000 IU is a Sun “ Center of Excellence ” and is pursuing collaborative research with Sun in the area of Chemical Informatics Photo: Tyagan Miller. May be reused by IU for noncommercial purposes. To license for commercial use, contact the photographer

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 AVIDD Analysis & Visualization of Instrument-Driven Data Large, distributed Intel-compatible Linux cluster Distributed data storage/data staging Distributed visualization Education a key component of this initiative – distributed education (IUB, IUPUI, IUN) taught via Access Grids at advanced undergrad/beginning grad level

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Massive Data Storage IU has a large massive data storage system based on IBM and STK tape robotic systems. IU ’ s massive data storage system is based on HPSS (High Performance Storage System) which provides for excellent security. >300 TB current capacity Mirrored storage in Indianapolis and Bloomington should provide safety in data storage IU was first installation to implement remote HPSS movers over long haul networks

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Photo: Tyagan Miller. May be reused by IU for noncommercial purposes. To license for commercial use, contact the photographer

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Advanced Visualization UITS, IU School of Medicine, and IUPUI Computer & Information Science have already collaborated to create 3- DIVE (3-D Interactive Volume Explorer) CAVE Immersadesk IU-designed passive 3D environments (4 ’ sq screen, 5 ’ sq footprint)

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Accomplishments & Challenges Past accomplishments –fastDNAml –3DIVE Challenges –Broader engagement with life scientists –Data heterogeneity –New application areas

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 fastDNAml

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Building Phylogenetic Trees Goal: an objective means by which phylogenetic trees can be estimated in tolerable amounts of wall-clock time, producing phylogenetic trees with measures of their uncertainty

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Why is tree-building a HPC problem? The number of bifurcating unrooted trees for n taxa is (2n-5)!/ {2 n-3 (n-3)!} For 100 taxa the number of possible trees is ~10 182

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 fastDNAml Developed by Gary Olsen Derived from Felsensteins ’ s PHYLIP programs One of the more commonly used ML methods The first phylogenetic software implemented in a parallel program (at Argonne National Laboratory, using P4 libraries) Olsen, G.J.,et al fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood. Computer Applications in Biosciences 10: MPI version available from IU now (development supported by IBM SUR grant)

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Performance of fastDNAml

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Current projects Data integration Gamma knife Pedigree analysis PET scan analysis Protein families AMASS – shotgun sequence assembly Data, data, data

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 HPC and life sciences HPC hardware and software market set to dramatically expand thanks to life sciences HPC and life sciences communities don ’ t share common language Biomedical researchers are no more conservative than anyone else Biomedical researchers not alone in creating bad code Both communities have lots to offer each other, but it seems at present up to the HPC community to reach out (when was the last time an astronomer saved your life?) HPC community has been slow to take advantage of opportunities offered via collaboration with life scientists This will be like the dot-com bust – sort of. The key question is: how great will be the similarities?

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Challenges: creating collaborations with life scientists Need to challenge “ I can do it on my desktop ” mentality when appropriate Go for the low hanging fruit Remember that physics, astronomy, and other traditional HPC codes have a head start of many years Need to recognize the complexity of the life sciences

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Current IU Really clever batch scripts…. then portals Appropriate documentation Door to door consulting Proof of concept projects Contributions to open source/community code efforts

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Keys to success in IU Long history of openness, diversity in HPC uses Accountability and service philosophy Supercomputing time and programming support baseline services Central computing center staff hired from several disciplines (including biology) Computer scientists who actually care about applications History and a certain amount of luck

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Summary IU has thus far been very successful in implementing advanced IT infrastructure for life scientists Reaching out has been essential to formation of partnerships Industry partnerships have been essential to success So far, so good……

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Acknowledgements IBM research relationships & SUR grants Sun and Center of Excellence relationships Compaq relationship Computer scientists at IU (esp. Randall Bramley, Dennis Gannon, Shaoifen Fang) State of Indiana Lilly Endowment

INDIANAUNIVERSITYINDIANAUNIVERSITY April 2002 Important URLs University Information Technology Services: UITS Research & Academic Computing Division InGen IT Core: IU Teraflop SP announcement: it.iu.edu