DATA-CENTRIC COMPUTING, SCIENCE GATEWAYS, AND THE TERAGRID Kurt A. Seiffert April 2008.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Xsede eXtreme Science and Engineering Discovery Environment Ron Perrott University of Oxford 1.
1 US activities and strategy :NSF Ron Perrott. 2 TeraGrid An instrument that delivers high-end IT resources/services –a computational facility – over.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
© The Trustees of Indiana University Centralize Research Computing to Drive Innovation…Really Thomas J. Hacker Research & Academic Computing University.
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
An Introduction to the Open Science Data Cloud Heidi Alvarez Florida International University Robert L. Grossman University of Chicago Open Cloud Consortium.
Core Services I & II David Hart Area Director, UFP/CS TeraGrid Quarterly Meeting December 2008.
Research & Academic IU Bradley C. Wheeler Associate Vice President & Dean Office of the VP for Information Technology & CIO
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Big Red, the Data Capacitor, and the future (clouds) Craig A. Stewart 2 March 2008.
The Creation of a Big Data Analysis Environment for Undergraduates in SUNY Presented by Jim Greenberg SUNY Oneonta on behalf of the SUNY wide team.
The TeraGrid: An essential tool for 21st century science Craig Stewart, Associate Dean, Research Technologies Chief Operating Officer, Pervasive Technology.
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
Statewide IT Conference, Bloomington IN (October 7 th, 2014) The National Center for Genome Analysis Support, IU and You! Carrie Ganote (Bioinformatics.
Research & Academic Computing Bradley C. Wheeler Associate Vice President & Dean.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Partnerships and Broadening Participation Dr. Nathaniel G. Pitts Director, Office of Integrative Activities May 18, 2004 Center.
The TeraGrid David Hart Indiana University AAAS’09, FEBRUARY 13, 2009.
PolarGrid Geoffrey Fox (PI) Indiana University Associate Dean for Graduate Studies and Research, School of Informatics and Computing, Indiana University.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
Implementation and experience with Big Red (a 30.7 TFLOPS IBM BladeCenter cluster), the Data Capacitor, and HPSS Craig A. Stewart 13 November.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
August 2007 Advancing Scientific Discovery through TeraGrid Adapted from S. Lathrop’s talk in SC’07
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
TeraGrid Overview Cyberinfrastructure Days Internet2 10/9/07 Mark Sheddon Resource Provider Principal Investigator San Diego Supercomputer Center
BAAC – Strategic Needs of the MU Libraries 1. 2 Outline  Overview of the MU Libraries  Current Status of the Libraries  The Case for Library Support.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
Implementation and experience with Big Red (a 30.7 TFLOPS IBM BladeCenter cluster), the Data Capacitor, and HPSS Craig A. Stewart 1 November.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Research and Educational Networking and Cyberinfrastructure Russ Hobby, Internet2 Dan Updegrove, NLR University of Kentucky CI Days 22 February 2010.
Implementing a Data Publishing Service via DSpace Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert May 20, 2009.
July 18, 2012 Campus Bridging Security Challenges from “Panel: Security for Science Gateways and Campus Bridging”
Future Grid Future Grid All Hands Meeting Introduction Indianapolis October Geoffrey Fox
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream Overview – XSEDE ’15 Panel - New and emerging.
Research Computing Archived Presentation Title:Indiana Economic Development From Indiana Economic Development Corporation to Indiana and Purdue.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
UITS Research Technologies – Services Available to Regenstrief Institute 13 Oct 2015 Craig Stewart ORCID ID Executive Director, Indiana.
Recent key achievements in research computing at IU Craig Stewart Associate Vice President, Research & Academic Computing Chief Operating Officer, Pervasive.
TeraGrid Quarterly Meeting Arlington, VA Sep 6-7, 2007 NCSA RP Status Report.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Jetstream: A new national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Pti.iu.edu/sc14 The National Center for Genome Analysis Support Supercomputing 2014 November 17-21, 2014.
Sergiu April 2006June 2006 Overview of TeraGrid Resources and Services Sergiu Sanielevici, TeraGrid Area Director for User.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Bio-IT World Conference and Expo ‘12, April 25, 2012 A Nation-Wide Area Networked File System for Very Large Scientific Data William K. Barnett, Ph.D.
SALSASALSASALSASALSA Digital Science Center February 12, 2010, Bloomington Geoffrey Fox Judy Qiu
NICS Update Bruce Loftis 16 December National Institute for Computational Sciences University of Tennessee and ORNL partnership  NICS is the 2.
IU Site Update TeraGrid Round Table Craig Stewart, Stephen Simms, Kurt Seiffert November 4, 2010.
National Science Foundation Blue Ribbon Panel on Cyberinfrastructure Summary for the OSCER Symposium 13 September 2002.
OKLAHOMA Supercomputing Symposium 2011 University of Oklahoma October 11, 2011 James Wicksted, RII Project Director Associate Director, Oklahoma EPSCoR.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
TG ’08, June 9-13, State of TeraGrid John Towns Co-Chair, TeraGrid Forum Director, Persistent Infrastructure National Center for Supercomputing.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
SIU Information Technology April 28, 2015 Research Computing and Cyberinfrastructure.
Creating Grid Resources for Undergraduate Coursework John N. Huffman Brown University Richard Repasky Indiana University Joseph Rinkovsky Indiana University.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
Deploying Regional Grids Creates Interaction, Ideas, and Integration
Dr. Craig A. Stewart Orcid ID:
PolarGrid and FutureGrid
Presentation transcript:

DATA-CENTRIC COMPUTING, SCIENCE GATEWAYS, AND THE TERAGRID Kurt A. Seiffert April 2008

Outline Presentation What is the TeraGrid Indiana University’s data-centric computing focus –HPSS –Lustre –Data collections Science Gateways Bringing it all together

What is the TeraGrid? An instrument (cyberinfrastructure) that delivers high-end IT resources - storage, computation, visualization, and data/service hosting - almost all of which are UNIX-based under the covers; some hidden by Web interfaces –A data storage and management facility: over 20 Petabytes of storage (disk and tape), over 100 scientific data collections –A computational facility - over 750 TFLOPS in parallel computing systems and growing –(Sometimes) an intuitive way to do very complex tasks, via Science Gateways, or get data via data services A service: help desk and consulting, Advanced Support for TeraGrid Applications (ASTA), education and training events and resources The largest individual cyberinfrastructure facility funded by the NSF, which supports the national science and engineering research community Allocated via peer review (and without double jeopardy)

TeraGrid: 11 Resource Partners, 1 Instrument October 4, 2015

HPSS Configuration IUB Subsystem IUPUI Subsystem Research Network Bloomington Users Indianapolis Users HPSS Movers HPSS Movers Research Network TCP/IP Wide Area Network FC SAN IUB Campus Network IUPUI Campus Network Disk ArraysTape LibraryDisk ArraysTape Library HPSS Core Servers

What’s A Data Capacitor Really? 12 pairs Dell PowerEdge 2950 –2 x 3.0 GHz Dual Core Xeon –Myrinet 10G Ethernet –Dual port Qlogic 2432 HBA (4 x FC) –2.6 Kernel (RHEL 4) 6 DDN S2A9550 Controllers –Over 2.4 GB/sec measured throughput each –535 Terabytes of spinning SATA disk

Bandwidth Challenge Annual Event at SC Conference in November –This year’s venue - Reno, Nevada This Year’s Theme - “Serving as a Model” –Can others do what you’re doing? Criteria for Judging –Did you fill a single 10 Gigabit connection? –How are you supporting science? –Did you use your production network?

The Challenge: Five Applications Simultaneously Acquisition and Visualization –Live Instrument Data Chemistry –Rare Archival Material Humanities Acquisition, Analysis, and Visualization –Trace Data Computer Science –Simulation Data Life Science High Energy Physics

Bandwidth Challenge Configuration

Digitization of “SarvamoolaGranthas” SarvamoolaGranthas – teachings of ShriMadhvacharya ( ) a great Indian Philosopher, proponent of Dvaita Philosophy SarvamoolaGranthas is a collection of works with commentaries on various important scriptures such Vedas, Upanishads, Itihasas, Puranas, Tantras and Prakaranas All of the original manuscripts of the Sarvamoolagranthas were incised on palm leaves Mathas or Monasteries –Keepers of Palm Leaf Manuscripts Shri Madhvacharya

Digitization of “Sarvamoola Granthas” Post processed images of the palm leaves Sample images of the palm leaf of Sarvamoola granthas illustrating the performance of the image processing algorithms. (a) Stitched 8 bit grayscale image without normalization and contrast enhancement, (b) Final image after contrast enhancement

MutDB (

Science Gateways A Science Gateway is a domain-specific computing environment, typically accessed via the Web, that provides a scientific community with end-to-end support for a particular scientific workflow Science Gateways are distinguished from Web portals ( in that portals “present information from diverse sources in a unified way.” Hides complexity (pay no attention to the grid behind the curtain…)

LEAD ( October 4, 2015

LEAD (portal.leadproject.org) Simple enough an undergraduate can use it! National Center for Supercomputing Applications (NCSA) and IU teamed up to support WxChallenge weather forecast competition. 64 teams, 1000 students, ~16,000 CPU hours on Big Red

Purdue’s NanoHUB (

But you don’t care - TeraGrid Architecture Compute Service Viz Service Data Service Network, Accounting, … RP 1 RP 3 RP 2 TeraGrid Infrastructure (Accounting, Network, Authorization,…) Science Gateways User Portal

Acknowledgements IU’s involvement as a TeraGrid Resource Partner is supported in part by the National Science Foundation under Grants No. ACI l, OCI , OCI , and OCI The IU Data Capacitor is supported in part by the National Science Foundation under Grant No. CNS The Grid Infrastructure Group management of the TeraGrid, and Dane Skow's leadership thereof, is funded by NSF grant Purdue’s involvement as a TeraGrid Resource Partner is supported in part by the National Science Foundation under Grant No. OCI This research was supported in part by the Pervasive Technology Labs and the Indiana METACyt Initiative. Both Indiana University initiatives are supported by the Lilly Endowment, Inc. This work was supported in part by Shared University Research grants from IBM, Inc. to Indiana University. The LEAD portal is developed under the leadership of IU Professors Dr. Dennis Gannon and Dr. Beth Plale, and supported by NSF grant Marcus Christie and SurreshMarru of the Extreme! Computing Lab contributed the LEAD graphics The ChemBioGrid Portal is developed under the leadership of IU Professor Dr. Geoffrey C. Fox and Dr. Marlon Pierce and funded via the Pervasive Technology Labs (supported by the Lilly Endowment, Inc.) and the National Institutes of Health grant P20 HG Many of the ideas presented in this talk were developed under a Fulbright Senior Scholar’s award to Stewart, funded by the US Department of State and the TechnischeUniversitaet Dresden. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF), National Institutes of Health (NIH), Lilly Endowment, Inc., or any other funding agency. This work is made possible by the dedicated efforts of the expert staff of the Research Technologies Division of University Information Technology Services, the faculty and staff of the Pervasive Technology Labs, and the staff of UITS generally. Steve Simms, Erik Cornet, Mike Lowe, Scott Tiege, Michael Grobe, and Malinda Lingwall helped with this presentation. Thanks to the faculty and staff with whom we collaborate locally at IU and globally (within the US via the TeraGrid, and internationally via collaboration with TechnischeUniversitaet Dresden)