The Virtual Grid Application Development Software (VGrADS) Project Overview Ken Kennedy Center for High Performance Software Rice University

Slides:



Advertisements
Similar presentations
-Grids and the OptIPuter Software Architecture Andrew A. Chien Director, Center for Networked Systems SAIC Chair Professor, Computer Science and Engineering.
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Technology Drivers Traditional HPC application drivers – OS noise, resource monitoring and management, memory footprint – Complexity of resources to be.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
AppLeS, NWS and the IPG Fran Berman UCSD and NPACI Rich Wolski UCSD, U. Tenn. and NPACI This presentation will probably involve audience discussion, which.
Achieving Application Performance on the Computational Grid Francine Berman This presentation will probably involve audience discussion, which will create.
AppLeS / Network Weather Service IPG Pilot Project FY’98 Francine Berman U. C. San Diego and NPACI Rich Wolski U.C. San Diego, NPACI and U. of Tennessee.
Workload Management Massimo Sgaravatto INFN Padova.
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
US NITRD LSN-MAGIC Coordinating Team – Organization and Goals Richard Carlson NGNS Program Manager, Research Division, Office of Advanced Scientific Computing.
4.x Performance Technology drivers – Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components.
Center for Research on Multicore Computing (CRMC) Overview Ken Kennedy Rice University
Overview of the Course Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University.
Cluster Reliability Project ISIS Vanderbilt University.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Programming Models & Runtime Systems Breakout Report MICS PI Meeting, June 27, 2002.
A Proposal of Application Failure Detection and Recovery in the Grid Marian Bubak 1,2, Tomasz Szepieniec 2, Marcin Radecki 2 1 Institute of Computer Science,
TMC BioGrid A GCC Consortium Ken Kennedy Center for High Performance Software Research (HiPerSoft) Rice University
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
Development Timelines Ken Kennedy Andrew Chien Keith Cooper Ian Foster John Mellor-Curmmey Dan Reed.
1 Logistical Computing and Internetworking: Middleware for the Use of Storage in Communication Micah Beck Jack Dongarra Terry Moore James Plank University.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Software Support for High Performance Problem Solving on the Grid An Overview of the GrADS Project Sponsored by NSF NGS Ken Kennedy Center for High Performance.
High Performance Computing on the Grid: Is It for You? With a Discussion of Help on the Way (the GrADS Project) Ken Kennedy Center for High Performance.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The GriPhyN Planning Process All-Hands Meeting ISI 15 October 2001.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
SDN Management Layer DESIGN REQUIREMENTS AND FUTURE DIRECTION NO OF SLIDES : 26 1.
7. Grid Computing Systems and Resource Management
Lawrence H. Landweber National Science Foundation SC2003 November 20, 2003
VgES Version 0.7 Release Overview UCSD VGrADS Team Andrew A. Chien, Henri Casanova, Yang-suk Kee, Jerry Chou, Dionysis Logothetis, Richard.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
Report on VGrADS EOT Activities VGrADS Workshop, February 24, 2006 Michael Sirois Center for Excellence and Equity in Education (CEEE)
VGrADS Programming Tools Research: Vision and Overview Ken Kennedy Center for High Performance Software Rice University
- DAG Scheduling with Reliability - - GridSolve - - Fault Tolerance In Open MPI - Asim YarKhan, Zhiao Shi, Jack Dongarra VGrADS Workshop April 2007.
VGrADS and GridSolve Asim YarKhan Jack Dongarra, Zhiao Shi, Fengguang Song Innovative Computing Laboratory University of Tennessee VGrADS Workshop – September.
VGES Demonstrations Andrew A. Chien, Henri Casanova, Yang-suk Kee, Richard Huang, Dionysis Logothetis, and Jerry Chou CSE, SDSC, and CNS University of.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
Distributed Systems Architectures. Topics covered l Client-server architectures l Distributed object architectures l Inter-organisational computing.
VGrADS Education, Outreach and Training Activities: Unlocking the Doors of Opportunity Keith Cooper With thanks to Theresa Chatman
Workload Management Workpackage
Strategies for NIS Development
Clouds , Grids and Clusters
Education, Outreach, and Training
Center for High Performance Software
EMAN, Scheduling, Performance Prediction, and Virtual Grids
LEAD-VGrADS Day 1 Notes.
Center for High Performance Software
Resource Characterization
FET Plans FET - Proactive 1.
VGrADS Execution System or “the Virtual Grid”
Abstract Machine Layer Research in VGrADS
VGrADS EOT (Education, Outreach and Training) Plan
Theresa Chatman, (presenter), Michael Sirois Richard Tapia
Overview – Guide to Developing Safety Improvement Plan
Overview of the Course Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University.
Connecting the European Grid Infrastructure to Research Communities
Overview – Guide to Developing Safety Improvement Plan
H2020 SCC Smart Cities & Communities Info Day Horizon 2020
R, Zhang, A. Chien, A. Mandal, C. Koelbel,
From Use Cases to Implementation
Presentation transcript:

The Virtual Grid Application Development Software (VGrADS) Project Overview Ken Kennedy Center for High Performance Software Rice University

Supplementary Materials Packets —Agenda —Participant bios and pictures —Slides —Poster abstracts —Annual report for 2005 Web Site —Linked agenda —Full participant bios —Electronic (full-page) slides —Publications —Annual reports —PDFs of Posters

Graphical Agenda Overview Kennedy Virtual Grid Chien, Casanova Programming Tools Kennedy Driving Applications Koelbel, Reed EOT Cooper Selected Topics Dongarra, Wolski Grad Student Posters Progressive Revelation

The VGrADS Team VGrADS is an NSF-funded Information Technology Research project Keith Cooper Ken Kennedy Charles Koelbel Richard Tapia Linda Torczon Rich Wolski Fran Berman Andrew Chien Henri Casanova Carl Kesselman Lennart Johnsson Dan Reed Jack Dongarra Plus many graduate students, postdocs, and technical staff!

VGrADS Vision: National Distributed Problem Solving Where We Want To Be —Transparent Grid computing –Submit job –Find & schedule resources –Execute efficiently Where We Are —Low-level hand development What Do We Need? —A more abstract view of the Grid –Each developer sees a specialized “virtual grid” —Simplified programming models built on the abstract view –Permit the application developer to focus on the problem Database Supercomputer

Config- urable Object Program Execution Environment Program Preparation System Performance Feedback Whole- Program Compiler Libraries Source Appli- cation Software Components Binder Performance Problem Real-time Performance Monitor Resource Negotiator Scheduler Grid Runtime System Negotiation The Original GrADS Vision

Lessons from GrADS Mapping and Scheduling for MPI Jobs is Hard –Although we were able to do some interesting experiments Performance Model Construction is Hard —Hybrid static/dynamic schemes are best —Difficult for application developers to do by hand Heterogeneity is Hard —We completely revised the launching mechanisms to support this —Good scheduling is critical Rescheduling/Migration is Hard —Requires application collaboration (generalized checkpointing) —Requires performance modeling to determine profitability Scaling to Large Grids is Hard —Scheduling becomes expensive

VGrADS Virtual Grid Hierarchy

VGrADS Project Overview Virtual Grid Abstraction and Virtual Grid Execution System (vgES) —Application-level resource abstraction separates concerns —Permits true scalability and control of resources —Virtual Grid Execution System enables simple resource management –Supports fault tolerance, reasoning about app behavior, rescheduling Tools for Application Development —Abstract programming interfaces —Easy application scheduling, launch, and management –Workflow graphs and tightly-coupled computations Support for Fault Tolerance and Rescheduling/Migration —Collaboration between application and vgES Research Driven by Real Application Needs —EMAN, LEAD, GridSAT, EOL

Architecture: vgES implements Virtual Grid Application vgES APIs vgMON vgDL Information Services Resource Managers vgLAUNCH vgFAB VG DVCW vgAgent Grid Resources

Virtual Grid Approach Separation of Concerns —Application Planning and Management —Complex Grid Resource Environment Mgmt => vgDL and Virtual Grid Scalable Selection and Binding —Large Resource Pools —Competitive, Dynamic Environments => Finding and Binding Application-Driven Resource Management —Application-Level Abstraction —Grid Information => Virtual Grid Explicit Resource Abstraction

Programming Tools Focus: Automating critical application-development steps: Initiating and managing application execution —Optimize and launch application on heterogeneous resources —Support for fault tolerance and rescheduling/migration Scheduling application workflows —Whole-workflow scheduling using performance models Constructing performance models —Automatically from application binaries –Cross-platform modeling Building workflow graphs from high-level scripts —Examples: Python (EMAN), OGRE (LEAD), Matlab

VGrADS Applications Philosophy: Computer Science Research Driven by Applications EMAN —Contruction of 3-D models from 2-D electron micrographs –Talk by Charles Koelbel, Posters by Anirban Mandal and Bo Liu and by Gabriel Marin LEAD —Collection of applications for real-time weather prediction, sponsored by NSF ITR Large –Talk by Dan Reed, Poster by Emma Buneci GridSAT —Satisfiability on the Grid EOL (Encyclopedia of Life) —Reduced emphasis due to software issues

VGrADS Education Graduate Education —Graduate student support: 23 students –Rice: 7; UCSB: 1; UCSD: 5; UH: 2; UNC: 4; USC: 1; UTK: 3 —Cross-institution graduate student exchange —Grid courses –UCSD (Chien) and UCSB (Wolski) Undergraduate Education —Internet and Grid Architectures Course –Under development Underrepresented Groups —Support for AGEP program —Activities at Tapia and Hopper Conferences —CS-CAMP

Outreach Web Site – Conferences and Talks —CCGrid and HPDC papers —Invited addresses Application Collaborations Participation at SC Conferences —Demonstrations of two applications

VGrADS Demos at SC04 EMAN - Electron Microscopy Analysis [Rice, Houston] —3D reconstruction of particles from electron micrographs —Workflow scheduling and performance prediction to optimize mapping GridSAT - Boolean Satisfiability [UCSB] —Classic NP-complete problem useful in circuit design and verification —Performance-based dynamic resource allocation and scheduling EMAN Refinement Process

Leverage NMI —Community infrastructure that can be counted on —Vehicle for deployment of successful research —Outlet for technology State of Texas Support —LEARN: –Support for statewide networking –33 institutions $7.5 million over two years —TIGRE –Application driven software stack –$2.5 million for two years shared by 5 institutions –Rice and UH both participants Houston Research and Education Network —Rice, UH, Texas Med Center, NLR (lit in May) –Abovenet contribution $3.4 million

Management Value of Virtual Organization —No one site could take on topic of this breadth —Project vision requires an integrated approach –Thinking about broad range of issues from the beginning Management —Executive Committee —Technical Working Group Strategy for Collaboration —Workshops, Telecons –Overcome project management challenges through interactions —Grad student exchange

Research vs Development Original Proposal —Research —Significant investment in software infrastructure development —Application collaborations (funded staff members) Revised Statement of Work —Focus on research (guidance from NSF) Situation Today —To conduct research we are building prototype software —Leveraging NMI and Texas —Application collaborations still drive research

Annual Budget

VGrADS Summary: A Holistic Approach What justifies a Large ITR? —Community: no one institution covers everything —Unified project vision —Shared infrastructure —Integration: would not happen without a unified project VGrADS —Built on GrADS insights and experiences –community of leading researchers who work together effectively –broad coverage of requisite topics —Vision for extremely simple application development interface –grid virtualization to hide complexity —Shared software stack and testbed –vgES toolkit, policies and application drivers —Many interrelated layers require and benefit from the integrated effort –program tools, provisioning, scheduling, measurement –prediction, fault tolerance, infrastructure, applications

Annual Budget by Institution