Science Gateways Update Nancy Wilkins-Diehr Science Gateways Area Director Quarterly Meeting, September 6-7, 2007.

Slides:



Advertisements
Similar presentations
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Advertisements

Academic Technology Services The UCLA Grid Portal - Campus Grids and the UC Grid Joan Slottow and Prakashan Korambath Research Computing Technologies UCLA.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
GridSphere for GridLab A Grid Application Server Development Framework By Michael Paul Russell Dept Computer Science University.
TeraGrid Science Gateway AAAA Model: Implementation and Lessons Learned Jim Basney NCSA University of Illinois Von Welch Independent.
TG QM Arlington: GIG User Support Coordination Plan Sergiu Sanielevici, GIG Area Director for User Support Coordination
Attribute-based Authentication for Gateways Jim Basney Terry Fleury Stuart Martin JP Navarro Tom Scavo Jon Siwek Von Welch Nancy Wilkins-Diehr.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
GIG Software Integration Project Plan, PY4-PY5 Lee Liming Mary McIlvain John-Paul Navarro.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Distributed Web Security for Science Gateways Jim Basney In collaboration with: Rion Dooley Jeff Gaynor
Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving.
Progress on TeraGrid Stability for the LEAD project.
TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007.
National Center for Supercomputing Applications The Computational Chemistry Grid: Production Cyberinfrastructure for Computational Chemistry PI: John Connolly.
TeraGrid Science Gateways: Scaling TeraGrid Access Aaron Shelmire¹, Jim Basney², Jim Marsteller¹, Von Welch²,
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
1 PY4 Project Report Summary of incomplete PY4 IPP items.
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
Portal for ArcGIS An Introduction
GridShib: Grid/Shibboleth Interoperability September 14, 2006 Washington, DC Tom Barton, Tim Freeman, Kate Keahey, Raj Kettimuthu, Tom Scavo, Frank Siebenlist,
TeraGrid Quarterly Meeting Dec 5 - 7, 2006 Data, Visualization and Scheduling (DVS) Update Kelly Gaither, DVS Area Director.
ArcGIS Server for Administrators
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
Tutorial: Building Science Gateways TeraGrid 08 Tom Scavo, Jim Basney, Terry Fleury, Von Welch National Center for Supercomputing.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
TeraGrid Advanced Scheduling Tools Warren Smith Texas Advanced Computing Center wsmith at tacc.utexas.edu.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Leveraging the InCommon Federation to access the NSF TeraGrid Jim Basney Senior Research Scientist National Center for Supercomputing Applications University.
TeraGrid Extension Gateway Activities Nancy Wilkins-Diehr TeraGrid Quarterly, September 24-25, 2009 The Extension Proposal!
TeraGrid Quarterly Meeting Arlington, VA Sep 6-7, 2007 NCSA RP Status Report.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Using the ARCS Grid and Compute Cloud Jim McGovern.
Education, Outreach and Training (EOT) and External Relations (ER) Scott Lathrop Area Director for EOT Extension Year Plans.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Data, Visualization and Scheduling (DVS) TeraGrid Annual Meeting, April 2008 Kelly Gaither, GIG Area Director DVS.
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
Network, Operations and Security Area Tony Rimovsky NOS Area Director
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Attribute-based Authentication for Gateways Jim Basney Terry Fleury Stuart Martin JP Navarro Tom Scavo Nancy Wilkins-Diehr.
Quality Assurance (QA) Working Group Update July 1, 2010 Kate Ericson (SDSC) Shava Smallen (SDSC)
Software Integration Highlights CY2008 Lee Liming, JP Navarro GIG Area Directors for Software Integration University of Chicago, Argonne National Laboratory.
Virtual Lab Overview 5/21/2015 xxxxxxxxxx NWS/MDL/CIRA.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
Shaowen Wang 1, 2, Yan Liu 1, 2, Nancy Wilkins-Diehr 3, Stuart Martin 4,5 1. CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department.
Shaowen Wang1, 2, Yan Liu1, 2, Nancy Wilkins-Diehr3, Stuart Martin4,5
Presentation transcript:

Science Gateways Update Nancy Wilkins-Diehr Science Gateways Area Director Quarterly Meeting, September 6-7, 2007

Today’s Presentation Q4FY07 –Progress on top five goals –Activities Gateway Highlights Q1FY08 –Top five goals Quarterly Meeting, September 6-7, 2007

Top 5 Goals for Q4FY07 1. Finalize community account policy, provide web interface to account details for TG security staff 2. Identify documentation staff member for gateways and begin work turning primer into documentation 3. Package TG07 tutorial materials for online access 4. Clarify and address gateway comments from February review - check-out procedure before gateway goes production - collection of usage info including user and discipline based demographics from each gateway - provide better user documentation for each SGW, online and on-site training for the SGWs, and more widespread promotion of the SGWs to potential user communities - describe how impact is measured, how scopes and objectives are defined

Q4FY07 Activities July –Gateway involvement in scheduling-wg kickoff Neutron Science, GISolve, nanoHUB to participate in metascheduling testbed –Discussion of log collection with security-wg –Presentation and Q&A with Von on attribute-based authentication –Finalized gateway xRAC proposal tips Prepared with assistance from xRAC reviewers –Online demo of NWS-QBETS web service August –Gateway list archives available from public page –TG07 tutorial slides up Work continues on audio presentations –Data-wg joins call to solicit advice on priorities that will meet gateway needs –Jim Basney presents on certificate management work September Quarterly Meeting, September 6-7, 2007

Gateway Activities and Futures Recent/Current Activities –“How to Write a Winning Gateway Proposal” –“Building Blocks for Science Gateways”, TG07 –Community account portal –Return of the original gateway RAT Gateway Survey II –Gateway transitions New projects, new services Futures –Gateway hosting –Interactive access –Application hosting –Web 2.0 capabilities –U Mich Gateway workshop results Quarterly Meeting, September 6-7, 2007

How to Write a Winning Gateway Proposal Completed June, 07 Community usage is an approved usage model on NSF-funded resources –Want successful Gateway PIs Current proposal instructions not tailored to gateway usage Developed augmented proposal instructions in conjunction with xRAC reviewers –Describe algorithms and codes to be run but also Classes of activities that will be enabled target audience typical use cases criteria for success Gateway management, user tracking Quarterly Meeting, September 6-7, 2007

Building Blocks for Science Gateways Captured for asynchronous learning Build a Gateway in an afternoon presented at TG07 Two components –Accounts on web server at U Iowa –TG accounts Very simple framework to do the basics –Compute jobs, data transfer, visualization Complete instructions provided on how to recreate this at home If and how to provide continued access –DAC requests to get accounts necessary to work through asynchronous training materials? –Continued access to a web server? Quarterly Meeting, September 6-7, 2007

Community Account Portal Under Development Designed for security-wg Password-protected internal site Type in community account username and bring up associated info Write permission for security staffers to add info about how and when these accounts have been secured at each site Repository for Gateway logs –Automatic mechanism and location for gateways to send logs. –Notification when logs are stale Perhaps recent usage data Quarterly Meeting, September 6-7, 2007

Return of the Gateway RAT Gateway Survey II planned for Fall, 2007 Original gateway RAT determined priorities for the last 2+ years –Time to check in again and resynch Draft developed with Doru Marcusiu Content will be developed with input across TG Quarterly Meeting, September 6-7, 2007

Q1FY08 (Oct-Dec) Top 5ish 1.Move dangerously close to routine, production use of TG 2.GRAM and gridFTP monitoring

Gateways Transition Most “original gateways” into production by 1/31/08 New projects –Take incubation, matchmaking CReSIS, SIDgrid, Navajo Tech –Requests through peer review CIG, June 2006 Gateway use in education –TG07 student competition –EOT supplement, participation in SC07 education program TeraGrid Gateways as a community builder –Not just a group of funded projects –Provides a unique forum for developers to share experiences U Mich Gateway workshop reinforces the need for this forum –Interest in calls and archives from outside the project Detailed gateway analysis by caBIG Quarterly Meeting, September 6-7, 2007

Increase in General Services Now need to clearly organize what’s being offered –Looking at skeleton gateway (SimpleGrid) and web service registry as organizing principals General web services –Continually watch new technologies »REST »Web 2.0 GRAM audit components –System for capturing per-job usage and attributing to individual gateways Attribute-based authentication Credential management –Takes coordination, group input Increased RP gateway activity –IU team led by Marlon Pierce Quarterly Meeting, September 6-7, 2007

Source: Von Welch Attribute-based Authorization Allow Science Gateways to push attributes to TeraGrid RP sites with credential –E.g. Client identifier, VO Role, client IP address Could be passed from user’s Idp or generated locally Development on Attribute push code nearing completion –GridShib for GT Tech Preview and GridShib-SAML-Tools have been released RP can authorize/de-authorize based on attributes Demonstrate integration with GISolved-based Science Gateway. Deploying attribute auditing/authorization code onto NCSA Met with Security and Science Gateways WGs and documented initial use cases AA_Testbed:Simple_Science_Gateway_with_Attributeshttp:// AA_Testbed:Simple_Science_Gateway_with_Attributes

Forward-looking capabilities Gateway hosting –Is it our mission? Is it worth the investment? –Provides Quick start for those building gateways Access to TG07 Building Blocks Tutorial Direct access to TG-hosted data –Same challenges as data collections –Could be implemented by CTSS gateway kit and hardware SSH client and server Globus Toolkit client tools Java J2SE 1.5 Ant Apache httpd Tomcat GridSphere (customized) MySQL server SimpleGrid TeraGrid CA configuration Quarterly Meeting, September 6-7, 2007

Forward-looking capabilities Interactive Access –Queue waits are a priority for all users, especially gateways –Metascheduling to identify free cycles Neutron Science, GISolve, nanoHUB participating in metascheduling testbed –On demand capabilities SPRUCE into production in the spring –Virtual machines? Application hosting –Ability to provide direct access to applications –RENCI interested in providing bio app access –PGRADE/GEMLCA, AHE? REST and Web 2.0 –User-generated content and user/third-party integration of services/capabilities See some early science applications like openwetware.org, myexperiment.org –JP and Lee already looking at publishing TG information services with REST front-end Quarterly Meeting, September 6-7, 2007

Build on Results from Ann Zimmerman’s Gateway Developer Workshop Community-driven, participatory planning process for the future of TeraGrid “What would TeraGrid be if it met the needs of your science gateway perfectly?” Many themes relevant for all users, not just gateways Gateway developers value a venue for interacting with other developers –TeraGrid has been good at providing this 17 Gateways at workshop –7 domains represented –Most funded in part by the NSF Quarterly Meeting, September 6-7, 2007

17 Top Solutions Develop gateway framework templates built upon toolkits which may already exist (8 votes across all criteria) Peering with NLR (National Lambda Rail), Internet2, etc. (8) Have common scheduling of jobs across different TeraGrid sites (7) Take meta-scheduling seriously, not as a future dream—allocate funding for development (11) Training, education, workshops, generalized & standardized basic services, documentation (23 for 7 items) Do not invest $200M into a single machine (15) –$100M in a capacity machine –$100M in “content”: middleware, interfaces, and end-user applications Reliably performing global file system with a fast local I/O (7) Standardize certificate based authentication/authorization (10) End-to-end support for Virtual Organizations (9) Source: Ann Zimmerman

18 Top Solutions Develop gateway framework templates built upon toolkits which may already exist (8 votes across all criteria) Peering with NLR (National Lambda Rail), Internet2, etc. (8) Have common scheduling of jobs across different TeraGrid sites (7) Take meta-scheduling seriously, not as a future dream— allocate funding for development (11) Training, education, workshops, generalized & standardized basic services, documentation (23 for 7 items) Do not invest $200M into a single machine (15) –$100M in a capacity machine –$100M in “content”: middleware, interfaces, and end-user applications Reliably performing global file system with a fast local I/O (7) Standardize certificate based authentication/authorization (10) End-to-end support for Virtual Organizations (9) Source: Ann Zimmerman

19 Key Issues Support interaction and cross-fertilization among Science Gateway development communities –Sharing code and successful solutions –Financial and professional support for developing gateways Reduce hurdles that make using and building on TeraGrid difficult –Reliability and tracking of upgrades –Length of development cycle –Bureaucracy Source: Ann Zimmerman

20 Prominent Needs Basic services that gateways can use instead of creating their own. Templates and standardized systems to save developers the time of recreating things that others have already built. Standardization that would make TeraGrid a real grid that could support the effective use of allocations and meta- scheduling. Operating more effectively as a community in order to better support the education and development needs of gateway developers. Source: Ann Zimmerman

Thank You Quarterly Meeting, September 6-7, 2007

Quarterly Meeting SPRUCE Jul-Sep 07 Accomplishments Accomplished Milestones - –Ported software to work with SGE on SDSC OnDemand cluster –Integrating SPRUCE with Condor under progress –Testbed WS-GRAM support complete at UC/ANL –Implementing Q/A backend using Inca –Prototype system for overall job turnaround time predictor nearing completion –Participating in scheduling-wg to evaluate the software Science Impact –Most of the work this quarter was to port and extend the system to work with different capabilities such as Condor, WS-GRAM, SGE etc. We hope to finish this work by next quarter.

Quarterly Meeting SPRUCE Oct-Dec 07 Plans Road Map –Finish all the tasks from previous quarter –Collaborate with HARC to integrate advance reservations –Work on policy encoding lookup table for various projects per resource –Upgrade the documentation to provide starter guides at each site –Work with other applications to add SPRUCE support –Prepare for SC07 Science Impact –The focus would be on finishing the current collaboration projects and put them under test. We also want to concentrate on further documentation and consolidating the SPRUCE user and resource provider base. SC07 demos and talks will concentrate on this aspect too.

Quarterly Meeting Caltech TeraGrid Science Gateways Status July-September 2007 Plans October-December 2007 Julian Bunn, Matthew Graham, Conrad Steenberg, Roy Williams

Quarterly Meeting Caltech TeraGrid Science Gateways Accomplishments –Regression test for NESSSI mosaicking mashup services Test each backend survey, fix details of XML and protocol –Experimenting with NESSSI services on Amazon.com “elastic computing cloud” Tradeoffs between TeraGrid and commercial resource provider –Rebuilt Clarens server on a /gpfs-wan headnode Getting the code verified and authorized with SDSC/TeraGrid admins –New Clarens release Improved speed and reliability. New database implementation NESSSI addons packaged as an RPM –Two day visit to Caltech by Rick Wagner for collaboration on ENZO cosmology portal –ROOTlet client code completely rewritten: now uses same protocol as for NaradaBrokering (NB) Improves stability, allows extensions more easily, reduces dependencies, uses https protocol –ROOTlet server now issues publish/subscribe status messages to NB –New leveraged funding for ROOTlets from DOE’s STTR program –First version of Grid-enabled StatPatternRecognition application, for deployment as a TeraGrid/Clarens service Over 20 powerful classifiers, including bagged decision trees, neural net, etc.

Caltech TeraGrid Science Gateways Plans –Add NB module to ROOTlet client –Tests of ROOTlets with TeraGrid batch queues –Present this work, and the role of TeraGrid, at “Computing in High Energy Physics” conference, Victoria, BC in September –Develop demonstration ROOTlet analysis for SC07 in Reno: will use TeraGrid –Develop polished interface to SPR as a new Clarens service

Quarterly Meeting GISolve Q4FY07 Accomplishments Highlights Background –Geographic Information Science (GIScience), an interdisciplinary field involving geography and other social sciences, computer science, geodesy, information sciences, and statistics to study generic issues about the development and use of geographic information systems (GIS) technologies. Milestones –Regular number of users: approximately 50 –Developed and released the SimpleGrid toolkit and its preliminary documentation by leveraging GISolve experience Support the education and training of TeraGrid Science Gateway technologies Help bring up new TeraGrid Science Gateways Help test and integrate attribute-based authorization solutions for TeraGrid Science Gateways –Education The following course (24 students) at the University of Illinois at Urbana-Champaign is using GISolve –Advanced Geographic Information Systems (undergraduate and graduate) Impact on science –Produced the following research publications Wang, S., and Armstrong, M. P “A Spatial Computational Domain Theory.” International Journal of Geographical Information Science, under revision

Quarterly Meeting GISolve Q1FY08 Plans Milestones –Deploy a new visualization service to support analysis steering –Produce a paper based on SimpleGrid experience –Explore new applications Land use and management for environmental sustainability Anthropology –Continue to develop and support the SimpleGrid toolkit Describe impact on science GISolve enables computationally intensive geographic analyses and support collaborative scientific investigations that rely on geospatial information. GISolve represents next-generation advanced Web-GIS that allows a large number of users to collaborate and share the workflows of geographic analyses. A public health study using a spatial interpolation method in GISolve

Quarterly Meeting IU GIG Quarterly Gateway Highlights Q3FY07 Accomplishments –LEAD & NOAA Scientists used LEAD Gateway to run on-demand forecasts as part of the Storm Prediction Center’s Spring Experiments –Continued to working with Globus Developers & TG RP staff in testing and debugging latest version of GRAM and Grid FTP which will be rolled out as part of CTSS V4 –Interactions with SIDGrid Project & potential Genomics Gateway. –Designing Generic Resource Selection & Data Movement Web Services useful for multiple Gateways Q4FY07 Plans –Working with Weather Challenge on using LEAD Portal to enable participation from undergraduate students from 67 universities. –Continue to work with TG RP team to ensure stability of GRAM and GridFTP with intended LEAD Gateway usage of half a million SU’s in Fall 07. –Help new Gateways transition to using high end TG Data and Compute Resources.

Quarterly Meeting TeraGrid Visualization Gateway Highlights Q4FY07 Accomplishments –Dynamic Accounts for community users Hardening of these capabilities, hope to move to production yet this quarter, depending on hardware/software availability –Visibility on main TeraGrid Website Rolled out first round of Visualization documentation on main TG website, including pointer to the Visualization Gateway Q1FY08 Plans –Additional visualization services TeraDRE portlet - Collaborating with Purdue to get this capability into production on the Visualization Gateway during Q1FY08 Exploring possible VMD service to visualize NAMD simulations (collaborating with Indiana) –SC07 Demonstration ~40 users have logged into the Visualization Gateway –25 TeraGrid Users –10 Training accounts during TG07 –5 community users

Quarterly Meeting Quarterly RENCI Gateway Highlights FY07 Q3 Accomplishments –Continued support of the production Bioportal –Outreach: three community presentations about the RENCI Gateway and the general Gateway program –Continued integration of the Web Service hosting infrastructure at RENCI for BioScientists in support of the workflow infrastructure, which uses the TeraGrid on the back-end –Continued discussions with a potential new Science Gateway: iFold –Discussions with consumers of the TeraGrid back-ended BioScience web services –Begun work on log file parsing for submission to TeraGrid –Begun examination of myCluster for large number of serial jobs on TG resources

Quarterly Meeting Quarterly RENCI Gateway Highlights FY07 Q4 Plans –Complete log file submission mechanism –Prototype sending all necessary info with job to eliminate need for log file submissions –Complete myCluster integration for large numbers of serial jobs –Develop and implement API for gateway uesr/job investigation –Maintain the workflow infrastructure integration and continue to develop specific workflows by engaging directly with key BioScientists –Work directly with one community to assist them in becoming a self sufficient TeraGrid Science Gateway; identify and develop relationships for a second community to begin assisting them in becoming a Gateway during FY07