Science Gateways Marlon Pierce Science Gateway Group Indiana University.

Slides:



Advertisements
Similar presentations
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
Advertisements

Workshop on Sustainable Software for Science: Practice and Experiences Communities Nancy Wilkins-Diehr
Xsede eXtreme Science and Engineering Discovery Environment Ron Perrott University of Oxford 1.
Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
1 US activities and strategy :NSF Ron Perrott. 2 TeraGrid An instrument that delivers high-end IT resources/services –a computational facility – over.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
ACCI TASK FORCES Update CASC September 22, Task Force Introduction Timeline months or less from June 2009 Led by NSF Advisory Committee on.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
Tom Sheridan IT Director Gas Technology Institute (GTI)
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
The Vision, Process, and Requirements for Creating EarthCube Presentation at Second EarthCube WebEx Aug 22, 2011.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
An Overview of the NISE Network Presentation Overview NISE Network Network Community Educational Products Get More Involved.
April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
US NITRD LSN-MAGIC Coordinating Team – Organization and Goals Richard Carlson NGNS Program Manager, Research Division, Office of Advanced Scientific Computing.
HUBzero Cyberinfrastructure: Your Workday on Steroids Michael McLennan Director, HUBzero® Platform for Scientific Collaboration Purdue University 1.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Organization & Management Model for FCP Center. Goals [From previous session] (Why?) Vision — The Center for Sustainable Software on Future Computing.
Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen – Louisiana State University Rob.
Software for Science Gateways: Open Grid Computing Environments Marlon Pierce, Suresh Marru Pervasive Technology Institute Indiana University
EmergingLeadersAlliance.org. The ELA began as a joint effort of the engineering Founder Societies that was started in 2008 and has remained a collaborative.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
Address Maps and Apps for State and Local Governments
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Top Issues Facing Information Technology at UAB Sheila M. Sanders UAB Vice President Information Technology February 8, 2007.
October 21, 2015 XSEDE Technology Insertion Service Identifying and Evaluating the Next Generation of Cyberinfrastructure Software for Science Tim Cockerill.
Apache Airavata (Incubating) Gateway to Grids & Clouds Suresh Marru Nov 10 th 2011.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
08/05/06 Slide # -1 CCI Workshop Snowmass, CO CCI Roadmap Discussion Jim Bottum and Patrick Dreher Building the Campus Cyberinfrastructure Roadmap Campus.
Framework for the Creation of Digital Knowledge Resources to meet the Challenges for Digital Future: A Librarian’s Perspective Dr. Harish Chandra Librarian.
HPC Centres and Strategies for Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen –
S2I2: Enabling grand challenge data intensive problems using future computing platforms Project Manager: Shel Swenson (USC & GATech)
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Marv Adams Chief Information Officer November 29, 2001.
Funding: Staffing for Research Computing What staffing models does your institution use for research computing? How does your institution pay for the staffing.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
NSF Middleware Initiative Purpose To design, develop, deploy and support a set of reusable, expandable set of middleware functions and services that benefit.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Internet2 Strategic Directions October Fundamental Questions  What does higher education (and the rest of the world) require from the Internet.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Data Infrastructure Building Blocks (DIBBS) NSF Solicitation Webinar -- March 3, 2016 Amy Walton, Program Director Advanced Cyberinfrastructure.
Esri UC 2014 | Technical Workshop | Address Maps and Apps for State and Local Government Allison Muise Nikki Golding Scott Oppmann.
June 23, 2016 Organizational Overview. 2 Automation Federation Background A fragmented community of automation professional associations and societies.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
Accessing the VI-SEEM infrastructure
XSEDE Value Added and Financial Economies
Joslynn Lee – Data Science Educator
Mentoring the Next Generation of Science Gateway Developers and Users
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Presentation transcript:

Science Gateways Marlon Pierce Science Gateway Group Indiana University

What I Want to Accomplish Invite you to participate in the Science Gateway Institute – NSF S2I2 planning grant Invite you to participate in Apache Airavata – Open community software for building scientific workflows for gateways

What Are Science Gateways? Web-based user interfaces and services that provide a science-centric view of cyberinfrastructure. Often centered around running science applications and workflows on grids and clouds. But can also be data-centric, information-centric – Earth System Grid – eBird and citizen science portals Or community-centric – Such as many HUBzero hubs.

GatewayDomainMetrics NanoHUBNanotechnologyHas supported nanotechnology simulations and data sharing among more than 250,000 users since 2000 and has been cited in more than 900 publications. CIPRESBioinformaticsMade it possible for more than 6,000 biologists to run phylogenetic analyses on XSEDE computing resources over the past 3 years and has enabled more than 475 publications over that period. UltraScanBiophysicsSupported the data analysis needs of over 120 active scientists and has contributed to over 60 publications during the last 3 years. GridChemChemistryProvided access to computational chemistry tools for more than 800 users, enabling 47 publications between 2007 and Gateways Support Science

Science Gateways Institute

Science is all about connections – Instruments, sensor networks, HPC facilities, campus laboratories, visualization facilities, data stores – Connections are often made through software A critical, but often overlooked component NSF vision for cyberinfrastructure in the 21st century Software is critical to today’s scientific advances

Scientific Software Elements (SSE) – Small groups create software that advances one or more area Scientific Software Integration (SSI) – Larger interdisciplinary teams, software frameworks Scientific Software Innovation Institutes (S2I2) Software vision implemented in 2010 Software Infrastructure for Sustained Innovation (SI2) program

Institutes: Long term hubs of excellence Serve a research community of substantial size and disciplinary breadth Expertise, processes, architectures, resources and implementation mechanisms to transform research practices and productivity Support, outreach, workforce development, proactive approach to diversity Pathways to community involvement

The Science Gateway Institute Partners Project Leadership – Nancy Wilkins-Diehr, SDSC Community Workshop Organization – Katherine Lawrence, University of Michigan Workforce Development – Linda Hayden, ECSU Gateway Providers – iPlant: Dan Stanzione, Rion Dooley – HUBzero: Michael McLennan, Michael Zentner – Apache Airavata: Marlon Pierce, Suresh Marru

Millions of dollars are spent on gateways, but developers face several challenges: They often work in isolation even though development can be quite similar across domain areas. They need to bridge cyberinfrastructure—locally, campus-wide, nationally, and sometimes internationally. They need foundational building blocks so they can focus on higher-level, grand-challenge functionality. They struggle to secure sustainable funding because gateways span the worlds of research and infrastructure.

Business plan development and review Development environment, consulting, documentation and software recommendations Software repositories Software engineering facilities Software assessment services – like Open Source Software Advisory Service, Apache assessment service, Software Sustainability Institute (UK) Build-and-test facilities Hosting service Offering gateways expertise in the following areas: – Usability assessment – Licensing – Sustainability – Project management – Security Incubator Service Assist with the entire lifecycle of a gateway:

Apache Airavata: Software for Scientific Workflows

What Is Apache Airavata? Science Gateway software system to Compose, manage, execute, and monitor distributed, computational workflows Wrap legacy command line scientific applications with Web services. Run jobs on computational resources ranging from local resources to computational grids and clouds

Workflow Interpreter Application Factory Message Box Regist ry Apache Airavata API Lorem ipsumLorem ipsum insolensinsolens p1p1 m5m5 duo duo x End Users Gateway Developer Scientific Application Core Developer Computational Resources Apache Airavata Architecture

DomainDescription AstronomyImage processing pipeline for One Degree Imager instrument on XSEDE AstrophysicsSupporting workflow of Dark Energy Survey simulations working group on XSEDE BioinformaticsSupported workflow executions on Amazon EC2 for BioVLAB project BiophysicsManage large scale data analysis of analytical ultracentrifugation experiments on XSEDE and campus resources Computational Chemistry Manage workflows to support computational chemistry parameter studies for ParamChem.org on XSEDE Nuclear PhysicsWorkflows for nuclear structure calculations using Leadership Class Configuration Interaction (LCCI) computations on DOE resources Apache Airavata in Action

Cyberinfrastructure: How Open is Open Source Software? What’s missing? Open source licensing Open standards Open codes (GitHub, SourceForge, Google Code, etc We also need open governance

Open Community Software and Governance Open source projects need diversity, governance. – Reproducibility – Sustainability Incentives for projects to diversify their developer base. Govern Software releases Contributions Credit sharing. Members are added Project direction decisions. IP, legal issues Our approach: Apache Software Foundation Collaborate Compete

More Information Science Gateway Institute: – Nancy Wilkins-Diehr, PI Contact me: Apache Airavata: You can contribute to Apache Airavata! Join the mailing list: YouTube presentation on Apache and NSF Cyberinfrastructure: U U

science gateway /sī′ əns gāt′ wā′/ n. 1.an online community space for science and engineering research and education. 2.a Web-based resource for accessing data, software, computing services, and equipment specific to the needs of a science or engineering discipline. We are building an institute to serve you—and others like you— with resources, services, experts, and ideas for creating and sustaining science gateways. Sign up to join the conversation: Are you building gateways that serve your science discipline? Do you wish you could connect with and learn from others who are doing the same thing?

1.Multi-level, long term support (individual, team, institute) 2.Responsibility for verification, validation, reproducibility 3.Consistent policy on open source 4.Collaborations across divisions, agencies and industry 5.Use of ACCI to obtain community input on priorities NSF CI Advisory Committee commissions 6 task forces Software task force recommends to NSF:

Figure 1. High-level architecture of software offerings and value-added services provided by the institute.

Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced Science Tools Science Gateways: Enabling & Democratizing Scientific Research

Simultaneous NSF study identifies limitations to short-lived science portals or gateways Characteristics of short funding cycles – Build exciting prototypes with input from scientists – Work with early adopters to extend capabilities – Tools are publicized, more scientists interested – Funding ends – Scientists who invested their time to use new tools are disillusioned Less likely to try something new again – Start again on new short-term project Need to break this cycle and fund for long-term success Science Gateway Institute conceptualization award in 2012

Gateway-Building Support Institute staff assigned to a project for months, up to a year – Assist with gateway development or implementation of advanced features Workflows, fault tolerance, sensor feeds, HPC simulations – Teach research teams what it takes to build, enhance, operate, and maintain gateways after support ends – Peer-reviewed request process open to all

Gateway Forum Gathering place for scientific web developers across NSF directorates, agencies, and international boundaries Social forums, white papers, blogs, testimonials and user stories Annual conference Broad and engaging symposium series Gateway training program – Synchronous and asynchronous, video tutorials – Best practices, case studies Showcase of successful projects Environment that enables continuous community feedback

Gateway Framework Modular, layered approach – Supports community contributions – Grocery store approach allows developers to pick and choose the components they need Tiered architecture 1.Value-added services Publication channel for delivering content to a wider audience Information repositories for good design practices Information/code samples for best practices in user-interface and user- experience design 2.Core web framework which includes hosted site creation and content management 3.Platform API to provide a cohesive set of RESTful web services upon which the previous two layers rely 4.Systems layer where the hardware and low-level middleware reside Clouds and cloud services, HPC systems, grid middleware, data warehouses, databases, instrumentation, and distributed data stores

Workforce Development Terrific opportunities for students and IT professionals – Much science gateway development currently done by campus IT Gateway building training – Web development is a natural interest area for students Very visual, see results of programming instantly – Builds cross-disciplinary communication skills Talk to scientists, construct a gateway that meets their needs – Utilize existing programming opportunities such as Google Summer of Code Opportunities to proactively address diversity

Community engagement activities in conceptualization grant One-on-one interviews with community leaders Group-based data collection – Focus groups, BOFs, workshops – Broad online surveys Social-feedback services – Get Satisfaction, UserVoice, HUBzero Continued events in the full institute to stay in touch with the community – Annual conference – Rolling 5-minute polls