1 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Name of Meeting Location Date - Change in Slide Master Computing for ngVLA: Lessons from LSST Jeffrey.

Slides:



Advertisements
Similar presentations
CS487 Software Engineering Omar Aldawud
Advertisements

Sponsored by the National Science Foundation Project PrimoGENI Spiral 2 Year-end Project Review Florida International University PI: Jason Liu;
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
1 HOT-WIRING THE TRANSIENT UNIVERSE | SANTA BARBARA, CA | MAY 12-15, 201 Name of Meeting Location Date - Change in Slide Master LSST Alert Production Pipelines.
Introduction to LSST Data Management Jeffrey Kantor Data Management Project Manager.
Gaining Support for a Sustainable Agile Transformation Dennis Stevens, VP Enterprise Engagements LeadingAgile November 12, 2013.
Process: A Generic View
CXC Implementing 2007 NRC Portals of the Universe Report Chandra X-ray Center Recommended Best Practices Roger Brissenden and Belinda Wilkes 25 April 2012.
Hunt for Molecules, Paris, 2005-Sep-20 Software Development for ALMA Robert LUCAS IRAM Grenoble France.
Chapter 2 The process Process, Methods, and Tools
LSST Scheduler status Francisco Delgado Sr. Software Engineer Telescope & Site.
DESC mtg U Penn June, 2012 Computing Infrastructure Computing Parallel Session R.Dubois
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
Chapter 2 Process: A Generic View
Spitzer Space Telescope Lisa Storrie-Lombardi Spitzer Science Center, Manager & Asst. Director for Community Affairs Implementing Portals of the Universe:
EPE Release 2 IOC Review August 7, 2012 Ocean Observatories Initiative OOI EPE Release 2 Initial Operating Capability Review System Development Overview.
1 Chapter 5 Software Engineering Practice. 2 What is “Practice”? Practice is a broad array of concepts, principles, methods, and tools that you must consider.
1 Hot-Wiring the Transient Universe Santa Barbara CA May 12, 2015 LSST + Tony Tyson UC Davis LSST Chief Scientist.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Doug Tody E2E Perspective EVLA Advisory Committee Meeting December 14-15, 2004 EVLA Software E2E Perspective.
Using the NSA Presentation to NOAO Users Committee October 5, 2005.
CRISP & SKA WP19 Status. Overview Staffing SKA Preconstruction phase Tiered Data Delivery Infrastructure Prototype deployment.
N. RadziwillEVLA NSF Mid-Project Report May 11-12, 2006 NRAO End to End (e2e) Operations Division Nicole M. Radziwill.
T Project Review WellIT PP Iteration
DC2 Post-Mortem/DC3 Scoping February 5 - 6, 2008 DC3 Goals and Objectives Jeff Kantor DM System Manager Tim Axelrod DM System Scientist.
Test and Integration Robyn Allsman LSST Corp DC3 Applications Design Workshop IPAC August , 2008.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Quality Assurance Benchmark Datasets and Processing David Nidever SQuaRE Science Lead.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
LSST Scheduler construction plan Francisco Delgado Sr. Software Engineer Telescope & Site.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
1 CORE DATA PROCESSING SOFTWARE PLAN REVIEW | SEATTLE, WA | SEPTEMBER 19-20, 2013 Name of Meeting Location Date - Change in Slide Master Data Release Processing.
What the Data Products Program Offers Users Todd Boroson Dick Shaw Presentation to NOAO Users Committee October 23, 2003.
SAGE meeting Socorro, May 22-23, 2007 EVLA Science Operations: the Array Science Center Claire Chandler NRAO/Socorro.
LSST VAO Meeting March 24, 2011 Tucson, AZ. Headquarters Site Headquarters Facility Observatory Management Science Operations Education and Public Outreach.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
The Large Synoptic Survey Telescope Project Bob Mann LSST:UK Project Leader Wide-Field Astronomy Unit, Edinburgh.
Ray Plante for the DES Collaboration BIRP Meeting August 12, 2004 Tucson Fermilab, U Illinois, U Chicago, LBNL, CTIO/NOAO DES Data Management Ray Plante.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Proposal: staged delivery of Scheduler and OpSim V1 (2016) meet most of the SRD requirements – Deliver a system that can be extended with an improved scheduler.
N. RadziwillEVLA Advisory Committee Meeting May 8-9, 2006 NRAO End to End (e2e) Operations Division Nicole M. Radziwill.
1 LSST Town Hall 227 th meeting of the AAS 1/7/2016 Pat Eliason, LSSTC Executive Office Pat Osmer, LSSTC Senior Advisor.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
Frazer OwenNSF EVLA Mid-Project Review May 11-12, Transition to EVLA
1 OSG All Hands SLAC April 7-9, 2014 LSST Data and Computing - Status and Plans Dominique Boutigny CNRS/IN2P3 and SLAC OSG All Hands meeting April 7-9,
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
LSST CORPORATION Patricia Eliason LSSTC Executive Officer Belgrade, Serbia 2016.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
1 AMCL July 17, 2015 Agile software development in an earned value world: a survival guide AS16 - AS Jeffrey Kantor LSST Data Management Project.
1 OBSERVATORY CONTROL SYSTEM (OCS) FRANCISCO DELGADO OCS CAM.
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
Software Engineering cosc 4359 Spring 2017.
Bob Jones EGEE Technical Director
LSST Data Management System Science Team Meeting Mario Juric, University of Washington LSST Data Management Subsystem Scientist Joint Technical Meeting.
From manual test shop to fully automated test coverage: A How-To session to speed up your journey Jayshree Bhakta ITHAKA/JSTOR.
Software Project Configuration Management
From LSE-30: Observatory System Spec.
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
Optical Survey Astronomy DATA at NCSA
What are DM folks working on?
Click to add title Planning for LSST Verification George Angeli LSST All Hands Meeting Tucson August 15, 2016.
Mark McKinnon EVLA Project Manager
Observatory Science Operations
Observatory Science Operations
Presentation transcript:

1 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Name of Meeting Location Date - Change in Slide Master Computing for ngVLA: Lessons from LSST Jeffrey Kantor, LSST Data Management Project Manager

2 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM A Dedicated Survey Telescope − A wide (half the sky), deep (24.5/27.5 mag), fast (image the sky once every 3 days) survey telescope. Beginning in 2022, it will repeatedly image the sky for 10 years. − The LSST is an integrated survey system. The Observatory, Telescope, Camera and Data Management system are all built to support the LSST survey. There’s no PI mode, proposals, or time. − The ultimate deliverable of LSST is not the telescope, nor the instruments; it is the fully reduced data. All science will be come from survey catalogs and images Telescope  Images  Catalogs

3 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Open Data, Open Source: A Community Resource − LSST data, including images and catalogs, will be available with no proprietary period to the astronomical community of the United States, Chile, and International Partners − Alerts to variable sources (“transient alerts”) will be available world-wide within 60 seconds, using standard protocols − LSST data processing stack will be free software (licensed under the GPL, v3-or-later) − All science will be done by the community (not the Project!), using LSST’s data products.

4 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM LSST From a Scientist’s Perspective − A stream of ~10 million time-domain events per night, detected and transmitted to event distribution networks within 60 seconds of observation. − A catalog of orbits for ~6 million bodies in the Solar System. − A catalog of ~37 billion objects (20B galaxies, 17B stars), ~7 trillion observations (“sources”), and ~30 trillion measurements (“forced sources”), produced annually, accessible through online databases. − Deep co-added images. − Services and computing resources at the Data Access Centers to enable user-specified custom processing and analysis. − Software and APIs enabling development of analysis codes. Level 3 Level 1 Level 2

5 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM HQ Site Science Operations Observatory Management Education and Public Outreach HQ Site Science Operations Observatory Management Education and Public Outreach Summit and Base Sites Telescope and Camera Data Acquisition Crosstalk Correction Long-term storage (copy 1) Chilean Data Access Center Summit and Base Sites Telescope and Camera Data Acquisition Crosstalk Correction Long-term storage (copy 1) Chilean Data Access Center Dedicated Long Haul Networks Two redundant 100 Gbit links from Santiago to Florida (existing fiber) Additional 100 Gbit link (spectrum on new fiber) from Santiago – Florida (Chile and US national links not shown) Dedicated Long Haul Networks Two redundant 100 Gbit links from Santiago to Florida (existing fiber) Additional 100 Gbit link (spectrum on new fiber) from Santiago – Florida (Chile and US national links not shown) Satellite Processing Center CC-IN2P3, Lyon, France Data Release Production (50%) French DAC Satellite Processing Center CC-IN2P3, Lyon, France Data Release Production (50%) French DAC Archive Site Archive Center Alert Production Data Release Production (50%) Calibration Products Production EPO Infrastructure Long-term Storage (copy 2) Data Access Center Data Access and User Services Archive Site Archive Center Alert Production Data Release Production (50%) Calibration Products Production EPO Infrastructure Long-term Storage (copy 2) Data Access Center Data Access and User Services

6 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Infrastructure: Petascale Computing, Gbit Networks Long Haul Networks to transport data from Chile to the U.S. 200 Gbps from Summit to La Serena (new fiber) 2x100 Gbit (minimum) for La Serena to Champaign, IL (protected, existing fiber) Archive Site and U.S. Data Access Center NCSA, Champaign, IL Archive Site and U.S. Data Access Center NCSA, Champaign, IL Base Site and Chilean Data Access Center La Serena, Chile Base Site and Chilean Data Access Center La Serena, Chile The computing cluster at the LSST Archive (at NCSA) will run the processing pipelines. Single-user, single-application, dedicated data center Process images in real-time to detect changes in the sky Produce annual data releases

7 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Science Pipelines: Scientific Core of LSST DM − Science Pipelines carry core scientific algorithms that process or analyze raw LSST data to generate output Data Products − N.b. also referred to as “Applications” − Variety of processing Image processing Measurement of source properties Associating sources across space and time, e.g. for tracking solar system objects Science Pipelines

8 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Middleware Layer: Isolating Hardware, Orchestrating Software Enabling execution of science pipelines on hundreds of thousands of cores. Frameworks to construct pipelines out of basic algorithmic components Orchestration of execution on thousands of cores Control and monitoring of the whole DM System Isolating the science pipelines from details of underlying hardware Services used by applications to access/produce data and communicate "Common denominator" interfaces handle changing underlying technologies

9 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Database and Science UI: Delivering to Users Massively parallel, distributed, fault-tolerant relational database. To be built on existing, robust, well- understood, technologies (MySQL and xrootd) Commodity hardware, open source Advanced prototype in existence (qserv) Science User Interface to enable the access to and analysis of LSST data Web and machine interfaces to LSST databases Visualization and analysis capabilities

10 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Distributed Team, SQuaREd Infrastructure Middleware Core Algorithms (“Apps”) Database UI Science QA & Reliability Engineering (SQuaRE)

11 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Tools and Processes − Mailing lists, wikis, Confluence, telecons, videocons, … − Github − Jenkins − JIRA planning and Earned Value − Licensing and contributions − Investigating Slack and Discourse 11

12 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Data Management Planning Process/Levels Detailed 6-month plan Developed from Agile Process (Epics, Stories) Individual resource assignments ~100 activities per cycle Granularity of activities: 1 – 3 months Granularity of steps within activities: 2 – 20 days Detailed 6-month plan Developed from Agile Process (Epics, Stories) Individual resource assignments ~100 activities per cycle Granularity of activities: 1 – 3 months Granularity of steps within activities: 2 – 20 days Project Risks & Milestones DM System Requirements and Roadmap PMCS Release Plan DM Roadmap (LDM-240) High-level picture Granularity: 12 months DM Roadmap (LDM-240) High-level picture Granularity: 12 months Institutional-level resource assignments Budgeting Earned-value management Basis of monthly reports Granularity: months 2 x 6-month development/release cycle per year activities for each cycle Institutional-level resource assignments Budgeting Earned-value management Basis of monthly reports Granularity: months 2 x 6-month development/release cycle per year activities for each cycle LSST System Requirements and Operations Plans

13 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM LSST Software Development Roadmap (LDM-240)  Feature Implementation   Performance Improvement  Major axes are 12-month cycles x WBS Cells represent milestones/major results Cells to left are predecessors to cells to the right Major axes are 12-month cycles x WBS Cells represent milestones/major results Cells to left are predecessors to cells to the right

14 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM LDM-240 DM Key Performance Metrics Major axes are 12-month cycles x metric Cells represent planned achievement of metric Progressively increasing performance from to left to right SRD minimum spec is met first, then design spec, then (if applicable) stretch Major axes are 12-month cycles x metric Cells represent planned achievement of metric Progressively increasing performance from to left to right SRD minimum spec is met first, then design spec, then (if applicable) stretch

15 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM EV Data Flow Institution Invoice and WBS/Account Breakout

16 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Roadmap − 2016: Finish end-to-end, start refining components, developer multipliers, run easily on precursor data − 2017: Reprocess DES, continuous simulated Alert Production − 2018: Ready for ComCam − 2019: Ready for full Camera − 2020: Ready for Science Verification 16

17 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Team CultureTeam Culture (emphasis added, truncated) The LSST DM community includes LSST-paid staff at multiple institutions and external contributors from around the world. We have built a strong team that collaborates well. As we hire new people, we want to make sure we are maintaining a healthy, supportive, productive culture. While culture is best transmitted by daily example, having some formal standards for conduct can aid newcomers and reinforce good patterns. Since LSST is an AURA center, the AURA standards of workplace conduct provide a starting point. As stated there, we in DM dedicate ourselves to fostering a civil and inclusive community characterized by mutual respect for the contributions of all individuals. As a community, we embrace the values in that document, in addition to any local institutional standards. 17

18 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Lessons Learned − AURA/LSST is in many ways a completely new entity, with inherent start-up activities − Hiring has been slow, particularly for lead roles − Team is not yet developing at maximum productivity − It is hard to mesh scientists, managers, and engineers on one team − Algorithmic progress is hard to measure and even harder to predict − Architectures are evolving rapidly: e.g. Cloud, AstroPy, Hadoop ecosystem − It is hard to balance technical priorities, trading off between performance/scale versus adaptability and usability − Reorganize work with delayed staffing, relieve current leads of less-critical tasks − Focus on developer multipliers − Establish team culture and norms − Focus on "end-to-end" system − Document design path and replan − Track new architectures, maintain portability − Make sure we don’t accumulate too much “technical debt” 18