NOAO Brown Bag Tucson, AZ March 11, 2008 Jeff Kantor LSST Corporation Requirements Flowdown with LSST SysML and UML Models.

Slides:



Advertisements
Similar presentations
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Advertisements

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
C6 Databases.
Astronomy of the Next Decade: From Photons to Petabytes R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST.
NOAO Brown Bag May 13, 2008 Tucson, AZ 1 Data Management Middleware NOAO Brown Bag Tucson, AZ May 13, 2008 Jeff Kantor LSST Corporation.
Managing Data Resources
Jeff Kantor LSST Data Management Systems Manager LSST Corporation Institute for Astronomy University of Hawaii Honolulu, Hawaii June 19, 2008 LSST Data.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
Astrophysics, Biology, Climate, Combustion, Fusion, Nanoscience Working Group on Simulation-Driven Applications 10 CS, 10 Sim, 1 VR.
Panel Summary Andrew Hanushevsky Stanford Linear Accelerator Center Stanford University XLDB 23-October-07.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
1 HOT-WIRING THE TRANSIENT UNIVERSE | SANTA BARBARA, CA | MAY 12-15, 201 Name of Meeting Location Date - Change in Slide Master LSST Alert Production Pipelines.
Introduction to LSST Data Management Jeffrey Kantor Data Management Project Manager.
Class 6 Data and Business MIS 2000 Updated: September 2012.
Class 3 Data and Business MIS 2000 Updated: January 2014.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Commissioning the NOAO Data Management System Howard H. Lanning, Rob Seaman, Chris Smith (National Optical Astronomy Observatory, Data Products Program)
The Worlds of Database Systems Chapter 1. Database Management Systems (DBMS) DBMS: Powerful tool for creating and managing large amounts of data efficiently.
Database Design – Lecture 16
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
Peking University Astronomy Symposium 10/17/2010 Large Synoptic Survey Telescope.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Data Management Subsystem Jeff Valenti (STScI). DMS Context PRDS - Project Reference Database PPS - Proposal and Planning OSS - Operations Scripts FOS.
Introduction to Hadoop and HDFS
1 Hot-Wiring the Transient Universe Santa Barbara CA May 12, 2015 LSST + Tony Tyson UC Davis LSST Chief Scientist.
MWA Data Capture and Archiving Dave Pallot MWA Conference Melbourne Australia 7 th December 2011.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
1 Jacek Becla, XLDB-Europe, CERN, May 2013 LSST Database Jacek Becla.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
The Worlds of Database Systems From: Ch. 1 of A First Course in Database Systems, by J. D. Pullman and H. Widom.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
1 FINAL DESIGN REVIEW | TUCSON, AZ | OCTOBER 21-25, 2013 Name of Meeting Location Date - Change in Slide Master Title of Presentation Andrew Connolly LSST.
DC2 Post-Mortem/DC3 Scoping February 5 - 6, 2008 DC3 Goals and Objectives Jeff Kantor DM System Manager Tim Axelrod DM System Scientist.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Enabling Reuse-Based Software Development of Large-Scale Systems IEEE Transactions on Software Engineering, Volume 31, Issue 6, June 2005 Richard W. Selby,
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
1 CORE DATA PROCESSING SOFTWARE PLAN REVIEW | SEATTLE, WA | SEPTEMBER 19-20, 2013 Name of Meeting Location Date - Change in Slide Master Data Release Processing.
Astronomy, Petabytes, and MySQL MySQL Conference Santa Clara, CA April 16, 2008 Kian-Tat Lim Stanford Linear Accelerator Center.
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
Peter Lee Head, Computer Science Department Carnegie Mellon University.
LSST VAO Meeting March 24, 2011 Tucson, AZ. Headquarters Site Headquarters Facility Observatory Management Science Operations Education and Public Outreach.
1 NGVLA WORKSHOP – DECEMBER 8, 2015 – NRAO SOCORRO, NM Name of Meeting Location Date - Change in Slide Master Computing for ngVLA: Lessons from LSST Jeffrey.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
The LSST Data Processing Software Stack Tim Jenness (LSST Tucson) for the LSST Data Management Team Abstract The Large Synoptic Survey Telescope (LSST)
HERY H AZWIR Syllabus Introduction to Computer & IT.
Mountaintop Software for the Dark Energy Camera Jon Thaler 1, T. Abbott 2, I. Karliner 1, T. Qian 1, K. Honscheid 3, W. Merritt 4, L. Buckley-Geer 4 1.
TM 8-1 Copyright © 1999 Addison Wesley Longman, Inc. Client/Server and Middleware.
Class 3 Data and Business MIS 2000 Updated: Jan
The Importance of IS Management Chapter 1 Information Systems Management In Practice 5E McNurlin & Sprague.
Tackling I/O Issues 1 David Race 16 March 2010.
Software System Performance CS 560. Performance of computer systems In most computer systems:  The cost of people (development) is much greater than.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
© Chinese University, CSE Dept. Software Engineering / Software Engineering Topic 1: Software Engineering: A Preview Your Name: ____________________.
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
What to Expect at the LSST Archive: The LSST Science Platform Mario Juric, University of Washington LSST Data Management Subsystem Scientist for the.
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
Optical Survey Astronomy DATA at NCSA
Click to add title Planning for LSST Verification George Angeli LSST All Hands Meeting Tucson August 15, 2016.
Presentation transcript:

NOAO Brown Bag Tucson, AZ March 11, 2008 Jeff Kantor LSST Corporation Requirements Flowdown with LSST SysML and UML Models

NOAO Brown Bag March 11, 2008 Tucson, AZ 2 Presentation Outline LSST Data Management introduction Requirements flow-down Enterprise Architect SysML/UML demonstration

NOAO Brown Bag March 11, 2008 Tucson, AZ 3 Data Management is a distributed system that leverages world-class facilities and cyber-infrastructure Long-Haul Communications Chile - U.S. & w/in U.S. 2.5 Gbps avg, 10 Gbps peak Archive Center NCSA, Champaign, IL 100 to 250 TFLOPS, 75 PB Data Access Centers U.S. (2) and Chile (1) 45 TFLOPS, 87 PB Mountain Summit/Base Facility Cerro Pachon, La Serena, Chile 25 TFLOPS, 150 TB 1 TFLOPS = 10^12 floating point operations/second 1 PB = 2^50 bytes or ~10^15 bytes

NOAO Brown Bag March 11, 2008 Tucson, AZ 4 LSST Data Management provides a unique national resource for research & education Astronomy and astrophysics –Scale and depth of LSST database is unprecedented in astronomy Provides calibrated databases for frontier science Breaks new ground with combination of depth, width, epochs/field Enables science that cannot be anticipated today Cyber-infrastructure and computer science –Requires multi-disciplinary approach to solving challenges Massively parallel image data processing Peta-scale data ingest and data access Efficient scientific and quality analysis of peta-scale data

NOAO Brown Bag March 11, 2008 Tucson, AZ 5 DM system complexity exists but overall is tractable Complexities we have to deal with in DM –Very high data volumes (transfer, ingest, and especially query) –Advances in scale of algorithms for photometry, astrometry, PSF estimation, moving object detection, shape measurement of faint galaxies –Provenance recording and reprocessing –Evolution of algorithms and technology Complexities we DON’T have to deal with in DM –Tens of thousands of simultaneous users (e.g. online stores) –Fusion of remote sensing data from many sources (e.g. earthquake prediction systems) –Millisecond or faster time constraints (e.g. flight control systems) –Very deeply nested multi-level transactions (e.g. banking OLTP systems) –Severe operating environment-driven hardware limitations (e.g. space- borne instruments) –Processing that is highly coupled across entire data set with large amount of inter-process communication (e.g. geophysics 3D Kirchhoff migration)

NOAO Brown Bag March 11, 2008 Tucson, AZ 6 Performance - Nightly processing timeline for a visit meets alert latency requirement Exposure 1 Exposure 2 Shutter close Time (sec) Readout complete Transfer to Base complete Image Processing/ Detection complete Association complete Alert generate complete Shutter close Readout complete Transfer to Base complete Image Processing/ Detection complete 2s6s20s Exposure begins 15s T0 - Start of 60 second latency timer 3s6s20s10s T0 + 51s 2s Exposure begins 15s

NOAO Brown Bag March 11, 2008 Tucson, AZ 7 Archive Center Base Data Access Center Archive Center Trend Line Computing needs show moderate growth

NOAO Brown Bag March 11, 2008 Tucson, AZ 8 Database Volumes Detailed spreadsheet-based analysis done Expecting: –6 petabytes of data, 14 petabytes data+indexes –all tables: ~16 trillion rows (16x10 12 ) –largest table: 3 trillion rows (3x10 12 )

NOAO Brown Bag March 11, 2008 Tucson, AZ 9 Cerro Pachon La Serena Long-haul communications are feasible Over 2 terabytes/second dark fiber capacity available Only new fiber is Cerro Pachon to La Serena (~100 km) 2.4 gigabits/second needed from La Serena to Champaign, IL Quotes from carriers include 10 gigabit/second burst for failure recovery Specified availability is 98% Clear channel, protected circuits

NOAO Brown Bag March 11, 2008 Tucson, AZ 10 Science Requirements Document Telescope, Camera, Survey Reference Designs Data Management Requirements Data Management Design Complete, traceable flow-down from science to system, to data management subsystem Allocation Traceability DMS Sizing Models: Processing, Storage, Communications Allocation Traceability

NOAO Brown Bag March 11, 2008 Tucson, AZ 11 Science Requirements Document System Requirements, Telescope, Camera, Survey Reference Designs Key Specified and Derived Requirements The mission, instrument design, observing cadence and observatory operational requirements drive the DM requirements Data Management Requirements Data Management Design Data Products Images Catalogs Alerts Quality and Performance Statistics Algorithms/Pipelines Astrometric/Photometric Calibration Source Detection Source - Object Association Moving Object Detection/Orbit Matching Alert Processing Deep Detection Calibration Classification Architectural Scalabiity Reliability/Availability Evolution Science and system requirements flow-down using SysML Allocation Traceability System Modeling Language (SysML)

NOAO Brown Bag March 11, 2008 Tucson, AZ 12 DMS Sizing Models Computational Requirements Processing Sustained & peak processing analyzed Tradeoffs considered: Store vs. recompute Types of parallelism Reliability vs cost Storage Sustained & peak I/O rates and storage needs analyzed Tradeoffs considered: Store vs. recompute DBMS vs File System Multi-dimensional access Reliability vs cost Communications Sustained & peak bandwidth analyzed Tradeoffs considered: Transfer and process vs process and transfer Media transfer vs network Reliability vs cost Data Management Requirements Data Management Design Storage and Input/Output Requirements Data Transfer, Replication, And Access Requirements Performance requirements analyzed & feasible

NOAO Brown Bag March 11, 2008 Tucson, AZ 13 Systems Engineering Model for Requirements Flowdown, Traceability & Configuration Control SysML = System Modeling Language SE Model Operational Model Structural/Component Model Requirements Model Other Engineering Analysis Models Performance & Constraints Model FEMAP NX Nastran

NOAO Brown Bag March 11, 2008 Tucson, AZ 14 Requirements Flow SRD Telescope Site Req. Camera Req. Data Management Req. Functional Req. (FPRD) Operational Req. (OCDD) Interface Requirements Outside Constraints System Requirements SysML Model DM Subsystems Req. UML Model T&S Subsystems Req. SysML Model Camera Subsystems Req. LSST Board & Science Council Project Office Change Control Board Project Office Change Control Board Subsystem Group

NOAO Brown Bag March 11, 2008 Tucson, AZ 15 Requirements Hierarchy

NOAO Brown Bag March 11, 2008 Tucson, AZ 16 Rigorous process for software engineering based on wide industry experience (Iconix) Algorithm/Pipeline Data Product Prototypes UML Models Image courtesy of Iconix Software Engineering, Inc. Unauthorized use not permitted. Unified Modeling Language (UML)

NOAO Brown Bag March 11, 2008 Tucson, AZ 17 Demo of Enterprise Architect Tool for SysML and UML System Engineering (SysML model is in “DM SysML.pdf”) Science Requirements DM Functional/Performance Requirements Use Cases Software Engineering (UML model is in “DM UML.pdf”) Use Cases/Robustness Diagrams Class Diagrams/Sequence Diagrams Code