Astronomy, Petabytes, and MySQL MySQL Conference Santa Clara, CA April 16, 2008 Kian-Tat Lim Stanford Linear Accelerator Center
MySQL Conference April 16, 2008 Santa Clara, CA / 47 2 Outline LSST LSST Database LSST Database + MySQL
MySQL Conference April 16, 2008 Santa Clara, CA / 47 3 LSST What Is It? Why Build It?
MySQL Conference April 16, 2008 Santa Clara, CA / 47 4 LSST What Is It? Why Build It?
MySQL Conference April 16, 2008 Santa Clara, CA / 47 5 Telescope Proposed telescope to be built in Chile
MySQL Conference April 16, 2008 Santa Clara, CA / 47 6 Large 3.2 gigapixel camera 8.4 meter diameter mirror
MySQL Conference April 16, 2008 Santa Clara, CA / 47 7 Synoptic Survey Wide Deep Fast
MySQL Conference April 16, 2008 Santa Clara, CA / 47 8 LSST What Is It? Why Build It?
MySQL Conference April 16, 2008 Santa Clara, CA / 47 9 Dark Matter and Energy Photo: J. A. Tyson, W. Colley, E. L. Turner, and NASA
MySQL Conference April 16, 2008 Santa Clara, CA / Variable Objects
MySQL Conference April 16, 2008 Santa Clara, CA / Transient Objects
MySQL Conference April 16, 2008 Santa Clara, CA / Moving Objects Photo: D. Roddy, Lunar and Planetary Institute
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / Database: Components Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / Astronomical Objects Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / Sources Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / Changes Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / Calibration and Facility Image Metadata Moving Objects Catalog Object Catalog Source Catalog Difference Image Source Catalog Provenance Statistics Summaries Calibration Engineering and Facility Database
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / Sagans of Rows 49 billion objects 2.8 trillion sources
MySQL Conference April 16, 2008 Santa Clara, CA / Lots of Columns 308 columns for objects 56 columns for sources (for now)
MySQL Conference April 16, 2008 Santa Clara, CA / Database Size Grows to >14 PB
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / Frequency Nightly updates Semi-annual data releases
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / Queries All about an object All objects meeting criteria All objects near objects meeting criteria All objects with interesting time series All pairs of objects with similar time series
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database What’s In It? How Big? How Often? What Queries? Unusual Needs
MySQL Conference April 16, 2008 Santa Clara, CA / Unusual Needs Flexibility Provenance
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database + MySQL Why MySQL? Scalability? Performance?
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database + MySQL Why MySQL? Scalability? Performance?
MySQL Conference April 16, 2008 Santa Clara, CA / MySQL Relational database management system
MySQL Conference April 16, 2008 Santa Clara, CA / Open Source Vibrant community Strong company support
MySQL Conference April 16, 2008 Santa Clara, CA / Hardware Runs on commodity hardware
MySQL Conference April 16, 2008 Santa Clara, CA / In-Memory Tables Needed for near-real-time processing
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database + MySQL Why MySQL? Scalability? Performance?
MySQL Conference April 16, 2008 Santa Clara, CA / “MySQL Grid”
MySQL Conference April 16, 2008 Santa Clara, CA / Partitioning Large tables partitioned spatially
MySQL Conference April 16, 2008 Santa Clara, CA / Replication Dimension tables likely replicated
MySQL Conference April 16, 2008 Santa Clara, CA / Needs: Distributor/Combiner LSST will build prototype Need long-term support
MySQL Conference April 16, 2008 Santa Clara, CA / LSST Database + MySQL Why MySQL? Scalability? Performance?
MySQL Conference April 16, 2008 Santa Clara, CA / Per-Column Indexing 2X data size
MySQL Conference April 16, 2008 Santa Clara, CA / Needs: Optimizer Efficient use of multiple (20- 30) indexes
MySQL Conference April 16, 2008 Santa Clara, CA / Needs: Indexes Bitmap/compressed indexes
MySQL Conference April 16, 2008 Santa Clara, CA / Needs: Storage Engine “Shared scan” for long- running full-table queries
MySQL Conference April 16, 2008 Santa Clara, CA / Summary Building a petabyte DB MySQL can be a core component