LAT Data Server Workshop - 1 Jan 13-14, 2005 Tom Stephens GSSC Database Lead GSSC LAT Data Server Overview.

Slides:



Advertisements
Similar presentations
GLAST Science Support CenterAugust 9, 2004 Overview of Analyzing GLAST Data David Band (GLAST SSC—GSFC/UMBC)
Advertisements

Transaction.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
Experiences From The Fermi Data Archive Dr. Thomas Stephens Wyle IS/Fermi Science Support Center.
NOAO/Gemini Data workshop – Tucson,  Hosted by CADC in Victoria, Canada.  Released September 2004  Gemini North data from May 2000  Gemini.
Evidor: The Evidence Collector Software using for: Software for lawyers, law firms, corporate law and IT security departments, licensed investigators,
ERAU Weather Archive Status Update 6 October 2009.
GLAST LAT ProjectISOC Peer Review - March 2, 2004 Document: LAT-PR Section 2.1 Requirements 1 Gamma-ray Large Area Space Telescope GLAST Large.
Metadata Server system software laboratory. Overview metadata service in Grid environment Grid environment Metadata server User query data search information.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
1 Tom Stephens GSSC/GSFC Database Access and the dataSubselector Tool December 8, 2003 DC1 Kickoff Workshop.
GLAST LAT ProjectISOC CDR, 4 August 2004 Document: LAT-PR-04500Section 4.11 GLAST Large Area Telescope: Instrument Science Operations Center CDR Section.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Data Grid: GRASP Mike Smorul. Grid Retrieval and Search Platform Based on concepts developed in the Earth Science Data Interface (ESDI) developed at the.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
GLAST Science Support Center June 6-7, 2005GLAST Users’ Committee Meeting GSSC Report David Band for the GLAST SSC.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
How to speed up search of ILMT light curves using the HTM (Hierarchical Triangular Mesh) method in relational databases ARC Liège, 11 February 2010 ILMT.
1 Designing a Data Exchange - Best Practices Data Exchange Scenarios –Sender vs. Receiver-initiated exchanges –Node Design Best Practices: –Handling Large.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Innovations in the Multimission Archive at STScI (MAST) M. Corbin, M. Donahue, C. Imhoff, T. Kimball, K. Levay, P. Padovani, M. Postman, M. Smith, R. Thompson.
WebInfoMall: the Chinese Web Archive how we got started and how it is now Huang Lianen and Li Xiaoming Peking University, China Digital Archive Workshop.
Web Archiving and Access Mike Smorul Joseph JaJa ADAPT Group University of Maryland, College Park.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
11 3 / 12 CHAPTER Databases MIS105 Lec15 Irfan Ahmed Ilyas.
The GLAST Science Support Center’s Role in Supporting the User Community [24.10] Thomas E. Stephens (GSFC/L-3GSI) for the GLAST Science Support Center.
GIST 23 DWD, 27-29th Apr 2005 GGSPS development and operations Andy Smith RAL.
Serving Data to the GLAST Users Community Thomas E. Stephens (GSFC/RSIS) for the GLAST Science Support Center Abstract.
GLAST Science Support CenterAugust 9, 2004 Users’ Committee Meeting GSSC USER SUPPORT David Band – GSSC.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Week 7 : Chapter 7 Agenda SQL 710 Maintenance Plan:
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
GLAST Science Support CenterJuly, 2003 LAT Ground Software Workshop Status of the D1 (Event) and D2 (Spacecraft Data) Database Prototypes for DC1 Robert.
GLAST Science Support CenterNovember, 2005 GSSC User Committee Meeting Tools for Mission and Observation Planning Robin Corbet, GSSC
Serving Data to the GLAST User Community Don Horner (L3 GSI/GSFC) and the GLAST Science Support Center Team Data Properties and Impact on Data Serving.
Swift HUG April Swift data archive Lorella Angelini HEASARC.
Mar 1-3 – DC2 Kickoff Meeting - 1 Tom Stephens GSSC Database Programmer Retrieving, Filtering and Previewing Data.
Introduction CMS database workshop 23 rd to 25 th of February 2004 Frank Glege.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Page 1 Envisat Validation Workshop, Campaign Database, 12/12/2002 Envisat Validation Workshop Atmospheric Chemistry Validation Team Ground-Based Measurements.
April , 2006 HEASARC Users Group Roger Brissenden 1 HEASARC Supported Activities at SAO Roger Brissenden.
06-1L ASTRO-E2 ASTRO-E2 User Group - 14 February, 2005 Astro-E2 Archive Lorella Angelini/HEASARC.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
EOVSA Pipeline Processing System J. McTiernan EOVSA Prototype Review 24-Sep-2012.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
STEREO Science Center Status Report William Thompson NASA Goddard Space Flight Center STEREO SWG November 2007 Pasadena, California.
SSC Support for Sim #3 Archived MOC Data Products, and distributed via web. –Level-0 telemetry files organized by year and month –SPICE orbit and attitude.
1 Getting Involved with GLAST Workshop, CFA, 06/21/ C. Shrader, NASA/GSFC GLAST Data & Software Release plans + Science Support Center Services C.
1 SUZAKU HUG 12-13April, 2006 Suzaku archive Lorella Angelini/HEASARC.
June 27-29, DC2 Software Workshop - 1 Tom Stephens GSSC Database Programmer GSSC Data Servers for DC2.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Retele de senzori Curs 2 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
15th CAA Cross-calibration workshop CIS archiving activities report University College of London 2012, April
GLAST Science Support Center November 8, 2005 GUC Action Item #15 AI#15: Pre-Launch GI Proposal Tools David Band (GSSC/JCA-UMBC)
A Solution for Maintaining File Integrity within an Online Data Archive Dan Scholes PDS Geosciences Node Washington University 1.
KEEPS – a system for UELMA preservation and security
The GLAST Science Support Center
Introduction to DBMS Purpose of Database Systems View of Data
Simulation Production System
Chapter 10 Verification and Validation of Simulation Models
David Band – Science Lead, GSSC
Introduction to DBMS Purpose of Database Systems View of Data
Manuscript Transcription Assistant Initiative
Problem Statement and Significance
data backup & system report
Presentation transcript:

LAT Data Server Workshop - 1 Jan 13-14, 2005 Tom Stephens GSSC Database Lead GSSC LAT Data Server Overview

LAT Data Server Workshop - 2 Jan 13-14, 2005 Outline  Definitions  Requirements  Design Goals  Overall System Architecture  Implementation Details  Benchmarks

LAT Data Server Workshop - 3 Jan 13-14, 2005 Database Definitions  Photon Database (D1ph) – Database that holds all LAT events considered photons and that were used to construct the IRFs. This is the primary science database.  Event Database (D1ev) – Database that holds (possibly) all reconstructed LAT Events, both photons and particles.  Pointing and Livetime History Database (D2) – Database that holds spacecraft attitude, position and instrument status information in 30 sec intervals.

LAT Data Server Workshop - 4 Jan 13-14, 2005 D1 Search Definitions  “Standard” Search –15° radius circle or 30° x 30° box on the sky for a time period of one year (LESDR ) –For photon database this is MBytes of data depending on sky position  “Large” Search –Photon database: Search that would return more than 2GBytes of data per year of observation (LESDR ) –Event database: Search that would return more than 20Gbytes of data (LESDR )

LAT Data Server Workshop - 5 Jan 13-14, 2005 D1 Database Design Requirements  Search Parameters –Search on values that are real or integer numbers, Booleans, dates and times. (LESDR ) –Times searchable to microsecond precision (LESDR ) –2-D positions on sphere (LESDR ) –Data quality (LESDR )  The database must be remotely accessible. (LESDR )  Portability – must not be tied to a single architecture or software system. (LESDR )  HEASARC Compatibility –Database will be turned over to HEASARC at the end of mission (LESDR ) –Must not require excessive effort (>1 FTE) to maintain. (LESDR )

LAT Data Server Workshop - 6 Jan 13-14, 2005 Photon Database Performance Requirements  Derived from statistics of current satellite data archives  Search Speeds –Standard Search – Data returned within 30 minutes per year of data searched. (LESDR ) –Standard Search with additional sub-selections – All data returned within 45 minutes per year of data searched. (LESDR ) –Large Search – All data returned within 3 days. Allows for processing during off peak hours. (LESDR )  Number of Requests –Must perform up to 60 standard searches a day. (LESDR )  Data Ingest –Ingest of new data must be complete within 10 minutes for a 5 hour observation data set (LESDR ) –Ingest of reprocessed data may interrupt database access for no more than 60 minutes for a 5 hour observation data set. (LESDR )  Database Restoration –Must be able to restore database after a crash in <3 days per year of data (LESDR )

LAT Data Server Workshop - 7 Jan 13-14, 2005 Event Database Performance Requirements  Search Speeds –Standard Search – All data returned within 10 hours per year of data searched. (LESDR ) –Standard Search with additional sub-selections – All data returned within 15 hours per year of data searched. (LESDR ) –Large Search – All data returned within 7 days. (LESDR )  Number of Requests –Must be able to perform up to 1 standard search a day. (LESDR )  Data Ingest –Ingest of new data must be complete within 100 minutes for a 5 hour observation data set. (LESDR ) –Ingest of reprocessed data may interrupt database access for no more than 10 hours for a 5 hour observation data set. (LESDR )  Database Restoration –Must be able to restore database after a crash in <1 week per year of data (LESDR )  Requirements are generous and design goals provide better performance

LAT Data Server Workshop - 8 Jan 13-14, 2005 D2 Database Design Requirements  Search Speed –Retrieve 6 months of consecutive data (~50 MBytes) in 1 minute (SAEDR )  Number of searches –Must be able to handle >1500 searches a day (SAEDR )  Data Ingest –Ingest of new data (5 hours of spacecraft operation) in 1 minute (SAEDR ) –Ingest of reprocessed data (5 hour period) in 5 minutes (SAEDR )  Database Restoration –Must be able to restore database after a crash in <1 day (SAEDR )

LAT Data Server Workshop - 9 Jan 13-14, 2005 Database Design Goals Design Requirement Design Goals Current Performance Standard D1 photon search – 1 year of data 30 min1 min~40 sec Standard D1 event search – 1 year of data 10 hrs30 minN/T D2 search – 6 months of data 60 sec 7 sec D1 photon ingest, new data – 5 hours of data 10 min2 min0.5-5 min D1 event ingest, new data – 5 hours of data 100 min20 minN/T D1 photon ingest, reprocessed data – 5 hours of data 60 min12 minN/T D1 event ingest, reprocessed data – 5 hours of data 10 hrs2 hrsN/T D2 Ingest, new data – 5 hours of data 1 min 10 sec D2 Ingest, reprocessed data – 5 hours of data 5 min1 minN/T D1 photon Data Restoration – year of data 3 days3 hrs10 min D1 event Data Restoration – year of data 7 days3 daysN/T D2 Data Restoration – entire database 1 day1 hr10 min

LAT Data Server Workshop - 10 Jan 13-14, 2005 D1/D2 Database System Design BROWSE Queue Manager Event Database Photon Database Pointing and Livetime History Database GSSC Internal Tools MySQL Database Custom Web Interface Ingest System

LAT Data Server Workshop - 11 Jan 13-14, 2005 D1 Photon Database Design Control Process... Search Process Photon Data Search Process Photon Data Search Process Photon Data Search Process Photon Data Search Process Photon Data Query Parameters Files to Search Selected Data Query Request Search Results Photon Data (Master Copy)

LAT Data Server Workshop - 12 Jan 13-14, 2005 Photon Database Internal Storage  All data is in HEASARC compatible FITS files  Each node (control and search) has a complete copy of the photon data. –Fast data access from internal disk –Multiple backups in case of failure of a single data disk  Data broken into sky regions and time periods in internal data files  Hierarchical Triangular Mesh (HTM) used to define regions –Developed for Sloan Digital Sky Survey at Johns Hopkins –Recursively divides sky into spherical triangles  Conducted trade study to determine optimal combination of HTM pixelization level and time binning –Best time of ~39 sec was level 3 pixelization (512 sky regions) with 2 month time bins

LAT Data Server Workshop - 13 Jan 13-14, 2005 File Metadata Database I  Currently 3 database tables (will eventually be 7)  Ingest_data – version information for database –Database name – (Photon, Event, Spacecraft) –Start time of current data file (Mission Elapsed Time seconds) –Current file version – incremented if reprocessed data received for this file, reset to 0 upon creation of new file –Database version – incremented every time reprocessed data is received. Will allow “roll-back” to earlier version of database if necessary  Photon_file_comp – what composes the data files –Filename base – This is a stub that contains the file data start time and version number of the set of data files the input data was added to. –Input filename – The name of the data file that was ingest –Ingest date – The data the file was added to the data set.

LAT Data Server Workshop - 14 Jan 13-14, 2005 File Metadata Database II  Photon_file_data – What is in the actual files –Filename – The name of the internal data file –Date modified – The date and time the file was last modified –N_photons – The number of photons in the data file –startTime – The start time of the data file –stopTime – The end time of the data file –First_DB_version – The first database version the file is valid in –Last_DB_version – The last database version the file is valid in –HTMpixel – the HTM pixel the file corresponds to.

LAT Data Server Workshop - 15 Jan 13-14, 2005 Sample photon_file_data Entries | filename | modified |n_photons| start_time | stop_time | f_DB_ver | l_DB_ver | HTM_pix | | N3321_ _V01.fits | :01:22 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :23:06 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :15:23 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :52:05 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :51:22 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :26:23 | | | | 1 | 1 | N3321 | | N3321_ _V01.fits | :40:23 | 7789 | | | 1 | 1 | N3321 | | S3321_ _V01.fits | :00:30 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :22:15 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :14:30 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :51:10 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :50:24 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :25:17 | | | | 1 | 1 | S3321 | | S3321_ _V01.fits | :40:20 | 2239 | | | 1 | 1 | S3321 |

LAT Data Server Workshop - 16 Jan 13-14, 2005 Screenshots – Search Page

LAT Data Server Workshop - 17 Jan 13-14, 2005 Screenshots – Query Submitted

LAT Data Server Workshop - 18 Jan 13-14, 2005 Screenshots – Results Page

LAT Data Server Workshop - 19 Jan 13-14, 2005 Ingest Performance

LAT Data Server Workshop - 20 Jan 13-14, 2005 Search Performance