LSC Segment Database Duncan Brown Caltech LIGO-G060176-00-Z.

Slides:



Advertisements
Similar presentations
MQ Series Cross Platform Dominant Messaging sw – 70% of market Messaging API same on all platforms Guaranteed one-time delivery Two-Phase Commit Wide EAI.
Advertisements

11/12/2003LIGO Document G Z1 Data reduction for S3 I Leonor (UOregon), P Charlton (CIT), S Anderson (CIT), K Bayer (MIT), M Foster (PSU), S Grunewald.
TDPS Wireless v Enhancements E1 - Multi load E2 - Driver time scheduler.
DataMigrator 7.7 in Real Time
G O B E Y O N D C O N V E N T I O N WORF: Developing DB2 UDB based Web Services on a Websphere Application Server Kris Van Thillo, ABIS Training & Consulting.
--What is a Database--1 What is a database What is a Database.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
Fast Track to ColdFusion 9. Getting Started with ColdFusion Understanding Dynamic Web Pages ColdFusion Benchmark Introducing the ColdFusion Language Introducing.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Passage Three Introduction to Microsoft SQL Server 2000.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Query Processing in Mobile Databases
 2000 Deitel & Associates, Inc. All rights reserved. Chapter 24 – Web Servers (PWS, IIS, Apache, Jigsaw) Outline 24.1Introduction 24.2Microsoft Personal.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
1. When things go wrong: how to find SQL error Sveta Smirnova Principle Technical Support Engineer, Oracle.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Software Engineering 2003 Jyrki Nummenmaa 1 CASE Tools CASE = Computer-Aided Software Engineering A set of tools to (optimally) assist in each.
1 Overview of Databases. 2 Content Databases Example: Access Structure Query language (SQL)
Python MySQL Database Access
CCP4(i) Database Development Wanjuan (Wendy) Yang CCP4 Annual developer’s meeting March 28, 2006 York.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
XA System-Link A Powerful Development Tool
CVSQL 2 The Revenge of the SQL. The present Read-only access to CVS repository logs Language is a subset of SQL XML interface for returning results Built-in.
IT 456 Seminar 5 Dr Jeffrey A Robinson. Overview of Course Week 1 – Introduction Week 2 – Installation of SQL and management Tools Week 3 - Creating and.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
Module 6: Implementing SQL Server Replication in an Enterprise Environment.
What’s new in Kentico CMS 5.0 Michal Neuwirth Product Manager Kentico Software.
1 Chapter Overview Performing Configuration Tasks Setting Up Additional Features Performing Maintenance Tasks.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
A Brief Documentation.  Provides basic information about connection, server, and client.
Guide to Linux Installation and Administration, 2e1 Chapter 11 Using Advanced Administration Techniques.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
 Replication is the process of copying database information  Replication is used for:  Backing up your database  Migrating to a new server  Mirroring.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
CMap Version 0.16 Ben Faga. CMap CMap Version 0.16 Bug fixes and code optimizations More intuitive menu system Asynchronous loading of comparative map.
Jan 12, 2009LIGO-G Z1 DMT and NDS2 John Zweizig LIGO/Caltech Ligo PAC, Caltech, Jan 12, 2009.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
11/12/2003LIGO-G Z1 Data reduction for S3 P Charlton (CIT), I Leonor (UOregon), S Anderson (CIT), K Bayer (MIT), M Foster (PSU), S Grunewald (AEI),
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Database Connectivity and Server-Side Scripting Chapter 12.
Linux Operations and Administration
CP476 Internet Computing Perl CGI and MySql 1 Relational Databases –A database is a collection of data organized to allow relatively easy access for retrievals,
May 29, 2006 GWADW, Elba, May 27 - June 21 LIGO-G0200XX-00-M Data Quality Monitoring at LIGO John Zweizig LIGO / Caltech.
ViaSQL Technical Overview. Viaserv, Inc. 2 ViaSQL Support for S/390 n Originally a VSE product n OS/390 version released in 1999 n Identical features.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Hyperion Artifact Life Cycle Management Agenda  Overview  Demo  Tips & Tricks  Takeaways  Queries.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
C Copyright © 2006, Oracle. All rights reserved. Integrating with Oracle Streams.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
Can you do this in SmarTeam?
© SCRIBE SOFTWARE CORPORATION 2008 Tips and Tricks for Working with Scribe Insight Trace Files.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
SSIS ETL Data Resource Management. Create an ETL package using a wizard database server to database server The business goal of this ETL package is to.
1 Copyright © 2005, Oracle. All rights reserved. Oracle Database Administration: Overview.
MQ Series Cross Platform Dominant Messaging sw – 70% of market
2nd year Computer Science & Engineer
Cameron Blashka | Informer Implementation Specialist
z/Ware 2.0 Technical Overview
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
POOL File Catalog: Design & Status
PHP / MySQL Introduction
Tiers vs. Layers.
Time Gathering Systems Secure Data Collection for IBM System i Server
MQ Series Cross Platform Dominant Messaging sw – 70% of market
Presentation transcript:

LSC Segment Database Duncan Brown Caltech LIGO-G Z

4/5/2003LIGO-G Z2 Goals of Segment Database Make interferometer state information available rapidly for S5 searches Allow automated insertion of DQ flags from DMT Reduce latency of releasing human generated data quality information Replicate DQ information between Caltech and Observatories Provide simple interface for accessing data that integrates with existing tools

4/5/2003LIGO-G Z3 Architecture Use IBM DB2 database for underlying engine »Reliable, well tested, runs on Solaris and Linux Write LSC specific client/servers in Python »Good interface between Python and DB2 (mxODBC) Base table design on existing database tables »Table design modified for using in segment database Use IBM Q-Replication for replication »Straightforward to set up a 3 element peer-to-peer network

4/5/2003LIGO-G Z4 Database Implementation Table designed based on existing table design »All inserts must have a process table »The segment_definer table describes a segment »The segment_def_map tables maps segment definitions to intervals »The segment table contains the start and end times Also have an LFN table and a grid_cert table »Track which frame file state information came from using LFNs »Track who inserted data with grid_cert table Otherwise straightforward DB2 database »DB2 instance created under the user ldbd on gateway machines

4/5/2003LIGO-G Z5 Replication Use IBM Q-Replication to set up peer-to-peer replication WebSphere MQ messages queues set up between CIT, LHO and LLO. At each site: »Two transmit queues (e.g. LHO_TO_LLO, LHO_TO_CIT) »Two receive queues (e.g. LLO_TO_LHO, CIT_TO_LHO) »Plus a couple of control queues Each site runs a capture server and an apply server »Capture sever pushes all incoming transactions into xmit q »Apply server gets transactions from recv q and applies to database Capture and apply servers store state in ctrl tables in database »Set up when database was created

4/5/2003LIGO-G Z6 Direct Clients Direct clients run on gateway machines and connect directly to the database. »Can be run as user ldbd or grid segpagegen »Creates nightly web pages containing segment dump »Segment dump written to ascii web pages for other tools to grab »Client program in glue/sbin/segpagegen installed on gateway »Run from ldbd user’s crontab at 1600 UTC daily –Runs from shell script that sets up environment statedb.py »Python module to publsh IFO state information into database »Used by Ben’s publishing scripts »Also used by publishstatefromfile and bulkpublishstate scripts

4/5/2003LIGO-G Z7 Grid Servers Implemented as python modules and run by ldbdd program »ldbdd provides common infrastructure (logging, gsi socket server, etc.) »Loads one of the two module below to handle connections LSCsegFindServer.py »Receives segment query on socket from client »Constructs SQL and executes it to get results from database »Returns output to client for display/writing to file »distinct() method –Returns available segments (with meanings) »segmentFindWithMetadata_vx() –Accepts query based on interferometer, type, start time, end time –Multiple types are unioned –Multiple IFOs are intersected

4/5/2003LIGO-G Z8 Grid Servers (continued) LDBDServer.py »Provides map between LIGO_LW XML and database »Reads configuration from /export/ldbd/etc/ldbdserver.ini »Reads database table design from DB2 on startup »query() –Executes SQL query on database and returns results as XML »insert() –Takes LIGO_LW XML that complies with table design and inserts it »insertmap() –Same as insert() but adds an LFN -> PFN map to an RLS server »Insertdmt() –Same as insert() but handles XML files containing existing segment_definer rows and updates process table with current end time of DMT process, if it already exists in the database »All insert methods capture the users DN and insert it in the grid_cert table

4/5/2003LIGO-G Z9 Grid Clients LSCsegFind.py »Straightforward interface to LSCsegFindServer »API directly by onasysd Command line version is LSCsegFind LDBDClient.py »Straightforward LDBDServer client for query(), insert() and insertmap() »API used directly by dmtdq_seg_insert and LSCdqInsert Command line version is ldbdc dmtdq_seg_insert »Used by DMT to insert XML containing online DQ »Calls LDBDClient.py with insertdmt() method »Deletes XML files on successful insert LSCdqInsert »Takes a list of GPS start/stop times, constructs XML and sends it to LDBDServer for insert into database

4/5/2003LIGO-G Z10 More on Table Design All tables have a creator_db column for replication »LHO 1, LLO 2, CIT 3 segment_definer is constrained to have unique rows for (run,ifos,name,version) »e.g. S5,H1,DUST,1 segment_def_map table links definitions to segments »(creator_db,segment_def_id) mapped to (creator_db,segment_db) segment table contains [gpsstart,gpsend) »Each segment has an active column: 0 for off, 1 for on »Science segments also populate segnum column with the science segment number

4/5/2003LIGO-G Z11 Problems encountered during S5 Replication shuts off if: »Large clock skew between two sites –Fix clock skew and restart replication »Conflict between tables –Rare, but could happen if user inserts DQ segments at LHO and the immediately inserts the same data at LLO before original insert can be replicated from LHO to LLO –Solution is to delete conflicting insert by deleting process table and restart replication »Replication message goes missing –Very rare: only happens if bad crash on server (e.g. raid crash at CIT) –Fix is to re-sync tables and restart replication Python severs have died a couple of times during S5 »Just restart, diagnosing error in /export/ldbd/var/log if possible

4/5/2003LIGO-G Z12 To Do List Extend LSCsegFind.py API to query active flag for segments »Would allow users to take intersections with “DQ flag not active” On site backups / deal with DB log files. »Currently the databases are not backed up. We rely on replication being working to ensure that a copy of the contents of the database are available, if these is a catastrophic failure at a site. Database should really be backup up locally as well. Cancerous log files should be dealt with. Status lights for the severs at CIT. »The status of the LSCsegFind LDBD and Trigger servers at CIT should be available on the status pages. Status for segment web dumps. »It would be nice to have status light which indicates if the latest segment web dump is older that 27 hours, which would indicate wether the cron job that creates these pages is functioning or not. Better status information for replication. »A red/green light for the Q replication processes would be very useful.