LHCb File-Metadata: Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 04 July 2006.

Slides:



Advertisements
Similar presentations
File-Metadata Management System For The LHCb Experiment Carmine Cioffi Department of Physics, University of Oxford CHEP04 Interlaken, 27 September 2004.
Advertisements

Module 2: Database Architecture
MSc IT UFCE8K-15-M Data Management Prakash Chatterjee Room 2Q18
Tux2 Database The Architecture of Our System © Juhani Välimäki 2005.
Native Monitoring packsOps Mgr SP1Ops Mgr R2Ops Mgr 2012 Ops Mgr 2012 Feature PacksN/AOps Mgr 2012 Product Ship.
Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
Database Security and Auditing: Protecting Data Integrity and Accessibility Chapter 5 Database Application Security Models.
Database Application Application logic: presentation (input /output)
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Chapter 5 Database Application Security Models
1 Foundations of Software Design Lecture 27: Java Database Programming Marti Hearst Fall 2002.
1 RAL Status and Plans Carmine Cioffi Database Administrator and Developer 3D Workshop, CERN, November 2009.
The Client/Server Database Environment
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
Grappa: Grid access portal for physics applications Shava Smallen Extreme! Computing Laboratory Department of Physics Indiana University.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
From the ChannelArchiver to the Best Ever Archive Utility, Yet July 2009.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Sep , 2006 v FME Worldwide User Conference - Vancouver Capturing SpatialDirect Usage Statistics to Support Intelligent Business Decisions Jason Close,
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
This material is based upon work supported by the U.S. Department of Energy Office of Science under Cooperative Agreement DE-SC Michigan State.
Database Application Security Models Database Application Security Models 1.
Eurotrace Hands-On The Eurotrace File System. 2 The Eurotrace file system Under MS ACCESS EUROTRACE generates several different files when you create.
Distributed Systems Fall 2014 Zubair Amjad. Outline Motivation What is Sqoop? How Sqoop works? Sqoop Architecture Import Export Sqoop Connectors Sqoop.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
Indiana University’s Name for its Sakai Implementation Oncourse CL (Collaborative Learning) Active Users = 112,341 Sites.
Victoria, May 2006 DAL for theorists: Implementation of the SNAP service for the TVO Claudio Gheller, Giuseppe Fiameni InterUniversitary Computing Center.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
Cosener’s House – 30 th Jan’031 LHCb Progress & Plans Nick Brook University of Bristol News & User Plans Technical Progress Review of deliverables.
The protection of the DB against intentional or unintentional threats using computer-based or non- computer-based controls. Database Security – Part 2.
LHCb week, 27 May 2004, CERN1 Using services in DIRAC A.Tsaregorodtsev, CPPM, Marseille 2 nd ARDA Workshop, June 2004, CERN.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
DB Installation and Care Carmine Cioffi Database Administrator and Developer ICAT Developer Workshop, The Cosener's House August
CSS/417 Introduction to Database Management Systems Workshop 4.
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
LFC Replication Tests LCG 3D Workshop Barbara Martelli.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementation and performance analysis of.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
Database Systems Lecture 1. In this Lecture Course Information Databases and Database Systems Some History The Relational Model.
JSP Server Integrated with Oracle8i Project2, CMSC691X Summer02 Ching-li Peng Ying Zhang.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
AMGA-Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 05 July 2006.
GFDL Data Portal Update: Curator DB Approach S.Nikonov, V.Balaji, K.Dixon GFDL The 5 th GO-ESSP Workshop June , LLNL.
LECTURE TWO Introduction to Databases: Data models Relational database concepts Introduction to DDL & DML.
The Database Project a starting work by Arnauld Albert, Cristiano Bozza.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12 1.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
Servicing Seismic and Oil Reservoir Simulation Data through Grid Data Services Sivaramakrishnan Narayanan, Tahsin Kurc, Umit Catalyurek and Joel Saltz.
Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.
ETL Validator Deployment Options
Chapter 2 Database System Concepts and Architecture
Database System Concepts and Architecture
The Client/Server Database Environment
IT-DB Physics Services Planning for LHC start-up
EVLA Archive The EVLA Archive is the E2E Archive
Job workflow Pre production operations:
New developments on the LHCb Bookkeeping
Oracle Storage Performance Studies
Status and plans for bookkeeping system and production tools
Presentation transcript:

LHCb File-Metadata: Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 04 July 2006

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 2 Outline File-Metadata. The metadata’s Logical Data Model. LHCb Services. DB schema. Some Numbers. Conclusions.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 3 File-Metadata LHCb store only data that describes and gives information about data-files (File- Metadata) The File-Metadata are divided in two groups: –Job provenance: Information about how a data-file was created –Bookkeeping: All the information about Files and Jobs.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 4 The Metadata’s Logical Data Model

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 5 The Metadata’s Logical Data Model The main subjects of the Logical Data Model are the Jobs and the Files. Files and Jobs have their distinct set of metadata. LHCb Physicist usually see in the file related metadata a subset of information called Quality. The relation between jobs and files are of type input or output.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 6 LHCb services File Catalog service: –Provides information about associations between LFN and PFN. Servlet service : –the service allows the selection of datasets based on their history (job provenance) by the web browser. XML-RPC service: –access to and modification of the metadata. –allow GANGA to access Bookkeeping data.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 7 LHCb services architecture Oracle DB JDBC Driver BK Service BookkeepingSvc BookkeepingQuery Jython Server XML-RPC Tomcat Servlet GANGA application Web Browser lbnts3 Write Read/Write Read

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 8 LHCb services architecture XML-RPC Bookkeeping ODBC File Catalog Pool.xml Gaudi.xml Pool.xml Gaudi.xml Pool.xml Gaudi.xml Pool.xml Gaudi.xml

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 9 DB Schema The two schemas: –Warehouse Schema. –View Schema. Warehouse schema used to store bookkeeping information. View schema used to store Job Provenance information. Views are specialized to provide the best performance to the service that access them.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 10 ER DB Schema

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 11 DB Schema Warehouse DB schema:

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 12 DB Schema View DB schema:

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 13 Updating the Views Warehouse DB View If there are few changes in the Warehouse DB the views are just updated otherwise are regenerated. This process is done periodically (every night ) or on demand. SQL script

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 14 Some Numbers LHCb is using ORACLE 10g technology for its DB –4 nodes cluster with RedHat Enterprise Server 3 –running Oracle Real Application Cluster –connected via fiber channel switches to a Storage Area Network –composed by storage arrays with 16 SATA disks of 300GB each –each node is 2-xeon 2.8GHz with 4GB memory The DB contains ~45GB of data –Shared between real data and indexing tables –~6.5M jobs rows –~18M files rows –~200M rows in parameters.

UK Metadata Workshop Oxford 4 July 2006 LHCb Metadata 15 Conclusions LHCb deal only with File-Metadata. File-Metadata are divided in two logical sets: –Job Provenance. –Bookkeeping DB characterized by two schemas: –Warehouse schema. –View schemas. View Schema specialized for specific use.