Alexandre Vaniachine (ANL) LCG PEB Applications Area Meeting November 20, 2002 Alexandre Vaniachine (ANL) MySQL Service Plans and Needs in ATLAS.

Slides:



Advertisements
Similar presentations
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
Advertisements

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Database Architectures and the Web
4/2/2002HEP Globus Testing Request - Jae Yu x Participating in Globus Test-bed Activity for DØGrid UTA HEP group is playing a leading role in establishing.
RLS Production Services Maria Girone PPARC-LCG, CERN LCG-POOL and IT-DB Physics Services 10 th GridPP Meeting, CERN, 3 rd June What is the RLS -
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1 Rhode Island Transportation Information System. (RITIS) Spring, 2000.
YNG Solutions Website Usability Review Prepared by Josepha Rood December 19, 2008.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
SaaS, PaaS & TaaS By: Raza Usmani
Database Infrastructure Major Current Projects –CDF Connection Metering, codegen rewrite, hep w/ TRGSim++ – Dennis –CDF DB Client Monitor Server and MySQL.
ORACLE APPLICATION SERVER BY PHANINDER SURAPANENI CIS 764.
“ Does Cloud Computing Offer a Viable Option for the Control of Statistical Data: How Safe Are Clouds” Federal Committee for Statistical Methodology (FCSM)
IT – DBMS Concepts Relational Database Theory.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
Abstract The automated multi-platform software nightly build system is a major component in the ATLAS collaborative software organization, validation and.
Oracle8 JDBC Drivers Section 2. Common Features of Oracle JDBC Drivers The server-side and client-side Oracle JDBC drivers provide the same basic functionality.
J.T Moscicki CERN LCG - Software Process & Infrastructure1 SPI Software Process & Infrastructure for LCG Software Packaging and Distribution LCG Application.
RLS Tier-1 Deployment James Casey, PPARC-LCG Fellow, CERN 10 th GridPP Meeting, CERN, 3 rd June 2004.
Service Computation 2010November 21-26, Lisbon.
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
Team Members Team Members Tim Geiger Joe Hunsaker Kevin Kocher David May Advisor Dr. Juliet Hurtig November 8, 2001.
Production Tools in ATLAS RWL Jones GridPP EB 24 th June 2003.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GGUS at PEB – –- page 1 LCG Klaus-Peter Mickel, GridKa Karlsruhe LCG-PEB-Meeting ( ) The Global Grid User Support Model (Report of GDB.
CERN - IT Department CH-1211 Genève 23 Switzerland t COOL Conditions Database for the LHC Experiments Development and Deployment Status Andrea.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
OSG Integration Activity Report Rob Gardner Leigh Grundhoefer OSG Technical Meeting UCSD Dec 16, 2004.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
System Center Lesson 4: Overview of System Center 2012 Components System Center 2012 Private Cloud Components VMM Overview App Controller Overview.
Managing and Monitoring the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Drupal Service: Infrastructure Update 2 Marek Salwerowicz Sergio Fernandez ENTICE Meeting
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
12 February 2004 ATLAS presentation to LCG PEB 1 Why ATLAS needs MySQL  For software developed by the ATLAS offline group, policy is to avoid dependencies.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
Grid Technologies for Distributed Database Services 3D Project Meeting CERN, May 19, 2005 A. Vaniachine (ANL)
CERN - IT Department CH-1211 Genève 23 Switzerland t Operating systems and Information Services OIS Proposed Drupal Service Definition IT-OIS.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
Site Services and Policies Summary Dirk Düllmann, CERN IT More details at
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
Database Access Patterns in ATLAS Computing Model G. Gieraltowski, J. Cranshaw, K. Karr, D. Malon, A. Vaniachine ANL P, Nevski, Yu. Smirnov, T. Wenaus.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
What is Cloud Computing 1. Cloud computing is a service that helps you to perform the tasks over the Internet. The users can access resources as they.
Towards Dynamic Database Deployment LCG 3D Meeting November 24, 2005 CERN, Geneva, Switzerland Alexandre Vaniachine (ANL)
Unit 3 Virtualization.
Jean-Philippe Baud, IT-GD, CERN November 2007
Bob Jones EGEE Technical Director
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Regional Operations Centres Core infrastructure Centres
LCG 3D Distributed Deployment of Databases
Introduction to PHP FdSc Module 109 Server side scripting and
Ian Bird GDB Meeting CERN 9 September 2003
3D Application Tests Application test proposals
Database Readiness Workshop Intro & Goals
GGUS Partnership between FZK and ASCC
An example of a pilot project as part of USP course:
2018 Valid Amazon AWS-Solution-Architect-Associate Exam Study Guide - AWS-Solution-Architect-Associate Questions Answers Realexamdumps.com
Chapter 2 Database Environment Pearson Education © 2009.
Introduction to Databases Transparencies
Database Environment Transparencies
Production Manager Tools (New Architecture)
Presentation transcript:

Alexandre Vaniachine (ANL) LCG PEB Applications Area Meeting November 20, 2002 Alexandre Vaniachine (ANL) MySQL Service Plans and Needs in ATLAS

Alexandre Vaniachine (ANL) Database Needs Current ATLAS software needs for relational databases are coming from three domains Data Challenges (support of ATLAS Grid tools): o Metadata catalog:AMI o Replica catalog:Magda o Virtual Data Catalog:VDC o Grid portal:ATCOM Offline Software (framework services, simulations) o primary numbers, conditions On-line/Testbeam o LAr, tilecal,…

Alexandre Vaniachine (ANL) Observations ATLAS: Three domains – three different schedules and priorities lists Same database technology was deployed by different software developers groups everywhere ATLAS gained experience in use of MySQL LHC/EDG: Other LHC experiments are using MySQL for many different needs LCG persistency project POOL selected MySQL for implementation of FileCatalog and Collections EDG project uses MySQL for Replica Location Service

Alexandre Vaniachine (ANL) Considerations Oracle is used in some LHC experiments for certain needs Oracle support is available from IT/DB In a comprehensive benchmark of the latest available versions of five server databases done by the eWEEK Labs/PC Labs both Oracle 9i and MySQL had the best performance and scalability. Both Oracle and MySQL come on top of other databases (IBM DB2, Microsoft SQL Server and Sybase ACE) scaling up to a 1000 simultaneous connections, with Oracle 9i just very slightly ahead of MySQL for most of the run. LCG persistency project POOL plan to use both Oracle and MySQL (BlueprintPlan.xls)

Alexandre Vaniachine (ANL) Experience MySQL functionality is sufficient Users would like faster database response Single server model is not convenient: o When developed databases started to be used in production this prevented server shutdown for maintenance, upgrades o Security upgrades done without preliminary studies of their potential side effects resulted in service disruption for many users Current authorization procedures are inadequate: o clear-text database passwords are used in scripts or distributed via s

Alexandre Vaniachine (ANL) Outcome MySQL server was deployed at CERN for evaluation (thanks to Gordon Lee from IT/DB and RD Shaffer) MySQL services working group was established: Christian Arnault, Nectarios Benekos, Jerome Fulachier, Jorge Lima, Antoine Perus, Alex Undrus, Sasha Vaniachine People who want to participate are welcome to join Group will define the requirements for MySQL services, and propose a phased deployment plan These requirements will provide input to the LCG infrastructure effort During discussion with MySQL services group members and a meeting at LAL on October 11 a proposal for a multi-tier server model was developed

Alexandre Vaniachine (ANL) T hree Tiers for Database Services Production servers o deployed for use in production data processing o stable and tested server versions are used o made highly secured and reliable (backed up, archived, replicated, protected) Software development servers o used to develop databases and test the software before its deployment on the production servers o validated and tested server version is used o used during release tests o used for tutorials, demos Database development servers o needed for evaluation of the new server releases o needed for testing effects of the new security upgrades o needed for testing effects of the operating system upgrades o may reside on the same hardware as the development server, although this will prevent evaluation of the new operating systems and the effects of network security upgrades

Alexandre Vaniachine (ANL) Four User Roles Access rights will be granted according to the roles: Administrators o manage the installation of database servers o give access rights to other users Developers o develop database for some software application, e.g. online conditions database, calibration database, tag collector database etc. o have full access rights to that database on the server Data writers o these are user applications or scripts that fill the data to the database o have only the rights to add the data, not to replace or delete Data readers o these are user applications or scripts that read data from the database o no password is needed for read data access It is expected that only a limited number of users will have administrative rights to on the server For database development servers, more users may have full access rights Long-terms solutions for the data writer password management issues have to be investigated

Alexandre Vaniachine (ANL) Support Issues We think that for production servers, CERN resources should be first considered However, considering that this solution is submitted to quite wide preconditions (relationships with computing plans, COCOTIME, etc.) it is probably wise to consider short term solutions - using private Atlas resources at CERN (atlasdev1) - using offered resources from external sites (BNL, Grenoble are current examples) In case other computing centers would offer similar services, it may be appropriate to undertake agreements on the level of service (Similarly to software agreements)

Alexandre Vaniachine (ANL) Web Services One central aspect of database services is the web access to those services. These include management services (such as phpMyAdmin) or Web applications accessing the databases (e.g. the NOVA browser). It is noticed that the providing Web access to databases is a different issue than managing the database server and that the Web server where applications run may be different than the SQL server. The constraints are different (different technologies are required, such as php, Java, etc.) It is expected that several servers will provide at least a php resource So far we have CERN home service and a possibility with the LCG web server (not operational these days). We need strong support for these servers, since it's likely that Web services may become part of normal software usage

Alexandre Vaniachine (ANL) Short-term Solutions Production servers Master server - atlasdev1.cern.ch Replica server 1 - db2.usatlas.bnl.gov (to be deployed) Data backup and archiving by BNL Tier 1 Facility Replica server for AMI database - isngrid1.in2p3.fr Data backup support by Grenoble Web access service provided by BNL and Grenoble Databases to be deployed on production servers: AMI, Magda, VDC, NOVA Development servers Master server - db1.usatlas.bnl.gov Replica server - atlassw1.phy.bnl.gov Web access service - atlassw1.phy.bnl.gov back-up support by BNL Tier 1 Facility Evaluation server lxshare070d.cern.ch (provided by LCG POOL project)

Alexandre Vaniachine (ANL) Future Developments Blueprint RTAG recommendation (laptop user case) => MySQL Embedded Server Technology Deployment model should provide for future Grid integration issues (OGSA, UDDI, Grid database services architecture) => MySQL X509 Certificate Authorization Technology Grid example by Kate Keahey

Alexandre Vaniachine (ANL) Long-term Evolution The three ATLAS software database clients: o Data Challenges (AMI, Magda, ATCOM, VDC) o Core Software (primary numbers, conditions) o On-line/Testbeam In the long run they should be supported by different sets of servers Extreme range of requirements for on-line database and the end-user laptop Need for two more tiers?  Ultimate robustness for the Tier O (on-line DB)  Least cost for the Tier 4 (end-user laptop)

Alexandre Vaniachine (ANL) Oracle Deployment Can we have an Oracle on every user laptop? Can we have embedded Oracle? Can we have Oracle Grid certificate authorization? These issues call for two technologies in a multi-tier model (thanks to Tom LeCompte) Technologies ranging from Oracle on a production tier server (ultimately robust service level) to MySQL on an end-user laptop ("do-it-yourself" service level)

Alexandre Vaniachine (ANL) Potential IT/DB Roles Provide support for production-level tiers of database servers Develop and implement strategies for fault-tolerant database services data backup and archiving crash recoveries Give advice for evaluation tier: MySQL versions, optimizations, hardware platforms Investigate value of purchasing support from MySQL AB Partnership agreement with MySQL AB to develop code providing better support for binary data delivery (Conditions DB) binary data transfer through TCP/IP sockets direct client access to data (a la Objectivity or some RDBMS) support for non-X509 certificates (grid proxies) …