Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.

Slides:



Advertisements
Similar presentations
Tom Hamilton – America’s Channel Database CSE
Advertisements

Database Architectures and the Web
IWR Ideen werden Realität Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Institut für Wissenschaftliches Rechnen Status of Database Services.
DB server limits (process/sessions) Carlos Fernando Gamboa, BNL Andrew Wong, TRIUMF WLCG Collaboration Workshop, CERN Geneva, April 2008.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
Hardening Linux for Enterprise Applications Peter Knaggs & Xiaoping Li Oracle Corporation Sunil Mahale Network Appliance Session id:
1 RAL Status and Plans Carmine Cioffi Database Administrator and Developer 3D Workshop, CERN, November 2009.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
1 Copyright © 2009, Oracle. All rights reserved. Exploring the Oracle Database Architecture.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
ASGC 1 ASGC Site Status 3D CERN. ASGC 2 Outlines Current activity Hardware and software specifications Configuration issues and experience.
Oracle on Windows Server Introduction to Oracle10g on Microsoft Windows Server.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
Clustering  Types of Clustering. Objectives At the end of this module the student will understand the following tasks and concepts. What clustering is.
Data Replication with Advanced Replication & Oracle Streams John Abrahams Technology Sales Consultant Oracle Nederland.
By Lecturer / Aisha Dawood 1.  You can control the number of dispatcher processes in the instance. Unlike the number of shared servers, the number of.
Heterogeneous Database Replication Gianni Pucciani LCG Database Deployment and Persistency Workshop CERN October 2005 A.Domenici
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation MongoDB Architecture.
Page 1. Data Integration Using Oracle Streams A Case Study Session #:
LFC Replication Tests LCG 3D Workshop Barbara Martelli.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
VMware vSphere Configuration and Management v6
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Implementation of a reliable and expandable on-line storage for compute clusters Jos van Wezel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementation and performance analysis of.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
LHCb File-Metadata: Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 04 July 2006.
HyperKVS Group Meeting Oracle Streams Dr. Volker Kuhr.
Oracle9i Performance Tuning Chapter 11 Advanced Tuning Topics.
BNL Oracle database services status and future plans Carlos Fernando Gamboa, John DeStefano, Dantong Yu Grid Group, RACF Facility Brookhaven National Lab,
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
AMGA-Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 05 July 2006.
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
C Copyright © 2006, Oracle. All rights reserved. Integrating with Oracle Streams.
Deployment and operations of a high availability infrastructure for relational databases in a heterogeneous Tier 1 workload environment Carlos Fernando.
An Introduction to GPFS
Marcin Bogusz CERN, PH-CMG WLCG Collaboration Workshop CMS online/offline replication Online/offline replication via Oracle Streams WLCG Collaboration.
WLCG Collaboration Workshop CMS online/offline replication
Jean-Philippe Baud, IT-GD, CERN November 2007
Maria Girone, CERN – IT, Data Management Group
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Database Architectures and the Web
IT-DB Physics Services Planning for LHC start-up
LCG 3D Distributed Deployment of Databases
Distributed Network Traffic Feature Extraction for a Real-time IDS
LCG 3D and Oracle Cluster
Oracle 11g Real Application Clusters Advanced Administration
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Database Architectures and the Web
Introduction to Networks
Introduction of Week 6 Assignment Discussion
TCB 9 RK/MSFT Benchmark Results
Oracle Storage Performance Studies
Case studies – Atlas and PVSS Oracle archiver
ASM-based storage to scale out the Database Services for Physics
Database Services for CERN Deployment and Monitoring
Oracle Streams Performance
Scalable Database Services for Physics: Oracle 10g RAC on Linux
High-Performance Storage System for the LHCb Experiment
Presentation transcript:

Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco

5 July, 2006UK Metadata Workshop, Oxford2 Overview  Oracle server architecture  Oracle Real Application clusters architecture and tests  Oracle Streams technology  LFC replcation tests done with LHCb team

5 July, 2006UK Metadata Workshop, Oxford3 Oracle Database Architecture The Oracle Server architecture can be divided in three categories:  User-related processes User Process Server Process  Logical memory structures that are collectively called an Oracle instance  Physical file structures that are collectively called a database Database

5 July, 2006UK Metadata Workshop, Oxford4

5 July, 2006UK Metadata Workshop, Oxford5 Instance Database

5 July, 2006UK Metadata Workshop, Oxford6 Oracle Real Application Cluster  The Oracle Real Application Cluster technology allows to share a database amongst several servers  All datafiles, control files, PFILEs, and redo log files in RAC environments must reside on cluster- aware shared disks so that all of the cluster database instances can access them.  RAC aims to provide highly available, fault tolerant and scalable database services Network shared disks (Cluster Filesystem) Database servers

5 July, 2006UK Metadata Workshop, Oxford7 RAC testbed ORA-RAC-01 ORA-RAC-02 ORA-RAC-03 ORA-RAC-04 IBM FAStT900 FC RAID Controller Fiber Channel Sw GigaSw2 Clients GigaSw1 Private network for interconnect traffic Public and VIP Network Interface 4 x Dual Xeon 2.8 GHz 4 GB RAM Red Hat Enterprise 4 on RAID-1 disks 2 x Intel PRO1000 NICs 1 QLogic 2312 FC HBA with 2 x 2Gb/s links Clients Disk I/O traffic 1.2 TB RAID-5 disk array formatted with OCFS2

RAC Test AS3AP 1-4 nodes Select Query 1GB cache

5 July, 2006UK Metadata Workshop, Oxford9 Overview  Summarize the main plans  Explain the long-term course to follow Select Query 8GB no db cache RAC Test AS3AP 1-4 nodes

5 July, 2006UK Metadata Workshop, Oxford10 RAC Test OLTP nodes With OLTP applications, system scalability is lower, we argue there is a disk subsystem bottleneck 1 node 2 nodes 4 nodes

5 July, 2006UK Metadata Workshop, Oxford11 RAC Test OLTP 4 nodes TransactionPerMinute workload OLTP O_DIRECT enabled ASYNC_IO enabled TransactionPerMinute workload OLTP O_DIRECT Disabled ASYNC_IO Disabled

5 July, 2006UK Metadata Workshop, Oxford12 Oracle Streams CAPTURE:  Streams captures events Implicitly: log-based capture of DML and DDL Explicitly: Direct enqueue of user messages PROPAGATION:  Captured events are published in the staging area  The staging area has the following characteristics: Implemented as a queue Messages remain in staging area until consumed by all subscribers  Other staging areas can subscribe to events in same database or in a remote database  Events can be routed through a series of staging areas  Transformations can be performed as events enter, leave or propagate between staging areas Consumption PropagateCapture

5 July, 2006UK Metadata Workshop, Oxford13 Oracle Streams Comsumption:  Staged events are consumed by subscribers Implicitly: Apply Process  Default Apply  User-Defined Apply Explicitly: Application dequeue via API (C++, Java…)  The default apply engine will directly apply the DML or DDL represented in the LCR apply to local Oracle table apply via DB Link to remote Oracle table  Automatic conflict detection with optional resolution unresolved conflicts placed in exception queue  Rule based configuration: expressed as “WHERE” clause Consumption PropagateCapture

5 July, 2006UK Metadata Workshop, Oxford14 Streams Replication Example table1 Update table1 set field1=‘value3’ where table1id=‘id1’; Redo Log Capture table1 table1id |field1|.. id1 | value3 |… id2 | value2 |... Apply Queue LCRs Queue LCRs Propagation ACK Source Node Destination Node User executes an update statement at source node: update table1 set field1= ‘id3’ where table1id = ‘id1’;

5 July, 2006UK Metadata Workshop, Oxford15

5 July, 2006UK Metadata Workshop, Oxford16 Oracle Streams in 3D  The Oracle streams allows connecting single tables or complete schemas in different databases and keeping them up to date at Real Time.

5 July, 2006UK Metadata Workshop, Oxford17

5 July, 2006UK Metadata Workshop, Oxford18

5 July, 2006UK Metadata Workshop, Oxford19

5 July, 2006UK Metadata Workshop, Oxford20

5 July, 2006UK Metadata Workshop, Oxford21 LFC Replication testbed  40 lfc clients, 40 lfc daemons threads, streams pool.  Client’s actions Control if LFN exists into the database  Select from cns_file_metadata If yes -> add a sfn for that lfn  Insert sfn into cns_file_replica If not -> add both lfn and sfn  Insert lfn into cns_file_metadata  Insert sfn into cns_file_replica For each lfn 3 sfn are inserted

5 July, 2006UK Metadata Workshop, Oxford22 LFC Master HW Configuration Gigabit Switch Private LHCB link rac-lhcb-01 rac-lhcb-02 Dell 224F 14 x 73GB disks ASM Dual Xeon 3,2GHz,4GB memory 2nodes-RAC on Oracle 10gR2 RHEL 4 kernel ELsmp 14 Fibre Channel disks (73GB each) HBA Qlogic Qla2340 – Brocade FC Switch Disk storage managed with Oracle ASM (striping and mirroring)

5 July, 2006UK Metadata Workshop, Oxford23 LFC Slave Configuration  LFC Read only replica Dual Xeon 2.4, 2GB RAM Oracle 10gR2 (oracle RAC but used as single instance) RHEL 3 kernel x 250GB disks in RAID 5 HBA Qlogic Qla2340 – Brocade FC Switch Disk storage formatted with OCFS2

5 July, 2006UK Metadata Workshop, Oxford24 Performance About 75 transactions per second on each cluster node. Inserted and replicated 1700k entries in 4 hours (118 insert per second). Almost real-time replica with Oracle Streams without significant delays (<< 1s).

5 July, 2006UK Metadata Workshop, Oxford25 CPU load on cluster nodes is far from being saturated.

5 July, 2006UK Metadata Workshop, Oxford26 Conclusions and Future Plans  RAC technology is a good solution for scalability at DB server level. Some work is needed to tune the installation and optimize performance for a particular application. Moreover a reliable and scalable storage subsystem is needed.  Streams based replication is a good solution for scalability at “grid level”, a reliable DB infrastructure has to be distributed across many sites.  First LFC replication test results demonstrate that Streams is an interesting solution for real-time master/slave replication.  VOMS replication tests in the very near future.  Many thanks to Vincenzo Vagnoni, Eva da Fonte Perez.