Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.

Slides:



Advertisements
Similar presentations
Tom Hamilton – America’s Channel Database CSE
Advertisements

Database Architectures and the Web
IWR Ideen werden Realität Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Institut für Wissenschaftliches Rechnen Status of Database Services.
DB server limits (process/sessions) Carlos Fernando Gamboa, BNL Andrew Wong, TRIUMF WLCG Collaboration Workshop, CERN Geneva, April 2008.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
Hardening Linux for Enterprise Applications Peter Knaggs & Xiaoping Li Oracle Corporation Sunil Mahale Network Appliance Session id:
1 RAL Status and Plans Carmine Cioffi Database Administrator and Developer 3D Workshop, CERN, November 2009.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
1 Copyright © 2009, Oracle. All rights reserved. Exploring the Oracle Database Architecture.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
ASGC 1 ASGC Site Status 3D CERN. ASGC 2 Outlines Current activity Hardware and software specifications Configuration issues and experience.
Oracle on Windows Server Introduction to Oracle10g on Microsoft Windows Server.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
Clustering  Types of Clustering. Objectives At the end of this module the student will understand the following tasks and concepts. What clustering is.
Oracle Streams--Simplifying Information Sharing in Oracle10 g Patricia McElroy Product Manager Oracle Corporation Session id:
Data Replication with Advanced Replication & Oracle Streams John Abrahams Technology Sales Consultant Oracle Nederland.
ORACLE RAC. Definition Oracle Real Application Cluster (RAC) is a cluster system at the application level. It uses shared disk architecture that provides.
By Lecturer / Aisha Dawood 1.  You can control the number of dispatcher processes in the instance. Unlike the number of shared servers, the number of.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Indiana University’s Name for its Sakai Implementation Oncourse CL (Collaborative Learning) Active Users = 112,341 Sites.
Heterogeneous Database Replication Gianni Pucciani LCG Database Deployment and Persistency Workshop CERN October 2005 A.Domenici
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Page 1. Data Integration Using Oracle Streams A Case Study Session #:
LFC Replication Tests LCG 3D Workshop Barbara Martelli.
Oracle® Streams for Near Real Time Asynchronous Replication Nimar S. Arora Oracle USA.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
VMware vSphere Configuration and Management v6
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Implementation of a reliable and expandable on-line storage for compute clusters Jos van Wezel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementation and performance analysis of.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
LHCb File-Metadata: Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 04 July 2006.
HyperKVS Group Meeting Oracle Streams Dr. Volker Kuhr.
Oracle9i Performance Tuning Chapter 11 Advanced Tuning Topics.
BNL Oracle database services status and future plans Carlos Fernando Gamboa, John DeStefano, Dantong Yu Grid Group, RACF Facility Brookhaven National Lab,
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
AMGA-Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 05 July 2006.
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
C Copyright © 2006, Oracle. All rights reserved. Integrating with Oracle Streams.
Making Sense of Service Broker Inside the Black Box.
Deployment and operations of a high availability infrastructure for relational databases in a heterogeneous Tier 1 workload environment Carlos Fernando.
An Introduction to GPFS
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.
Marcin Bogusz CERN, PH-CMG WLCG Collaboration Workshop CMS online/offline replication Online/offline replication via Oracle Streams WLCG Collaboration.
WLCG Collaboration Workshop CMS online/offline replication
Jean-Philippe Baud, IT-GD, CERN November 2007
Maria Girone, CERN – IT, Data Management Group
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
LCG 3D Distributed Deployment of Databases
Distributed Network Traffic Feature Extraction for a Real-time IDS
LCG 3D and Oracle Cluster
Scalable Database Services for Physics: Oracle 10g RAC on Linux
TCB 9 RK/MSFT Benchmark Results
Oracle Storage Performance Studies
ASM-based storage to scale out the Database Services for Physics
Database Services for CERN Deployment and Monitoring
Oracle Streams Performance
Oracle10g RAC Service Architecture
Scalable Database Services for Physics: Oracle 10g RAC on Linux
High-Performance Storage System for the LHCb Experiment
Presentation transcript:

Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco

8 June, 2006CCR Workshop, Otranto2 Oracle Database Architecture The Oracle Server architecture can be described in three categories:  User-related processes User Process Server Process  Logical memory structures that are collectively called an Oracle instance  Physical file structures that are collectively called a database Database

8 June, 2006CCR Workshop, Otranto3

8 June, 2006CCR Workshop, Otranto4 Instance Database

8 June, 2006CCR Workshop, Otranto5 Oracle Real Application Cluster  The Oracle Real Application Cluster technology allows to share a database amongst several servers  All datafiles, control files, PFILEs, and redo log files in RAC environments must reside on cluster- aware shared disks so that all of the cluster database instances can access them.  RAC aims to provide highly available, fault tolerant and scalable database services Network shared disks (Cluster Filesystem) Database servers

8 June, 2006CCR Workshop, Otranto6 RAC testbed ORA-RAC-01 ORA-RAC-02 ORA-RAC-03 ORA-RAC-04 IBM FAStT900 FC RAID Controller Fiber Channel Sw GigaSw2 Clients GigaSw1 Private network for interconnect traffic Public and VIP Network Interface 4 x Dual Xeon 2.8 GHz 4 GB RAM Red Hat Enterprise 4 on RAID-1 disks 2 x Intel PRO1000 NICs 1 QLogic 2312 FC HBA with 2 x 2Gb/s links Clients Disk I/O traffic 1.2 TB RAID-5 disk array formatted with OCFS2

RAC Test AS3AP 1-4 nodes Select Query 1GB cache

Overview  Summarize the main plans  Explain the long-term course to follow Select Query 8GB no db cache RAC Test AS3AP 1-4 nodes

RAC Test OLTP nodes With OLTP applications, system scalability is lower, we argue there is a disk subsystem bottleneck 1 node 2 nodes 4 nodes

RAC Test OLTP 4 nodes TransactionPerMinute workload OLTP O_DIRECT enabled ASYNC_IO enabled TransactionPerMinute workload OLTP O_DIRECT Disabled ASYNC_IO Disabled

8 June, 2006CCR Workshop, Otranto11 Oracle Streams CAPTURE:  Streams captures events Implicitly: log-based capture of DML and DDL Explicitly: Direct enqueue of user messages PROPAGATION:  Captured events are published in the staging area  Streams publishes captured events into a staging area Implemented as a queue Messages remain in staging area until consumed by all subscribers  Other staging areas can subscribe to events in same database or in a remote database  Events can be routed through a series of staging areas  Transformations can be performed as events enter, leave or propagate between staging areas Consumption PropagateCapture

8 June, 2006CCR Workshop, Otranto12 Oracle Streams Comsumption:  Staged events are consumed by subscribers Implicitly: Apply Process  Default Apply  User-Defined Apply Explicitly: Application dequeue via API (C++, Java…)  The default apply engine will directly apply the DML or DDL represented in the LCR apply to local Oracle table apply via DB Link to non-Oracle table  Automatic conflict detection with optional resolution unresolved conflicts placed in exception queue  Rule based configuration: expressed as “WHERE” clause Consumption PropagateCapture

8 June, 2006CCR Workshop, Otranto13 Streams Replication Example table1 Update table1 set field1=‘value3’ where table1id=‘id1’; Redo Log Capture table1 table1id |field1|.. id1 | value3 |… id2 | value2 |... Apply Queue LCRs Queue LCRs Propagation ACK Source Node Destination Node User executes an update statement at source node: update table1 set field1= ‘id3’ where table1id = ‘id1’;

8 June, 2006CCR Workshop, Otranto14

8 June, 2006CCR Workshop, Otranto15 Oracle Streams in 3D  The Oracle streams allows connecting single tables or complete schemas in different databases and keeping them up to date at Real Time.

8 June, 2006CCR Workshop, Otranto16

8 June, 2006CCR Workshop, Otranto17

8 June, 2006CCR Workshop, Otranto18

8 June, 2006CCR Workshop, Otranto19

8 June, 2006CCR Workshop, Otranto20 LFC Replication testbed  40 lfc clients, 40 lfc daemons threads, streams pool.  Client’s actions Control if LFN exists into the database  Select from cns_file_metadata If yes -> add a sfn for that lfn  Insert sfn into cns_file_replica If not -> add both lfn and sfn  Insert lfn into cns_file_metadata  Insert sfn into cns_file_replica For each lfn 3 sfn are inserted

8 June, 2006CCR Workshop, Otranto21 LFC Master HW Configuration Gigabit Switch Private LHCB link rac-lhcb-01 rac-lhcb-02 Dell 224F 14 x 73GB disks ASM Dual Xeon 3,2GHz,4GB memory 2nodes-RAC on Oracle 10gR2 RHEL 4 kernel ELsmp 14 Fibre Channel disks (73GB each) HBA Qlogic Qla2340 – Brocade FC Switch Disk storage managed with Oracle ASM (striping and mirroring)

8 June, 2006CCR Workshop, Otranto22 LFC Slave Configuration  LFC Read only replica Dual Xeon 2.4, 2GB RAM Oracle 10gR2 (oracle RAC but used as single instance) RHEL 3 kernel x 250GB disks in RAID 5 HBA Qlogic Qla2340 – Brocade FC Switch Disk storage formatted with OCFS2

8 June, 2006CCR Workshop, Otranto23 Performance About 75 transactions per second on each cluster node. Inserted and replicated 1700k entries in 4 hours (118 insert per second). Almost real-time replica with Oracle Streams without significant delays (<< 1s).

8 June, 2006CCR Workshop, Otranto24 CPU load on cluster nodes is far from being saturated.

8 June, 2006CCR Workshop, Otranto25 Conclusions and Future Plans  RAC technology is a good solution for scalability at DB server level. Some work is needed to tune the installation and optimize performance for a particular application. Moreover a reliable and scalable storage subsystem is needed.  Streams based replication is a good solution for scalability at “grid level”, a reliable DB infrastructure has to be distributed across many sites.  First LFC replication test results demonstrate that Streams is an interesting solution for real-time master/slave replication.  VOMS replication tests in the very near future.  Many thanks to Vincenzo Vagnoni, Eva da Fonte Perez.