Replication solutions for Oracle database 11g Zbigniew Baranowski.

Slides:



Advertisements
Similar presentations
Forward Data Cache Integration Pattern
Advertisements

Replication for Availability & Durability with MySQL and Amazon RDS Grant McAlister.
Gareth Ellis Senior Solutions Consultant Session 5a Key and PIN Management.
Tom Hamilton – America’s Channel Database CSE
13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.
Go-Faster Consultancy Ltd.1 Experiences of Global Temporary Tables in Oracle 8.1 David Kurtz Go-Faster Consultancy Ltd.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Jos van Wezel Doris Ressmann GridKa, Karlsruhe TSM as tape storage backend for disk pool managers.
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
Capacity Planning in a Virtual Environment
Virtual Switching Without a Hypervisor for a More Secure Cloud Xin Jin Princeton University Joint work with Eric Keller(UPenn) and Jennifer Rexford(Princeton)
Contents 2 Engagement Overview Migrating to Hyper-V from VMware Consider if time allows.
Bottoms Up Factoring. Start with the X-box 3-9 Product Sum
<Insert Picture Here>
1 Dbvisit Software Ltd. Dbvisit Overview. 2www.dbvisit.com Agenda Dbvisit Software Ltd. Overview Dbvisit Standby Dbvisit Replicate Customer Successes.
GoldenGate Monitoring and Troubleshooting
Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.
Replication Technologies at WLCG Lorena Lobato Pardavila CERN IT Department – DB Group JINR/CERN Grid and Management Information Systems, Dubna (Russia)
©Company confidential 1 Performance Testing for TM & D – An Overview.
CERN IT Department CH-1211 Genève 23 Switzerland t Streams new features in 11g Zbigniew Baranowski.
Next Generation GoldenGate(12c) vs. Streams for Physics Data Zbigniew Baranowski - CERN UKOUG 2013.
MOVE-4: Upgrading Your Database to OpenEdge® 10 Gus Björklund Wizard, Vice President Technology.
Oracle Recovery Manager (RMAN) 10g : Reloaded
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
LCG 3D StatusDirk Duellmann1 LCG 3D Throughput Tests Scheduled for May - extended until end of June –Use the production database clusters at tier 1 and.
CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.
Oracle on Windows Server Introduction to Oracle10g on Microsoft Windows Server.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
ORACLE GOLDENGATE AT CERN
Oracle Advanced Compression – Reduce Storage, Reduce Costs, Increase Performance Session: S Gregg Christman -- Senior Product Manager Vineet Marwah.
Selling the Storage Edition for Oracle November 2000.
Daniela Anzellotti Alessandro De Salvo Barbara Martelli Lorenzo Rinaldi.
1 Data Guard. 2 Data Guard Reasons for Deployment  Site Failures  Power failure  Air conditioning failure  Flooding  Fire  Storm damage  Hurricane.
1 Chapter 14 DML Tuning. 2 DML Performance Fundamentals DML Performance is affected by: – Efficiency of WHERE clause – Amount of index maintenance – Referential.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
LFC Replication Tests LCG 3D Workshop Barbara Martelli.
Oracle® Streams for Near Real Time Asynchronous Replication Nimar S. Arora Oracle USA.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
Marcin Blaszczyk, Zbigniew Baranowski – CERN Outline Overview & Architecture Use Cases for Our experience with ADG and lessons learned Conclusions.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementation and performance analysis of.
High Availability in DB2 Nishant Sinha
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
CERN IT Department CH-1211 Genève 23 Switzerland t Streams Service Review and Outlook Distributed Database Workshop PIC, 20th April 2009.
Database Competence Centre openlab Major Review Meeting nd February 2012 Maaike Limper Zbigniew Baranowski Luigi Gallerani Mariusz Piorkowski Anton.
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 24 January 2005.
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
Emil Pilecki Credit: Luca Canali, Marcin Blaszczyk, Steffen Pade.
CERN IT Department CH-1211 Geneva 23 Switzerland t Eva Dafonte Perez IT-DB Database Replication, Backup and Archiving.
11-Nov Distr. DB Operations workshop - November 2008 The PVSS Oracle DB Archive in ATLAS ( life cycle of the data ) Gancho Dimitrov (LBNL)
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
WLCG critical services update Andrea Sciabà WLCG operations coordination meeting December 18, 2014.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
REMINDER Check in on the COLLABORATE mobile app Best Practices for Oracle on VMware - Deep Dive Darryl Smith Chief Database Architect Distinguished Engineer.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
1 PVSS Oracle scalability Target = changes per second (tested with 160k) changes per client 5 nodes RAC NAS 3040, each with one.
Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.
Marcin Bogusz CERN, PH-CMG WLCG Collaboration Workshop CMS online/offline replication Online/offline replication via Oracle Streams WLCG Collaboration.
WLCG Collaboration Workshop CMS online/offline replication
Replication using Oracle Streams at CERN
IT-DB Physics Services Planning for LHC start-up
Evolution of Data(base) Replication Technologies for WLCG
Lead SQL BankofAmerica Blog: SQLHarry.com
Database Services at CERN Status Update
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
Windows Azure Migrating SQL Server Workloads
Oracle Streams Performance
Presentation transcript:

Replication solutions for Oracle database 11g Zbigniew Baranowski

Agenda Testing of replication solutions with real production workload Preparing the environment Comparison of results Conclusions 11g replication deployment for experiments databases Proposals of changes for 2012 Deployment strategy

Motivation Validation of technologies with real production data Generic tests rarely uncover bugs and problems Confidence when moving to production Tests have no impact on production environment! Measurable and valuable comparison of available replication solutions Exactly the same workload HW&SW configuration equal to production Isolated test environment

Setting up the environment Source Host Target Host TEST11G STRM11G TEST11G standby STREAMS GOLDEN GATE DATAGUARD

Setting up the environment STREAMS Redo log transport Source Host Target HostTEST11G STRM11G TEST11G standby STREAMS GOLDEN GATE DATAGUARD STREAMS

Performance Monitoring Streams 11g STRMMON – compatible with 11g GoldenGate Current out of the box monitoring does not offer much Custom scripts has been written DataGuard 11g STRMMON has been extended to offer performance history of standby database

Workload description Replication software configuration Default SW configuration Workload set #1 LHC file catalogue (LHCb) Data window of 13 days (28/05 – 10/06) 15GB of redo logs generated on source ~4M of operations (statements) 1,5M of inserts 1,5M of updates 1M of deletes 7M of row changes

Results for workload #1 Dataguard: Shipping database redo log files over the network to the target db

Performance results with workload #1 Database writer (DBW) process was a bottleneck due to CPU (100% on one core) Database writer (DBW) process was a bottleneck due to CPU (100% on one core) Random access to database data files was a bottleneck Random access to data files was a bottleneck Shipping database redo log files over the network to the target db Log mining of redo logs is very efficient Slow down due to big transaction processing

Resource utilization by workload#1 Almost no load on source Insignificant load on target Almost no load on source Insignificant load on target I/O (reads) only for shipping redo logs over the network No writes on the source I/O (reads) only for shipping redo logs over the network No writes on the source Besides redo download quite significant load of I/O system (reads and writes) Almost no load on source Some load on target (apply paralellism) Almost no load on source Some load on target (apply paralellism) No writes on source. Quite a lot of reads (logmining) No writes on source. Quite a lot of reads (logmining) Small amount of reads. High writes rato (~20MB/s) Small amount of reads. High writes rato (~20MB/s) Insignificant load on source and target High reads rates during logmining Modrate I/O reates

Workload description Replication software configuration DataGuard: 2x database writers GoldenGate: SQLBatch optimization enabled Workload set #2 LHC file catalogue (LHCb) Data window of 13 days (10/06 – 23/06) 17GB of redo logs generated on source ~6M of operations (statements) 2,5M of inserts 2M of updates 1,5M of deletes transactions in total ( ~ 10M of row changes)

Results for workload #2 BatchSQL disabled

Performance results with workload #2 BatchSQL disabled Database writers (DBW) process were still a bottleneck due to CPU (100% on one core) Database writers (DBW) process were still a bottleneck due to CPU (100% on one core) Random access to database data files was a bottleneck Random access to database data files

Resource utilization by workload#2 Without BatchSQL load is lower and performance better Target load increased by 1 but performance did not improve

Conclusions DataGuard was the fastest technology Streams were slower by 1.5 GoldenGate was slower by 2 Optimization of DG did not bring significant effects BatchSQL optimization brought down GG performance by 3 introducing overhead additionally No more SW optimization from Streams and Goldengate

Conclusions Resource utilization Insignificant load on source system (all) DataGuard while using log buffers does not need to read anything from disks on source db DataGuard writes data to replica most efficient (lowest write rates) Streams introduce highest load on target system (parallel composition of data changes) Streams required a lot of reads on source (~15MB/s) system but less on target (use of buffered queues)

Bug discoverd Streams DDL replication incomatibiities between RDBMS and STREAMS DDLDDL DDLDDL DDLDDL DDLDDL DDLDDL DDLDDL X X DDLDDL DDLDDL

Replication plans for 2012 No changes (Streams11g) ATLAS (foreseen in 2013) LHCb COMPASS Streams replacement with ADG CMS ALICE

CMS overview Databases Online (CMSONR) Size 3486 GB 588 schemas Offline (CMSR) Size 3459GB Replications 3 streamings (conditions, pvss and misc) 60% of CMSONR database size is replicated 92 schemas, 2078 GB many DDL updates many cross schema dependencies high workload - around 500 LCR/s (periodical data delivery latency on PVSS replication)

Safe streams replacement with ADG for CMS Redo Transport STREAMS PVSS STREAMS COND STREAMS MISC Redo Transport Other OFFLINE readers sessions PVSS, COND, MISC readers sessions Everybody is happy with ADG – we can drop streams - PVSS, COND replica schemas can be dropped. - CMSR RAC can be reduced - some storage space can be reclaim (also from STANDBY)

ALICE overview Databases Online (ALIONR) size: 2788GB 97 schemas Offline (PDBR) Size 1174GB Replications PVSS 2 schemas,431 GB (15% of ALIONR), avg workload 50 LCRs/s Replica size on PDBR: 349 GB (30% of db size)

Safe streams replacement with ADG for ALICE Redo Transport STREAMS PVSS PVSS readers sessions Everybody is happy with ADG – we can drop streams - PVSS replica schemas can be dropped. - some storage space can be reclaim on PDBR

Future plans Switchover and failover tests in double standby database environment Validation of active standby with CMS applications GodenGate 11.2 beta testing?