CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Possible Service Upgrade Jacek Wojcieszuk, CERN/IT-DM Distributed Database Operations.

Slides:



Advertisements
Similar presentations
Networking Essentials Lab 3 & 4 Review. If you have configured an event log retention setting to Do Not Overwrite Events (Clear Log Manually), what happens.
Advertisements

High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
CERN IT Department CH-1211 Geneva 23 Switzerland t Marcin Blaszczyk, IT-DB Atlas standby database tests February.
CERN IT Department CH-1211 Genève 23 Switzerland t Backup configuration best practices Jacek Wojcieszuk, CERN IT-DM Distributed Database.
1 Recovery and Backup RMAN TIER 1 Experience, status and questions. Meeting at CNAF June of 2007, Bologna, Italy Carlos Fernando Gamboa, BNL Gordon.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Data Guard for RAC migrations WLCG Service Reliability Workshop CERN, November.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Presented by Jacob Wilson SharePoint Practice Lead Bross Group 1.
Introduction to Oracle Backup and Recovery
1 RAL Status and Plans Carmine Cioffi Database Administrator and Developer 3D Workshop, CERN, November 2009.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
Oracle Database High Availability Brandon Kuschel Jian Liu Source: Oracle Database 11g Release 2 High Availability An Oracle White Paper November 2010.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
IBM TotalStorage ® IBM logo must not be moved, added to, or altered in any way. © 2007 IBM Corporation Break through with IBM TotalStorage Business Continuity.
CERN IT Department CH-1211 Genève 23 Switzerland t Data Protection with Oracle Data Guard Jacek Wojcieszuk, CERN/IT-DM Distributed Database.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
1 Data Guard Basics Julian Dyke Independent Consultant Web Version - February 2008 juliandyke.com © 2008 Julian Dyke.
Selling the Database Edition for Oracle on HP-UX November 2000.
Oracle Recovery Manager (RMAN) 10g : Reloaded
PPOUG, 05-OCT-01 Agenda RMAN Architecture Why Use RMAN? Implementation Decisions RMAN Oracle9i New Features.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
CERN IT Department CH-1211 Genève 23 Switzerland t Tape-dev update Castor F2F meeting, 14/10/09 Nicola Bessone, German Cancio, Steven Murray,
CERN IT Department CH-1211 Geneva 23 Switzerland t Experience with NetApp at CERN IT/DB Giacomo Tenaglia on behalf of Eric Grancher Ruben.
D0 Taking Stock1 By Anil Kumar CD/CSS/DSG July 10, 2006.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
11g(R1/R2) Data guard Enhancements Suresh Gandhi
Oracle Database 10g RMAN and ATA Storage in Action Paul Tsien Oracle Corporation Bob Ng EMC Corporation Session id:
Selling the Storage Edition for Oracle November 2000.
1 Data Guard. 2 Data Guard Reasons for Deployment  Site Failures  Power failure  Air conditioning failure  Flooding  Fire  Storm damage  Hurricane.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
ALMA Archive Operations Impact on the ARC Facilities.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
Marcin Blaszczyk, Zbigniew Baranowski – CERN Outline Overview & Architecture Use Cases for Our experience with ADG and lessons learned Conclusions.
CERN IT Department CH-1211 Genève 23 Switzerland t Frédéric Hemmer IT Department Head - CERN 23 rd August 2010 Status of LHC Computing from.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
1© Copyright 2012 EMC Corporation. All rights reserved. EMC VNX5700, EMC FAST Cache, SQL Server AlwaysOn Availability Groups Strategic Solutions Engineering.
CERN IT Department CH-1211 Genève 23 Switzerland t DBA Experience in a multiple RAC environment DM Technical Meeting, Feb 2008 Miguel Anjo.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Database Competence Centre openlab Major Review Meeting nd February 2012 Maaike Limper Zbigniew Baranowski Luigi Gallerani Mariusz Piorkowski Anton.
D0 Taking Stock1 By Anil Kumar CD/CSS/DSG June 06, 2005.
CERN IT Department CH-1211 Genève 23 Switzerland t Storage Overview and IT-DM Lessons Learned Luca Canali, IT-DM DM Group Meeting
PROOF tests at BNL Sergey Panitkin, Robert Petkus, Ofer Rind BNL May 28, 2008 Ann Arbor, MI.
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
CERN IT Department CH-1211 Geneva 23 Switzerland t Eva Dafonte Perez IT-DB Database Replication, Backup and Archiving.
CERN IT Department CH-1211 Genève 23 Switzerland 1 Active Data Guard Svetozár Kapusta Distributed Database Operations Workshop November.
BNL Oracle database services status and future plans Carlos Fernando Gamboa, John DeStefano, Dantong Yu Grid Group, RACF Facility Brookhaven National Lab,
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Juraj Sucik, Michal Kwiatek, Rafal.
PIC port d’informació científica Luis Diaz (PIC) ‏ Databases services at PIC: review and plans.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
CERN IT Department CH-1211 Geneva 23 Switzerland t Distributed Database Operations Workshop November 17 th, 2010 Przemyslaw Radowiecki CERN.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
CERN - IT Department CH-1211 Genève 23 Switzerland t Service Level & Responsibilities Dirk Düllmann LCG 3D Database Workshop September,
CERN IT Department CH-1211 Genève 23 Switzerland t Using Data Guard for hardware migration UKOUG RAC & HA SIG, Feb 2008 Miguel Anjo, CERN.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
Oracle Database High Availability
Integrating Disk into Backup for Faster Restores
Maria Girone, CERN – IT, Data Management Group
Backup & Recovery of Physics Databases
Experiences and Outlook Data Preservation and Long Term Analysis
Oracle Database High Availability
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Introduction of Week 6 Assignment Discussion
Oracle Storage Performance Studies
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Presentation transcript:

CERN IT Department CH-1211 Genève 23 Switzerland t Possible Service Upgrade Jacek Wojcieszuk, CERN/IT-DM Distributed Database Operations Workshop April 20 th, 2009

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 2 Outline Hardware –CPU/Memory –Storage OS –RHEL 5 Oracle Software –Data Guard –Oracle 11g Procedures –Backup&Recovery –Data Lifecycle Management

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 3 Hardware Although the CPU clocks do not speed up anymore, new processors continue to offer better and better performance –More cores –Improved microarchitectures –Faster and bigger cache Intel Nehalem –Expected to be even twice faster than the currently used CPUs: clossons-silly-little-benchmark-is-silly-fast-on-nehalem/ –To be released this year –Very promising for Oracle customers paying per-core licenses although Oracle licensing not clear, yet

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 4 More on hardware Memory –Is a critical factor for performance of Oracle RDBMS –Growing databases require bigger caches To keep IOPS reasonable Storage –10k RPM disks get cheaper WD Raptor: ~140 IOPS, ~$300 Capacity grows as well –Bigger SATA disks Up to 2 TB Making on-disk backups cheaper and cheaper –New promising interconnect technologies: iSCSI (1Gb or 10Gb) –See Luca’s talk for more details

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 5 Latest hardware acquisition RAC7 –Ordered last year, will pe put in production in coming weeks –20 blade servers split into 2 chassis Dual Quad Core, 16MB of RAM –32 x 12-bay disk arrays equipped with WD Raptor disks (10k RPM, 300GB) –12 x 12-bay disk arrays equipped with 1TB SATA disks (7.2k RPM) –Will host CMSR, LCGR, LHCBR and Downstream Capture databases SAN - datafiles - on-disk backup

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 6 RAC7

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 7 Next hardware acquisition RAC8 and RAC9 –Replacement for RAC3 and RAC4 –Should be put in production in late autumn 2009 –40 blade servers grouped into 4 chassis Nehalem CPUs At least 24 GB of RAM –60 12-bay disk arrays Big SATA disks (2TB most likely) iSCSI solutions considered

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 8 Operating System RHEL 5 seems to be mature enough to be used for production –3rd update already released Few minor improvements: –Bigger file systems allowed –Improved TCP/IP stuck (important for iSCSI) –Improved (?) DevMapper management RedHat stops support for RHEL 4 in 2012 Some features of Oracle 11gR2 not supported on RHEL 4 At least 2 possible migration scenarios: –Rolling upgrade: node-by-node –Data Guard and switchover

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 9 Data Guard Physical Standby Database Mature and stable technology The most reliable Oracle recovery solution –Simplifies and speeds up both: full database recovery and handling human errors –Less error-prone than RMAN-based recovery –Very important for on-line databases of CMS, LHCB and Alice which are not backed up to tapes In conjunction with flashback logging it allows to run various tests of applications A lot of experience gathered in past years: –Data Guard for migrations –Pilot setup during the 2008 run Plans to deploy standby database for majority of production systems

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 10 Data Guard Physical Standby Database WAN/Intranet Primary RAC database with ASM Physical Standby RAC database with ASM Data changes

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 11 Data Guard - configuration details Standby databases configured as RAC –Servers sized to handle moderate applications’ load –Enough disk space to fit all the data and few days of archived redo logs Identical Oracle RDBMS version on primary and standby –The same patch level Asynchronous data transmission with the LGWR process –Standby redo logs necessary –Only few last transactions can be lost Shipped redo data applied with 24 hours delay Flashback logging enabled with 48 hours retention Data Guard Broker –to simplify administration and monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 12 Oracle 11g Many interesting features: –ASM fast mirror resync Short unavailability of one side of the ASM mirror will not result in disk eviction –Active Data Guard Standby database can be continuously used for query/reporting activity –Real Application Testing (Database Reply) Allows replaying captured workload on demand Potentially very useful for testing and optimization

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 13 Oracle 11g Other interesting 11g features: –SQL Plan Management Prevents Oracle from switching to a worse execution plan –Oracle Secure files (LOBs) Improved LOB performance Compression and encryption –Interval based partitioning Even more features in 11gR2 –Hopefully will be released in autumn We plan to go straightaway to 11gR2 –On production after first patchset –On integration probably several months earlier

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 14 Backup&Recovery Even with standby databases in place tape backups will stay as the key element of the backup&recovery strategy –Certain type of failures require restore from tapes: Disasters Human errors discovered long time after they were made Backing up to tapes and restoring those backups is a challenging task as the databases get bigger and bigger

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 15 Current tape backup architecture at CERN Traditionally at CERN tape backups are sent over a general purpose network to a media management server: –This limits backup/recovery speed to ~100 MB/s –Backup/restore of a 10TB database takes almost 30 hours! –At the same time tape drives can archive data with the speed of 160 MB/s compressed Database TSM Server Tape drives RMAN backups Metadata 1Gb IT/DM technical meeting Gb

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 16 LAN-free tape backups Tivoli Storage Manager supports so-called LAN- free backup When using LAN-free configuration: –Backup data flows to tape drives directly over SAN –Media Management Server used only to register backups –Very good performance observed during tests (see next slide) Database TSM Server Tape drivesRMAN backups Metadata 1Gb IT/DM technical meeting Gb

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 17 LAN-free tape backups - tests 1 TB test database with contents very similar to one of the production DB Different TSM configurations: –TCP and Shared Memory mode Backups taken using 1 or 2 streams TCPShared mem 1 stream198 MB/s231 MB/s 2 streams361 MB/s402 MB/s Restore tests done using 1 stream only –Performance of a test with 2 streams affected by Oracle software issues (bug ) TCPShared mem 1 stream150 MB/s158 MB/s IT/DM technical meeting - 17

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 18 LAN-free backup alternatives Using a disk pool instead of tape drives Database TSM Server Tape drives RMAN backups Metadata 10 Gb 10 Gb network for backups –Will be tested in next few weeks

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 19 A disk pool instead of tapes - test Two tested configurations: –SAN-attached storage with a file system –SAN-attached storage with ASM –1 disk array, 2 x 8 disk RAID 5 –Test performed with 4 streams Database Remote storage RMAN backups Storage Area Network IT/DM technical meeting - 19 RAID 5 ext3ASM 4 streams (backup)235 MB/s369 MB/s

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 20 Summary No revolutionary changes this year –Move to RHEL5 should be quite smooth –Migration to 11gR2 will happen next year the earliest We are ready for data growth –Enough hardware resources allocated Many procedure improvements –Backup&Recovery –Dealing with big data volumes –Security

CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Databse Operations Workshop - 21 Q&A Thank You