D0 Taking Stock1 By Anil Kumar CD/CSS/DSG June 06, 2005.

Slides:



Advertisements
Similar presentations
ITEC474 INTRODUCTION.
Advertisements

2 Copyright © 2005, Oracle. All rights reserved. Installing the Oracle Database Software.
Do MUCH More with Less Presented by: Jon Farley 2W Technologies.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
9 Copyright © Oracle Corporation, All rights reserved. Oracle Recovery Manager Overview and Configuration.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
CERN IT Department CH-1211 Geneva 23 Switzerland t CERN IT Department CH-1211 Geneva 23 Switzerland t
Director Product Management
Backup Rationalisation Reorganisation of the CERN Computer Centre Backups David Asbury IT/DS Friday 6 December 2002.
Introduction to Oracle Backup and Recovery
Backup & Recovery Concepts for Oracle Database
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Castor F2F Meeting Barbara Martelli Castor Database CNAF.
Online Databases Status Oracle 8i Servers, release Platform: Compaq Tru64 UNIX V5.1 (Rev. 732) Production DB d0onprd, on d0olc, 64 users, 34Gb.
Acceleratio Ltd. is a software development company based in Zagreb, Croatia, founded in We create innovative software solutions for SharePoint,
Best Implementation Practices for Discoverer April Sims OCP 8i 9i.
LAN / WAN Business Proposal. What is a LAN or WAN? A LAN is a Local Area Network it usually connects all computers in one building or several building.
Day 10 Hardware Fault Tolerance RAID. High availability All servers should be on UPSs –2 Types Smart UPS –Serial cable connects from UPS to computer.
D0 DB Taking Stock ‘10 1 By Anil Garg – Database Services June 17, 2010.
The Mass Storage System at JLAB - Today and Tomorrow Andy Kowalski.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Operating in a SAN Environment March 19, 2002 Chuck Kinne AT&T Labs Technology Consultant.
Fermilab Oct 17, 2005Database Services at LCG Tier sites - FNAL1 FNAL Site Update By Anil Kumar & Julie Trumbo CD/CSS/DSG FNAL LCG Database.
Online Database Support Experiences Diana Bonham, Dennis Box, Anil Kumar, Julie Trumbo, Nelly Stanfield.
D0 Taking Stock1 By Anil Kumar CD/CSS/DSG July 10, 2006.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
15 Copyright © 2005, Oracle. All rights reserved. Performing Database Backups.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Backup & Recovery Backup and Recovery Strategies on Windows Server 2003.
Technical Details – SAN PHARMA SFA. Front End / Back End Details  ASP  ASP.net  XML  JAVA Script  DHTML  MS SQL SERVER.
CDF Taking Stock ‘08 1 By Anil Kumar CD/LSCS/DBI/DBA July 16, 2008.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Databases March 14, /14/2003Implementation Review2 Goals for Database Architecture Changes Simplify hardware architecture Improve performance Improve.
15 Copyright © 2007, Oracle. All rights reserved. Performing Database Backups.
A Guide to Oracle9i1 Database Instance startup and shutdown.
Tape logging- SAM perspective Doug Benjamin (for the CDF Offline data handling group)
SLAC Site Report Chuck Boeheim Assistant Director, SLAC Computing Services.
06/22/2005CDF Taking Stock CDF Taking Stock By Anil Kumar CD/CSS/DSG June 22, 2005.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Database Services Nelly Stanfield October 7, 2009 Database Services3425-v1.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
(WINDOWS PLATFORM - ITI310 – S15)
CDF DB Taking Stock ‘10 1 By Anil Garg – Database Services Aug 18, 2010.
The Million Point PI System – PI Server 3.4 The Million Point PI System PI Server 3.4 Jon Peterson Rulik Perla Denis Vacher.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Reliability of KLOE Computing Paolo Santangelo for the KLOE Collaboration INFN LNF Commissione Scientifica Nazionale 1 Roma, 13 Ottobre 2003.
ORACLE & VLDB Nilo Segura IT/DB - CERN. VLDB The real world is in the Tb range (British Telecom - 80Tb using Sun+Oracle) Data consolidated from different.
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
19 Copyright © 2004, Oracle. All rights reserved. Database Backups.
Virtual Server Server Self Service Center (S3C) JI July.
Office of Administration Enterprise Server Farm Managed Services August 2004 Briefing.
March, Database Projects J.Trumbo CSS-DSG May,
Extending Auto-Tiering to the Cloud For additional, on-demand, offsite storage resources 1.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
ETL Validator Deployment Options
Monitoring Storage Systems for Oracle Enterprise Manager 12c
Backup & Recovery of Physics Databases
Building a Virtual Infrastructure
By Anil Kumar CD/CSS/DSG June 06, 2005
Database Services at Fermilab
Monitoring Storage Systems for Oracle Enterprise Manager 12c
Oracle Database Monitoring and beyond
Upgrading to Microsoft SQL Server 2014
Backup Monitoring – EMC NetWorker
Backup Monitoring – EMC NetWorker
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Presentation transcript:

D0 Taking Stock1 By Anil Kumar CD/CSS/DSG June 06, 2005

D0 Taking Stock2 Production/Integration Infrastructure 8 900MHz CPU 16Gb RAM The machine has a Clariion 4500 hardware raid array with 80 drives. Oracle Server (64 bit) on Solaris bit. Load Avg 2 –3 CPU usage < 50%, Memory Free : 61% Average Db Response time/execute ~ secs Uptime excluding schedule down times % Uptime (based on 420 min of total db unavailability) since Nov 15, 2004 System Performance : Db Performance Charts :

D0 Taking Stock3 D0 offline development Infrastructure MHz, 4GB of RAM OS and Oracle Version same as of int/prd. 64bit OS and 64 bit Oracle Load Avg 1-2, CPU usage 10-15%, Mem Free 7.8% System Performance URL

D0 Taking Stock4 D0 Calib Servers Deployment Infrastructure Sun Solaris Linux PC for Analysis Farm Linux PC Failover Linux PC for Reconstructed Farm Note : There was 1 failure for User Servers and Farm Servers since Nov 15, 2005

D0 Taking Stock5 Space Usage

D0 Taking Stock6 Space Usage Summary D0ofprd1 786 GB used. d0ofint1 77GB used. 800Gb is available for use for int and production. d0ofdev1 82 GB Used 190GB is available for use.

D0 Taking Stock7 Capacity Planning Next three years expected Growth 825Gb. SAM growth 250Gb/year and other apps 25Gb/year. This exclude Luminosity DB We have around 800Gb available. Should start planning to upgrade Disk Capacity Next Year. Luminosity growth is 125Gb/year. Sun v40z machine with a Sun StorEdge 3310 scsi disk array w/ tb & 2 Ultra160 raid controllers. URL for Capacity Planning : css.fnal.gov/dsg/internal/d0_ofl_dbs/D0_database_servers(sun)/d0o ra_index_page/d0ora2_d0ofprd/d0ora2_disk_planning.htm

D0 Taking Stock8 Accomplishments Upgraded D0 offline databases to Quarterly Database Security Up-to-date Tested Complete Database Recovery of d0ofprd1 database. It took 4 hours. This assumes hardware is already configured and Backup files are available on disk. Moving d0 offline to a standardized backup recovery method using a san and enstore. Parallel testing of san as backup media for development and production instances going well. Luminosity db deployed 9i and 10g versions on loaner CDF machine

D0 Taking Stock9 Monitoring And Data Modeling Tools Monitoring Tools : dbatool/toolman To monitor the space usage, users, SQL, tempspace, sniping of inactive sessions, auto start of Listener, IA, estimate table/Index stats OEM (Oracle Enterprise Manager) - DB Monitoring tool/ Monthly charts posted on web Db Performance Charts : cdserver.fnal.gov/cd_public/css/dsg/db_stats/data/db_stats.html The url for the ganglia charts (monitoring tools) is: Data Modeling Tool : Oracle Designer is used for Data Modeling and space estimates.

D0 Taking Stock10 Back-up/Recovery D0ofprd1 - Daily, 7 days of archives, one always on DISK - Bi-weekly backup of READ ONLY tablespaces - Allocated 1179GB Used 755GB, Tape Daily, RMAN Back-up time - > 5 Hrs 45Min ( 3 Hrs Excl READ ONLY + 2 Hrs 45 Exclude READ ONLY ) No Export Tape Rotation : 1 Week for Daily backups and 2 months for Read Only backups. D0ofint1 Once a week on SAN D0ofdev1 - Daily 3 days of archives Sat on DISK otherwise on SAN -Allocated 100GB, used 58GB, Daily Tape Backup RMAN Backup time -> 1.5 Hr. Tape Rotation : 2 Months. - backup strategy for d0of lum boxes will be the SAN centralize strategy

D0 Taking Stock11 RMAN Backup on SAN Inexpensive, large disk array can accommodate growing RMAN backups Fast & reliable backup and recovery 24 x 7 and 8 x 5 support tiers available Can serve various O/S platforms Briefing on the database backup/recovery standardization on june 16, it will discuss the san testing in more detail Multiplexing of archives to local disk and SAN

D0 Taking Stock12 RMAN to SAN test case on d0ofdev1 d0ofdev1 RMANs to SAN since Nov. ’04 Two 1TB SAN mount points available Keep 2 alternating days of RMANs on SAN, once/week to local backup disk RMAN validation to determine backup file integrity One validation failure since Nov. ’04 Recoveries from SAN were all successful

D0 Taking Stock13 Production backups to SAN Initial problems encountered due to incompatible PCI cards – solved now Two 1TB SAN mount points in use 2 daily backups – one to SAN, one to local backup disk Always 2 backups on disk, plus X200 tape library backup of RMAN from local disk Read-only portion of database backed up twice/month to local backup disk

D0 Taking Stock14 SAN issues Current SAN is not 24 x 7 support IDE disks are not as reliable as other, more expensive disks are Purchasing 24 x 7 SAN requires licensing and changes to O/S to be able to use it Firewall issues (CDF & D0 online)

D0 Taking Stock15 SAM Schema Production Deployments : - Autodestination Sub-System of SAM schema - Indexes on Param Values Deployed in production. - Data Types correction cut. - Indexes for Volumes to be deployed on 06/07/05 - Partition Cut to be deployed on 06/07/05 Work-in-progress - Request Sub System of SAM Schema. Cut in Mini-sam. Upgrade to Mini SAM as SAM Schema Evolved. -> This facilitate individual developers to have copy of SAM metadata and seed data available for server software rewrite if needed. Mini-SAM in Postgres. Initiative to move towards free ware Databases for SAM Proof of product not complete, requires testing with a dbserver from the sam development team 1.61B events in 32 Partitions. Now Avg 1 partition/ 3 running weeks Partitions Rollover dates URL :

D0 Taking Stock16 What’s Next ? Deploy san/enstore backup recovery plan. ( TESTING OF SAN on d0ofprd1 is work-in-progress) Deployment of Lum Db in production. 10g. Possible Upgrade to 10g to d0ofprd1 due to enhanced feature of incremental database backups. Upgrade OEM to 10g Rewrite of dbatools/toolman for enhanced features of monitoring and 10g support. SAM Schema Deployment for SAM Request System. Testing of postgres mini sam for proof of product.

D0 Taking Stock17 Concerns Backups will get bigger. So backup of VLDB Speaker Bureau application to be moved to production ASAP. It is on dev being used in production mode. SAM Servers on Linux ? Not Enough Space for Integration db to do full refresh of SAM. Single point of failures with D0 offline database. future of the aging clarion array must be addressed in next budget.