Turning Practice into Perfect Implementing Fathom 2.0 Adam Backman White Star Software

Slides:



Advertisements
Similar presentations
Computer-System Structures Er.Harsimran Singh
Advertisements

Chapter 9. Performance Management Enterprise wide endeavor Research and ascertain all performance problems – not just DBMS Five factors influence DB performance.
J. Espen Stokke Professional Services Manager.
Skyward Disaster Recovery Options
SQL Server Disaster Recovery Chris Shaw Sr. SQL Server DBA, Xtivia Inc.
Protect Your Business and Simplify IT with Symantec and VMware Presenter, Title, Company Date.
Recovery Planning A Holistic View Adam Backman, President White Star Software
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
1 Storage Today Victor Hatridge – CIO Nashville Electric Service (615)
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Skyward Server Management Options Mike Bianco. Agenda: Managed Services Overview OpenEdge Management / OpenEdge Explorer OpenEdge Managed Demo.
Reliability Week 11 - Lecture 2. What do we mean by reliability? Correctness – system/application does what it has to do correctly. Availability – Be.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
Concepts of Database Management Seventh Edition
Maintaining and Updating Windows Server 2008
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
 Network Management  Network Administrators Jobs  Reasons for using Network Management Systems  Analysing Network Data  Points that must be taken.
Simplify your Job – Automatic Storage Management Angelo Session id:
November 2009 Network Disaster Recovery October 2014.
SQL Server 2008 Implementation and Maintenance Chapter 7: Performing Backups and Restores.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Database Storage Considerations Adam Backman White Star Software DB-05:
Data Center Infrastructure
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
Business Continuity and Disaster Recovery Chapter 8 Part 2 Pages 914 to 945.
Concepts of Database Management Sixth Edition
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Top Performance Enhancers Top Performance Killers in Progress Dan Foreman Progress Expert
15 Copyright © 2005, Oracle. All rights reserved. Performing Database Backups.
Concepts of Database Management Eighth Edition
Module 7: Fundamentals of Administering Windows Server 2008.
Top 10 Performance Hints Adam Backman White Star Software
CHAPTER 2: COMPUTER-SYSTEM STRUCTURES Computer system operation Computer system operation I/O structure I/O structure Storage structure Storage structure.
© Pearson Education Limited, Chapter 16 Physical Database Design – Step 7 (Monitor and Tune the Operational System) Transparencies.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
NOAA WEBShop A low-cost standby system for an OAR-wide budgeting application Eugene F. Burger (NOAA/PMEL/JISAO) NOAA WebShop July Philadelphia.
Databases Lesson 5.
Service Transition & Planning Service Validation & Testing
©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Disaster Recovery Michael Leonard.
15 Copyright © 2007, Oracle. All rights reserved. Performing Database Backups.
MCTS Guide to Microsoft Windows Vista Chapter 4 Managing Disks.
Creating a complete recovery solution Adam Backman Partner, White Star Software.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
Overview Managing a DHCP Database Monitoring DHCP
A Brief Documentation.  Provides basic information about connection, server, and client.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
COMP-1: OpenEdge® Management Overview and Demo Principal Technical Support Engineer Cyril E. Gleiman.
Serverless Network File Systems Overview by Joseph Thompson.
Business Data Communications, Fourth Edition Chapter 11: Network Management.
1 OpenEdge Management Jarmo Nieminen Senior Technical Account Manager
VMware vSphere Configuration and Management v6
Using Progress® Analytical Tools Adam Backman White Star Software DONE-05:
Network management Network management refers to the activities, methods, procedures, and tools that pertain to the operation, administration, maintenance,
Component 8/Unit 9aHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 9a Creating Fault Tolerant.
TRUE CANADIAN CLOUD Cloud Experts since The ORION Nebula Ecosystem.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Oracle Standby Implementation Tantra Invedy. Standby Database Introduction Fail over Solution Disaster Recovery Solution if remote Ease of implementation.
A Solution for Maintaining File Integrity within an Online Data Archive Dan Scholes PDS Geosciences Node Washington University 1.
Monitoring Windows Server 2012
Adam Backman Chief Cat Wrangler – White Star Software
Data Center Infrastructure
Maximum Availability Architecture Enterprise Technology Centre.
Lec3: Network Management
Introduction of Week 6 Assignment Discussion
Upgrading to Microsoft SQL Server 2014
Backup Monitoring – EMC NetWorker
Lec1: Introduction to Network Management
Presentation transcript:

Turning Practice into Perfect Implementing Fathom 2.0 Adam Backman White Star Software

Notes ä All of the information covered in this presentation is covered in a new portion of the Progress documentation called: OpenEdge Revealed Mastering the Progress Database with Fathom ä A system works as a whole and not as a sum of its parts so the presentation is written the same way. With this in mind please hold your questions until the end

Presentation Goal ä Explain how to use Fathom ä Implementing best practices ä Not here to teach in-depth System Administration ä Point you to other presentations for more in- depth information

The Operation’s Challenge ä Constant reactive mode ä Manual processes ä Poor reporting ä Unplanned downtime ä Cannot plan for growth ä Poor System Performance Resulting In: n n Unpredictable Operations n n Exposure to Errors n n Incomplete Information n n Frustrated End Users n n Frustrated Administrators

Goals of a Well Maintained System ä Resiliency – The ability to recover ä Availability – Provide maximum uptime ä Performance – Consistency despite system load Fathom can help achieve these goals

Roadmap ä What are best practices? ä What is Fathom? ä Providing a resilient system ä Making your system highly available ä Providing consistent performance

What are Best Practices? ä Defined processes to follow ä Consistent verifiable outcome ä End result – well maintained system

Defined Process to Follow ä Must have clear goals ä Functional ä Business ä Document where you are now and how you are going to achieve your goals Munich Paris London Amsterdam Prague Paris

Defined Verifiable Outcome ä Know what you expect ä Know what you are getting ä Test completely prior to implementation ä Unit testing ä End-to-end testing

Well Maintained System ä Ability to support 24 hour operations with only scheduled outages for upgrades and maintenance ä Ability to recover from disaster with little or no data loss and minimal interruption to operations ä Ability to support the changing needs of the business with little or no performance degradation during times of heavy processing

Roadmap ä What are best practices? ä What is Fathom? ä Providing a resilient system ä Making your system highly available ä Providing consistent performance

What is Fathom? ä Java-based management console and agent ä Management console ä Provides interface to the agent ä Provides an interface to the Fathom Database ä Allows for definition of alerts ä Fathom agent ä Collector of operating system resource information ä Collector of Progress database management information

Dictionary ä Resource – Anything Fathom can monitor or trend ä Schedule – A defined timeframe when a resource is available for monitoring, alerting, and trending ä Poll – The process of gathering information about a resource ä Rule – a performance requirement that can be evaluated ä Alert – A response to a rule being broken ä Action – A process to be performed in response to an alert ä Trending – The process of storing performance and audit data in the Fathom Trend Database ä Monitoring – Performing polling, evaluating rules, generating alerts, executing actions, and trending of a resource within a scheduled timeframe.

Progress Fathom Architecture Fathom DB Fathom DB ProductionDBProductionDB Memory Disk CP U Net* Log File System

Fathom Architecture Multiple Sites Fathom DB Fathom DB Fathom DB Fathom DB

Fathom Architecture Monitor Locally\Trend Remotely Fathom DB Fathom DB

Fathom Architecture 2.0 Monitor/Trend Database Remotely DB Agent Fathom DB Fathom DB Fathom DB Fathom DB

DB Agent Fathom Architecture 2.0 Monitor & Trend Anywhere Fathom DB Fathom DB Fathom

Fathom Architecture Manage from One Browser Fathom DB Fathom DB DB Agent Fathom DB Fathom DB Fathom DB Fathom DB

Roadmap ä What are best practices? ä What is Fathom and how does it work? ä Providing a resilient system ä Making your system highly available ä Providing consistent performance

Resiliency ä Redundancy ä Developing an effective recovery plan ä Monitoring for problem avoidance

Redundancy ä Disk ä RAID ä Raid Levels ä Dos and Don’ts ä After imaging ä Memory

RAID ä Redundant Array of Inexpensive Disks Patterson, Gibson and Katz at the University of California Berkeley (1987) ä Common RAID Levels ä RAID 0 – striping ä RAID 1 – mirroring ä RAID 10 or 0+1 – Striped with mirrors ä RAID 5 – Striped with calculated parity

RAID: Dos and Don’ts ä Do: ä Use RAID 10 for randomized storage ä Use RAID 1 for sequential storage ä Use RAID 5 for READ-ONLY data ä Don’t ä Use RAID 5 for OLTP ä Use RAID 0 for data storage

Memory Interleaving Memory interleaving works like RAID 0 for memory. While there are significant potential performance gains from interleaving memory you run the risk of having one faulty memory chip bring down your application.

Resiliency: Recovery Planning ä Who in involved in the process? ä What gets backed up? ä Where do we backup up our data ä Where do we store the physical backup? ä When do we do a backup? ä Why do a backup at all? ä How can Fathom help?

Who is Involved in Recovery Planning? ä Technical people ä They understand what is possible ä Business people ä They understand what is needed and the cost of downtime ä Management ä They understand where the business is headed and what can be afforded

What is Included on the Backup? ä More than just a database backup ä Database ä Application ä Other Files ä Physical backup ä Secondary machine room ä Additional Hardware ä Infrastructure

Where Do We Backup To? ä Capacity – How much do you need to store? ä Removable – To allow off-site archival ä Reliable – It must work every time ä Compatible – Keeps your options open

Where to Store your Backup? ä Formal service ä 24 hour access ä Secure ä Highly disaster resistant ä Separate location (different building) ä Inexpensive ä Greater need for planning (access, security, disaster, etc.)

When to do a Backup? ä As often as practical ä A once a day backup will cause you to loose up to 24 hour of processing in the worst case ä Fill in with after imaging ä Store AI on different disk ä Archive AI files throughout the day ä Keep warm standby to reduce downtime

Why do a Backup? ä Reduce data loss ä Build user confidence ä Keep your job

How Can Fathom Help? Scheduling ä Consistent schedule that is not forgotten ä Pro-active notification if there is a problem ä Fathom 2.0 Job Templates

How Can Fathom Help? Reporting ä Processing time is captured ä Historical trend report of backup ä Audit trail

Resiliency: Problem Avoidance ä Common problem areas: ä Disk full problems ä Database extents filling fast

Fathom: Disk Monitoring ä Disk view ä Monitoring disks other than database ä Graphical view of what disks look like

Roadmap ä What are best practices? ä What is Fathom and how does it work? ä Providing a resilient system ä Making your system highly available ä Providing consistent performance

Availability ä Reducing the impact of unplanned events ä Planning for system growth ä Reducing impact of change to the user ä Scheduling Online Utilities

Planning for System Growth ä Trending allows for patterns to be viewed and acted upon ä Trending allows for operational thresholds to be established ä Trending allows for advanced planning so maintenance can be scheduled when convenient for the business

Fathom: Disk Trending ä Correlating database and disk trends ä Month by Month, Week by Week or Day by Day it is your choice ä Fill rates and activity of each disk

Fathom: Storage Area Trending ä Fill rate ä Activity by area ä This information can show a need to spread data even further

Fathom: Table and Index Trending – database analysis ä Predicting table growth ä Predicting Index growth ä Index compaction rates can be monitored and actions can be taken if the compaction drops below a certain level ä Utilization of each table and index can also be tracked and viewed in other areas of fathom

Fathom: Memory Trending ä Focus on paging and swapping rather than utilization ä This is currently a weak area within the Fathom product

Fathom: CPU Trending ä Look at Idle ä Look at the ratio between User and System ä High system time can indicate an incorrect value for –spin or High paging or swapping

Roadmap ä What are best practices? ä What is Fathom and how does it work? ä Providing a resilient system ä Making your system highly available ä Providing consistent performance

Performance ä Performance is relative ä Fast is overrated ä Fathom can help find tough problems

Performance is Relative ä What is a baseline? ä Determining your baselines ä How Fathom can help ä Important indicators ä Who is your canary?

Determining your baseline ä Good baseline guidelines ä Often accessed portions on the application ä High customer impact ä End to End (Time to enter an order) ä Bad baseline ä Year-end process ä Management reporting (in most cases) ä Little used portions of the application

Components of Performance ä Network ä Disk ä Memory ä CPU

Issues: Network ä Check your network capacity BEFORE adding any additional applications ä Baseline response times with Fathom ä Routed vs. switched networks ä Location of Progress files ä Program Libraries

Issues: Disk ä Storage capacity vs. throughput capacity ä Remember your RAID levels ä Location of data

Issues: Memory ä Memory acts as a buffer between the user processes and disk ä Use memory for the common good ä Increase broker memory first ä Increase client memory (-Bt, …) ä Then get creative

Issues: CPU ä Good CPU usage vs. Bad CPU Usage ä The –spin parameter ä Have a CPU problem? Look at your disks

Monitoring Performance ä Spot checks ä My Fathom Views ä Trend reporting ä Getting out of the Forest

Conclusion ä Start slow ä Remember your goals ä Resiliency ä Availability ä Performance ä Consider the cost/benefit before adding monitoring or trending to a resource

Questions