Modeling Retail a Major Telecom Company Predictive Analysis in a Multi-Tier Infrastructure John Slobodnik October 21, 2008 CMG Canada.

Slides:



Advertisements
Similar presentations
Tales from the Lab: Experiences and Methodology Demand Technology User Group December 5, 2005 Ellen Friedman SRM Associates, Ltd.
Advertisements

Testing Relational Database
Case Study: Photo.net March 20, What is photo.net? An online learning community for amateur and professional photographers 90,000 registered users.
Guide to data collection A look at why now… and why us.
10. 5: Model Solution Model Interpretation 10
This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
Pankaj Kumar Qinglan Zhang Sagar Davasam Sowjanya Puligadda Wei Liu
Software Delivery. Software Delivery Management  Managing Requirements and Changes  Managing Resources  Managing Configuration  Managing Defects 
DataBase Administration Scheduling jobs Backing up and restoring Performing basic defragmentation and index rebuilding Using alerts Archiving.
Enter Date in Title Master Workload Management HBC Case Study IRMAC, January 2008 Shelley Perrior -DBA team lead.
©202 BMC Software, Inc. All Rights Reserved. Server Consolidation Eric D. Ho Advisory Software Consultant BMC Software, Inc. March 20, 2002.
Performance Engineering Methodology Chapter 4. Performance Engineering Performance engineering analyzes the expected performance characteristics of a.
1 CSSE 477 – A bit more on Performance Steve Chenoweth Friday, 9/9/11 Week 1, Day 2 Right – Googling for “Performance” gets you everything from Lady Gaga.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
Chapter 14 Chapter 14: Server Monitoring and Optimization.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
1 Software Testing and Quality Assurance Lecture 40 – Software Quality Assurance.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
Proper Care and Feeding of your SQL MDB -Recommendations for General MDB Maintenance -Read the notes on the foils! -Revised October
What Can You do With BTM? Business Transaction Management touches the following disciplines:  Performance Management  Application Management  Capacity.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
Load Test Planning Especially with HP LoadRunner >>>>>>>>>>>>>>>>>>>>>>
Copyright © 2007 Quest Software The Changing Role of SQL Server DBA’s Bryan Oliver SQL Server Domain Expert Quest Software.
November 2009 Network Disaster Recovery October 2014.
Chapter 17: Watching Your System BAI617. Chapter Topics Working With Event Viewer Performance Monitor Resource Monitor.
11 The Ultimate Upgrade Nicholas Garcia Bell Helicopter Textron.
Module 18 Monitoring SQL Server 2008 R2. Module Overview Monitoring Activity Capturing and Managing Performance Data Analyzing Collected Performance Data.
HBar OR Reader Documentation A copy of the PowerPoint Viewer is shipped with the HBar OR Reader on the HBar Official Records [OR] CD. The PowerPoint Viewer.
Introduction to HP LoadRunner Getting Familiar with LoadRunner >>>>>>>>>>>>>>>>>>>>>>
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
Introduction Optimizing Application Performance with Pinpoint Accuracy What every IT Executive, Administrator & Developer Needs to Know.
Performance Concepts Mark A. Magumba. Introduction Research done on 1058 correspondents in 2006 found that 75% OF them would not return to a website that.
August 01, 2008 Performance Modeling John Meisenbacher, MasterCard Worldwide.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Maintaining File Services. Shadow Copies of Shared Folders Automatically retains copies of files on a server from specific points in time Prevents administrators.
Copyrighted material John Tullis 10/6/2015 page 1 Performance: WebSphere Commerce John Tullis DePaul Instructor
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Software Performance Testing Based on Workload Characterization Elaine Weyuker Alberto Avritzer Joe Kondek Danielle Liu AT&T Labs.
Computer Emergency Notification System (CENS)
Maintaining Large Vista Installations Amy Edwards, Ezra Freelove, & George Hernandez July 12, 2007.
The ProactiveWatch Monitoring Service. Are These Problems For You? Your business gets disrupted when your IT environment has issues Your employee and.
Maintaining Large Vista Installations Amy Edwards, Ezra Freelove, & George Hernandez July 12, 2007.
1 All Powder Board and Ski Oracle 9i Workbook Chapter 9: Database Administration Jerry Post Copyright © 2003.
Downloading and Installing Autodesk Revit 2016
What is Sure Stats? Sure Stats is an add-on for SAP that provides Organizations with detailed Statistical Information about how their SAP system is being.
1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.
Graphing and statistics with Cacti AfNOG 11, Kigali/Rwanda.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Downloading and Installing Autodesk Inventor Professional 2015 This is a 4 step process 1.Register with the Autodesk Student Community 2.Downloading the.
Software Architecture in Practice Practical Exercise in Performance Engineering.
Chapter 13: LAN Maintenance. Documentation Document your LAN so that you have a record of equipment location and configuration. Documentation should include.
How We Use Our PerfCap Tool How Do You? CMG Canada Oct. 27, 2015 John Slobodnik Performance and Capacity Planner.
Module 12: Configuring and Managing Storage Technologies
Capacity Planning - Managing the hardware resources for your servers.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
If you have a transaction processing system, John Meisenbacher
AA202: Performance Enhancers for Laserfiche Connie Anderson, Technical Writer.
OPERATING SYSTEMS (OS) By the end of this lesson you will be able to explain: 1. What an OS is 2. The relationship between the OS & application programs.
1 Presented by: Val Pennell, Test Tool Manager Date: March 9, 2004 Software Testing Tools – Load Testing.
Software Architecture in Practice
Management Reporter Tips and Tricks.
2018 Real Dell EMC E Exam Questions Killtest
Automation in IMS Can it help the shrinking talent pool
Database hidden disasters…
Backup Monitoring – EMC NetWorker
Backup Monitoring – EMC NetWorker
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Fernando Jacinto Alvarez
Presentation transcript:

Modeling Retail a Major Telecom Company Predictive Analysis in a Multi-Tier Infrastructure John Slobodnik October 21, 2008 CMG Canada

Preparation for Modeling  Get an application infrastructure diagram.  Turn on Solaris Process Accounting.  Install TeamQuest Manager.  Install TeamQuest View.  Gather Key Performance Indicator.  Perform Workload Characterization  Perform predictive analysis using TeamQuest Model.

Infrastructure Diagram It is important to get this diagram to understand the infrastructure that this multi-tier application resides on. Typically, an application support team is responsible for keeping these diagrams up-to-date.

Infrastructure Diagram

Turn on Solaris Process Accounting Turn on Solaris Process accounting.  Minimal additional CPU overhead since the data is already collected.  Allows short-running tasks to be captured for workload characterization. Normally tasks <0.5 seconds get grouped. Certain applications with thousands of short tasks are prime candidates for this extra level of accuracy.

Install TeamQuest Manager Install TeamQuest Manager on at least one server from each tier of the application architecture. At least one agent was installed in each of 4 tiers Customize the TQ database on each server. Changed retention of 10 minute data to 2 weeks. Changed retention of 1 minute data to 1 week. Deactivated reductions. Requires Process Accounting turned on. Keep process information for 7 days. Created a silent install script to install the agent and customize the database. Create a script to customize the database (using tqdbu) with the settings specified in the previous bullet. Record the silent install script Syntax “install.sh –r silentinstallscriptnamehere tqmgr”

Install TeamQuest Manager Create a specifications file backup for each TQ database daily. Makes rebuilding the DB, in case of disaster, easier. The command to create a specifications file called “productionDBspec” is: teamquesthomedirectory/bin/tqdbu –o productionDBspec The command to use the specifications file to recreate a new database is: teamquesthomedirectory/bin/tqdbu –c productionDBspec Put disk free space monitoring in place. With process accounting on a lot of data was gathered on our Oracle server. There was barely enough space to keep a week’s worth of data in the existing filesystem. Alerts us when there is <20% free space in the filesystem used by the TQ DB.

Customize TQ Database

Install TeamQuest View TQ View was used to ensure consistent performance across each server. –This tells us that the workload is consistent and reliable to use for modeling. –Data for whole week was analyzed to come up with the best time frame to use for modeling.

TeamQuest View Data Analysis

Gather Key Performance Indicator We asked the business what their key performance indicator (or main business driver metric) was. They were tracking these sales numbers by hour in an Oracle database. – Using a customized SQL query. – Which you can turn into a “custom metric” and create historical reports against.

Gather Key Performance Indicator

Workload Characterization Purpose: To uniquely identify application-related work that runs on each server. A pre-requisite for modeling. Used TeamQuest View to list all processes that run on each server. Identified processes into unique workloads. –This is the most labor-intensive part of the whole exercise (can take days or weeks depending upon level of co-operation). Requires co-operation of the application experts to help identify processes which belong to their application. –Try to keep the number of workloads to as small a number as possible. Our goal was to create 2 workloads per server, one for the application- related work and OTHER. Define the workload definitions using TeamQuest Manager. On each server we created a new “Workload Set” containing a new “Workload” definition which uniquely identifies application-related activity. –Left the default “Example” workload set alone. “Login =“ uniquely identified application-related work on our Web Services, authentication, WebLogic, and Oracle servers.

Workload Characterization

Using TeamQuest Model The most important decision to make for modeling is “What timeframe do I use to base my model upon?”. The answer varies upon the peak usage time of the application from both a system resource and business sales perspective. I use a combination of busiest CPU, I/O and sales to come up with the timeframe to use. This has worked successfully for me using a 1-hour timeframe to base my modelling upon (5 hour timeframe as well). Stay away from “problem” times. Then we apply a growth percentage to that timeframe which equates to what the business said the estimated peak volume would be at their busiest time of year. We frame the growth % (LT & GT 50%). If the model did not show any weakness in the infrastructure at 50% growth we created another model with enough growth applied to find a weakness.

Using TeamQuest Model Outcome: We have successfully identified the need for an additional Oracle node in the infrastructure. Other outcomes have been: Your infrastructure is sufficient to make it through this years peak period, however, once the growth from the current state hits 300% then the Web Services tier will be the bottleneck, addition of 2 additional servers of the same build is recommended prior to that time.

Select data to build the Model Select “Generate Input File” servername

Select data to build the Model Fill out time and date and click “Next” servername

Select data to build the Model Confirm Workload Set, click “Next” servername

Select data to build the Model Click “Create Model Input File” servername

Select data to build the Model “Save” the file servername

Select data to build the Model Choose a filename then save.

Select data to build the Model Confirmation servername

TQ Model - Assumptions TeamQuest was not installed on all the systems in the environment, so in absence of that data we assume : External webservers – The 4 Sun servers are load balanced. WebLogic tier – The 3 Sun servers are load balanced. The 2 Sun WebLogic instances performs twice the work as a single WebLogic instance on the larger Sun server. Applications such as iPlanet, WebLogic, and Oracle are well instrumented. The orders are coming from the External Webserver.

TQ Model - Findings, Recommendations & Results Findings for multi-tier application environment: The number of orders on mm/dd/yyyy from noon until 5 pm was n. At 300% growth or nn orders from noon till 5 pm, the CPU in the UNIX web services iPlanet tier is maxed and the response time is significantly higher than for n orders, i.e % higher. Recommendations: Add 2 additional nodes to the external web tier Plan to add the additional servers in Results: TeamQuest time spent on Model = less than 2 hours

TQ View – CPU Utilization CPU utilization of all the systems: One day does not stand out as looking any different than any other day for CPU & I/O. So, we chose the afternoon mm/dd/yyyy, 12:00- 17:00. We divided the work between application & non-application (workloads).

TeamQuest Model – Systems/Tier

TeamQuest Model –Response Time with 300% growth applied

TQ Model – CPU Utilization by Workload

Active Resource utilization on web tier

Active Resource Utilization on Web tier

Components of Response–3 DB nodes

What if we add 2 servers to the external web server tier?

What if we model the external web server on its own?

Frequency of Modeling During peak time of year for the application and 6 months later (at a minimum). Prior to and after any major hardware changes to the infrastructure. After any major software changes to the infrastructure. This can be changes to the application code. Can also be vendor software version change. New version of WebLogic. New OS level. Latest version of Oracle These happen more frequently, it is not realistic (in my life) to re-do the exercise monthly.

John Slobodnik Performance & Capacity Planner Infrastructure & Technology CGI (905) ext Mobile: (416)