Monitoring Latency Sensitive Enterprise Applications on the Cloud Shankar Narayanan Ashiwan Sivakumar.

Slides:



Advertisements
Similar presentations
MQ Series Cross Platform Dominant Messaging sw – 70% of market Messaging API same on all platforms Guaranteed one-time delivery Two-Phase Commit Wide EAI.
Advertisements

Implementing Tableau Server in an Enterprise Environment
Complete Event Log Viewing, Monitoring and Management.
Windows IT Pro magazine Datacenter solution with lower infrastructure costs and OPEX savings from increased operational efficiencies. Datacenter.
System Center 2012 R2 Overview
Multi-Mode Survey Management An Approach to Addressing its Challenges
Futures – Alpha Cloud Deployment and Application Management.
Mohammad Hajjat Purdue University Joint work with: Shankar P N (Purdue), David Maltz (Microsoft), Sanjay Rao (Purdue) and Kunwadee Sripanidkulchai (NECTEC.
Closer to the Cloud - A Case for Emulating Cloud Dynamics by Controlling the Environment Ashiwan Sivakumar Shankaranarayanan P N Sanjay Rao School of Electrical.
Dynamically Scaling Applications in the Cloud Presented by Paul.
Microsoft Ignite /16/2017 2:42 PM
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Building Offline/Cache Mode Web Apps Using Sync Framework Mike Clark Group Manager Cloud Data Services Team
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Introduction to the Enterprise Library. Sounds familiar? Writing a component to encapsulate data access Building a component that allows you to log errors.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
1 NETE4631 Using Google Web Services and Using Microsoft Cloud Services Lecture Notes #7.
Mostafa Abdollahi Mazandaran University Of Science And Technology January 2011.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
1 The Fast(est) Path to Building a Private/Hybrid Cloud October 25th, 2011 Paul Mourani RightScale.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
Managing the Oracle Application Server with Oracle Enterprise Manager 10g.
Introduction to the Adapter Server Rob Mace June, 2008.
Microsoft Management Seminar Series SMS 2003 Change Management.
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
North America Europe Asia Pacific Data centers.
ECS. Overview Features Architecture Benefits Reports.
Zvezdan Pavković. Storage Non-Persistent Storage Persistent Storage Easily add additional storage. Networking Internal and Input Endpoints configured.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
 Mike Martin  Architect  MEET Member  Crew Member of Azug  Windows Azure Insider  Windows Azure MVP  
Performance Testing Test Complete. Performance testing and its sub categories Performance testing is performed, to determine how fast some aspect of a.
+ Logentries Is a Real-Time Log Analytics Service for Aggregating, Analyzing, and Alerting on Log Data from Microsoft Azure Apps and Systems MICROSOFT.
General requirements for BES III offline & EF selection software Weidong Li.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
EJB Enterprise Java Beans JAVA Enterprise Edition
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
Building web applications with the Windows Azure Platform Ido Flatow | Senior Architect | Sela | This session.
Windows Azure and iOS Chris Risner Windows Azure Technical Evangelist Microsoft
The best of WF 4.0 and AppFabric Damir Dobric MVP-Connected System Developer Microsoft Connected System Division Advisor Visual Studio Inner Circle member.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
WINDOWS AZURE AND THE HYBRID CLOUD. Hybrid Concepts and Cloud Services.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Implement Storage Implement Blobs and Azure Files Manage Access Configure Diagnostics, Monitoring & Analytics Implement SQL Databases Implement Recovery.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Energy Management Solution
IT06 – HAVE YOUR OWN DYNAMICS NAV TEST ENVIRONMENT IN 90 MINUTES
Business Continuity & Disaster Recovery
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Platform as a Service.
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Nimble Streamer Helps Media Content Providers Create Streaming Networks Cost-Effectively and Easily by Utilizing Azure’s Worldwide Scalability MICROSOFT.
Energy Management Solution
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Exploring Azure Event Grid
Business Continuity & Disaster Recovery
Amazon AWS Solution Architect Associate Exam Dumps For Full Exam Info Visit This Link:
Rocky Mountain CMG Spring? ‘09 Forum
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Data Security for Microsoft Azure
Outline Virtualization Cloud Computing Microsoft Azure Platform
AWS Cloud Computing Masaki.
Saranya Sriram Developer Evangelist | Microsoft
Technical Capabilities
5 Azure Services Every .NET Developer Needs to Know
Windows Azure Hybrid Architectures and Patterns
ZORAN BARAC DATA ARCHITECT at CIN7
Microsoft Azure Services Platform
06 | SQL Server and the Cloud
Presentation transcript:

Monitoring Latency Sensitive Enterprise Applications on the Cloud Shankar Narayanan Ashiwan Sivakumar

Enterprise Applications (EA) Stock Trader Benchmark Application 2 Data Base (DB) Business Service (BS)Front End (FE) Configuration Service (CS) Order Processing Service (OS)

EA as Services 3 FE Users FE BS OS DB Load Balancers Service Endpoints

EA Characteristics 4 Notice: Dynamic and distributed nature of cloud deployments. Reducing user observed latency is the goal – Monitor this ! EA propertyRelevant cloud characteristic ScalabilityDynamic deployment sizes Availabilitygeo-redundancy EconomicsPay-as-you-use ElasticityDecoupled services Low latencyDeploy closer to user groups UtilizationLoad balancing

Performance Variation: Time Series and CDF of DB Latency 5 - data snapshot worth 4 hours across both the days

Monitoring Framework – Design Goals  Resilience: Less sensitive to cloud variability  Scalability: Capable of scaling with component instances  Portability: Easy to integrate with applications  Flexibility: Multiple levels of measurement  User level latency  Component level isolation  Efficiency: Fast and accurate measurements 6

7 Why is Monitoring Hard  Dynamic environment – number of components change  Distributed deployment - needs a collection framework  Variable request path – different choice of components Existing monitoring tools Do not support service oriented architectures Too detailed Not scalable Remember: user observed latency is our goal Abstract away un-necessary details !

Measuring End-points – Existing Tools FE BSDB Users HTTP Request SOAP Response HTTP Response MySQL Replies 8 Aggregate !!

Measurement Model T i,i+1 C i + 1 C i + 2 C i T i-1,i T i,i+1 T i+1,i+2 T’ i+1,i+2 T i+1,i+2 T i,i+2 T’ i,i+2 T’’ i,i+2 T i,i+2 T’ i,i+1 T i,i+1 T i+1,i+2 T’’’ i,i+2 T i,i+2 T’’’’ i,i+2 T i,i+2  CL i = Component latency of i th component  LL i,i+1 = Link latency across components i, i+1  N = No of components C i communicates with  nj = No of calls made by C i to each of the j components 9

Notification Q Instrumented application component Log server (local) Raw log Storage (local) Global collector Instrumented application component Log server (local) Raw log Storage (local) Aggregated log Monitoring Framework Architecture 10

Outline 11 Monitoring tool – Collection framework – Instrumentation framework

The Collection Framework  Each component writes to local storage  Front-end sends “done” message to local queue  Queues: decouple producer, consumer entities  Storage: persistence, no limit on size  Both: scalable, robust 12 Question: Why this a right model ? When in doubt, measure!

Alternative Model 13  All components write to queue  Collection framework de-queues  Forms a P2P network to collate the data

Experiments on Azure and EC2 Experiments evaluating performance of storage and queues. Real cloud deployments (Microsoft Azure, Amazon AWS) Extensive measurements from all data-centers US (East/West/North/South) Europe (West/Central) Asia (East/South East) 14

Performance of Storage and Queues 15 Microsoft AzureAmazon AWS Measurements made in all 12 datacenter regions (Azure and AWS) Experiment length (24 – 26 hours) Approx 100,000 requests to storage 16,000 requests to the queues Write Q Read Q Write Q Write Store

Outline 16 Monitoring tool – Collection framework – Instrumentation framework

17 Instrumentation Framework - Goals Minimize coding effort and intervention Measure latency at the granularity of user request Automate instrumentation as much as possible Generate minimal measurement parameters

Comparison of Existing Tools 18

Instrumentation Framework Instrumented Application Component Original Application Component Aspects Specification for the application end- points (X-trace: log events) Measurement metric specification (X-trace: meta-data) Log Format specifications 19

Experiment Set-up 20 Deployed two similar benchmark applications DayTrader - Amazon AWS StockTrader - Windows Azure (prior work) Deployed the collection framework on AWS and Azure. User sessions and request patterns from DaCapo benchmark suite. Instrumentation: Automated using aspects – DayTrader (AWS) Custom coded - DayTrader and StockTrader

Aggregation Benefit: DayTrader 21 User request type Storage writes without aggregation Storage writes with aggregation FEBSFEBS Login3511 Portfolio10 11 Update profile4511 Home2211 Buy1711 Sell1811 Account3311 Total User sessions : 20, 1 every 10 seconds Results shown for a random user from DaCapo 78% writes reduced in above case transactions benefits

Aggregation Benefit: MedRec Application Suite 22 ApplicationStorage writes without aggregation Storage writes with aggregation FEBSFEBS MedRec App4811 Physician App81511 Admin App2511 Storage writes reduced by at least 50% from FE, 80% from BS

Instrumentation Benefit 23 Category Code (# of files) Handcrafted Code (# of files) X-Trace with Aspect same15250 (88)15250 (92) modified593 (74)465 (70) added878 (0)166 (2) automatable0 (0)166 (2) FE component code : automatable using aspects with x-trace Cross component calls : x-trace object passed as parameter New lines of code reduced by ~80% SLOC reduced by ~20% Aspects can be automated

Future Work 24 Scaling the framework Application scale to Framework scale ratio Per Datacenter ? Per VM ? Varies per cloud provider ? Impact of these design decisions on the sensitivity of the framework

Conclusions 25 Architectural benefits: Generic across - application, # of components, access patterns Scalable – decoupled entities Aggregation benefits: N writes to storage becomes one write Log server offloads work from application Instrumentation benefits: Easy to integrate with application New lines of code reduced by ~80% SLOC reduced by ~20%

26 Q & A

Back up slides 27

Azure Blob Read and Write Latency Blob read-write at least msec 28

Azure Queue Read and Write Latency Queue read costly, write comparable to blob 29

30 SQL Azure Performance Issue Snapshot (6 Days)