© Lindsay Bradford1 Scaling Dynamic Web Content Provision Using Elapsed-Time- Based Content Degradation Lindsay Bradford, Stephen Milliner and Marlon Dumas Contact: Centre for Information Technology Innovation, Queensland University of Technology WISE 2004
© Lindsay Bradford2 Overview Motivation Background Our Content Degradation Prototype Experiments and Results Ongoing and Future Work
© Lindsay Bradford3 Motivation: Scalability Matters Users expect “service on demand” from the Web - Bhatti et.al Dynamic web content: On the increase – Barford et.al Much harder to scale than static content – Stading et.al Consider: Web Services (SOAP, WSDL, etc), easier to build clients with much smaller “think times”, greater strain on scalability.
© Lindsay Bradford4 Scaling Dynamic Web Content Our focus is on dynamic web content scalability at a single server: Desire: Minimise TOC (Total Cost of Ownership). Single-Server Scalability = Bottleneck Reduction. Resource Bottleneck for: Static Web Content (HTML files, etc) = Bandwidth Dynamic Web Content (JSP, ASP, PHP, etc) = CPU, Stading, Padmanablan, Challenger. User-perceived QoS ≈ elapsed-time, not CPU.
© Lindsay Bradford5 Background: Dynamic Content Degradation: Deliver degraded version of content under load. Aim: deliver “acceptable” version to an end user that is cheaper to deliver than original. Static web images – Abdelzaher and Bhatti, Multimedia - Chandra et al. Degrading content via a distributed, tiered network - Chen and Iyengar. As opposed to: Dynamic Web Content Caching: Has limited applicability. Resource Management: Can deny access to unlucky majority.
© Lindsay Bradford6 Example:
© Lindsay Bradford7 Guiding Heuristics: Pick content that will respond within human acceptable elapsed-time. (< 1 second, based on HCI research - Ramsay et.al, Bhatti et.al, Bouch et.al) Prefer more costly content to less costly where possible. Balance content generation time against target response time. Deliberately limit scope to “Application Programmer” perspective. No modification of supporting technologies (App Servers, etc). What could developers do right now? What limits exist?
© Lindsay Bradford8 Our Content Degradation Prototype: A number of approaches to generating same web content (same URI). An Approach Selector to pick which approach is most appropriate. Two parameters: lower-time-limit and upper-time-limit. Parameters act as elapsed-time thresholds. Cross a threshold, choose a different approach. Slowest content generating approach called a baseline.
© Lindsay Bradford9 Our Prototype(2): Contains memory of last n (20 for experiments) elapsed-times for response generation per approach. Longer the memory, the more pessimistic selector becomes about the future. “Current worst elapsed time” of an approach crossing one of the time-limit bounds triggers approach change.
© Lindsay Bradford10 Our Prototype (3): Approach Selector Design
© Lindsay Bradford11 Approach Selector Implementation: Unmodified Apache Tomcat Approach Selector as a ``ServletFilter’’ Approaches as Servlets: 4 instances of a floating-point division servlet, configured to 100, 500, 1000 and 3000 loops. 4 instances of a memory-intensive lookup servlet with same loop numbers as above. 3000 loop servlets are baseline approaches.
© Lindsay Bradford12 The Test Traffic Patterns: Response is adequate if <= 1 second elapsed-time between request sent & response received at client. –Steady – Server under constant load –Periodic – Server alternates between 1 minute of load and 1 minute of no requests.
© Lindsay Bradford13 Periodic Pattern Latency:
© Lindsay Bradford14 Periodic Pattern Throughput:
© Lindsay Bradford15 Steady Pattern Latency:
© Lindsay Bradford16 Steady Pattern Throughput:
© Lindsay Bradford17 Periodic Limit Variance: Latency
© Lindsay Bradford18 Periodic Limit Variance: Throughput
© Lindsay Bradford19 Steady Limit Variance: Latency
© Lindsay Bradford20 Steady Limit Variance: Throughput
© Lindsay Bradford21 Results using Approach Selector: Benefit of Approach Selector outweighs overhead. Significant scalability recorded against our one second (user expected) response time target. Approach Selector Parameters: lower-time-limit matters. Algorithm is far less sensitive to upper-time-limit variations. Must pessimistically configure the Approach Selector to partially compensate for the elapsed- time missed by the Approach Selector.
© Lindsay Bradford22 Future Work: Approach Selector New I/O (Database simulation) servlet Altering the Approach Selection Heuristic Automatic lower-time-limit variance. Automatic variance of remembered response times. Guidelines for automated service adaptation to request traffic.
© Lindsay Bradford23 Future Work: Architecture Want to answer : “How can we minimise the overhead of the application development process WRT content degradation?” Avoid JIC (Just in case) degradation. Explore JIT (Just in time). Declarative approach, mark-up code that degrades with alternatives Manual, fine-grained behaviour alteration at run-time (exploring now – adaptable architecture based on generative communications). Servlet engine architecture (and specification) too limiting. We desire a more adaptive architecture: Ability to dynamically alter supporting architecture and its configuration (thread limits, etc) Ability to manually modify behaviour as load changes. Individual (running) object replacement; objects interacting that know nothing of each other.
© Lindsay Bradford24 Finish. Questions? Comments?