Download presentation
Presentation is loading. Please wait.
Published bySilke de Croon Modified over 6 years ago
1
Admission Control and Request Scheduling in E-Commerce Web Sites
Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel, EPFL
2
Dynamic Content Is Important
1 2 3
3
Generating Dynamic Content
Web Server Dynamic Content Generator Database Server http Several components Examples of request types
4
Objective: Stable Performance
Ideal Throughput Actual Underload Saturation overload Sustain peak throughput Load Prevent overload (admission control) Sustain peak throughput in overload region
5
Objective: Improve Response Time
Excess requests are queued Improve response time (request scheduling) Sustain peak throughput
6
Main Idea: Gatekeeper Proxy
Web Server Dynamic Content Generator Gate Keeper Database Server http Transparent Intercept requests to the bottleneck Maintains measurement-based estimates Performs: Admission control Request scheduling Transparently control how requests are admitted (non-invasive)
7
Admission Control Sustain peak throughput during overload
Amount of work required by each request Capacity of the system
8
Estimating Work by Request Type
Web Server Dynamic Content Generator Gate Keeper Database Server http Key Observations: Finite number of request types Requests of same type take same execution time Different requests differ greatly in execution time Online measurements Gatekeeper maintains per-request estimates
9
Service Time Distributions
To drive these points home let us look at this graph Request type is key! Online measurement
10
Estimating System Capacity
Request load = Execution time (work units of required) Database capacity = max # work units before overload To determine system capacity Binary search Offline Unit approximates resource usage Oblivious to the bottleneck resource
11
Admission Control - Example
req type units R1 5 R2 500 R3 300 … R2 R3 700 1 R1 6 Table: maintain for each type, online estimates of service time (moving average) R1 R2 R3 695 2 R3 195 3 R1 R2 R3 200 4 R2
12
Request Scheduling - Example
(0+500) + (500+10) = 505 (0+10) + (10+500) = 260 10 500 500 10
13
Request Scheduling Reduce average response time
Use shortest job first (SJF) policy Reorder requests in admission queue No preemption
14
Large Variability: TPC-W Requests
95% require < 1000 ms Scheduling has high impact Large variability in many internet workloads
15
Request Scheduling: Aging
Prevent starvation Limit the delay due to scheduling Limit is X times “expected service time” 1 1 1 Big 1 Some graphics in here delay due to sched.
16
Outline Motivation & Background The Gatekeeper Proxy
Experimental Environment Software & Hardware Metrics & Methodology Results Summary and Conclusions
17
TPC-W Benchmark Transaction Processing Council (TPC-W)
Workload generator Models a large e-commerce site: Online bookstore Searching, browsing, buying, registration, … Persistent data Static images on web server All others on back-end database
18
TPC-W Benchmark - Snapshot
Image Promotion (ad) Shopping Cart Next Interaction
19
TPC-W: Interactions 14 interactions, e.g.: Scale
Home (read-only query) Best sellers (complex) Secure payment (ssl) Shopping cart (update query) Scale 10,000 items 288,000 clients 350 MB database (fits in main memory) 183 MB images (in file system of web server)
20
Software Web Server Dynamic Content Generator Gate Keeper Database
http GateKeeper Implemented in Java (JDBC driver) Web Server Apache App Server Jakarta Tomcat 3.2.4 Database MySQL OS RedHat 7.2 Linux
21
Hardware Apache Tomcat MySQL http sql CPU AMD Athlon 1.33 GHz Memory
GateKeeper MySQL http sql CPU AMD Athlon 1.33 GHz Memory 768 MB Disk 60 GB, 12 msec, 5400 rpm Network 100 Mbps Ethernet
22
Emulated Clients Emulated Clients Apache Tomcat MySQL http sql
GateKeeper MySQL http sql Client emulator Session duration Think time Markov model Load is a function of the number of clients
23
Experiments Performance Metrics Methodology
Throughput (interactions/minute) Response time (msec, submission to completion) Examine each as a function of load (# of clients) Methodology Average of 5 runs 100 second warm-up 600 second measurement
24
Admission Control - Throughput
1 db processes: 49 used mem: 275 MB 2 db processes: 233 used mem: 450 MB 3 db processes: 345 used mem: 509 MB
25
Request Scheduling - Response Time
26
Request Scheduling - Analysis
Response time = Waiting time + Execution (service) time Large variability Many short requests Few very large requests
27
Request Scheduling - Explanation
Big effect on waiting times Average job 9000 200 430
28
Request Scheduling - Explanation
There are many small requests, whose waiting time is affected greatly Few large requests, big effect, but relatively small Net result is … Short job Average job Long job 8000 100 9000 200 13000 16000 400 430 4800
29
Request Scheduling - Fairness
Response time = Waiting time + Execution (service) time Fairness trade-off FIFO Fair: all wait for same amount of time SJF Unfair: favors short requests Better average response time Control waiting time Trade-off
30
Aging: Prevent Starvation
1 3 5 ∞
31
In The Paper More results Related work Different bottleneck
(database lock contention) Online vs. offline measurements DB2 Related work Most other methods are invasive
32
Summary Presented the Gatekeeper proxy Admission control
Transparent (non-invasive) Intercept requests Online measurements Admission control Consistent performance during overload Improves throughput 10 % Request scheduling using SJF Improves response time 14 times Penalizes long jobs only 13 % Aging controls penalty
33
Thank You!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.