Database Replication Policies for Dynamic Content Applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto EuroSys 2006: Leuven,

Slides:



Advertisements
Similar presentations
Full-System Timing-First Simulation Carl J. Mauer Mark D. Hill and David A. Wood Computer Sciences Department University of Wisconsin—Madison.
Advertisements

Exploiting Distributed Version Concurrency in a Transactional Memory Cluster Kaloian Manassiev, Madalin Mihailescu and Cristiana Amza University of Toronto,
The State of the Art in Distributed Query Processing by Donald Kossmann Presented by Chris Gianfrancesco.
1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,
1 HYRISE – A Main Memory Hybrid Storage Engine By: Martin Grund, Jens Krüger, Hasso Plattner, Alexander Zeier, Philippe Cudre-Mauroux, Samuel Madden, VLDB.
Predicting Replicated Database Scalability Sameh Elnikety, Microsoft Research Steven Dropsho, Google Inc. Emmanuel Cecchet, Univ. of Mass. Willy Zwaenepoel,
Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.
Handling Web Hotspots at Dynamic Content Web Sites Using DotSlash Weibin Zhao Henning Schulzrinne Columbia University NYMAN’04.
Handling Web Hotspots at Dynamic Content Web Sites Using DotSlash Weibin Zhao Henning Schulzrinne Columbia University Dagstuhl.
Overview  Strong consistency  Traditional approach  Proposed approach  Implementation  Experiments 2.
Yaksha: A Self-Tuning Controller for Managing the Performance of 3-Tiered Web Sites Abhinav Kamra, Vishal Misra CS Department Columbia University Erich.
DotSlash: Providing Dynamic Scalability to Web Applications Weibin Zhao and Henning Schulzrinne Department of Computer Science, Columbia University More.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
Towards Autonomic Hosting of Multi-tier Internet Services Swaminathan Sivasubramanian, Guillaume Pierre and Maarten van Steen Vrije Universiteit, Amsterdam,
SIP Routing Proxy Integrated Routing solution. General functionality  The Routing Proxy is based on Session Initial Protocol(SIP) and is acting as a.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
PNUTS: YAHOO!’S HOSTED DATA SERVING PLATFORM FENGLI ZHANG.
Optimizing Cloud Resources for Delivering IPTV Services Through Virtualization.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
AUTONOMOUS RESOURCE PROVISIONING FOR MULTI-SERVICE WEB APPLICATIONS Jiang Dejun,Guillaume Pierre,Chi-Hung Chi WWW '10 Proceedings of the 19th international.
Introduction to HP LoadRunner Getting Familiar with LoadRunner >>>>>>>>>>>>>>>>>>>>>>
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
JOnAS developer workshop – /02/2004 status Emmanuel Cecchet
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
Paradigms & Benchmarks Ryan McCune CSE Final Presentation 11/3/11 Notre Dame Computer Science 1.
Improving Network I/O Virtualization for Cloud Computing.
AUTHORS: STIJN POLFLIET ET. AL. BY: ALI NIKRAVESH Studying Hardware and Software Trade-Offs for a Real-Life Web 2.0 Workload.
IISWC 2007 Panel Benchmarking in the Web 2.0 Era Prashant Shenoy UMass Amherst.
Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.
Alireza Angabini Advanced DB class Dr. M.Rahgozar Fall 88.
Scaling Dynamic Content Applications through Data Replication - Opportunities for Compiler Optimizations Cristiana Amza UofT.
1 Specification and Implementation of Dynamic Web Site Benchmarks Sameh Elnikety Department of Computer Science Rice University.
Evaluating FERMI features for Data Mining Applications Masters Thesis Presentation Sinduja Muralidharan Advised by: Dr. Gagan Agrawal.
Applications Web et bases de données en grappe Séminaire InTech 3 Février 2005 – Grenoble.
Usenix Annual Conference, Freenix track – June 2004 – 1 : Flexible Database Clustering Middleware Emmanuel Cecchet – INRIA Julie Marguerite.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
THE LITTLE ENGINE(S) THAT COULD: SCALING ONLINE SOCIAL NETWORKS B 圖資三 謝宗昊.
DotSlash: Handling Web Hotspots at Dynamic Content Web Sites Weibin Zhao Henning Schulzrinne Department of Computer Science Columbia.
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety.
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Towards Dynamic Green-Sizing for Database Servers Mustafa Korkmaz, Alexey Karyakin, Martin Karsten, Kenneth Salem University of Waterloo.
Database replication policies for dynamic content applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto Presented by Ahmed.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
Highly available database clusters with JDBC
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
1 Agility in Virtualized Utility Computing Hangwei Qian, Elliot Miller, Wei Zhang Michael Rabinovich, Craig E. Wills {EECS Department, Case Western Reserve.
Dynamo: Amazon’s Highly Available Key-value Store DAAS – Database as a service.
Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.
An Efficient Threading Model to Boost Server Performance Anupam Chanda.
A Comparative Evaluation of Transparent Scaling Techniques for Dynamic Content Servers Presented by Chen Zhang Written by C. Amza, A. L. Cox,
EuroSys Doctoral Workshop 2011 Resource Provisioning of Web Applications in Heterogeneous Cloud Jiang Dejun Supervisor: Guillaume Pierre
Load Rebalancing for Distributed File Systems in Clouds.
E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing Systems Jihui Yang CS525 Advanced Distributed System March 1, 2016.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Abhinav Kamra, Vishal Misra CS Department Columbia University
Lead SQL BankofAmerica Blog: SQLHarry.com
Choosing a MySQL Replication & High Availability Solution
Frequency Governors for Cloud Database OLTP Workloads
Efficient and Transparent Dynamic Content Updates for Mobile Clients
Virtualization Techniques
DotSlash: An Automated Web Hotspot Rescue System
Admission Control and Request Scheduling in E-Commerce Web Sites
Benchmarking Cloud Serving Systems with YCSB
Presentation transcript:

Database Replication Policies for Dynamic Content Applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto EuroSys 2006: Leuven, Belgium April 19, 2006

2 Dynamic Content Web Server

3

4 Today’s Server Farms Data centers can run multiple applications  E.g., IBM/HP Service providers can multiplex resources  E.g., applications have peaks at different times Challenge: database server becomes the bottleneck

5 Motivation Scale the database backend on clusters  Handle more clients  Run multiple applications  Handle failures in the backend Our approach:  Database replication  Dynamic replica allocation Adapt to changing load or failures

6 Database Replication Read-one, write-all Plattner & Alonso, MW 04 Lin et. al, SIGMOD 05 Amza et. al, ICDE 05 Scaling for E-Commerce (TPC-W)

7 Dynamic Replication Assume a cluster hosts 2 applications  App1 (Red) using 2 machines  App2 (Blue) using 2 machines Assume App1 has a load spike

8 Dynamic Replication Choose nr. of replicas to allocate to App1  Say, we adapt by allocating one more replica Then, two options  App2 still uses two replicas (overlap replica sets)  App2 loses one replica (disjoint replica sets)

9 Dynamic Replication Choose nr. of replicas to allocate to App1  Say, we adapt by allocating one more replica Then, two options  App2 still uses two replicas (overlap replica sets)  App2 loses one replica (disjoint replica sets)

10 Dynamic Replication Choose nr. of replicas to allocate to App1  Say, we adapt by allocating one more replica Then, two options  App2 still uses two replicas (overlap replica sets)  App2 loses one replica (disjoint replica sets)

11 Challenges Adding a replica can take time  Bring replica up-to-date  Warm-up memory Can avoid adaptation with fully-overlapped replica sets

12 Challenges However, overlapping applications compete for memory causing interference Can avoid interference with disjoint replica sets

13 Challenges However, overlapping applications compete for memory causing interference Can avoid interference with disjoint replica sets Tradeoff between adaptation delay and interference

14 Insight for Dynamic Content Apps Database reads are much heavier than writes  Reads are multi-table joins  Writes are single row updates Overlapping reads – high interference Overlapping writes – little interference

15 Insight for Dynamic Content Apps Database reads are much heavier than writes  Reads are multi-table joins  Writes are single row updates Overlapping reads – high interference Overlapping writes – little interference Solution: Separate reads and overlap writes

16 Our Solution – Partial Overlap Reads of applications sent to disjoint replica sets  Avoids interference Read-Set  Set of replicas where reads are sent

17 Our Solution – Partial Overlap Writes of apps sent to overlapping replica sets  Reduces replica addition time Write-Set  Set of replicas where writes are sent

18 Optimization For a given application,  Replicas in Write-Set – Fully Up-to-Date  Other Replicas – Periodic Batch Updates

19 When do we adapt? Add when application’s requirements not met  Due to either load spikes or failures Remove when replica not needed Application requirements defined through a Service Level Agreement (SLA)

20 Resource Manager Feedback Loop Global Resource Manager Monitor Analyze Request Add/Remove Execute

21 Resource Manager Feedback Loop Global Resource Manager Monitor Analyze Request Add/Remove Execute When does the feedback loop end?

22 Possible Oscillations Change not seen immediately Replica addition takes time  Bring replica fully up-to- date, warm-up memory May trigger more adds Oscillations cause interference between applications Global Resource Manager Monitor Analyze Request Add/Remove Execute

23 Avoiding Oscillations Delay-Awareness  Use load-balance as heuristic for stabilization after replica addition Removes are conservative  Tentative removes Global Resource Manager Monitor Analyze Request Add/Remove Execute

24 Cluster Architecture

25 Experimental Setup Hardware  AMD Athlon running at 2.1 Ghz  512 MB of RAM  60 GB Hard Drive Software  RedHat Fedora Core 2 Linux  Apache with PHP 4.0  MySQL with InnoDB tables Benchmarks  TPC-W: E-Commerce Retail Store  RUBIS: Online Bidding

26 Outline of Results Defined SLA in terms of query latency bound  Query latency < 600 ms Cluster Size  Up to 8 database replicas  10 web/application servers Experiments  Interference between Workloads  Adapting to Load Changes  Adapting to Faults

27 Disjoint

28 Partial Overlap

29 Full Overlap

30 Interference

31 Adaptation to Load Changes

32 Adapting to Load Changes Three schemes  Disjoint – 4/4  Dynamic allocation using Partial overlap  Full Overlap – 8/8

33 Disjoint TPC-WRUBIS

34 Full Overlap TPC-WRUBIS

35 Partial Overlap TPC-WRUBIS

36 Adaptation to Faults

37 Adaptation to Faults

38 More Results - In the Paper More complex load scenarios  Including overload Effect of delay-awareness  Avoiding oscillations

39 Conclusion Database replication  Handle more clients Dynamic replica allocation  Handle multiple workloads with different peaks  Handle faults

40 Thanks!