Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,

Slides:



Advertisements
Similar presentations
Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol Li Fan, Pei Cao and Jussara Almeida University of Wisconsin-Madison Andrei Broder Compaq/DEC.
Advertisements

Distributed Processing, Client/Server and Clusters
Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
Efficient Collective Operations using Remote Memory Operations on VIA-Based Clusters Rinku Gupta Dell Computers Dhabaleswar Panda.
Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand P. Balaji, K. Vaidyanathan, S. Narravula,
04/25/06Pavan Balaji (The Ohio State University) Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol over InfiniBand.
Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji,
On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand P. Balaji, S. Narravula, K. Vaidyanathan,
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
Cooperative Caching of Dynamic Content on a Distributed Web Server Vegard Holmedahl, Ben Smith, Tao Yang Speaker: SeungLak Choi, DB Lab., CS Dept.
Technical Architectures
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Hypertext Transfer Protocol Kyle Roth Mark Hoover.
1 Design and Implementation of A Content-aware Switch using A Network Processor Li Zhao, Yan Luo, Laxmi Bhuyan University of California, Riverside Ravi.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Online Magazine Bryan Ng. Goal of the Project Product Dynamic Content Easy Administration Development Layered Architecture Object Oriented Adaptive to.
Architectural Impact of SSL Processing Jingnan Yao.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Web Caching Schemes For The Internet – cont. By Jia Wang.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Distributed Systems: Client/Server Computing
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
Sockets vs. RDMA Interface over 10-Gigabit Networks: An In-depth Analysis of the Memory Traffic Bottleneck Pavan Balaji  Hemal V. Shah ¥ D. K. Panda 
Multiprocessor Cache Coherency
04/26/06D. K. Panda (The Ohio State University) Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services P. Balaji,
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
P. Balaji, S. Bhagvat, D. K. Panda, R. Thakur, and W. Gropp
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
Department of Computer Science Southern Illinois University Edwardsville Dr. Hiroshi Fujinoki and Kiran Gollamudi {hfujino,
Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based.
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
On the Scale and Performance of Cooperative Web Proxy Caching University of Washington Alec Wolman, Geoff Voelker, Nitin Sharma, Neal Cardwell, Anna Karlin,
High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.
Workload-driven Analysis of File Systems in Shared Multi-Tier Data-Centers over InfiniBand K. Vaidyanathan P. Balaji H. –W. Jin D.K. Panda Network-Based.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Automatic Cache Update Control for Scalable Resource Information Service with WS-Management September 23, 2009 Kumiko Tadano, Fumio Machida, Masahiro Kawato,
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
The Performance of Spin Lock Alternatives for Shared-Memory Multiprocessors THOMAS E. ANDERSON Presented by Daesung Park.
A Throttling Layer-7 Web Switch James Furness. Motivation & Goals Specification & Design Design detail Demonstration Conclusion.
Computer Science Lecture 14, page 1 CS677: Distributed OS Last Class: Concurrency Control Concurrency control –Two phase locks –Time stamps Intro to Replication.
Sockets Direct Protocol Over InfiniBand in Clusters: Is it Beneficial? P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
An Architectural Evaluation of Java TPC-W Harold “Trey” Cain, Ravi Rajwar, Morris Marden, Mikko Lipasti University of Wisconsin-Madison
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
System Optimization Networking
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
VIRTUAL SERVERS Chapter 7. 2 OVERVIEW Exchange Server 2003 virtual servers Virtual servers in a clustering environment Creating additional virtual servers.
Adaptive Configuration of a Web Caching Hierarchy Pranav A. Desai Jaspal Subhlok Presented by: Pranav A. Desai.
WWW and HTTP King Fahd University of Petroleum & Minerals
Netscape Application Server
Improving searches through community clustering of information
Distributed Shared Memory
Cache Memory Presentation I
Load Balancing Memcached Traffic Using SDN
On the Scale and Performance of Cooperative Web Proxy Caching
Be Fast, Cheap and in Control
Introduction to Databases Transparencies
Presentation transcript:

Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda The Ohio State University

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Introduction Fast Internet Growth  Number of Users  Amount of data  Types of services Several uses  E-Commerce, Online Banking, Online Auctions, etc Web Server Scalability  Multi-Tier Data-Centers  Caching – An Important Technique

Presentation Outline Introduction/Motivation  Multi-Tier Data-Centers  Active Caches Design and Implementation Experimental Results Conclusions

A Typical Multi-Tier Data-Center Database Servers Clients Application Servers Web Servers Proxy Nodes Tier 0 Tier 1 Tier 2 Apache PHP WAN SAN

InfiniBand High Performance  Low latency  High Bandwidth Open Industry Standard Provides rich features  RDMA, Remote Atomic operations, etc Targeted for Data-Centers Transport Layers  VAPI  IPoIB

Presentation Outline Introduction/Motivation  Multi-Tier Data-Centers  Active Caches Design and Implementation Experimental Results Conclusions

Caching Can avoid re-fetching of content Beneficial if requests repeat Important for scalability Static content caching  Well studied in the past  Widely used Front-End Tiers Back-End Tiers Number of Requests Decrease

Active Caching Dynamic Data  Stock Quotes, Scores, Personalized Content, etc  Complexity of content Simple caching methods not suited Issues  Consistency  Coherency Proxy Node Cache Back-End Data User Request Update

Cache Coherency Refers to the average staleness of the document served from cache Strong or immediate (Strong Coherency)  Required for certain kinds of data  Cache Disabling  Client Polling

Basic Client Polling * Front-EndBack-End Request Cache Hit Cache Miss Response Version Read * SAN04: Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand. Narravula, et. Al.

Multiple Object Dependencies Cache documents contain multiple objects A Many-to-Many mapping  Single Cache document can contain Multiple Objects  Single Object can be a part of multiple Documents Complexity!! Cache Documents Objects

Client Polling Front-EndBack-End Request Cache Hit Cache Miss Response Version Read Single Check Possible Single Lookup counter essential for correct and efficient design

Objective To design an architecture that very efficiently supports strong cache coherency with multiple dynamic dependencies on InfiniBand

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Basic System Architecture Server Node Mod Server Node Mod Server Node Mod Server Node Mod Cooperation Cache Lookup Counter maintained on the Application Servers Proxy Servers Application Servers

Basic Design Home Node based Client Polling  Cache Documents assigned home nodes Proxy Server Modules  Client polling functionality Application Server Modules  Support “Version Reads” for client polling  Handle updates

Many-to-Many Mappings Mapping of updates to dynamic objects Mapping of dynamic objects with Lookup counters Efficiency  Factor of dependency Updates Objects Lookup counters

Mapping of updates Non-Trivial solution Three possibilities  Database schema, constraints and dependencies are known  Per query dependencies are known  No dependency information known

Mapping Schemes Dependency Lists  Home node based  Complete dependency lists Invalidate All  Single Lookup Counter for a given class of queries  Low application server overheads

Handling Updates Database Server Ack (Atomic) Application Server Application Server Application Server Update Notification VAPI Send Local Search and Coherent Invalidate HTTP Request HTTP Response DB Query (TCP) DB Response

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Experimental Test-bed Cluster 1: Eight Dual 3.0 GHz Xeon processor nodes with 64-bit 133MHz PCI-X interface, 512KB L2-Cache and 533 MHz Front Side Bus Cluster 2: Eight Dual 2.4 GHz Xeon processor nodes with 64-bit 133MHz PCI-X interface, 512KB L2-Cache and 400 MHz Front Side Bus Mellanox InfiniHost MT23108 Dual Port 4x HCAs MT43132 eight 4x port Switch Mellanox Golden CD 0.5.0

Experimental Outline Basic Data-Center Performance Cache Misses in Active Caching Impact of Cache Size Impact of Varying Dependencies Impact of Load in Backend Servers Traces Used  Traces 1-5 with increasing update rate  Trace 6: Zipf like trace

Basic Data-Center Performance Maintaining Dependency Lists perform significantly well for all traces

Cache Misses in Active Caching Cache misses for Invalidate All increases drastically with increasing update rates

Impact of Cache Size Maintaining Dependency Lists perform significantly well for all traces Possible to cache a select few and still extract performance

Impact of Varying Dependencies Throughput drops significantly with increase in the average number of dependencies per cache file

Impact of Load in Backend Servers Our design can sustain high performance even under high loaded conditions with a factor of improvement close to 22

Conclusions An architecture for supporting Strong Cache Coherence with multiple dynamic dependencies Efficiently handle multiple dynamic dependencies  Supporting RDMA-based Client polling Resilient to load on back-end servers

Web Pointers {narravul, balaji, vaidyana, jinhy, NBC home page

Back-up Slides

Cache Consistency Non-decreasing views of system state Updates seen by all or none User Requests Proxy Nodes Back-End Nodes Update

Performance Receiver side CPU utilization is very low Leveraging the benefits of One sided communication

RDMA based Client Polling * The VAPI module can sustain performance even with heavy load on the back-end servers * SAN04: Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand. Narravula, et. Al.

Mechanism Cache Hit:  Back-end Version Check  If version current, use cache  Invalidate data for failed version check  Use of RDMA-Read Cache Miss  Get data to cache  Initialize local versions

Other Implementation Details Requests to read and update are mutually excluded at the back-end module to avoid simultaneous readers and writers accessing the same data. Minimal changes to existing application software