Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,

Slides:



Advertisements
Similar presentations
Indications in green = Live content Indications in white = Edit in master Indications in blue = Locked elements Indications in black = Optional elements.
Advertisements

Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand P. Balaji, K. Vaidyanathan, S. Narravula,
Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji,
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand P. Balaji, S. Narravula, K. Vaidyanathan,
RDS and Oracle 10g RAC Update Paul Tsien, Oracle.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
Technical Architectures
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
The Effect of Consistency on Cache Response Time John Dilley and HP Laboratories IEEE Network, May-June 2000 Chun-Fu Kung System Laboratory Dept. of Computer.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
Architectural Impact of SSL Processing Jingnan Yao.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Web Caching Schemes For The Internet – cont. By Jia Wang.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
July 16 th, 2005 Software Architecture in Practice RiSE’s Seminars Bass’s at all Book :: Chapters 13 Fred Durão.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
04/26/06D. K. Panda (The Ohio State University) Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services P. Balaji,
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Web application architecture
N-Tier Architecture.
1 Web Servers (IIS and Apache) Outline 9.1 Introduction 9.2 HTTP Request Types 9.3 System Architecture 9.4 Client-Side Scripting versus Server-Side Scripting.
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based.
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
第十四章 J2EE 入门 Introduction What is J2EE ?
Workload-driven Analysis of File Systems in Shared Multi-Tier Data-Centers over InfiniBand K. Vaidyanathan P. Balaji H. –W. Jin D.K. Panda Network-Based.
Building an E-Commerce website Dr. John P. Abraham.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
An Overlay Network Providing Application-Aware Multimedia Services Maarten Wijnants Bart Cornelissen Wim Lamotte Bart De Vleeschauwer.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
A Method for Transparent Admission Control and Request Scheduling in E-Commerce Web Sites S. Elnikety, E. Nahum, J. Tracey and W. Zwaenpoel Presented By.
C-Hint: An Effective and Reliable Cache Management for RDMA- Accelerated Key-Value Stores Yandong Wang, Xiaoqiao Meng, Li Zhang, Jian Tan Presented by:
Sockets Direct Protocol Over InfiniBand in Clusters: Is it Beneficial? P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda.
Proxy Cache Engine Performed by: Artyom Borzin Stas Lapchev Instructor: Hen Broodney In cooperation with Magnifier Ltd. הטכניון - מכון טכנולוגי לישראל.
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
Cloud Computing Computer Science Innovations, LLC.
Scalable Data Scale #2 site on the Internet (time on site) >200 billion monthly page views Over 1 million developers in 180 countries.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
The Internet Technological Background. Topic Objectives At the end of this topic, you should be able to do the following: Able to define the Internet.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Varnish Cache and its usage in the real world Ivan Chepurnyi Owner EcomDev BV.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Website Deployment Week 12. Software Engineering Practices Consider the generic process framework – Communication – Planning – Modeling – Construction.
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
N-Tier Architecture.
Web Development Web Servers.
Principles of Network Applications
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Cache Memory Presentation I
SABRes: Atomic Object Reads for In-Memory Rack-Scale Computing
COS 518: Advanced Computer Systems Lecture 9 Michael Freedman
Be Fast, Cheap and in Control
Introduction to Databases Transparencies
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda The Ohio State University

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Introduction Fast Internet Growth  Number of Users  Amount of data  Types of services Several uses  E-Commerce, Online Banking, Online Auctions, etc Types of Content  Images, documents, audio clips, video clips, etc - Static Content  Stock Quotes, Online Stores (Amazon), Online Banking, etc. - Dynamic Content (Active

Presentation Outline Introduction/Motivation  Multi-Tier Data-Centers  Active Caches  InfiniBand Design and Implementation Experimental Results Conclusions

Multi-Tier Data-Centers Single Powerful Computers Clusters  Low ‘Cost to Performance’ Ratio  Increasingly Popular Multi-Tier Data-Centers  Scalability – an important issue

A Typical Multi-Tier Data-Center Database Servers Clients Application Servers Web Servers Proxy Nodes Tier 0 Tier 1 Tier 2 Apache PHP WAN

Tiers of a Typical Multi-Tier Data-Center Proxy Nodes  Handle Caching, load balancing, security, etc Web Servers  Handle the HTML content Application Servers  Handle Dynamic Content, Provide Services Database Servers  Handle persistent storage

Data-Center Characteristics Computation Front-End Tiers Back-End Tiers The amount of computation required for processing each request increases as we go to the inner tiers of the Data-Center Caching at the front tiers is an important factor for scalability

Presentation Outline Introduction/Motivation  Introduction  Multi-Tier Data-Centers  Active Caches  InfiniBand Design and Implementation Experimental Results Conclusions

Caching Can avoid re-fetching of content Beneficial if requests repeat Static content caching  Well studied in the past  Widely used Front-End Tiers Back-End Tiers Number of Requests Decrease

Active Caching Dynamic Data  Stock Quotes, Scores, Personalized Content, etc Simple caching methods not suited Issues  Consistency  Coherency Proxy Node Cache Back-End Data User Request Update

Cache Consistency Non-decreasing views of system state Updates seen by all or none User Requests Proxy Nodes Back-End Nodes Update

Cache Coherency Refers to the average staleness of the document served from cache Two models of coherence  Bounded staleness (Weak Coherency)  Strong or immediate (Strong Coherency)

Strong Cache Coherency An absolute necessity for certain kinds of data  Online shopping, Travel ticket availability, Stock Quotes, Online auctions  Example: Online banking Cannot afford to show different values to different concurrent requests

Caching policies ConsistencyCoherency No Caching Client Polling Invalidation *  TTL/Adaptive TTL  *D. Li, P. Cao, and M. Dahlin. WCIP: Web Cache Invalidation Protocol. IETF Internet Draft, November 2000.

Presentation Outline Introduction/Motivation  Introduction  Multi-Tier Data-Centers  Active Caches  InfiniBand Design and Implementation Experimental Results Conclusions

InfiniBand High Performance  Low latency  High Bandwidth Open Industry Standard Provides rich features  RDMA, Remote Atomic operations, etc Targeted for Data-Centers Transport Layers  VAPI  IPoIB  SDP

Performance Low latencies of less than 5us achieved Bandwidth over 840 MB/s * SDP and IPoIB from Voltaire’s Software Stack

Performance Receiver side CPU utilization is very low Leveraging the benefits of One sided communication

Caching policies ConsistencyCoherency No Caching Client Polling Invalidation  TTL/Adaptive TTL 

Objective To design an architecture that very efficiently supports strong cache coherency on InfiniBand

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Basic Architecture External modules are used  Module communication can use any transport Versioning:  Application servers version dynamic data  Version value of data passed to front end with every request to back-end  Version maintained by front end along with cached value of response

Mechanism Cache Hit:  Back-end Version Check  If version current, use cache  Invalidate data for failed version check Cache Miss  Get data to cache  Initialize local versions

Architecture Front-EndBack-End Request Cache Hit Cache Miss Response

Design Every server has an associated module that uses IPoIB, SDP or VAPI to communicate VAPI:  When a request arrives at proxy, VAPI module is contacted.  Module reads latest version of the data from the back-end using one-sided RDMA Read operation  If versions do not match, cached value is invalidated

VAPI Architecture Front-EndBack-End Request Cache Hit Cache Miss Response RDMA Read

Implementation Socket-based Implementation:  IPoIB and SDP are used  Back-end version check is done using two-sided communication from the module Requests to read and update are mutually excluded at the back-end module to avoid simultaneous readers and writers accessing the same data. Minimal changes to existing software

Presentation Outline Introduction/Motivation Design and Implementation Experimental Results Conclusions

Data-Center: Performance The VAPI module can sustain performance even with heavy load on the back-end servers

Data-Center: Performance The VAPI module responds faster even with heavy load on the back-end servers

Response Time Breakup Worst case Module Overhead less than 10% of the response time Minimal overhead for VAPI based version check even for 200 compute threads

Data-Center: Throughput The drop in the throughput of VAPI in World cup trace is due to the higher penalty for cache misses under increased load VAPI implementation does better for real trace too

Conclusions An architecture for supporting Strong Cache Coherence External module based design  Freedom in choice of transport  Minimal changes to existing software Sockets API inherent limitation  Two-sided communication  High performance Sockets not the solution (SDP) Main benefit  One sided nature of RDMA calls

Web Pointers {narravul, balaji, vaidyana, savitha, wuj, NBC home page