Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based.

Slides:



Advertisements
Similar presentations
Distributed Processing, Client/Server and Clusters
Advertisements

The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand P. Balaji, K. Vaidyanathan, S. Narravula,
High Performance Cluster Computing Architectures and Systems Hai Jin Internet and Cluster Computing Center.
04/25/06Pavan Balaji (The Ohio State University) Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol over InfiniBand.
Performance Characterization of a 10-Gigabit Ethernet TOE W. Feng ¥ P. Balaji α C. Baron £ L. N. Bhuyan £ D. K. Panda α ¥ Advanced Computing Lab, Los Alamos.
Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji,
Netscape Application Server Application Server for Business-Critical Applications Presented By : Khalid Ahmed DS Fall 98.
Institute of Computer Science Foundation for Research and Technology – Hellas Greece Computer Architecture and VLSI Systems Laboratory Exploiting Spatial.
On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand P. Balaji, S. Narravula, K. Vaidyanathan,
Energy Conservation in Datacenters through Cluster Memory Management and Barely-Alive Memory Servers Vlasia Anagnostopoulou Susmit.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Technical Architectures
Overview Distributed vs. decentralized Why distributed databases
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
RDMA ENABLED WEB SERVER Rajat Sharma. Objective  To implement a Web Server serving HTTP client requests through RDMA replacing the traditional TCP/IP.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
04/26/06D. K. Panda (The Ohio State University) Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services P. Balaji,
P. Balaji, S. Bhagvat, D. K. Panda, R. Thakur, and W. Gropp
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Client/Server Software Architectures Yonglei Tao.
Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,
Word Wide Cache Distributed Caching for the Distributed Enterprise.
DNN Performance & Scalability Planning, Evaluating & Improving : Part 2.
Databases and the Internet. Lecture Objectives Databases and the Internet Characteristics and Benefits of Internet Server-Side vs. Client-Side Special.
Database Architectures and the Web Session 5
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
Towards a Common Communication Infrastructure for Clusters and Grids Darius Buntinas Argonne National Laboratory.
High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.
DISTRIBUTED COMPUTING
Workload-driven Analysis of File Systems in Shared Multi-Tier Data-Centers over InfiniBand K. Vaidyanathan P. Balaji H. –W. Jin D.K. Panda Network-Based.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
1 A Framework for Data-Intensive Computing with Cloud Bursting Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The Ohio.
© Lindsay Bradford1 Scaling Dynamic Web Content Provision Using Elapsed-Time- Based Content Degradation Lindsay Bradford, Stephen Milliner and.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Srihari Makineni & Ravi Iyer Communications Technology Lab
N. GSU Slide 1 Chapter 05 Clustered Systems for Massive Parallelism N. Xiong Georgia State University.
Automated Control in Cloud Computing: Challenges and Opportunities Harold C. Lim, Shivnath Babu, Jeffrey S. Chase, and Sujay S. Parekh ACM’s First Workshop.
Plethora: A Wide-Area Read-Write Storage Repository Design Goals, Objectives, and Applications Suresh Jagannathan, Christoph Hoffmann, Ananth Grama Computer.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
1 Supporting Dynamic Migration in Tightly Coupled Grid Applications Liang Chen Qian Zhu Gagan Agrawal Computer Science & Engineering The Ohio State University.
Sockets Direct Protocol Over InfiniBand in Clusters: Is it Beneficial? P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
High-level Interfaces for Scalable Data Mining Ruoming Jin Gagan Agrawal Department of Computer and Information Sciences Ohio State University.
Mellanox Connectivity Solutions for Scalable HPC Highest Performing, Most Efficient End-to-End Connectivity for Servers and Storage September 2010 Brandon.
Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet P. Balaji, S. Bhagvat, R. Thakur and D. K. Panda, Mathematics.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Tackling I/O Issues 1 David Race 16 March 2010.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
Cloud-based movie search web application with transaction service Group 14 Yuanfan Zhang Ji Zhang Zhuomeng Li.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Netscape Application Server
Introduction to Load Balancing:
Introduction to Databases Transparencies
DotSlash: An Automated Web Hotspot Rescue System
Resource Allocation for Distributed Streaming Applications
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based Computing Laboratory (NBCL) Computer Science and Engineering Ohio State University

Introduction and Motivation Interactive Data-driven Applications –Scientific as well as Enterprise/Commercial Applications Static Datasets: Medical Imaging Modalities Dynamic Datasets: Stock value datasets, E-commerce, Sensors –Need for interacting, synthesizing and visualizing large datasets –Data-centers enable such capabilities Clients initiate queries (over the web) to process specific datasets –Data-centers process data and reply to queries

Typical Multi-Tier Data-center Environment Requests are received from clients over the WAN Proxy nodes perform caching, load balancing, resource monitoring, etc. If not cached, the request is forwarded to the next tiers  Application Server Application server performs the business logic (CGI, Java servlets, etc.) –Retrieves appropriate data from the database to process the requests Proxy Server Web-server (Apache) Application Server (PHP) Database Server (MySQL) WAN Clients Storage More Computation and Communication Requirements

Overview of Research Propose a novel framework for next generation data-centers –Delivering performance and scalability –Providing advanced features such as active caching, fine-grain resource monitoring, dynamic resource adaptation, etc Novel approaches using the advanced features of InfiniBand and other RDMA-enabled Networks –Resilient to the load on the back-end servers –Order of magnitude performance gain for several scenarios –Exploit features like RDMA and remote atomic operations for new primitives and services Three-layer Architecture –Advanced Communication Protocol Support –Data-Center Primitives –Data-Center Services

Proposed Architecture Existing Data-Center Components RDMAAtomicMulticast Sockets Direct Protocol Protocol Offload Packetized Flow-control Global Memory Aggregator Distributed Lock Manager Point To Point Advanced System Services Data-Center Service Primitives Advanced Communication Protocols and Subsystems Network Active Caching Soft Shared State Async. Zero-copy Communication Cooperative Caching Dynamic Reconfiguration Resource Monitoring Dynamic Content Caching Active Resource Adaptation Distributed Data Sharing Substrate Active Caching Cooperative Caching Dynamic Reconfiguration Resource Monitoring Soft Shared State Distributed Lock Manager Distributed Data Sharing Substrate Async. Zero-copy Communication

Publications (So Far) Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data-Centers over InfiniBand, CCGrid 2005 On the Provision of Prioritization and Soft QoS in Dynamically Reconfigurable Shared Data-Centers over InfiniBand, ISPASS 2005 Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol (SDP) over InfiniBand, CAC 2006 Designing Efficient Cooperative Caching Schemes for Multi-Tier Data- Centers over RDMA-enabled Networks, CCGrid 2006 Exploiting RDMA operations for Providing Efficient Fine-Grained Resource Monitoring in Cluster-Based Servers, RAIT 2006 DDSS: A Low-Overhead Distributed Data Sharing Substrate for Cluster-Based Data-Centers over Modern Interconnects, HiPC 2006 High Performance Distributed Lock Management Services using Network-based Remote Atomic Operations, CCGrid

Sockets Direct Protocol: Throughput and Overlap Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol (SDP) over InfiniBand, P. Balaji, S. Bhagvat, H. –W. Jin and D. K. Panda. Workshop on Communication Architecture for Clusters (CAC); with IPDPS ‘06.

Presentation Layout  Introduction and Motivation  Cooperative Caching Services  Resource Monitoring Services  Conclusions and Ongoing Work

Cooperative Caching Services Aggregate cache benefits – well known!! Performance considerations –Two-sided operation vs. One-sided RDMA operations –Placement of data ( Local Vs. Remote) –Controlling data redundancy –Utilize available remote memory –Load sensitive Protocols Objective –Can we design efficient cooperative caching schemes utilizing the idle resources in the Data-Centers and the RDMA capabilities in networks and eliminate redundancy to optimize available system cache size?

Data-Center Throughput with Cooperative Caching 8-Proxy nodes Our schemes achieve significant performance gain over basic Apache Caching (AC) Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks, S. Narravula, H. -W. Jin, K. Vaidyanathanand D. K. Panda. In International Symposium on Cluster Computing and the Grid (CCGrid), 2006 TPSTPS

Presentation Layout  Introduction and Motivation  Data-center Service Primitives  Cooperative Caching Services  Resource Monitoring Services  Conclusions and Ongoing Work

Resource Monitoring Services Traditional approaches –Coarse-grained in nature –Assume resource usage is consistent throughout the monitoring granularity (in the order of seconds) This assumption is no longer valid –Resource usage is becoming increasingly divergent Fine-grained monitoring is desired but has additional overheads –High overheads, less accurate, slow in response Can we design fine-grained resource monitoring scheme with low overhead and accurate resource usage?

Synchronous Resource Monitoring using RDMA (RDMA-Sync) /proc Kernel Space User Space Kernel Space User Space Front-end Node Memory CPU App Threads Front-end Monitoring Process Kernel Data Structures App Threads Back-end Node RDMA

Impact of Fine-grained Monitoring with Applications Exploiting RDMA operations for Providing Efficient Fine-Grained Resource Monitoring in Cluster- Based Servers, K. Vaidyanathan, H. –W. Jin and D. K. Panda. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Technologies, 2006 Our schemes (RDMA-Sync and e-RDMA-Sync) achieve significant performance gain over existing schemes

Work-in-Progress Data-Center Primitives –Efficient Global Memory Aggregator Mechanisms Advanced Communication Protocol Mechanisms –Efficient Packetized Flow-Control Detailed Data-Center Evaluation with the proposed framework Software release of several data-center components –Have received multiple requests from organizations for such a release including a large financial company

Conclusions Proposed new protocols, primitives and services for next generation data-centers –Use advanced features of InfiniBand and other RDMA-Enabled interconnects –Significant performance gains and scalability for several scenarios Potential for designing next generation scalable and high performance data-center architectures

Future Challenges Challenges –Benefits of all these components and services in an integrated manner for handling Terabytes of data and Multi-thousand users –Redesigning middleware and applications on next generation data-centers Significance to the SMA and PDOS components of the program Discussion Bullet –How to re-architect next generation data-center architectures, software services, middleware and applications with advances in modern networking technologies and capabilities?

Web Pointers Website: Group Homepage: NBCL