Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec. 1999.

Slides:



Advertisements
Similar presentations
Chapter 4 Infrastructure as a Service (IaaS)
Advertisements

ITIS 3110 Jason Watson. Replication methods o Primary/Backup o Master/Slave o Multi-master Load-balancing methods o DNS Round-Robin o Reverse Proxy.
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
NETWORK LOAD BALANCING NLB.  Network Load Balancing (NLB) is a Clustering Technology.  Windows Based. (windows server).  To scale performance, Network.
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 Principles of Reliable Distributed Systems Tutorial 12: Frangipani Spring 2009 Alex Shraer.
12/15/00EMTM 5531 EMTM 553: E-commerce Systems Lecture 4: Performance and Scalability Insup Lee Department of Computer and Information Science University.
City University London
Midterm 2: April 28th Material:   Query processing and Optimization, Chapters 12 and 13 (ignore , 12.7, and 13.5)   Transactions, Chapter.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
OSD Metadata Management
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
CS6320 – Performance more details L. Grewe 1. System Architecture Client Web Server Tier 2Tier 1Tier 3 Application Server Database Server DMS.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
ArcGIS for Server Reference Implementations An ArcGIS Server’s architecture tour.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 7: Planning a DNS Strategy.
How to Cluster both Servers and Storage W. Curtis Preston President The Storage Group.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
National Manager Database Services
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
1 Oracle 9i AS Availability and Scalability Margaret H. Mei Senior Product Manager, ST.
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
PMIT-6102 Advanced Database Systems
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Managing Multi-User Databases AIMS 3710 R. Nakatsu.
1 © 2006 SolidWorks Corp. Confidential. Clustering  SQL can be used in “Cluster Pack” –A pack is a group of servers that operate together and share partitioned.
Module 12: Designing High Availability in Windows Server ® 2008.
Windows 2000 Advanced Server and Clustering Prepared by: Tetsu Nagayama Russ Smith Dale Pena.
1 Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
GigaSpaces Global HTTP Session Sharing October 2013 Massive Web Application Scaling.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Module 10: Maintaining High-Availability. Overview Introduction to Availability Increasing Availability Using Failover Clustering Standby Servers and.
7. Replication & HA Objectives –Understand Replication and HA Contents –Standby server –Failover clustering –Virtual server –Cluster –Replication Practicals.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Features Scalability Manage Services Deliver Features Faster Create Business Value Availability Latency Lifecycle Data Integrity Portability.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
1 Chapter Overview Using Standby Servers Using Failover Clustering.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation Introduction to Cloud.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
PERFORMANCE MANAGEMENT IMPROVING PERFORMANCE TECHNIQUES Network management system 1.
SysPlex -What’s the problem Problems are growing faster than uni-processor….1980’s Leads to SMP and loosely coupled Even faster than SMP and loosely coupled.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
REPLICATION & LOAD BALANCING
Managing Multi-User Databases
Network Load Balancing
Maximum Availability Architecture Enterprise Technology Centre.
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
VIDIZMO Deployment Options
Windows Azure 講師: 李智樺, Ruddy Lee
Design Unit 26 Design a small or home office network
Specialized Cloud Mechanisms
Cloud Computing Architecture
Presentation transcript:

Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec Based on presentation by Hongwei Zhang, Ohio State U.

2 Outline  Why and how to scale  Ways to organize massive computation  Farm  Geoplex  Ways to scale a farm  Clone (RACS)  Partition (RAPS)

3 Why and how to Scale?  why  Server systems must be able to start small Small-size company (garage-scale) v.s. international company (kingdom-scale)  System should be able to grow as demand grows e.g. eCommerce made system growth more rapid & dynamic ASP also need dynamic growth  how  Scale up - expanding a system by incrementally adding more devices to an existing node – CPUs, discs, NICS, etc. inherently limited  Scale Out – expanding the system by adding more nodes – convenient (computing capacity can be purchased incrementally), no theoretical scalability limit

4 Farm/Geoplex  Farm - the collection of servers, applications and data at a particular site  features: functionally specialized services ( , WWW, directory, database, etc.) administered as a unit (common staff, management policies, facilities, networking)  Geoplex – a replicated (duplicated?) farm at two or more sites  disaster protection  may be active-active: all farms carry some of the load; active-passive: one or more are hot-standbys (waiting for fail-over of corresponding active farms)

5

6

7 Clone  A replica of a server or a service  Allows load balancing  External to the clones  IP sprayer (like Cisco LocalDirectorTM) dispatches (sprays) requests to different nodes in the clone to achieve load- balancing  Internal to the clones  IP sieve like Network Load Balancing in Windows 2000  Every requests arrive at every node in the clone, each node intelligently accepts a part of these requests;  Distributed coordination among nodes

8 RACS  RACS (Reliable Array of Cloned Services) – collection of clones for a particular service  two types  Shared-nothing RACS – each node duplicates all the storage locally  Shared-disk RACS – all the nodes (clones) share a common storage manager. Stateless servers at different nodes access a common backend storage server

9  scalable – good way to add processing power, network bandwidth, and storage bandwidth to a farm;  available  nodes can act as backup for one another: one node fail, other nodes continue to offer service (probably with degraded performance)  Failures could be masked, if node- and application- failure detection mechanisms are integrated with the load-balancing system or with client applications  easy to manage – administrative operations on one service instance at one node could be replicated to all others. RACS advantages

10  Shared-nothing RACS  not a good way to grow storage capacity: updates at one node’s must be applied to all other nodes’ storage  problematic for write-intensive services: all clones must perform all writes (no throughput improvement) and need subtle coordination  Shared-disk RACS  Storage server should be fault-tolerant for availability (only one copy of data)  Still require subtle algorithms to manage updates (such as cache validation, lock managers, transaction logs, etc.) Problems with RACS

11 Partitions and Packs  Partition – service is grown by dividing data among nodes  Only one copy of data in each partition – availability is not improved  Pack – each partition is implemented by a set of servers  Shared disk  Shared nothing Active/active – all members service partition Active/passive – one member services partitions, the others - standby

12  typically, the application middleware partitions the data and workload by object:  Mail servers partition by mailboxes  Sales systems partition by customer accounts or product lines  challenges  When a partition (node) is added, the data should be automatically repartitioned among the nodes to balance the storage and computational load.  The partitioning should automatically adapt as new data is added and as the load changes.  RAPS (Reliable Array of Partitioned Services) – a collection of nodes that support a packed-partitioned service  provides both scalability and availability;  better performance than RACS for write-intensive services. How to partition, RAPS

13 RACS vs. RAPS  Clones and RACS  for or read-mostly applications with low consistency and modest storage requirements (<= 100 GB) Web/file/security/directory servers  Partitions and RAPS  For update-intensive and large database applications (routing requests to specific partitions) /instant messaging/enterprise resource planning(ERP)/record keeping

14 Multi-tier application example  functional separation  front-tier: web and firewall services (read mostly)  middle-tier: file servers (read mostly)  data-tier: SQL (database) servers (update intensive)

15

16 Summary  scalability technique  replicate a service at many nodes  simpler forms of replication  duplicate both programs and data: RACS  for large databases or update-intensive services  data partitioned: RAPS  packs make partitions highly available  against disaster  the entire farm is replicated to form a geoplex