2/1/00 Porcupine: a highly scalable email service Authors: Y. Saito, B. N. Bershad and H. M. Levy This presentation by: Pratik Mukhopadhyay CSE 291 Presentation.

Slides:



Advertisements
Similar presentations
Database Architectures and the Web
Advertisements

Jaringan Informasi Pengantar Sistem Terdistribusi oleh Ir. Risanuri Hidayat, M.Sc.
Innosoft international inc. Ó 1999 Innosoft International, Inc. Using LDAPv3 for Directory-Enabled Applications & Networking Greg Lavender Director of.
Dynamo: Amazon's Highly Available Key-value Store Distributed Storage Systems CS presented by: Hussam Abu-Libdeh.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Ao-Jan Su and Aleksandar Kuzmanovic Department of EECS Northwestern University Thinning Akamai USENIX/ACM SIGCOMM IMC ’08.
Content  Overview of Computer Networks (Wireless and Wired)  IP Address, MAC Address and Workgroups  LAN Setup and Creating Workgroup  Concept on.
Clusters Robert Grimm New York University Or: How to replace Big Iron with PCs.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 Principles of Reliable Distributed Systems Tutorial 12: Frangipani Spring 2009 Alex Shraer.
Distributed Hardware How are computers interconnected ? –via a bus-based –via a switch How are processors and memories interconnected ? –Private –shared.
G Robert Grimm New York University Porcupine.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Overview Distributed vs. decentralized Why distributed databases
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.
1 Porcupine: A Highly Available Cluster-based Mail Service Yasushi Saito Brian Bershad Hank Levy University of Washington Department of Computer Science.
Scalable Applications and Real Time Response Ashish Motivala CS 614 April 17 th 2001.
©Silberschatz, Korth and Sudarshan19.1Database System Concepts Lecture-10 Distributed Database System A distributed database system consists of loosely.
Manageability, Availability and Performance in Porcupine: A Highly Scalable, Cluster-based Mail Service.
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
Clusters Robert Grimm New York University Or: How to replace Big Iron with PCs.
CS294, YelickApplications, p1 CS Applications of Reliable Distributed Systems
CS 443 Advanced OS Fabián E. Bustamante, Spring 2005 Porcupine: A Highly Available Cluster- based Mail Service Y. Saito, B. Bershad, H. Levy U. Washington.
 Distributed Software Chapter 18 - Distributed Software1.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
1 The Google File System Reporter: You-Wei Zhang.
Lecturer: Maxim Podlesny Sep CSE 473 File Transfer and Electronic in Internet.
Module 8: Managing Client Configuration and Connectivity.
70-291: MCSE Guide to Managing a Microsoft Windows Server 2003 Network Chapter 3: TCP/IP Architecture.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
CS Storage Systems Lecture 14 Consistency and Availability Tradeoffs.
Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Manageability, Availability and Performance in Porcupine: A Highly Scalable, Cluster-based Mail Service Yasushi Saito, Brian N Bershad and Henry M.Levy.
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
Exchange Exchange Server Role Architecture in Exchange Server 2013 Server roles in Exchange Server 2013: Client Access Server Mailbox Server Client.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter Seven Configuring and Managing Exchange Server.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Internet Protocol B Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Multiprossesors Systems.. What are Distributed Databases ? “ A Logically interrelated collection of shared data ( and a description of this data) physically.
Distributed Computing Systems CSCI 4780/6780. Distributed System A distributed system is: A collection of independent computers that appears to its users.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Distributed Databases
Distributed database system
INTERNET PROTOCOLS. Microsoft’s Internet Information Server Home Page Figure IT2031 UNIT-3.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 1: Characterization of Distributed & Mobile Systems Dr. Michael R.
Distributed Computing Systems CSCI 6900/4900. Review Distributed system –A collection of independent computers that appears to its users as a single coherent.
Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter One Introduction to Exchange Server 2003.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Distributed Databases
3.1 Types of Servers.
Cluster-Based Scalable
REPLICATION & LOAD BALANCING
Application layer tcp/ip
HELLO WORLD!!! Run Project 2: WELCOME Subject: Virtual LAN’s
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 9
DNS.
3.1 Types of Servers.
An Introduction to Computer Networking
CS 4594 Broadband PNNI Signaling.
Windows Server Administration Fundamentals
Protocol Application TCP/IP Layer Model
Distributed Systems and Concurrency: Distributed Systems
Distributed Databases
Presentation transcript:

2/1/00 Porcupine: a highly scalable service Authors: Y. Saito, B. N. Bershad and H. M. Levy This presentation by: Pratik Mukhopadhyay CSE 291 Presentation on Full citation: Yasushi Saito, Brian N. Bershad, and Henry M. Levy. "Manageability, Availability and Performance in Porcupine: A Highly Scalable Cluster-based Mail Service." Proceedings of the 17th ACM Symposium on Operating Systems Principles, December 1999.

2/1/00 Goals of Porcupine High Performance : must handle billions of messages Good Scalability : must scale to 100’s of nodes, yet have competitive single node performance High Availability : must mask node failures from users Easy System Administration

2/1/00 System Architecture Functionally homogeneous nodes Key processes : membership manager mailbox manager, user profile manager replication manager mail delivery proxy (SMTP) mail retrieval proxy (POP & IMAP)

2/1/00 Terminology Mailbox fragment Mailbox fragment list User profile database User profile soft state User map Cluster membership list

2/1/00 System Management Desired features : Transparent handling of node addition, deletion and temporary node failures Load balancing across nodes automatically in face of changing workloads

2/1/00 Membership services Uses a variant of Three Round Membership Protocol Failure detection methods: remote operation timeout ping neighbor in IP address order periodically broadcast probe packets periodically Is broadcasting a good idea ? Allowing any node to be a coordinator ?

2/1/00 Recovery process User map reconstruction Soft state reconstruction -- A 2 step process : + Find changes + Notify changes Cache soft state information ? Do we reconfigure after every failure ?

2/1/00 Scaling Easy addition of new nodes : just install software and connect to network ( make IP address known to users ) Performance studies show that the system uses the newly available resources

2/1/00 Replication Basic properties : update anywhere eventual consistency total updates no locking ordered by loosely synchronized clocks Relaxed consistency for the user database ?

2/1/00 Load balancing Collecting load information : + side effect of RPC operations + load information packets Limit spread of a users mail for better performance

2/1/00 Conclusions Performance studies show that Porcupine is scalable, highly available and makes good use of resources under all workloads.

2/1/00 Miscellaneous Functional homogeneity -- good or bad ? Will it work for stuff other than ? For a very large system do we want a single geographical presence ? Special support for mailing list mail ?