The Spensa File System Douglas Santry Computer Laboratory University of Cambridge.

Slides:



Advertisements
Similar presentations
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
Advertisements

1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Chapter 9 Designing Systems for Diverse Environments.
Distributed Processing, Client/Server, and Clusters
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
Chapter 16 Client/Server Computing Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
Ken Birman. Massive data centers We’ve discussed the emergence of massive data centers associated with web applications and cloud computing Generally.
Automatic Run-time Adaptation in Virtual Execution Environments Ananth I. Sundararaj Advisor: Peter A. Dinda Prescience Lab Department of Computer Science.
Coda file system: Disconnected operation By Wallis Chau May 7, 2003.
Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
PRASHANTHI NARAYAN NETTEM.
1 Distributed Systems: Distributed Process Management – Process Migration.
University of Pennsylvania 11/21/00CSE 3801 Distributed File Systems CSE 380 Lecture Note 14 Insup Lee.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
(part 3).  Switches, also known as switching hubs, have become an increasingly important part of our networking today, because when working with hubs,
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
ProjectWise Virtualization Kevin Boland. What is Virtualization? Virtualization is a technique for deploying technologies. Virtualization creates a level.
Slingshot: Deploying Stateful Services in Wireless Hotspots Ya-Yunn Su Jason Flinn University of Michigan.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Remus: VM Replication Jeff Chase Duke University.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Advanced Operating Systems - Spring 2009 Lecture 21 – Monday April 6 st, 2009 Dan C. Marinescu Office: HEC 439 B. Office.
Distributed File Systems
Chapter 20 Distributed File Systems Copyright © 2008.
Computing Infrastructure for Large Ecommerce Systems -- based on material written by Jacob Lindeman.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Distributed Computing Systems CSCI 4780/6780. Geographical Scalability Challenges Synchronous communication –Waiting for a reply does not scale well!!
Server Performance, Scaling, Reliability and Configuration Norman White.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
Chap 7: Consistency and Replication
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
Dsitributed File Systems
Chapter 3.  The advantages and disadvantages of: ◦ Client-Server Networks & ◦ Peer-to-Peer Networks.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Bentley Systems, Incorporated
Dedicated Servers vs Cloud Hosting
Slingshot: Deploying Stateful Services in Wireless Hotspots
CSI 400/500 Operating Systems Spring 2009
1. 2 VIRTUAL MACHINES By: Satya Prasanna Mallick Reg.No
Hands-On Virtualization in the Classroom
Download dumps - Microsoft Real Exam Questions Dumps4download
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
CMSC 611: Advanced Computer Architecture
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
Distributed File Systems
DISTRIBUTED FILE SYSTEMS
Distributed File Systems
Process Migration Troy Cogburn and Gilbert Podell-Blume
Operating Systems : Overview
Outline Announcements Lab2 Distributed File Systems 1/17/2019 COP5611.
CSE 451: Operating Systems Spring Module 21 Distributed File Systems
Distributed File Systems
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
CSE 451: Operating Systems Distributed File Systems
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Distributed File Systems
Distributed File Systems
Requirements of Computing in Network
Presentation transcript:

The Spensa File System Douglas Santry Computer Laboratory University of Cambridge

Target Environment “Lots” of physical machines in a machine room Physical machines interconnected by “high” quality network Machines are cheap and stuffed with “large” ATA disk drives

What are they doing? Machines are running virtual machines (Xen or VMWare) Virtual machines are mobile, that is, they migrate between physical machines There is very little explicit file sharing between virtual machines Candidates include corporate data centres, “service” providers, e-commerce sites

Challenges Data availability and reliability Load balancing and performance tuning Service differentiation and guarantees Location Transparency – virtual machines and data need to move transparently to the one another ATA disks are cheap – they WILL fail

Spensa Features Service Differentiation Service Guarantees Service Isolation Automatic load balancing Automatic performance tuning

Spensa A Distributed File System Two components: a client file system and a server Servers store opaque objects – they have no notion of file systems The client file system is backed by objects on the servers and offers the traditional file system hierarchy and name space

An instance of a Spensa (Name: foo) / usr home mnt Machine A Machine BMachine C Foo’s bascauda Spensa operates on objects

Bascauda A Bascauda B Bascauda C VM Mounted Spensa B VM Mounted Spensa C VM Mounted Spensa A

Spensa continued Every physical machine runs application virtual machines and a Spensa server Spensa servers run inside dedicated virtual machines – one per physical machine

Reliability and Availability Replication At 50 cents/G one can be free with it Replication factor specified on a per Spensa basis

Reading Replicas Spensa client broadcasts request for data to all copies of it First machine to fetch it answers and cancels fetch on peers

Caching Servers reside in virtual machines with all of the other virtual machines – memory is critical Servers do not cache client data Servers cache path critical meta data to minimize latency (backing file system’s inode, bitmaps &c)

Service Service can be specified in terms of time or bandwidth Time is specified in terms of percentage Bandwidth specified in KB/s Latency in milliseconds A Server is configured for either time or bandwidth. They are mutually exclusive

Service Continued Enforcement is distributed. There are no centralised or interposed enforcement machines or mechanisms Bandwidth seems to be more intuitive to specify for humans Bandwidth offers tighter short-term control

Load Balancing Too many machines (real and virtual) for a human to make provision decisions - Spensa auto-provisions Load balancing mitigates poor decisions Virtual diffusion with direct migration

Diffusion Bascaudae need to be decomposed for partial migration Bascaudae are decomposed in the object name space (it has no knowledge of the file system’s name space) Traffic is not Poisson – use the real distribution Servers keep a per bascauda load and address reference histogram