September 24, 2007The 3 rd CSAIL Student Workshop Byzantine Fault Tolerant Cooperative Caching Raluca Ada Popa, James Cowling, Barbara Liskov Summer UROP.

Slides:



Advertisements
Similar presentations
Two phase commit. Failures in a distributed system Consistency requires agreement among multiple servers –Is transaction X committed? –Have all servers.
Advertisements

P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
PNUTS: Yahoo!’s Hosted Data Serving Platform Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, HansArno Jacobsen,
Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Thomas ZahnCST1 Seminar: Information Management in the Web Query Processing Over Peer- to-Peer Data Sharing Systems (UC Santa Barbara)
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Tzu-Han Wu Yi-Chi Chiang Han-Yang Ou
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Peer-to-peer Multimedia Streaming and Caching Service Jie WEI, Zhen MA May. 29.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
Signature Based Concurrency Control Thomas Schwarz, S.J. JoAnne Holliday Santa Clara University Santa Clara, CA 95053
Distributed Lookup Systems
CSCS: A Concise Implementation of User-Level Distributed Shared Memory Zhi Zhai Feng Shen Computer Science and Engineering University of Notre Dame Dec.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
BASE: Using Abstraction to Improve Fault Tolerance Rodrigo Rodrigues, Miguel Castro, and Barbara Liskov MIT Laboratory for Computer Science and Microsoft.
Wide-area cooperative storage with CFS
Peer-to-peer Multimedia Streaming and Caching Service by Won J. Jeon and Klara Nahrstedt University of Illinois at Urbana-Champaign, Urbana, USA.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Distributed Databases
Fault Tolerance via the State Machine Replication Approach Favian Contreras.
Distributed Transactions March 15, Transactions What is a Distributed Transaction?  A transaction that involves more than one server  Network.
Failure Resilience in the Peer-to-Peer-System OceanStore Speaker: Corinna Richter.
Project Presentation Students: Yan Michalevsky Asaf Cidon Supervisors: Alexander Shraer Assoc. Prof. Idit Keidar.
1 The Design of a Robust Peer-to-Peer System Rodrigo Rodrigues, Barbara Liskov, Liuba Shrira Presented by Yi Chen Some slides are borrowed from the authors’
Molecular Transactions G. Ramalingam Kapil Vaswani Rigorous Software Engineering, MSRI.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Chapter 19 Recovery and Fault Tolerance Copyright © 2008.
HQ Replication: Efficient Quorum Agreement for Reliable Distributed Systems James Cowling 1, Daniel Myers 1, Barbara Liskov 1 Rodrigo Rodrigues 2, Liuba.
CS 5204 (FALL 2005)1 Leases: An Efficient Fault Tolerant Mechanism for Distributed File Cache Consistency Gray and Cheriton By Farid Merchant Date: 9/21/05.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Practical Byzantine Fault Tolerance
From Viewstamped Replication to BFT Barbara Liskov MIT CSAIL November 2007.
1 ZYZZYVA: SPECULATIVE BYZANTINE FAULT TOLERANCE R.Kotla, L. Alvisi, M. Dahlin, A. Clement and E. Wong U. T. Austin Best Paper Award at SOSP 2007.
Serverless Network File Systems Overview by Joseph Thompson.
Low Cost Commit Protocols for Mobile Computing Environments Marc Perron & Baochun Bai.
Chapter 6.5 Distributed File Systems Summary Junfei Wen Fall 2013.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
Architecture Models. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Carnegie Mellon Increasing Intrusion Tolerance Via Scalable Redundancy Mike Reiter Natassa Ailamaki Greg Ganger Priya Narasimhan Chuck Cranor.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Caching Consistency and Concurrency Control Contact: Dingshan He
MQTT QoS2 Considerations Konstantin Dotchkoff. Challenges associated with implementing QoS 2 in large scale distributed systems Replication of QoS 2 messages.
Distributed Systems CS Consistency and Replication – Part I Lecture 10, September 30, 2013 Mohammad Hammoud.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Systems Research Barbara Liskov October Replication Goal: provide reliability and availability by storing information at several nodes.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
Tzu-Han Wu Yi-Chi Chiang Han-Yang Ou
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Magdalena Balazinska, Hari Balakrishnan, and David Karger
Plethora: Infrastructure and System Design
EECS 498 Introduction to Distributed Systems Fall 2017
Providing Secure Storage on the Internet
Outline Announcements Fault Tolerance.
Consistency and Replication
From Viewstamped Replication to BFT
Lazy Type Changes in Object-oriented Databases
Distributed Systems CS
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Presentation transcript:

September 24, 2007The 3 rd CSAIL Student Workshop Byzantine Fault Tolerant Cooperative Caching Raluca Ada Popa, James Cowling, Barbara Liskov Summer UROP CSAIL, MIT

September 24, 2007 The 3 rd CSAIL Student Workshop 2 Outline Introduction Motivation System Description Discussion Conclusions

September 24, 2007 The 3 rd CSAIL Student Workshop 3 Introduction Cooperative caching: multiple peers coordinate their caches Efficiency Scalability Byzantine client: any client that does not behave properly Our contribution is to add Byzantine fault tolerance to cooperative caching

September 24, 2007 The 3 rd CSAIL Student Workshop 4 Motivation Cooperative caching is useless with Byzantine clients  Clients can provide false data Our system is useful for a distributed safe storage system

September 24, 2007 The 3 rd CSAIL Student Workshop 5 System Setting Many clients request/modify pages  Some are Byzantine A storage server  Runs BFT Clients commit transactions at the server Clients implement cooperative caching Need efficient Byzantine fault tolerant cooperative caching on the peer side

September 24, 2007 The 3 rd CSAIL Student Workshop 6 System Overview Optimistic approach  Few Byzantine clients in the common case  Reputation system for efficiency Exploit locality  Organize peers in locality regions

September 24, 2007 The 3 rd CSAIL Student Workshop 7 Optimistic Approach Assumes speculatively that clients are non- Byzantine Upon commit, the server checks if data used was up-to-date Rolls back if not Check is based on hashes Reputation system reduces the number of aborts

September 24, 2007 The 3 rd CSAIL Student Workshop 8 Locality Split peers in locality regions based on their coordinates (eg. Vivaldi) Look up a page in own locality region first Rationale 1. Common case: hot pages are in locality region 2. Fast fetch of data Locality Region Clients

September 24, 2007 The 3 rd CSAIL Student Workshop 9 Page Lookup Consistent hashing function in each locality region: Peer with metadata for Page ID knows which peers store the page Provides replication Page ID Peer with metadata for page ID

September 24, 2007 The 3 rd CSAIL Student Workshop 10 Maintaining Metadata Each client informs the appropriate metadata peer when fetching from the server, updating, or removing a page Metadata peers communicate Server sends periodic invalidation streams (e.g. peer membership)

September 24, 2007 The 3 rd CSAIL Student Workshop 11 Page Fetch Walk-through Server ?Page 5? Look in local cache Ask the metadata node in the locality region Ask the server hash

September 24, 2007 The 3 rd CSAIL Student Workshop 12 Reputation System Each client maintains reputation scores  Initial debit  Debit increased/decreased depending on behavior Is based on information from the server upon commit Fetch from highest reputation peer Clients sign messages

September 24, 2007 The 3 rd CSAIL Student Workshop 13 Discussion Byzantine resilience  Reputation system  Last check performed at server Efficiency  Three short network delays Scalability  Server is offloaded

September 24, 2007 The 3 rd CSAIL Student Workshop 14 Project Status Prototype is designed and implemented Need to evaluate  Efficiency of the reputation system  Average fetch time

September 24, 2007 The 3 rd CSAIL Student Workshop 15 Conclusions First Byzantine fault tolerant cooperative caching system Efficient and scalable We hope it will show good evaluation results

September 24, 2007 The 3 rd CSAIL Student Workshop 16 Thank You Questions?