Brahim Ayari, Abdelmajid Khelil, Neeraj Suri and Eugen Bleim

Slides:

Advertisements

Similar presentations

Two phase commit. Failures in a distributed system Consistency requires agreement among multiple servers –Is transaction X committed? –Have all servers.

Advertisements

Message Queues COMP3017 Advanced Databases Dr Nicholas Gibbins –

(c) Oded Shmueli Distributed Recovery, Lecture 7 (BHG, Chap.7)

1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 12: Three-Phase Commits (3PC) Professor Chen Li.

CIS 720 Concurrency Control. Timestamp-based concurrency control Assign a timestamp ts(T) to each transaction T. Each data item x has two timestamps:

Computer Science Lecture 18, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.

ICS 421 Spring 2010 Distributed Transactions Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/16/20101Lipyeow.

Termination and Recovery MESSAGES RECEIVED SITE 1SITE 2SITE 3SITE 4 initial state committablenon Round 1(1)CNNN-NNNN Round 2FAILED(1)-CNNN--NNN Round 3FAILED.

Database Replication techniques: a Three Parameter Classification Authors : Database Replication techniques: a Three Parameter Classification Authors :

Computer Science Lecture 17, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.

Transaction Processing in Mobile Distributed Databases Sherida Jacob CSC 536 5/2/2005.

Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.

1 ICS 214B: Transaction Processing and Distributed Data Management Replication Techniques.

CS 425 / ECE 428 Distributed Systems Fall 2014 Indranil Gupta (Indy) Lecture 18: Replication Control All slides © IG.

1 ICS 214B: Transaction Processing and Distributed Data Management Distributed Database Systems.

Distributed Systems Fall 2009 Distributed transactions.

Distributed Databases

Commit Protocols. CS5204 – Operating Systems2 Fault Tolerance Causes of failure: process failure machine failure network failure Goals : transparent:

CS162 Section Lecture 10 Slides based from Lecture and

Distributed Transactions March 15, Transactions What is a Distributed Transaction?  A transaction that involves more than one server  Network.

Distributed Transactions Chapter 13

CSE 486/586 CSE 486/586 Distributed Systems Concurrency Control Steve Ko Computer Sciences and Engineering University at Buffalo.

PAVANI REDDY KATHURI TRANSACTION COMMUNICATION. OUTLINE 0 P ART I : I NTRODUCTION 0 P ART II : C URRENT R ESEARCH 0 P ART III : F UTURE P OTENTIAL 0 R.

ISADS'03 Message Logging and Recovery in Wireless CORBA Using Access Bridge Michael R. Lyu The Chinese Univ. of Hong Kong

Fault Tolerance CSCI 4780/6780. Distributed Commit Commit – Making an operation permanent Transactions in databases One phase commit does not work !!!

University of Tampere, CS Department Distributed Commit.

Low Cost Commit Protocols for Mobile Computing Environments Marc Perron & Baochun Bai.

Commit Algorithms Hamid Al-Hamadi CS 5204 November 17, 2009.

Lampson and Lomet’s Paper: A New Presumed Commit Optimization for Two Phase Commit Doug Cha COEN 317 – SCU Spring 05.

IM NTU Distributed Information Systems 2004 Distributed Transactions -- 1 Distributed Transactions Yih-Kuen Tsay Dept. of Information Management National.

Revisiting failure detectors Some of you asked questions about implementing consensus using S - how does it differ from reaching consensus using P. Here.

Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.

Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.

CMS Advanced Database and Client-Server Applications Distributed Databases slides by Martin Beer and Paul Crowther Connolly and Begg Chapter 22.

Chapter 8 – Fault Tolerance Section 8.5 Distributed Commit Heta Desai Dr. Yanqing Zhang Csc Advanced Operating Systems October 14 th, 2015.

More on Fault Tolerance

Recovery in Distributed Systems:

Ad-hoc Networks.

Two Phase Commit CSE593 Transaction Processing Philip A. Bernstein

Outline Introduction Background Distributed DBMS Architecture

Database System Implementation CSE 507

Two phase commit.

Operating System Reliability

Operating System Reliability

Commit Protocols CS60002: Distributed Systems

Outline Introduction Background Distributed DBMS Architecture

Outline Announcements Fault Tolerance.

Operating System Reliability

Operating System Reliability

CSE 486/586 Distributed Systems Concurrency Control --- 3

Assignment 5 - Solution Problem 1

Distributed Transactions

Lecture 21: Replication Control

DISTRIBUTED DATABASES

Causal Consistency and Two-Phase Commit

Operating System Reliability

Distributed Databases Recovery

Chapter 19: Distributed Transaction Recovery

UNIVERSITAS GUNADARMA

Transactions in Distributed Systems

Lecture 21: Replication Control

Abstractions for Fault Tolerance

CIS 720 Concurrency Control.

CSE 486/586 Distributed Systems Concurrency Control --- 3

Last Class: Fault Tolerance

Operating System Reliability

Transaction Communication

Operating System Reliability

Brahim Ayari, Abdelmajid Khelil and Neeraj Suri

Presentation transcript:

Implementation and Evaluation of Delay-Aware and Fault-Tolerant Mobile Transactions Brahim Ayari, Abdelmajid Khelil, Neeraj Suri and Eugen Bleim Department of Computer Science TU Darmstadt, Germany {brahim, khelil, suri}@informatik.tu-darmstadt.de jonnybleim@gmx.de

Database Use in Healthcare Mobile Healthcare Life assist systems Patient data management Body area network Mobile Database Mobile participants Strict data consistency Atomic commit is a fundamental operation Patient records Wired Network

Objectives Ensure strict atomicity Minimize the blocking time of resources in the wired network Minimize transaction abort rate (heterogeneity and fault-tolerance) Reduce (wireless) message complexity System Model Related Work FT-PPTC Approach Implementation Measurements Summary

System and Fault Model Heterogeneity Perturbations Wireless networks Mobile nodes Slowest process determines resource blocking time Perturbations Disconnections Node/Comm failures Increases resource blocking time or abort rate Wired Network WLAN GPRS UMTS

Related Work Unilateral Commit for Mobile (UCM) [PDCS’00] Supports mobile participants Strict assumptions (two-phase locking for all participants) Mobile Two Phase Commit (M-2PC) [ADC’05] Mobile participants are assumed to be connected from initializing the transaction until finishing the execution of their fragments Transaction Commit On Timeout (TCOT) [TOC’02] Uses timeouts to reduce the number of messages exchanged Provides only semantic atomicity and not strict atomicity Supports only one mobile participant Entail high resource blocking time Handle (small) subset of failures

Fault-Tolerant Pre-Phase Transaction Commit (FT-PPTC) Main idea: ►Decouple commit of mobile participants from fixed participants ►Initial mobile commit, delegate problem to fixed part of network;  Reduction of resource blocking time on fixed participants  Re-use established protocols (commit, recovery, fault-tolerance) CO distributes fragments first for mobile participants and waits for their execution results (Pre-Phase) CO waits for a mobile participant for the timeouts that the participant initializes and updates (execution and shipping timeouts)  CO waits for the slowest mobile participant and proceeds with the following phase iff the pre-phase succeeds Handles network and node heterogeneity Each mobile participant is represented in the fixed network by a proxy called mobile host agent (MH-Agent)  Handles failures: Crash, disconnection, wireless message loss …

Send corresponding execution fragment The FT-PPTC Protocol Initiator Coordinator MH-Agent Mobile Participant Fixed Participant Begin Send transaction & own timeouts Forward fragment Send corresponding execution fragment Send timeouts Forward timeouts Hetero- geneity ~ Pre-Commit Phase ~ Send updates Send YES Vote Send updates Fixed Participant Execution fragment Ack 2PC Prepare Core 2PC Phase Vote Decision Decision Decision Ack Ack Ack End Release resources Release resources Release resources

FT-PPTC: Failure Resilience Scenarios Initiator Coordinator MH-Agent Mobile Participant Begin Send transaction and timeouts Forward fragment Send corresponding execution fragment Send timeouts Transient Disconnect. Forward timeouts ~ Pre-Commit Phase ~ Extend timeouts Forward extended timeouts Send updates ~ Permanent failures Send updates Send YES Vote Fixed Participant Execution fragment Ack 2PC Prepare Core 2PC Phase Vote Decision Decision Decision Ack Ack Ack End Release resources Release resources Release resources

Implementation J2ME: Apache Derby = IBM Cloudscape Database Personal Profile IBM J9 Java Virtual Machine Apache Derby = IBM Cloudscape Database Mobile Device: PDA with Windows Mobile

“fastest” {link + participant} “slowest” {link + participant} Timeout Selection Higher #Transactions Timeout selection lower than “fastest” {link + participant} Optimal timeout selection Timeout selection higher than “slowest” {link + participant}

Transaction Throughput Tolerable decrease Reasons for throughput decrease: Communication bottlenecks Processing/storage resources bottlenecks (Transaction blocking time on most involved resources is a key metric)

Blocking Time Transaction blocking time on servers is constant (independent from the number of started transactions)

Conclusions and Future Work Fault-tolerant & delay-aware atomic commit protocol for mobile transactions Decouples commit of mobile participants from fixed participants  Minimizes blocking time of resources on fixed part of the network  Provides resilience to both communication and node failures Future work Consider ad-hoc communication between the mobile devices

Questions ? Brahim Ayari, Abdelmajid Khelil, Neeraj Suri and Eugen Bleim Department of Computer Science TU Darmstadt, Germany {brahim, khelil, suri}@informatik.tu-darmstadt.de, jonnybleim@gmx.de