Enabling Technology for Fault Tolerance Ricardo Jiménez-Peris Marta Patiño-Martínez Technical University of Madrid (Universidad Politécnica de Madrid,

Slides:

Advertisements

Similar presentations

Sensor Web, Grid Computing and Geospatial Web Services for Real Time Decision Support Sensor Web, Grid Computing and Geospatial.

Advertisements

Distributed Systems 1 Topics  What is a Distributed System?  Why Distributed Systems?  Examples of Distributed Systems  Distributed System Requirements.

Reliability on Web Services Presented by Pat Chan 17/10/2005.

Distributed Systems Brief Overview CNT Mobile & Pervasive Computing Dr. Sumi Helal University of Florida.

The road to reliable, autonomous distributed systems

Dynamic Service Composition with QoS Assurance Feb , 2009 Jing Dong UTD Farokh Bastani UTD I-Ling Yen UTD.

Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.

Replicating Basic Components Bettina Kemme McGill University, Montreal, Canada.

Evaluation of an internet protocol security based virtual private network solution Thesis written by Arto Laukka at TeliaSonera Finland Oyj SupervisorProfessor.

Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank

“Turn you Smart phone into Business phone “

E-business Infrastructure

Algorithm for Virtually Synchronous Group Communication Idit Keidar, Roger Khazan MIT Lab for Computer Science Theory of Distributed Systems Group.

Software Engineering and Middleware: a Roadmap by Wolfgang Emmerich Ebru Dincel Sahitya Gupta.

A DAPT IST Initial Work on Transactional Composite Web Services and Visual Composition tool Ricardo Jiménez-Peris, Marta Patiño-Martínez Alberto.

Dynamic Hypercube Topology Stefan Schmid URAW 2005 Upper Rhine Algorithms Workshop University of Tübingen, Germany.

OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”

Transactional Services Ricardo Jiménez-Peris Marta Patiño-Martínez Technical University of Madrid 1 st Adapt Workshop 23 rd -24 th September 2002 Madrid,

Data Sharing in OSD Environment Dingshan He September 30, 2002.

Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.

Page 1 Copyright © Alexander Allister Shvartsman CSE 6510 (461) Fall 2010 Selected Notes on Fault-Tolerance (12) Alexander A. Shvartsman Computer.

A DAPT IST Middle-R: A Middleware for Dynamically Adaptive Database Replication R. Jiménez-Peris, M. Patiño-Martínez, Jesús Milán Distributed.

A Distributed Web Information System Platform for High Responsiveness and Fault Tolerance Jordi Bataller, Hendrik Decker, Luis Irún, Francesc Muñoz Instituto.

QoS-enabled middleware by Saltanat Mashirova. Distributed applications Distributed applications have distinctly different characteristics than conventional.

PHASE 3: SYSTEMS DESIGN Chapter 8 System Architecture.

©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.

FMEA-technique of Web Services Analysis and Dependability Ensuring Anatoliy Gorbenko Vyacheslav Kharchenko Olga Tarasyuk National Aerospace University.

Objective 1.2 Cloud Computing, Internet of Services and Advanced Software Engineering Arian Zwegers European Commission Information Society and Media Directorate.

INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.

Priority Research Direction (use one slide for each) Key challenges -Fault understanding (RAS), modeling, prediction -Fault isolation/confinement + local.

TRƯỜNG ĐẠI HỌC CÔNG NGHỆ Bộ môn Mạng và Truyền Thông Máy Tính.

Service Architecture of Grid Faults Diagnosis Expert System Based on Web Service Wang Mingzan, Zhang ziye Northeastern University, Shenyang, China.

NEST 1 NEST System Working Group Meeting #1 Jack Stankovic University of Virginia September 2001 Boeing Huntington Beach, CA.

Distributed Systems: Concepts and Design Chapter 1 Pages

ARMADA Middleware and Communication Services T. ABDELZAHER, M. BJORKLUND, S. DAWSON, W.-C. FENG, F. JAHANIAN, S. JOHNSON, P. MARRON, A. MEHRA, T. MITTON,

1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.

©Ian Sommerville 2006MSc module: Advanced Software Engineering Slide 1 Service dependability.

ARTEMIS JU Grant Agreement number ARTEMIS JU Grant Agreement number Sept 25-27, 2013 Riga Safety Certification of Software-intensive.

Service Oriented Architectures Presentation By: Clifton Sweeney November 3 rd 2008.

Fault Tolerance David Powell LAAS-CNRS, Toulouse.

Sunday, October 15, 2000 JINI Pattern Language Workshop ACM OOPSLA 2000 Minneapolis, MN, USA Fault Tolerant CORBA Extensions for JINI Pattern Language.

SCALABLE EVOLUTION OF HIGHLY AVAILABLE SYSTEMS BY ABHISHEK ASOKAN 8/6/2004.

Secure Systems Research Group - FAU 1 Active Replication Pattern Ingrid Buckley Dept. of Computer Science and Engineering Florida Atlantic University Boca.

Investigating Survivability Strategies for Ultra-Large Scale (ULS) Systems Vanderbilt University Nashville, Tennessee Institute for Software Integrated.

1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.

Yuhui Chen; Romanovsky, A.; IT Professional Volume 10, Issue 3, May-June 2008 Page(s): Digital Object Identifier /MITP Improving.

Applying Database Replication to Multi-player Online Games Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris Oct 30, 2006.

Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.

1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.

Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.

CprE 458/558: Real-Time Systems

FT-ERF Fault-Tolerance in an Event Rule Framework for Distributed Systems Hillary Caituiro-Monge, Graduate Student. Advisor: Javier Arroyo-Figueroa, Ph.D.

GLOBE DISTRIBUTED SHARED OBJECT. INTRODUCTION  Globe stands for GLobal Object Based Environment.  Globe is different from CORBA and DCOM that it supports.

OS2- Sem1-83; R. Jalili Introduction Chapter 1. OS2- Sem1-83; R. Jalili Definition of a Distributed System (1) A distributed system is: A collection of.

Distributed Systems: Principles and Paradigms By Andrew S. Tanenbaum and Maarten van Steen.

Definition of a Distributed System (1) A distributed system is: A collection of independent computers that appears to its users as a single coherent system.

Slide title In CAPITALS 50 pt Slide subtitle 32 pt Robust Reconfigurable Erlang Component System ErlCOM Gabor Batori, Zoltan Theisz, Domonkos Asztalos.

Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.

Resilience through Dynamic Reconfigurations in Agent Systems Ilya Lopatkin Newcastle University, School of Computing Science.

Section 2.1 Distributed System Design Goals Alex De Ruiter

EJB Replication Graham, Iman, Santosh, Mark Newcastle University.

Slide 3.1 David Chaffey, E-Business & E-Commerce Management, 5 th Edition, © Marketing Insights Limited 2012 Chapter 3 Managing digital business infrastructure.

Langley Research Center An Architectural Concept for Intrusion Tolerance in Air Traffic Networks Jeffrey Maddalon Paul Miner {jeffrey.m.maddalon,

NTT - MIT Research Collaboration — Bi-Annual Report, July 1—December 31, 1999 MIT : Cooperative Computing in Dynamic Environments Nancy Lynch, Idit.

Reaching for k Nines Miroslaw Malek Humboldt University Berlin, Germany

Ricardo Jimenez-Peris Universidad Politecnica de Madrid

Definition of Distributed System

Mobile Computing.

Presentation Title September 22, 2019

Presentation transcript:

Enabling Technology for Fault Tolerance Ricardo Jiménez-Peris Marta Patiño-Martínez Technical University of Madrid (Universidad Politécnica de Madrid, UPM)

Enabling Technology for FT  Dependability is not as widespread as it should due to:  Common restrictions and assumptions that are not acceptable in realistic applications.  The high performance penalties of existing dependable solutions.  Lack of support or adequate interfaces in current middleware for providing FT.

Removing Restrictions for Applying FT  One of the common restriction to achieve FT has been the restriction of single-threaded servers.  Existing solutions for replicating multithreaded servers either restrict the potential concurrency or require an amount of inter-replica communication proportional the degree of synchronization.  An open-issue to be addressed is how to achieve FT of multithreaded servers, typical of current middleware platforms, with the above mentioned restrictions.  Another common restriction is the one of performing recovery and reconfiguration offline.  In order to provide high-available solutions is necessary to develop replication techniques in which recovery and reconfiguration can take place online.

Improving Performance of Fault-Tolerant Solutions  Existing approaches to fault-tolerance of stateful applications are either non-scalable or they sacrifice data consistency.  An important issue to be addressed is how to achieve scalable replication of stateful application whilst still guaranteeing full data consistency.  Another shortcoming usually associated to FT is that the latency of the resulting systems is too poor due to the cost of the underlying agreement protocols.  This is especially true when extending FT to WANs.  It should be addressed how this latency can be improved.

Middleware Support for FT e-Business  The ADAPT project addresses to some extent the FT support at middleware level.  The ADAPT partners are: Technical University of Madrid, Hewlett- Packard, Newcastle, Bologna, ETH Zurich, McGill, Trieste.  The project title is “Middleware Technologies for Adaptive and Composable Components”  The project deals with:  Adaptable web services.  Fault-tolerant and dynamically adaptable middleware (more specifically, J2EE-based application servers).  Workflow-like web service composition.  Predictable QoS of service compositions.  Service diversity for adaptable compositions.