Team Members Lora zalmover Roni Brodsky Academic Advisor Professional Advisors Dr. Natalya Vanetik Prof. Shlomi Dolev Dr. Guy Tel-Zur.

Slides:



Advertisements
Similar presentations
CHANGING THE WAY IT WORKS Cloud Computing 4/6/2015 Presented by S.Ganesh ( )
Advertisements

Principles of Engineering System Design Dr T Asokan
COURSE: COMPUTER PLATFORMS
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
Abstract HyFS: A Highly Available Distributed File System Jianqiang Luo, Mochan Shrestha, Lihao Xu Department of Computer Science, Wayne State University.
Cloud Computing Brandon Hixon Jonathan Moore. Cloud Computing Brandon Hixon What is Cloud Computing? How does it work? Jonathan Moore What are the key.
Availability in Globally Distributed Storage Systems
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
15.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 15: Configuring a Windows.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Making Services Fault Tolerant
City University London
Academic Advisor: Prof. Ronen Brafman Team Members: Ran Isenberg Mirit Markovich Noa Aharon Alon Furman.
Introduction to Distributed Systems CS412: Programming Distributed Applications Computer Science Southern Illinois University CS412: Programming Distributed.
Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES CHAPTER.
1 Making Services Fault Tolerant Pat Chan, Michael R. Lyu Department of Computer Science and Engineering The Chinese University of Hong Kong Miroslaw Malek.
Implementing Disaster Protection
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Saving Your Business from a Data Loss Randy Clark.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
CHAPTER FIVE Enterprise Architectures. Enterprise Architecture (Introduction) An enterprise-wide plan for managing and implementing corporate data assets.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
Lecture 13 Fault Tolerance Networked vs. Distributed Operating Systems.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Self stabilizing Linux Kernel Mechanism Doron Mishali, Alex Plits Supervisors: Prof. Shlomi Dolev Dr. Reuven Yagel.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Module 9: Configuring Storage
TRƯỜNG ĐẠI HỌC CÔNG NGHỆ Bộ môn Mạng và Truyền Thông Máy Tính.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development 3.
McLean HIGHER COMPUTER NETWORKING Lesson 15 (a) Disaster Avoidance Description of disaster avoidance: use of anti-virus software use of fault tolerance.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Distributed File Systems 11.2Process SaiRaj Bharath Yalamanchili.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
Submitted to :- Neeraj Raheja Submitted by :- Ghelib A. Shuaib (Asst. Professor) Roll No : Class :- M.Tech(CSE) 2 nd Year.
© 2010 VMware Inc. All rights reserved Why Virtualize? Beng-Hong Lim, VMware, Inc.
OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.
Week#3 Software Quality Engineering.
Enterprise Architectures
Chapter 6: Securing the Cloud
TECHNOLOGY GUIDE THREE
Cloud computing-The Future Technologies
Unit OS10: Fault Tolerance
Fault Tolerance In Operating System
TECHNOLOGY GUIDE THREE
Replication Middleware for Cloud Based Storage Service
Anne Pratoomtong ECE734, Spring2002
Distributed Systems Bina Ramamurthy 11/12/2018 From the CDK text.
Unit I Flash Cards Start.
Brandon Hixon Jonathan Moore
Slides for Chapter 1 Characterization of Distributed Systems
TECHNOLOGY GUIDE THREE
Presentation transcript:

Team Members Lora zalmover Roni Brodsky Academic Advisor Professional Advisors Dr. Natalya Vanetik Prof. Shlomi Dolev Dr. Guy Tel-Zur

 Introduction  Solution  Motivation  System Requirements  System Architecture  User Interface

Cloud computing is where data, software and applications, or computer processing power are accessed from a cloud of online resources. Cloud computing today There are commercial tools that provide Cloud computing services, like EC2 of Amazons. Also existing an open-source tools like Eucalyptus which enables users run programs on their own resources and provides a detailed view of what would otherwise be the black box of cloud-computing services.

Motivation for cloud computing  enables individual users to access their data or applications from any device.  allows the organizations to reduce their capital costs by purchasing hardware and software as a utility service. Rather than purchasing servers, software, data center space or network equipment, clients instead buy those resources as a fully outsourced service. Fault tolerance Fault-tolerance is the property that enables a system (often computer-based) to continue operating properly in the event of the failure of (or one or more faults within) some of its components.

In our project we established a cluster of cloud computing. We used an open-source program for managing cloud computing called "Eucalyptus ". On top of the cloud we developed an algorithm for fault tolerance, which purpose is to identify errors during the execution of a program, and handle them with Fault-tolerance by replication. Replication means providing multiple identical instances of the same system or subsystem, directing tasks or requests to all of them in parallel, and choosing the correct result on the basis of a quorum. After identifying the failure our system tries to recover several times.

In order to demonstrate the motivation of our system, we will present an example. Suppose you have a statistical algorithm that determines whether a given number is prime. Since the result is statistical, there is some probability for getting wrong answer. Our system significantly increases the chances of getting the right answer. Several replicas will run the same algorithm simultaneously, and the answer will be determined by the majority.

Main functional requirements:  Running a program without support of fault tolerance on cloud computing cluster  Running a program with support of fault tolerance on cloud computing cluster (detection and recovery)  Statistical information

Server 1Server 2Server 3... Server i Cloud Client Server FTDR Application Client