DEGISCO Desktop Grids For International Scientific Collaboration Details on Roadmap (technical, legal, human aspects) Budapest, 14.07.2011 Robert Lovas,

Slides:



Advertisements
Similar presentations
1 IDGF-SP International Desktop Grid Federation - Support Project Overview London, UK, 26/9/2012 Robert Lovas, MTA SZTAKI - Project coordinator IDGF-SP.
Advertisements

P-GRADE and WS-PGRADE portals supporting desktop grids and clouds Peter Kacsuk MTA SZTAKI
BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.
Volunteer Computing and Hubs David P. Anderson Space Sciences Lab University of California, Berkeley HUBbub September 26, 2013.
Module 13: Network Load Balancing Fundamentals. Server Availability and Scalability Overview Windows Network Load Balancing Configuring Windows Network.
Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Introduction to information technology Dr.Fatimah Ali Al-Rowibah.
Grid Computing at The Hartford Condor Week 2008 Robert Nordlund
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
BOINC.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
11 Introduction to EDGI Peter Kacsuk, MTA SZTAKI Start date: Duration: 27 months EDGI.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
1 IDGF International Desktop Grid Federation ASSESSING THE PERFORMANCE OF DESKTOP GRID APPLICATIONS A. Afanasiev, N. Khrapov, and M. Posypkin DEGISCO is.
Background Computer System Architectures Computer System Software.
Introduction of Desktop Grids MTA SZTAKI, 04/17/2012 Akos Balasko EDGI is supported by the FP7 Capacities Programme under contract nr RI DEGISCO.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
A Complete Guide to Select the Best VPS Hosting Providers.
SCI-BUS project Pre-kick-off meeting University of Westminster Centre for Parallel Computing Tamas Kiss, Stephen Winter, Gabor.
Centre for Parallel Computing Tamas Kiss Centre for Parallel Computing A Distributed Rendering Service Tamas Kiss Centre for Parallel Computing Research.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.
Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 30 Dec
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
The Limits of Volunteer Computing Dr. David P. Anderson University of California, Berkeley March 20, 2011.
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Using volunteered resources for data-intensive computing and storage David Anderson Space Sciences Lab UC Berkeley 10 April 2012.
Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.
Linux Overview Date: 18/09/ Outline What is Operating System Operating System Services Types of Operating System Parts of Operating System Introduction.
Introduction to Computers - Hardware
An Overview of Volunteer Computing
With Office 365, Collaborative Solution by Qorus Streamlines Document Assembly and Enhances Productivity for Any Business-Critical Documents OFFICE 365.
Volunteer Computing and BOINC
Software Requirements
Unit 3 Virtualization.
Chapter 6: Securing the Cloud
Amazon Web Services (aws)
Understanding The Cloud
Modularity Most useful abstractions an OS wants to offer can’t be directly realized by hardware Modularity is one technique the OS uses to provide better.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Chapter 1: Introduction
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
Volunteer Computing: SETI and Beyond David P
Volunteer Computing for Science Gateways
The EDGI (European Desktop Grid Initiative) infrastructure and its usage for the European Grid user communities József Kovács (MTA SZTAKI)
2. OPERATING SYSTEM 2.1 Operating System Function
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Designing a Runtime System for Volunteer Computing David P
Job Scheduling in a Grid Computing Environment
Chapter 2: System Structures
Grid Computing.
Introduction to client/server architecture
TYPES OFF OPERATING SYSTEM
Introduction to Cloud Computing
Chapter 1: Introduction
EECS 498 Introduction to Distributed Systems Fall 2017
Chapter 2: System Structures
Chapter 2: The Linux System Part 1
Grid Computing Done by: Shamsa Amur Al-Matani.
Lecture Topics: 11/1 General Operating System Concepts Processes
Windows Virtual PC / Hyper-V
LO2 – Understand Computer Software
Introduction To Distributed Systems
Chapter 1: Introduction
Presentation transcript:

DEGISCO Desktop Grids For International Scientific Collaboration Details on Roadmap (technical, legal, human aspects) Budapest, Robert Lovas, MTA SZTAKI DEGISCO is supported by the FP7 Capacities Programme under contract nr RI

DEGISCO WP4 14/07/ Presentation outline 1.Effectiveness (costs) 2.Human factors 3.Technology (trust and security) 4.Legal

DEGISCO WP4 14/07/ Cost effectiveness  For scientists, volunteer computing is cheaper than other paradigms.  A medium-scale project (10,000 computers, 100 TeraFLOPS) can be run using a single server computer and one or two staff ($ $ USD).  An equivalent CPU cluster costs at least an order of magnitude more.  Cloud computing is even more expensive. – However, studies suggest that cloud computing is cost-effective for hosting volunteer computing project servers.

DEGISCO WP4 14/07/ Human factors  In volunteer computing these factors are particularly crucial and complex. Several surveys have been done: Why do people volunteer?  Support scientific goals: such as curing diseases, finding extraterrestrial life, or predicting climate change.  Community: some volunteers enjoy participating in the online communities and social networks.  Competition: some volunteers are interested in the performance of computer systems, and they use volunteer computing to quantify and publicize the performance of their computers.  There are attempts to commercialize volunteer computing by paying participants, directly or via a lottery, and reselling the computing power.  Local (enterprise) Desktop Grids have commercial success (Platform, Parabon) at several companies (e.g. Novartis).

DEGISCO WP4 14/07/ Human factors (cont)  To attract and retain volunteers, a project must perform a variety of human functions.  Web content describing its research goals, methods, and credentials.  Periodic updates on its scientific progress.  It must manage the moderation of its web site’s message boards to ensure that they remain positive and useful.  Publicize itself by media. Volunteers must trust projects, but projects cannot trust volunteers.

DEGISCO WP4 14/07/ Technical challenges  Heterogeneity. The volunteer computer population is extremely diverse in terms of hardware (processor type and speed, RAM, disk space), software (operating system and version) and networking (bandwidth, proxies, firewalls). – BOINC provides scheduling mechanisms that assign jobs to the hosts that can best handle them. However, projects still generally need to compile applications for several platforms (Windows 32 and 64 bit, Mac OS X, Linux 32 and 64 bit, various GPU platforms). This difficulty may soon be reduced by running applications in virtual machines.  Sporadic availability and churn. Volunteer computers are not dedicated. – The time intervals when a computer is on, and when BOINC is allowed to compute, are sporadic and generally unpredictable. BOINC tracks these factors and uses them in estimating job completion times. In addition, computers are constantly joining and leaving the pool of a given project. BOINC must address the fact that computers with many jobs in progress may disappear forever.

DEGISCO WP4 14/07/ Technical challenges (cont)  Result validation. Because volunteer computers are anonymous and untrusted, BOINC cannot assume that job results are correct, or that the claimed credit is accurate. – One general way of dealing with this is replication: that is, send a copy of each job to multiple computers; compare the results; accept the result if the replicas agree; otherwise issue additional replicas.  Scalability. Large volunteer projects can involve a million hosts and millions of jobs processed per day. This is beyond the capabilities of grid and cluster systems. – BOINC addresses this using an efficient server architecture that can be distributed across multiple machines. The server is based on a relational database, so BOINC leverages advances in scalability and availability of database systems. The communication architecture uses exponential backoff after failures, so that the rate of client requests remains bounded even when a server comes up after a long outage.

DEGISCO WP4 14/07/ Technical challenges (cont)  Security 1: What if hackers break into a project server and use it to distribute malware to the attached computers? – BOINC prevents this by requiring that executables be digitally signed using a secure, offline signing computer.  Security 2: What if hackers create a fraudulent project that poses as academic research while in fact stealing volunteers’ private data? – This is partly addressed by account-based sandboxing: applications are run under an unprivileged user account and typically have no access to files other than their own input and outputs. In the future, stronger sandboxing may be possible using virtual machine technology.

DEGISCO WP4 14/07/ Technology comparison matrix

DEGISCO WP4 14/07/ Local Desktop Grid: University of Westminster (London, UK) as a best practice New Cavendish Street576 nodes 2.Marylebone Campus559 nodes 3.Regent Street395 nodes 4.Wells Street31 nodes 5.Little Tichfield Street66 nodes 6.Harrow Campus254 nodes Total:1881 nodes Lifecycle of a node: 1.PCs basically used by students/staff 2.If unused, switch to Desktop Grid mode 3.No more work from DG server -> shutdown (green solution)

DEGISCO WP4 14/07/ Legal issues resource ownersdata owners DG service providers application provider data provider (storage)end-user resource owners (through granting access to service provider) grant access to his/her own resources by join a project (through granting access to service provider) data owners (through validation process of EADM) grant access in production grant access during developmentstore data in storage allow the exploitation of data DG service providers take no liability but validate applications with EADM take no liability but apply best practices/technologie s host the application grant access during development and in production provide access with SLA application providers (through validation process of EADM) accept the data owner policy collaborate in testing and validation grant access to stored data use Grid-friendly SW license (or self-made software) data provider (storage) (through validation process of EADM) ensure privacy and confidentiality ensure secure access from Grid ensure secure access from apps grant access to stored data end-users (through accepted AUP) accept the data owner policy accept AUP from the grid service provider accept SW licensing policies (or self-made software) accept policy of data provider Liabilities (security, policies) Rights (services, access)

DEGISCO WP4 14/07/ Acknowledgement The presentation is partly based on David P. Anderson (University of Berkeley): Volunteer Computing – The ultimate cloud (Crossroads, Spring 2010)

Globe adapted from wikipedia/commons/f/fa/ Globe.svg