Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 30 Dec. 2011.

Slides:

Advertisements

Similar presentations

Cloud Computing Mick Watson Director of ARK-Genomics The Roslin Institute.

Advertisements

W4118 Operating Systems OS Overview Junfeng Yang.

BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.

Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.

Scientific Computing on Smartphones David P. Anderson Space Sciences Lab University of California, Berkeley April 17, 2014.

System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.

Volunteer Computing and Hubs David P. Anderson Space Sciences Lab University of California, Berkeley HUBbub September 26, 2013.

1 port BOSS on Wenjing Wu (IHEP-CC)

Achievements and Opportunities in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 18 April 2008.

A Guided Tour of BOINC David P. Anderson Space Sciences Lab University of California, Berkeley TACC November 8, 2013.

Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.

Scientific Computing in the Consumer Digital Infrastructure David P. Anderson Space Sciences Lab University of California, Berkeley The Austin Forum November.

Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.

Volunteer Computing with BOINC Dr. David P. Anderson University of California, Berkeley SC10 Nov. 14, 2010.

Volunteer Computing with GPUs David P. Anderson Space Sciences Laboratory U.C. Berkeley.

and Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.

BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.

David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.

TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.

Application Software System Software.

Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.

David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.

Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.

David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.

CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.

A Tour of Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.

Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.

Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.

DEGISCO Desktop Grids For International Scientific Collaboration Details on Roadmap (technical, legal, human aspects) Budapest, Robert Lovas,

The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.

Volunteer Computing in the Next Decade David Anderson Space Sciences Lab University of California, Berkeley 4 May 2012.

Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.

David P. Anderson Space Sciences Laboratory University of California – Berkeley A Million Years of Computing.

Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.

Volunteer Computing: the Ultimate Cloud Dr. David P. Anderson University of California, Berkeley Oct 19, 2010.

A Brief History of (CPU) Time -or- Ten Years of Multitude David P. Anderson Spaces Sciences Lab University of California, Berkeley 2 Sept 2010.

David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.

The Limits of Volunteer Computing Dr. David P. Anderson University of California, Berkeley March 20, 2011.

Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.

Using volunteered resources for data-intensive computing and storage David Anderson Space Sciences Lab UC Berkeley 10 April 2012.

Technology for Citizen Cyberscience Dr. David P. Anderson University of California, Berkeley May 2011.

Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab Nov. 15, 2006.

Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.

Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 28 Nov

Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab January 30, 2007.

An Overview of Volunteer Computing

A Brief History of BOINC

Volunteer Computing and BOINC

Unit 3 Virtualization.

Guide to Operating Systems, 5th Edition

The Future of Volunteer Computing

The 9th Annual BOINC Workshop

University of California, Berkeley

Volunteer computing PC owners donate idle cycles to science projects

Volunteer Computing: SETI and Beyond David P

Volunteer Computing for Science Gateways

Designing a Runtime System for Volunteer Computing David P

Exa-Scale Volunteer Computing

Job Scheduling in a Grid Computing Environment

David P. Anderson Space Sciences Lab UC Berkeley LASER

Chapter 2: System Structures

The Global Status of Citizen Cyberscience

Volunteer computing and volunteer thinking Dr. David P

The software infrastructure of II

Haiyan Meng and Douglas Thain

Guide to Operating Systems, 5th Edition

Ch 4. The Evolution of Analytic Scalability

Ivan Reid (Brunel University London/CMS)

LO2 – Understand Computer Software

Exploring Multi-Core on

Presentation transcript:

Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 30 Dec. 2011

High-Throughput Computing (HTC) ● Thousands or millions of separate jobs ● What matters is the rate of job completion – not the turnaround time of individual jobs ● Can use commodity computers – don’t need supercomputers

Scientific use of HTC ● Physical simulation – particle physics – atomic/molecular (bio, nano) – Earth climate system ● Compute-intensive data analysis – LHC (particle physics) – LIGO (gravitational waves) – genomics ● Bio-inspired optimization – genetic algorithms, flocking, ant colony etc.

Measures of computing throughput ● Floating-point operations (FLOP) – benchmarks: Linpack, Whetstone ● GigaFLOPS (10 9 /sec): 1 PC ● TeraFLOPS (10 12 /sec): 1 GPU ● PetaFLOPS (10 15 /sec): supercomputer ● ExaFLOPS (10 18 /sec): the future

Approaches to HPC ● Cluster computing – commodity or rack-mount PCs in 1 room ● Grid computing – sharing of clusters among/between organizations ● Cloud computing – rent cluster nodes, e.g. Amazon EC2 ● Volunteer computing – PC owners donate use of resources

Computing capacity ● Cluster: 1,000 nodes = ~10 TeraFLOPS ● Grid: largest one is ~100,000 nodes ● Cloud: Amazon ~100,000 nodes; ~1 PetaFLOPS ● Volunteer (actual): – 700,000 PCs, 100,000 with GPUs; 12 PetaFLOPS ● Volunteer (current potential): – 1.5 billion PCs: 100 ExaFLOPS – 5 billion mobile devices

Cost (for 10 TeraFLOPS/year) ● Cluster: $1.5M ● Amazon EC2 (5,000 instances): $4M ● Volunteer: ~ $0.1M

Energy All computing uses energy, but ● In cold climates, volunteer computing replaces conventional heating ● GPUs are 10X more efficient than CPUs ● Mobile device CPUs are 10x more efficient

Volunteer computing with BOINC volunteers projects CPDN WCG attachments

How to volunteer

Choose projects

Configure

Graphical interfaces

Community

Creating a BOINC project ● Install BOINC server software on a Linux box ● Compile apps for Windows/Mac/Linux ● Attract volunteers – develop web site – generate publicity – communicate with volunteers

Some projects ● ● IBM World Community Grid ● ● Climateprediction.net ● ●

Fundamental problems of volunteer computing ● Heterogeneity – need to compile apps for Win, Mac – portability is hard even on Linux ● Security – currently: account-based sandboxing – not enough for untrusted apps Virtual machine technology can solve both

Virtual machines application Operating system

Virtual machines Host operating system Guest operating system application

Virtual machines Windows 7 Debian Linux 2.6 application

VirtualBox: a VM system ● Open source (owned by Oracle) ● Rich feature set ● Low runtime overhead ● Easy to install

Process structure BOINC client vboxwrapper VirtualBox daemon VM instance shared-mem msg passing cmdline tool file-based communication

VM advantages ● Applications run in developer’s favorite environment (OS, libraries) – No need for multiple versions ● A VM is a strong “sandbox” – Application running in VM can’t access host OS – Can run untrusted applications

Volunteer storage ● A modern PC has ~1 TB disk ● 1M PCs * 100GB = 100 Petabytes ● Amazon: $120 million/year

BOINC storage architecture BOINC file management infrastructure storage applications dataset storage data archival data stream buffering locality scheduling

Data archival ● Goals – store large files for long periods – arbitrarily high reliability

Recovery in volunteer storage Server data

Recovery in volunteer storage Server data client s data

Recovery in volunteer storage Server client s data

Recovery in volunteer storage Server client s data X

Recovery in volunteer storage Server data client s data

Recovery in volunteer storage Server data client s data

Recovery in volunteer storage Server client s data

Volunteer storage issues ● high churn rate of hosts – ~90 day mean lifetime ● high latency of file transfers – hours or days ● Modeling volunteer storage systems – overlapping failure and recovery – server storage and bandwidth may be bottleneck

Replication ● Advantages: – Fast recovery (1 upload, 1 download) – Increase N to reduce server storage needs ● But: – High space overhead – Reliability decreases exponentially with N N M file hosts

Coding Divide file into N blocks, generate K additional “checksum” blocks. Recover file from any N blocks. Advantages: ● High reliability with low space overhead But: ● Recovering a block requires reassembling the entire file (network, space overhead) N K

Multi-level coding ● Divide file, encode each piece separately ● Use encoding for top-level chunks as well ● Can extend to > 2 levels N KN K

Hybrid coding/replication ● Use multi-level coding, but replicate each bottom- level block 2 or 3X. ● Most failures will be recovered with replication ● The idea: get both the fast recovery of replication and the high reliability of coding.

Distributed storage simulator ● Inputs: – host arrival rate, lifetime distribution, upload/download speeds, free disk space – parameters of files to be stored ● Policies that can be simulated – M-level coding, N and K coding values, R-fold replication ● Outputs – statistics of server disk space usage, network BW, “vulnerability” level

Multi-user projects ● Needed: – remote job submission mechanism – quota system – scheduling support for batches science portal BOINC server Scientists (users) sysadmins batches of jobs

Quota system ● Each user has “quota” ● Batch prioritization goals: – enforce quotas over the long term – give priority to short batches – don’t starve long batches

Batch prioritization ● Each user has “logical start time” LST(U) ● Prioritize batches by increasing end time ● Example: time B1 LST(U 1 ) B2B4B3 LST(U 2 )

Thank you!