BOINC.

Slides:



Advertisements
Similar presentations
Distributed Processing, Client/Server and Clusters
Advertisements

BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.
BOINC Berkeley Open Infrastructure for Network Computing An open-source middleware system for volunteer and grid computing (much of the images and text.
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
Chapter 21 Successfully Implementing The Information System
Distributed Processing, Client/Server, and Clusters
A Service Platform for On-Line Games DebanJan Saha, Dambit Sahu, Anees Shaikh (IBM TJ Watson Research Center, NY) Presented by Gary Huang March 17, 2004.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing.
High-Performance Task Distribution for Volunteer Computing Rom Walton
A. Frank 1 Internet Resources Discovery (IRD) Peer-to-Peer (P2P) Technology (1) Thanks to Carmit Valit and Olga Gamayunov.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 2, 2007.
Lecture 6 - Other Distributed Systems CSE 490h – Introduction to Distributed Computing, Spring 2007 Except as otherwise noted, the content of this presentation.
Grid Computing Exposing the myths of desktop scavenging grids John Easton – IBM Grid computing
Client/Server Architecture
TOPIC 1 – SERVER SIDE APPLICATIONS IFS 234 – SERVER SIDE APPLICATION DEVELOPMENT.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
Web Based Applications
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
1 NETE4631 Managing the Cloud and Capacity Planning Lecture Notes #8.
A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
David Cameron Claire Adam Bourdarios Andrej Filipcic Eric Lancon Wenjing Wu ATLAS Computing Jamboree, 3 December 2014 Volunteer Computing.
David Cameron Riccardo Bianchi Claire Adam Bourdarios Andrej Filipcic Eric Lançon Efrat Tal Hod Wenjing Wu on behalf of the ATLAS Collaboration CHEP 15,
07:44:46Service Oriented Cyberinfrastructure Lab, Introduction to BOINC By: Andrew J Younge
Supporting Molecular Simulation-based Bio/Nano Research on Computational GRIDs Karpjoo Jeong Konkuk Suntae.
Server Performance, Scaling, Reliability and Configuration Norman White.
GAAIN Virtual Appliances: Virtual Machine Technology for Scientific Data Analysis Arihant Patawari USC Stevens Neuroimaging and Informatics Institute July.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Mobile Agents For Mobile Computing Department Of Computer Science – Dartmouth College Robert Gray David Kotz Saurab Nog Daniela Rus George Cybenko.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
Virtualization and Databases Ashraf Aboulnaga University of Waterloo.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Chapter 14 Advanced Architectural Styles. Objectives Describe the characteristics of a distributed system Explain how middleware supports distributed.
1 Volunteer Computing at CERN past, present and future Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) White Area lecture.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
The History of Clustering. What is computer clustering? Computer clustings is when a group of computers are linked together operating as one, sharing.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
Next Generation of Apache Hadoop MapReduce Owen
CLIENT SERVER COMPUTING. We have 2 types of n/w architectures – client server and peer to peer. In P2P, each system has equal capabilities and responsibilities.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.
Successfully Implementing The Information System Systems Analysis and Design Kendall and Kendall Fifth Edition.
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.
An Overview of Volunteer Computing
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Volunteer Computing: Planting the Flag David P
Volunteer Computing: SETI and Beyond David P
Designing a Runtime System for Volunteer Computing David P
Job Scheduling in a Grid Computing Environment
David Cameron ATLAS Site Jamboree, 20 Jan 2017
Meng Cao, Xiangqing Sun, Ziyue Chen May 28th, 2014
Grid Means Business OGF-20, Manchester, May 2007
Chapter 17: Database System Architectures
Database System Architectures
Exploring Multi-Core on
Presentation transcript:

BOINC

What is BOINC? “Berkeley Open Infrastructure for Network Computing” Platform for Internet-wide distributed applications Volunteer computing infrastructure Relies on many far-flung users volunteering spare CPU power

Some Facts 1,000,000+ active nodes 521 TFLOPS of computing power 20 active projects (SETI@Home, Folding@Home, Malaria Control…) and several more in development (Current as of March 2007)

Comparison to MapReduce Both are frameworks on which “useful” systems can be built Does not prescribe particular programming style Much more heterogeneous architecture Does not have a formal aggregation step Designed for much longer-running systems (months/years vs. minutes/hours)

Volunteer computing != Grid computing Resource owners anonymous, unaccountable identified, accountable yes – software stack requirements OK Managed systems? no – need plug & play software Clients behind firewall? yes – pull model no – push model ISP bill? yes no ... nor is it “peer-to-peer computing”

System Features Homogenous redundancy Work unit “trickling” Locality scheduling Distribution based on host parameters Recognition metrics to reward volunteers Open source

Architecture Central server runs LAMP (Linux, Apache, MySQL, PHP/Perl) architecture for web + database End-users run client application with modules for actual computation BitTorrent used to distribute data elements efficiently

Architecture

Job Life-Cycle

Replicated Computations

Client software Available as regular application, background “service”, or screensaver Can be administered locally or LAN- administered via RPC Can be configured to use only “low priority” cycles

Client/Task Interaction Client software runs on variety of operating systems, each with different IPC Uses shared memory message passing to transmit information from “manager” to actual tasks and vice versa

Background utility compatibility Background utilities disk defrag disk indexing virus scanning web pre-fetch disk backup Most run only when computer is idle volunteer computing ==> they never run Background manager intelligent decision about when to run various activities

Why Participate? Sense of accomplishment, community involvement, or scientific duty Stress testing machines/networks Potential for fame (if your computer “finds” an alien planet, you can name it!) “Bragging rights” for computing more units “BOINC Credits”

Credit & Cobblestones Work done is rewarded with “cobblestones” 100 cobblestones = 1 day of CPU time for a computer with performance equaling 1,000 double-precision floating-point MIPS (Whetstone) & 1,000 integer VAX MIPS (Dhrystone) Computers are benchmarked by the BOINC system and receive credit appropriate to their machine

Anti-Cheating Measures Work units are computed redundantly by several different machines, and results are compared by the central server for consistency Credit is awarded after the internal server validates the returned work units Work units must be returned before a deadline

The hard non-technical problems How to increase the number of volunteers? currently 1 in 1000 PC owners How to increase the number of projects? currently stuck at about 50 How to get volunteers to diversify?

How to attract and retain volunteers? Active hosts: Retention reminder emails frequent science updates Recruitment Viral “email a friend”, referral reward Organizational World Community Grid: “partner” program Media coverage need more discoveries Bundling

Why aren’t there more projects? Lack of PR among scientists IT antipathy Creating a BOINC project is expensive: Research Needed Science App development Experiment design Paper writing Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations

Conclusions Versatile infrastructure SETI tasks take a few hours Climate simulation tasks take months Network monitoring tasks are not CPU-bound at all! Scales extremely well to internet-wide applications Provides another flexible middleware layer to base distributed applications on Volunteer computing comes with add’l considerations (rewards, cheating)