BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

Database System Concepts and Architecture
BOINC Berkeley Open Infrastructure for Network Computing An open-source middleware system for volunteer and grid computing (much of the images and text.
High Performance Computing Course Notes Grid Computing.
A P2P Distributed Computing Platform: (Wed) C.Y. ‘Connor’ Park KISTI Supercomputing Center.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing.
Peer-to-Peer Networking By: Peter Diggs Ken Arrant.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Distributed Computer Architecture Benjamin Jordan, Kevin Cone, Jason Bradley.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Scientific Computing on Smartphones David P. Anderson Space Sciences Lab University of California, Berkeley April 17, 2014.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
1 port BOSS on Wenjing Wu (IHEP-CC)
GrIDS -- A Graph Based Intrusion Detection System For Large Networks Paper by S. Staniford-Chen et. al.
A Guided Tour of BOINC David P. Anderson Space Sciences Lab University of California, Berkeley TACC November 8, 2013.
A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Introduction to the BOINC software David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
07:44:46Service Oriented Cyberinfrastructure Lab, Introduction to BOINC By: Andrew J Younge
Lessons Learned from David P. Anderson Director, Spaces Sciences Laboratory U.C. Berkeley April 2, 2002.
BOINC.
Volunteer Computing with GPUs David P. Anderson Space Sciences Laboratory U.C. Berkeley.
and Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
Dr Jukka Klem CHEP06 1 Public Resource Computing at CERN – Philippe Defert, Markku Degerholm, Francois Grey, Jukka Klem, Juan Antonio.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Introduction. Internet Worldwide collection of computers and computer networks that link people to businesses, governmental agencies, educational institutions,
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Technology for Citizen Cyberscience Dr. David P. Anderson University of California, Berkeley May 2011.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab Nov. 15, 2006.
Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab January 30, 2007.
Volunteer Computing and BOINC
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
University of California, Berkeley
Volunteer computing PC owners donate idle cycles to science projects
Volunteer Computing: Planting the Flag David P
Volunteer Computing: SETI and Beyond David P
Volunteer Computing for Science Gateways
Designing a Runtime System for Volunteer Computing David P
Grid Computing.
University of Technology
The software infrastructure of II
Basic Grid Projects – Condor (Part I)
Presentation transcript:

BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley

Paper overview  Defines public-resource computing  Introduces the BOINC platform  Lists public-resource computing projects using the BOINC platform  Discusses BOINC implementation

Public-resource computing  AKA global computing or P2P computing  Combines the resources of personal computers and game consoles belonging to the general public to perform scientific computations  Started with  Great Internet Mersenne Prime Search (GIMPS) (1996)  Distributed.net (1997)

Contrast with grid computing  Grid computing involves "organizationally- owned resources"  Centrally managed by IT professionals  Powered on most of the time  Connected by high-speed links  Malicious behavior handled by organization  None of that is true for public-resource computing.

Specific requirements of public-computing systems  Redundant computing  Against result falsification/fabrication  Cheat-resistant accounting  Support for user-configurable application graphics

BOINC  Berkeley Open Infrastructure for Network Computing  Developed at UCB Space Science Laboratory by the group  started in 1999 and still runs today

Goals of BOINC (I)  Reduce the barriers of entry to public- resource computing:  A project can be run from a single computer running standard open-source software  Share resources among autonomous projects:  Each PC owner can join multiple projects  Results in better resource utilization

Goals of BOINC (II)  Support diverse applications:  Offer various data distribution mechanisms  Support various programming languages  …  Reward participants:  Mostly by giving them credits  System must be cheating-resistant  Also by offering nice graphics  Great screensavers!

Projects using BOINC (I)  search for intelligent extra-terrestrial life  protein behavior  protein folding, misfolding, aggregation and related diseases  Climateprediction.net: long term-climate prediction

Projects using BOINC (II)  long term-climate prediction  CERN projects: were to use in-house PCs  gravitational waves  UCB/Intel study of Internet resources

BOINC overview (I)  Projects:  Have a single master URL  Can involve one or more applications  They may change over time  Server complex of a BOINC project  Centered around a relational database containing most project data  Scheduling server daemons handles RPC from by clients  Data server daemons manage uploads

BOINC overview (II)  BOINC offers tools for  Creating, starting, stopping and querying projects  Adding new applications, new platforms, …  Creating workunits  Monitoring server performance  Conceived to be used by scientists, not IT professionals

BOINC overview (III)  Participants join by registering with web site of project and downloading the BOINC client  Client can run as  Screensaver (with fancy graphics)  Window service (running in the background)  Application (displaying results in tabular form)

Describing computations and data  Applications:  Can have different versions for different participant platforms  Consist of one or more files  Workunits:  Represent inputs to a computation  Include parameters specifying computational step requirements  Results (obvious)

More details  Files associated with  Application versions  Workunits  Results are immutable  Files have numerous BOINC-specific attributes

Client/scheduler interactions  When a client interacts with a scheduling server, it  Reports completed work  Gets a collection of workunits

Redundant computing (I)  BOING supports redundant computing  Projects can specify each workunit should be executed N times  If M  N executions agree on a particular result, it becomes canonical

Redundant computing (II)  Issues:  Must prevent cheaters to create quorums of fabricated results  Each user can provide at most one result for each workunit  Must distinguish between erroneous results and mere numerical variations  Homogeneous redundancy: Same workunit only sent to clients with identical platforms

Failure and backoff  Must prevent server overload after a failure  Everyone wants to reconnect  All client/server communication uses exponential backoff after a failure  Preents avalanches of requests

Participant preferences  User general preferences let users specify numerous parameters  Including transfer limits (for participants having monthly transfer limits) and CPU duty cycles (for participants having overclocked CPUs)

Credit and accounting  To reward participants

User community features  To motivate participant emulation

Platform diversity issues  Maintains multiple versions of application executables  Participants may also request to recompile applications before running them  Anonymous platform mechanism  Useful for  Non standard platforms  Paranoiac participants

Graphics and screensavers  Client software appears monolithic to BOINC participants but comprises four components:  Core client: does the work  Client GUI: provides a user-interface showing computation progress and letting participants quit and join projects  API: report CPU usage and fraction done, handle requests to provide graphics, …  Screensaver

Local scheduling  Decides which projects to run  Goals include  Maximizing resource usage  Meeting result deadlines;  Respecting resource share allocation among projects of each participant  Ensuring some ”variety” among projects  Participants want to see progress in all the projects they have joined

Conclusions  It works  More work still needs to be done

If you want to hear more  David Anderson speaks about and BOINC x6A x6A