Volunteer Computing: Planting the Flag David P

Slides:



Advertisements
Similar presentations
BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.
Advertisements

BOINC Berkeley Open Infrastructure for Network Computing An open-source middleware system for volunteer and grid computing (much of the images and text.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing.
BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 2, 2007.
Scientific Computing on Smartphones David P. Anderson Space Sciences Lab University of California, Berkeley April 17, 2014.
Volunteer Computing and Hubs David P. Anderson Space Sciences Lab University of California, Berkeley HUBbub September 26, 2013.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
Achievements and Opportunities in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 18 April 2008.
Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Scientific Computing in the Consumer Digital Infrastructure David P. Anderson Space Sciences Lab University of California, Berkeley The Austin Forum November.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Lessons Learned from David P. Anderson Director, Spaces Sciences Laboratory U.C. Berkeley April 2, 2002.
BOINC.
David P. Anderson Space Sciences Lab U.C. Berkeley Exa-Scale Volunteer Computing.
Volunteer Computing with GPUs David P. Anderson Space Sciences Laboratory U.C. Berkeley.
and Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Celebrating Diversity in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley Sept. 1, 2008.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
A Tour of Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 14 Sept 2007.
Volunteer Computing: SETI and Beyond David P. Anderson University of California, Berkeley 7 June 2007.
Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.
Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 30 Dec
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Volunteer Computing in Biology David P. Anderson Space Sciences Lab U.C. Berkeley 10 Sept 2007.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
David P. Anderson Space Sciences Laboratory University of California – Berkeley A Million Years of Computing.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
Volunteer Computing: the Ultimate Cloud Dr. David P. Anderson University of California, Berkeley Oct 19, 2010.
A Brief History of (CPU) Time -or- Ten Years of Multitude David P. Anderson Spaces Sciences Lab University of California, Berkeley 2 Sept 2010.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.
The Limits of Volunteer Computing Dr. David P. Anderson University of California, Berkeley March 20, 2011.
All the computers in the world (~1 billion) BOINC: high-level goal Computational science biology, medicine Earth sciences, physics, astronomy, math, A.I.,...
Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.
David P. Anderson UC Berkeley Gilles Fedak INRIA The Computational and Storage Potential of Volunteer Computing.
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Local Scheduling for Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab John McLeod VII Sybase March 30, 2007.
Using volunteered resources for data-intensive computing and storage David Anderson Space Sciences Lab UC Berkeley 10 April 2012.
Technology for Citizen Cyberscience Dr. David P. Anderson University of California, Berkeley May 2011.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab Nov. 15, 2006.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab January 30, 2007.
An Overview of Volunteer Computing
A Brief History of BOINC
Volunteer Computing and BOINC
The Future of Volunteer Computing
University of California, Berkeley
Building a Global Brain David P. Anderson U. C
Volunteer computing PC owners donate idle cycles to science projects
Volunteer Computing: SETI and Beyond David P
Volunteer Computing for Science Gateways
Designing a Runtime System for Volunteer Computing David P
Exa-Scale Volunteer Computing
Job Scheduling in a Grid Computing Environment
David P. Anderson Space Sciences Lab UC Berkeley LASER
The Global Status of Citizen Cyberscience
Grid Computing Colton Lewis.
University of California, Berkeley
Ivan Reid (Brunel University London/CMS)
Exploring Multi-Core on
Presentation transcript:

Volunteer Computing: Planting the Flag David P Volunteer Computing: Planting the Flag David P. Anderson Space Sciences Lab U.C. Berkeley March 30, 2007

Outline What is volunteer computing? History and current status Some research directions Non-technical problems

What is Volunteer computing? Volunteers Projects Internet Helps science Involves public in science

Early history GIMPS (1996) George Woltman, Scott Kurowski distributed.net (1997) Adam Beberg, Jim Lawson, ... SETI@home (1999) me, Eric Korpela Folding@Home (2000) Vijay Pande

Academic projects, 1996-99 Bayanihan L. Sarmenta, MIT fault-tolerance in presence of hackers “volunteer computing” Popcorn Noam Nisan, Hebrew Univ. Market-based approach Superweb Charlotte Jet The problem: Java

Business attempts Entropia (1998) Popular Power United Devices (2000) Parabon Data Synapse The problem: no customers

BOINC (2002) Middleware for volunteer computing Open-source (LGPL) Application-driven Goals lots of independent projects support for diverse applications client participation in multiple projects

Volunteer computing != Grid computing Resource owners anonymous, unaccountable identified, accountable Managed systems? yes – software stack requirements OK no – need plug & play software Clients behind firewall? yes – pull model no – push model ISP bill? yes no ... nor is it “peer-to-peer computing”

Some BOINC-based projects Climateprediction.net Oxford; climate change study Einstein@home LIGO; gravitational wave astronomy Rosetta@home U. Washington; protein study SETI@home U.C. Berkeley; SETI LHC@home CERN; accelerator simulation Africa@home STI, U. of Geneva; malaria epidemiology IBM World Community Grid several biomedical applications ...and about 30 others

Computing power Folding@home: 650 TeraFLOPS 200 from PCs; 50 from GPUs; 400 from PS3! BOINC-based projects:

A sampling of research problems Data-intensive computing Low-latency computing Background utility compatibility Credit mechanism Efficient validation Game consoles and graphics chips Simulation

Data-intensive computing – client limits Q = network transfer per GFLOPS/hr SETI@home: Q = 0.1 MB but wider range is OK:

Server-side limits Internet $ $ Client Server

Using free networks Internet2 commodity Internet Server $ Server $ $ Client

Using more free networks Server $ commodity Internet Internet2 Server $ $ Server $ LAN Client Client Client

Low-latency computing job submission deadline 2 min 4 min time VC usually minimizes connection frequency What if you want to do 10,000 1-minute jobs in 6 minutes of wall time?

Background utility compatibility Background utilities disk defrag disk indexing virus scanning web pre-fetch disk backup Most run only when computer is idle volunteer computing ==> they never run A) ignore zero-priority CPU activity B) Background manager intelligent decision about when to run various activities

Credit mechanism Credit is important Currently based on FLOPS Should reflect, e.g. RAM size? Goal: maximize utility to projects Project-specific pricing schedules? Credit for other resources disk usage network transfers

Efficient validation Job output results claimed credit Replicated computing do job N times, require quorum of M Dealing with numerical discrepancies homogeneous replication eliminate discrepancies no replication How to validate provably and efficiently?

Game consoles and graphics chips NVIDIA, ATI, Cell 10X CPU and gaining? Folding@home: ATI version Sony PS3 version BOINC and Einstein@home on PS3 How to make this available to other projects?

Simulating volunteer computing Ad-hoc development of scheduling policies slow, noisy jeopardizes running projects Simulation-based R&D client simulator client scheduling policies Project simulator server scheduling policies Global simulator study data-intensive, low-latency, etc.

The hard non-technical problems How to increase the number of volunteers? currently 1 in 1000 PC owners How to increase the number of projects? currently stuck at about 50 How to get volunteers to diversify?

How to attract and retain volunteers? Active hosts: Retention reminder emails frequent science updates Recruitment Viral “email a friend”, referral reward Organizational World Community Grid: “partner” program Media coverage need more discoveries Bundling

Why aren’t there more projects? Lack of PR among scientists IT antipathy Creating a BOINC project is expensive: Research group Science App development Experiment design Paper writing Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations

Meta-projects Virtual Campus Supercomputing Center Deployment and publicity: PC labs, staff/faculty desktops students alumni public Research groups Berkeley@home Existing UCB staff Science App development Experiment design Paper writing Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations IBM World Community Grid

Encouraging change Cross-project credit system encourage competition in total credit, not per- project Account Managers Make it easier to discover/attach/detach projects GridRepublic, BAM! Science Stock Market? encourage participation in new high-potential projects Scientific Mutual Funds? e.g. American Cancer Society BOINC “portfolio”

Conclusion Volunteer computing: a new paradigm distinct research problems, software requirements big accomplishments, potential time to plant the flag! Social impacts Contact me about: Using BOINC Research based on BOINC Organizational use of BOINC davea@ssl.berkeley.edu