Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.

Slides:



Advertisements
Similar presentations
BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.
Advertisements

BOINC Berkeley Open Infrastructure for Network Computing An open-source middleware system for volunteer and grid computing (much of the images and text.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing.
High-Performance Task Distribution for Volunteer Computing Rom Walton
BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 2, 2007.
Scientific Computing on Smartphones David P. Anderson Space Sciences Lab University of California, Berkeley April 17, 2014.
Volunteer Computing and Hubs David P. Anderson Space Sciences Lab University of California, Berkeley HUBbub September 26, 2013.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
Achievements and Opportunities in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 18 April 2008.
A Guided Tour of BOINC David P. Anderson Space Sciences Lab University of California, Berkeley TACC November 8, 2013.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 7, 2008.
Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Scientific Computing in the Consumer Digital Infrastructure David P. Anderson Space Sciences Lab University of California, Berkeley The Austin Forum November.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
BOINC.
Volunteer Computing with BOINC Dr. David P. Anderson University of California, Berkeley SC10 Nov. 14, 2010.
David P. Anderson Space Sciences Lab U.C. Berkeley Exa-Scale Volunteer Computing.
Volunteer Computing with GPUs David P. Anderson Space Sciences Laboratory U.C. Berkeley.
and Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
A Tour of Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 14 Sept 2007.
Volunteer Computing: SETI and Beyond David P. Anderson University of California, Berkeley 7 June 2007.
Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.
Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 30 Dec
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Volunteer Computing in the Next Decade David Anderson Space Sciences Lab University of California, Berkeley 4 May 2012.
Volunteer Computing in Biology David P. Anderson Space Sciences Lab U.C. Berkeley 10 Sept 2007.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
David P. Anderson Space Sciences Laboratory University of California – Berkeley A Million Years of Computing.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
Volunteer Computing: the Ultimate Cloud Dr. David P. Anderson University of California, Berkeley Oct 19, 2010.
A Brief History of (CPU) Time -or- Ten Years of Multitude David P. Anderson Spaces Sciences Lab University of California, Berkeley 2 Sept 2010.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.
The Limits of Volunteer Computing Dr. David P. Anderson University of California, Berkeley March 20, 2011.
All the computers in the world (~1 billion) BOINC: high-level goal Computational science biology, medicine Earth sciences, physics, astronomy, math, A.I.,...
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Using volunteered resources for data-intensive computing and storage David Anderson Space Sciences Lab UC Berkeley 10 April 2012.
Technology for Citizen Cyberscience Dr. David P. Anderson University of California, Berkeley May 2011.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab Nov. 15, 2006.
Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.
Frontiers of Volunteer Computing David Anderson Space Sciences Lab UC Berkeley 28 Nov
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab January 30, 2007.
An Overview of Volunteer Computing
A Brief History of BOINC
Volunteer Computing and BOINC
University of California, Berkeley
Building a Global Brain David P. Anderson U. C
Volunteer computing PC owners donate idle cycles to science projects
Volunteer Computing: Planting the Flag David P
Volunteer Computing: SETI and Beyond David P
Volunteer Computing for Science Gateways
Designing a Runtime System for Volunteer Computing David P
Exa-Scale Volunteer Computing
David P. Anderson Space Sciences Lab UC Berkeley LASER
The Global Status of Citizen Cyberscience
The software infrastructure of II
University of California, Berkeley
Ivan Reid (Brunel University London/CMS)
Exploring Multi-Core on
Presentation transcript:

Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007

Outline Science needs more computing power What is volunteer computing? How BOINC works Projects using BOINC Future directions

Simulation of physical systems Biolog y Climate study Cosmology

Data analysis Physic s Astronom y

Genetic algorithms and other new computational paradigms

Parallel computing Suppose you need 100 years of computing  1 CPU : 100 years  1,000 CPUs: 36 days  1,000,000 CPUs: 1 hour Types of parallelism  CPUs on one chip (multi-core)  CPUs in one box (supercomputers)  CPUs in one room (cluster computing)  CPUs owned by allied organizations (Grid computing)  Any CPU, anywhere (volunteer computing)

Where’s the computing power? Goals of volunteer computing  give science access to maximal computer power  allocate resources based on merit, not money owned by individuals (~1 billion) owned by companies (~100M) owned by government (~50M)

A brief history of volunteer computing Projects Platforms distributed.net, GIMPS Popular Power Entropia United Devices, Parabon BOINC Climateprediction.net Einstein, IBM World Community Grid

The BOINC volunteer/project model Accounts PC Attachments Resource shares 40 % 60 % Volunteers Project s IBM WCG Climateprediction.ne t

The volunteer computing game Internet Projects Volunteers Do more science Involve public in science

Participation and computing power 500K active participants, 700K computers ~40 projects Computing power: about 2 PetaFLOPS  That’s about 10X an IBM Blue Gene L ($300M)

Cost per TeraFLOPS-year Cluster (6.8 TeraFLOPS)  power and A/C: $750K  network hardware: $175K  computing hardware (780 nodes): $1000K  storage (300 TB RAID-6): $250K  power: $140K/year  sysadmin: $150K/year  total: $124K/year Amazon EC2: $1.75M/year Average BOINC project: $2K/year

Volunteer computing ≠ Grid computing Resource owners Managed systems? Clients behind firewall? anonymous, unaccountable; need to check results no – need plug & play software yes – pull model yes – software stack requirements OK no – push model identified, accountable ISP bill? ye s nono... nor is it “peer-to-peer computing”

The BOINC project Location: UC Berkeley Space Sciences Lab Personnel  director: David Anderson  other employees: 1.5 programmers  lots of volunteers Funding  supported by NSF since 2002  current grant runs through Aug 2010

What the BOINC project does We develop software for volunteer computing We enable on-line communities What we don’t do: branding, hosting, authorizing, endorsing, controlling

BOINC software Distributed under LGPL license Server side  uses Linux, Apache, MySQL, PHP  Job distribution: C++, 20K lines  Web features: PHP, 30K lines Client side  uses WxWidgets, OpenGL  Client: C++, 30K lines  GUI: C++, 45K lines

BOINC server software High performance, scalability (10M jobs/day) Recovery from client errors and malfeasance MySQL DB (accounts, jobs, etc.) scheduler web site features file upload/ download executables, input files, output files assimilator DB purge file deleter transitioner validator work generator Clients and volunteers

Creating a BOINC project Set up server  On a Linux box (some work)  Use the BOINC VMware virtual server  Use the BOINC VM for Amazon EC2 (easy but $$)  Apply to IBM World Community Grid easy but restrictive Port application Develop web site Lots of testing and debugging Public relations and customer support

Volunteer’s view 1-click install All platforms Invisible, autonomic Highly configurable (optional)

BOINC client structure core client application BOINC library GUI screensaver local TCP schedulers, data servers Runtime system user preferences, control

Communication: “Pull” model client scheduler I can run Win32 and Win MB RAM 20GB free disk 2.5 GFLOPS CPU (description of current work) Here are three jobs. Job 1 has application files A,B,C, input files C,D,E and output file F...

The BOINC community Projects Volunteer programmers Alpha testers Online Skype-based help Translators (web, client) Documentation (Wiki) Teams

Some BOINC projects Climateprediction.net  Oxford University  Global climate modeling  LIGO scientific collaboration  gravitational wave detection  U.C. Berkeley  Radio search for E.T.I. and black hole evaporation Leiden Classical  Leiden University  Surface chemistry using classical dynamics

More projects  CERN  simulator of LHC, collisions  Univ. of Muenster  Quantum chemistry  Bielefeld Univ.  Sutdy nanoscale magnetism  Leiden Univ.  Number theory

Biomed-related BOINC projects  University of Washington  Rosetta: Protein folding, docking, and design Tanpaku  Tokyo Univ. of Science  Protein structure prediction using Brownian dynamics MalariaControl  The Swiss Tropical Institute  Epidemiological simulation

More projects  Scripps Institute  CHARMM, protein structure prediction SIMAP  Tech. Univ. of Munich  Protein similarity matrix  Technion  Genetic linkage analysis using Bayesian networks

More projects (IBM WCG) Dengue fever drug discovery  U. of Texas, U. of Chicago  Autodock Human Proteome Folding  New York University  Rosetta  Scripps Institute  Autodock

Future work How to get more volunteers?  media  bundling  social networks How to get more projects? How to use future hardware?  multicore CPUs  GPUs  video game consoles (e.g., PS3/Cell)  set-top boxes  mobile devices

Campus-level “meta-project” Applications  6 pilot apps: climate, fluid dynamics, nanotechnology, genetics, Volunteers  1,000 instructional PCs  5,000 faculty/staff  30,000 students  400,000 alumni  general public NSF proposal submitted

Citizen Cyber-Science Distributed thinking  Clickworkers, GalaxyZoo  protein-folding game New software initiatives: Bolt and Bossa

Conclusion Volunteer computing: a new paradigm  Distinct research problems, software requirements  Computing power More Cheaper Democratic allocation  Social impact Contact me about:  Using BOINC  Research based on BOINC