Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific program, September 12, 2014, Shanghai
Outline What is public-resource computing The BOINC midware A list public-resource computing projects using the BOINC platform Our work with BOINC CEPC Simulation implementation 2
Public-resource computing AKA global computing or P2P computing Combines the resources of personal computers and game consoles belonging to the general public to perform scientific computations Started with Great Internet Mersenne Prime Search (GIMPS) (1996) Distributed.net (1997) Contrast with grid computing: Grid computing involves "organizationally-owned resources" Centrally managed by IT professionals Powered on most of the time Connected by high-speed links Malicious behavior handled by organization Of course None of that is true for public-resource computing. 3
BOINC Berkeley Open Infrastructure for Network Computing Developed at UCB Space Science Laboratory by the group Reduce the barriers of entry to public-resource computing: A project can be run from a single computer running standard open-source software Share resources among autonomous projects: Each PC owner can join multiple projects Results in better resource utilization Support diverse applications: Offer various data distribution mechanisms Support various programming languages … Reward participants: Mostly by giving them credits System must be cheating-resistant Also by offering nice graphics Great screensavers! 4
Projects using BOINC search for intelligent extra-terrestrial life(hosts: , users: total cpu hours: ~ ) Search for gravitational signals emitted by pulsars(hosts: , users: total cpu hours:~ ) Climateprediction.net: long term-climate prediction(hosts: users: total cpu hours:~ ) World Community Grid: Runs multiple sub-projects that focus on humanitarian research including HIV/AIDS, cancer, dengue fever, malaria, etc(hosts: ,users:466627,total cpu hours:~ ) 5
Projects using BOINC Classic – SixTrack, 2003 hosts: & users: total cpu hours:~ b. Test4Theory virtualization technology c. - B-physics for LHCb experiment d. – atlas simulation 6
Our work with BOINC(I) First official Volunteer Project in mainland China Applications: TreeThreader, Lammps, etc address: 7
Our work with BOINC(II) ATLAS simulation A lot of CPUs needed for simulation Key point: virtualization Total jobs statistic in Aug.2014: ~24000 jobs completed. 8
9
Our work with BOINC(III) Desktop Grid: integrate Dirac and boinc, provide public resource for grid user. Test work completed and promotion in progress Application: BESIII simulation 10
Discusses CEPC Simulation implementation Feasibility: a. Simulation jobs : low I/O jobs b. virtualization technology: Heterogeneous Resources c. Huge potential resources: 1) Enthusiastic&strong volunteers from 2) Technologic collaboration with BOINC develop group &other project groups d. development experiences with Large physical experiments: & BESDIRAC project 11
Web protral/api Discusses CEPC Simulation implementation Work flow on boinc side boinc sever Image repository PC g PCs PCs PC Physicist User Physicist User Generater Running workuni t BOINC CLient Core Shared directory VM image output 12
Discusses CEPC Simulation implementation Basic architecture: ongoing integration Dirac and Boinc for CEPC simulation 13
Summary & Outlook Skilled technology for CEPC simulation experiment by using volunteer computing Completed some test simulation jobs on boinc(C/S) side and got good results, more test jobs are needed, and some more details should be discussed. Reduce the size of the image, current software all loaded in the image Database access not available for public, load database in the image. On going integration of DIRAC and boinc for CEPC simulation. Powerful resources are provided by volunteers from (host: cpu cores: ) &other boinc projects. 14
Thanks! Q & A Contacts: 15