A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.

Slides:



Advertisements
Similar presentations
Chapter 20 Oracle Secure Backup.
Advertisements

Operating Systems Manage system resources –CPU scheduling –Process management –Memory management –Input/Output device management –Storage device management.
The GridBee Web Computing Framework Attila Szarvas BME IK 2012.
BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.
Cloud Computing Brandon Hixon Jonathan Moore. Cloud Computing Brandon Hixon What is Cloud Computing? How does it work? Jonathan Moore What are the key.
Bookshelf.EXE - BX A dynamic version of Bookshelf –Automatic submission of algorithm implementations, data and benchmarks into database Distributed computing.
LUNARC, Lund UniversityLSCS 2002 Transparent access to finite element applications using grid and web technology J. Lindemann P.A. Wernberg and G. Sandberg.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
© 2009 Research In Motion Limited Methods of application development for mobile devices.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Workload Management Massimo Sgaravatto INFN Padova.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Ch 4. The Evolution of Analytic Scalability
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
P EER - TO -P EER N ETWORKS Michael Fine 1. W HAT ARE P EER -T O -P EER N ETWORKS ? Napster Social networking spawned from this concept. Emerged in the.
Ajmer Singh PGT(IP) Software Concepts. Ajmer Singh PGT(IP) Operating System It is a program which acts as an interface between a user and hardware.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
1 port BOSS on Wenjing Wu (IHEP-CC)
HTCondor and BOINC. › Berkeley Open Infrastructure for Network Computing › Grew out of began in 2002 › Middleware system for volunteer computing.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Dynamic Firewalls and Service Deployment Models for Grid Environments Gian Luca Volpato, Christian Grimm RRZN – Leibniz Universität Hannover Cracow Grid.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
EGrid Software Packages Overview. EGrid Introduction Egrid Introduction : A description of the main software packages EGrid Inside : A detailed description.
Integrating HPC into the ATLAS Distributed Computing environment Doug Benjamin Duke University.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Module 5 A system where in its parts perform a unified job of receiving inputs, processes the information and transforms the information into a new kind.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
Distributed Programming Framework Alexandre David B2-206.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
The EDGeS project receives Community research funding 1 Porting Applications to the EDGeS Infrastructure A comparison of the available methods, APIs, and.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
Timeshared Parallel Machines Need resource management Need resource management Shrink and expand individual jobs to available sets of processors Shrink.
Auger & XtremWeb: Monte Carlo computation on A Global Computing platform O. Lodygensky, G. Fedak, V. Neri, A.Cordier, F. Cappello Laboratoire de l’Accelerateur.
Basic Computer Terms Introduction to Computer Applications Mrs. Sorrell.
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
MICROSOFT TESTS /291/293 Fairfax County Adult Education Courses 1477/1478/1479.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Future of Distributed Production in US Facilities Kaushik De Univ. of Texas at Arlington US ATLAS Distributed Facility Workshop, Santa Cruz November 13,
10 March Andrey Grid Tools Working Prototype of Distributed Computing Infrastructure for Physics Analysis SUNY.
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
SHIWA Simulation Platform (SSP) Gabor Terstyanszky, University of Westminster EGI Community Forum Munnich March 2012 SHIWA is supported by the FP7.
COMP7330/7336 Advanced Parallel and Distributed Computing MapReduce - Introduction Dr. Xiao Qin Auburn University
SCI-BUS project Pre-kick-off meeting University of Westminster Centre for Parallel Computing Tamas Kiss, Stephen Winter, Gabor.
VIRTUAL NETWORK COMPUTING SUBMITTED BY:- Ankur Yadav Ashish Solanki Charu Swaroop Harsha Jain.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Workload Management Workpackage
Chapter 1 – Introduction to Computers, the Internet, and the Web
New developments for deploying
MIK 2.1 DBNS - introduction to WS-PGRADE, 2013
The software infrastructure of II
AN ENVIRONMENT FOR REMOTE CONTROL
CIS16 Application Development – Programming with Visual Basic
Implementation of a small-scale desktop grid computing infrastructure in a commercial domain    
Overview of Computer system
Presentation transcript:

A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate

October 30th, Overview Introduction BOINC ATLAS Use Case Tests and Results Conclusions

October 30th, Introduction Project participants: –Faculdade de Ciências da Universidade de Lisboa –Faculdade de Engenharia da Universidade do Porto From Grid-Brick system presented at CHEP2003 Goals: –Create a distributed computing system –Explore commodity CPU’s and disks and keep them together –Use public computing –Evaluate its use for dedicated HEP clusters.

October 30th, Overview Introduction BOINC –Description –Features –Behavior –Related Work ATLAS Use Case Tests and Results Conclusions

October 30th, Description Stands for Berkeley Open Infrastructure for Network Computing Generic software platform for distributed computing Developed by the team Based on public computing Key concepts –Project –Application –Workunit (Job) –Result

October 30th, Features Generic platform: supports many applications / projects Projects can be run simultaneously Common language applications can run as BOINC applications Fault-tolerance Monitored through a Web interface Implements security mechanisms

October 30th, Behavior Initial communication Work request –Hardware characteristics Server decides Workunit download –Application –Input files Results Upload Client makes requests, Server is passive

October 30th, Related Work Project-specific solutions: –Distributed.net Commercial solutions XtremWeb JXGrid

October 30th, Overview Introduction BOINC –Background –Additional Features –Behavior ATLAS Use Case Tests and Results Conclusions

October 30th, Background Grid-Brick project: Presented at CHEP2003 Goal was merge storage units with computing farms. Conclusions: –No central resource manager –Plug and play clients –Increase robustness –Fault-tolerant system

October 30th, Additional Features Avoid data movement User specific applications Environments –Scripts –Libraries Environments patches “get input” apps Job dependencies

October 30th, Behavior Initial communication Work request –Hardware characteristics –Available input files Server decides: –Input file exists: ok –No input file: wait, run "get input" app Workunit download: –Application –Environment / Patches Results Upload

October 30th, Overview Introduction BOINC ATLAS Use Case Tests and Results Conclusions

October 30th, ATLAS Use Case How can physicists use to run ATLAS jobs. The actors of this use case can be: –Physicist doing personal job submission –Real production Let us suppose we have: –Several ATLAS jobs to run –We know what files each job will produce and consume and how to generate or get these files. –We have computers connected to the Internet

October 30th, ATLAS Use Case Execution Steps: –Select or submit ATLAS application –Work submission: environment files (job options files, scripts, etc) environment patch input file template "get input" application result (output file) template As a result the user gets the aggregation of the produced output files as a unique output file.

October 30th, Overview Introduction BOINC ATLAS Use Case Tests and Results Conclusions

October 30th, Tests Based on the defined ATLAS Use Case Typical ATLAS jobs sequence using Muon events: –Generation: e events (1x) –Simulation: e/10 events (10x) –Digitization: e/10 events (10x) –Reconstruction: e/10 events (10x) Two groups of tests were defined: e = 100, e = For each group, 4 tests were made: –One simple client –Two BOINC client –Four BOINC client –Eight BOINC client

October 30th, Results - Execution Times Group A: 100 events Group B: 1000 events

October 30th, Results - Data Movement 1000 events in 8 machines: Seqx: events x00-x99

October 30th, Overview Introduction BOINC ATLAS Use Case Tests and Results Conclusions

October 30th, Conclusions Several BOINC projects are currently running successfully worldwide From tests: –Execution of user applications => more flexibility –Environments and patches => easier work submission –Heavier computation => better results –Low data movement => better results can be brought to physicists daily tasks with not much effort