David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.

Slides:



Advertisements
Similar presentations
BARNALI CHAKRABARTY. What is an Operating System ?
Advertisements

GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL July 15, 2003 LCG Analysis RTAG CERN.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
David Adams ATLAS AJDL: Analysis Job Description Language David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
ATLAS DIAL: Distributed Interactive Analysis of Large Datasets David Adams – BNL September 16, 2005 DOSAR meeting.
David Adams ATLAS DIAL status David Adams BNL July 16, 2003 ATLAS GRID meeting CERN.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
Grid Workload Management Massimo Sgaravatto INFN Padova.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Event Data History David Adams BNL Atlas Software Week December 2001.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Bi-Hadoop: Extending Hadoop To Improve Support For Binary-Input Applications Xiao Yu and Bo Hong School of Electrical and Computer Engineering Georgia.
INFSO-RI Enabling Grids for E-sciencE ATLAS Distributed Analysis A. Zalite / PNPI.
David Adams ATLAS Architecture for ATLAS Distributed Analysis David Adams BNL March 25, 2004 ATLAS Distributed Analysis Meeting.
David Adams ATLAS DIAL status David Adams BNL November 21, 2002 ATLAS software meeting GRID session.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Metadata Mòrag Burgon-Lyon University of Glasgow.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL June 7, 2004 BNL Technology Meeting.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences.
D. Adams, D. Liko, K...Harrison, C. L. Tan ATLAS ATLAS Distributed Analysis: Current roadmap David Adams – DIAL/PPDG/BNL Dietrich Liko – ARDA/EGEE/CERN.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL November 17, 2003 SC2003 Phoenix.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
Tool Integration with Data and Computation Grid “Grid Wizard 2”
David Adams ATLAS ATLAS Distributed Analysis: Overview David Adams BNL December 8, 2004 Distributed Analysis working group ATLAS software workshop.
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
- GMA Athena (24mar03 - CHEP La Jolla, CA) GMA Instrumentation of the Athena Framework using NetLogger Dan Gunter, Wim Lavrijsen,
David Adams ATLAS Datasets for the Grid and for ATLAS David Adams BNL September 24, 2003 ATLAS Software Workshop Database Session CERN.
Chapter – 8 Software Tools.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
STAR Scheduler Gabriele Carcassi STAR Collaboration.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
ATLAS Distributed Analysis DISTRIBUTED ANALYSIS JOBS WITH THE ATLAS PRODUCTION SYSTEM S. González D. Liko
Planning Session. ATLAS(-CMS) End-to-End Demo Kaushik De is the Demo Czar Need to put team together Atlfast production jobs –Atlfast may be unstable over.
David Adams ATLAS AJDL: Abstract Job Description Language David Adams BNL June 29, 2004 PPDG Collaboration Meeting Williams Bay.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
David Adams ATLAS Hybrid Event Store Integration with Athena/StoreGate David Adams BNL March 5, 2002 ATLAS Software Week Event Data Model and Detector.
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech

David Adams ATLAS June 23, 2003 DIAL GAE workshop2 Contents Goals of DIAL What is DIAL? Design Application Task Result Scheduler Dataset properties Exchange format Status Development plans Interactivity issues

David Adams ATLAS June 23, 2003 DIAL GAE workshop3 Goals of DIAL 1. Demonstrate the feasibility of interactive analysis of large datasets How much data can we analyze interactively? 2. Set requirements for GRID services Datasets, schedulers, jobs, results, resource discovery, authentication, allocation, Provide ATLAS with a useful analysis tool For current and upcoming data challenges Real world deliverable Another experiment would show generality

David Adams ATLAS June 23, 2003 DIAL GAE workshop4 What is DIAL? Distributed Data and processing Interactive Iterative processing with prompt response – (seconds rather than hours) Analysis of Fill histograms, select events, … Large datasets Any event data (not just ntuples or tag)

David Adams ATLAS June 23, 2003 DIAL GAE workshop5 What is DIAL? (cont) DIAL provides a connection between Interactive analysis framework –Fitting, presentation graphics, … –E.g. ROOT and Data processing application –Natural to the data of interest –E.g. athena for ATLAS DIAL distributes processing Among sites, farms, nodes To provide user with desired response time Look to other projects to provide most infrastructure

David Adams ATLAS June 23, 2003 DIAL GAE workshop6

David Adams ATLAS June 23, 2003 DIAL GAE workshop7 Design DIAL has the following major components Dataset describing the data of interest Application defined by experiment/site Task is user extension to the application Result (generated by applying application and task to an input dataset) Scheduler directs the processing

David Adams ATLAS June 23, 2003 DIAL GAE workshop8 User Analysis Job 1 Job 2 ApplicationTask Dataset 1 Scheduler 1. Create or locate 2. select3. Create or select 4. select 8. run(app,tsk,ds1) 5. submit(app,tsk,ds) 8. run(app,tsk,ds2) 6. split Dataset Dataset 2 7. create e.g. ROOT e.g. athena Result 9. fill 10. gather Result 9. fill ResultCode

David Adams ATLAS June 23, 2003 DIAL GAE workshop9 Application User specifies application with Name and version –E.g. dial_cbnt 0.3 Scheduler Maps this specification to an executable E.g. ChildScheduler uses mapping files at each site or node

David Adams ATLAS June 23, 2003 DIAL GAE workshop10 Task Task allows users to extend application Application Defines the task syntax Provides means to build and install task –E.g. compile and dynamic load Examples: Empty histograms plus code to fill them Sequence of algorithms

David Adams ATLAS June 23, 2003 DIAL GAE workshop11 Result Result is filled during processing Examples –Histogram –Event list –New dataset –Combination of the above Returned to the user Should be small Logical file identifiers rather than the data in those files

David Adams ATLAS June 23, 2003 DIAL GAE workshop12 Schedulers A DIAL scheduler provides means to Submit a job Terminate a job Monitor a job –Status –Events processed –Partial results Verify availability of an application Install and verify the presence of a task for a given application

David Adams ATLAS June 23, 2003 DIAL GAE workshop13 Schedulers (cont) Schedulers form a hierarchy Corresponding to that of compute nodes –Grid, site, farm, node Each scheduler splits job into sub-jobs and distributes these over lower-level schedulers Lowest level ChildScheduler starts processes to carry out the sub-jobs Scheduler concatenates results for its sub-jobs User may enter the hierarchy at any level Client-server communication

David Adams ATLAS June 23, 2003 DIAL GAE workshop14

David Adams ATLAS June 23, 2003 DIAL GAE workshop15 Dataset properties Datasets have the following properties 0. Identity 1. Content 2. Location (of the data) 3. Mapping (content to location) 4. Provenance (of the dataset) 5. History (of the production) 6. Labels (describing dataset)

David Adams ATLAS June 23, 2003 DIAL GAE workshop16 Dataset properties (cont) All the above are “metadata” Dataset also has “data” Details: 0. Identity –Index or name to distinguish datasets 1. Content –Tells what kind of data is carried by the dataset –E.g. for HEP event data >List of event identifiers >Content for each event (raw, tracks, jets, …)

David Adams ATLAS June 23, 2003 DIAL GAE workshop17 Dataset properties (cont) 2. Location –Most interesting example is a collection of logical files –Could also be >physical files >DB >data store such as POOL 3. Mapping –For each piece of content, where (in the location) is the associated data –For collection of logical files, which file and where in the file

David Adams ATLAS June 23, 2003 DIAL GAE workshop18 Dataset properties (cont) 4. Provenance –Parent datasets –Transformation >Applied to parent dataset to obtain this dataset –Sufficient to construct dataset (virtual data) 5. History –Production history beyond provenance –Division into jobs, which compute nodes, time stamps, …

David Adams ATLAS June 23, 2003 DIAL GAE workshop19 Dataset properties (cont) 6. Labels –Additional data characterizing the dataset –For example >Motivation for dataset E.g. starting point for Higgs searches >Results of dataset evaluation E.g. approved for basis of publications >Other global properties E.g. integrated luminosity for an event dataset

David Adams ATLAS June 23, 2003 DIAL GAE workshop20 Dataset operations Operations that datasets should support Content selection Splitting Merging Content selection User can create a new dataset by selecting content from an existing dataset For an event dataset, there are two categories –Event selection >Apply an algorithm which flags which events to accept –Event content selection >e.g. keep tracks, drop jets

David Adams ATLAS June 23, 2003 DIAL GAE workshop21 Dataset operations (cont) Splitting Distributed processing requires that the input dataset be split into sub-datasets for processing Natural to split along file boundaries For event datasets, the split is along event boundaries Merging Combine multiple datasets to form a new dataset For event datasets, there are again two dimensions –Same event content and data for different events >E.g. data from two different run periods –Same events and different content >E.g. raw data and the corresponding reconstructed data >Or reconstructed data and relevant conditions data

David Adams ATLAS June 23, 2003 DIAL GAE workshop22 Dataset operations (cont) Event dataset operations

David Adams ATLAS June 23, 2003 DIAL GAE workshop23 Dataset implementation The dataset properties span many realms Different properties will be stored in different ways Properties used for selection naturally reside in relational DB tables –Content, provenance, metadata Properties required for end applications on worker nodes might expect properties to reside in files –Content, location mapping Some property data will likely be replicated –both RDB and files Identification of the different properties is a first step in implementing datasets

David Adams ATLAS June 23, 2003 DIAL GAE workshop24 Dataset implementation (cont) Interface for schedulers Central problem is how to split dataset Account for –Data location –Compute cycle locations –Matching these Interactive analysis (fast response) is a special challenge How can different dataset providers share a scheduler? Options: –Common dataset interface which provides the required information >Logical file mapping or >Composite structure (sub-datasets) –Each provider also provides a service to do the splitting

David Adams ATLAS June 23, 2003 DIAL GAE workshop25 Exchange format DIAL components are exchanged Between –User and scheduler –Scheduler and scheduler –Scheduler and application executable Components have an XML representation Exchange mechanism can be –C++ objects –SOAP –Files Mechanism defined by scheduler

David Adams ATLAS June 23, 2003 DIAL GAE workshop26 Status All DIAL components in place Release 0.30 made last week –Includes demo to demonstrate look and feel But scheduler is very simple –Only local ChildScheduler is implemented –Grid, site, farm and client-server schedulers not yet implemented More details in CHEP paper at

David Adams ATLAS June 23, 2003 DIAL GAE workshop27 Status (cont) Dataset implemented as a separate system Implementations: –ATLAS combined ntuple hbook file >CbntDataset >in release 0.3 –ATLAS AthenaRoot file >Holds Monte Carlo generator information >In 0.2; not in 0.3 (easy to add) –Athena-Pool files >when they are available

David Adams ATLAS June 23, 2003 DIAL GAE workshop28 Status (cont) DIAL and dataset classes imported to ROOT ROOT can be used as interactive user interface –All DIAL and dataset classes and methods available at command prompt –DIAL and dataset libraries must be loaded Import done with ACLiC Need to add result for TH1 and any other classes of interest

David Adams ATLAS June 23, 2003 DIAL GAE workshop29 Status (cont) Applications Test program dialproc counts events Wrapper around PAW dial_cbnt –Processes CbntDataset –Task provides >HBOOK file with empty histograms >Fortran subroutine to fill histograms

David Adams ATLAS June 23, 2003 DIAL GAE workshop30 Status (cont) Interface for logical files was added recently Keep results small –Carry logical file instead of data in the file Includes abstract interface for a file catalog –Implemented: >Local directory >AFS –Planned: >Magda (under development) >RLS (add flavors as required)

David Adams ATLAS June 23, 2003 DIAL GAE workshop31 Development plans Farm Scheduler Distribute processing over a single farm –Condor, LSF, ssh, STAR, …? Provides useful tool for distributed processing Remote access to scheduler Job submission from anywhere Web service Add policy to scheduler interface Response time How to split dataset for distributed processing

David Adams ATLAS June 23, 2003 DIAL GAE workshop32 Development plans (cont) Grid schedulers Distribute data and processing over multiple sites Interact with dataset, file and replica catalogs Authentication, authorization, resource location and allocation, … ATLAS POOL dataset After ATLAS incorporates POOL ATLAS athena as application

David Adams ATLAS June 23, 2003 DIAL GAE workshop33 Interactivity issues Important aspect is latency Interactive system provides means for user to specify maximum acceptable response time All actions must take place within this time –Locate data and resources –Splitting and matchmaking –Job submission –Gathering of results Longer latency for first pass over a dataset –Record state for later passes –Still must be able to adjust to changing conditions

David Adams ATLAS June 23, 2003 DIAL GAE workshop34 Interactivity issues (cont) Interactive and batch must share resources Share implies more available resources for both Interactive use varies significantly –Time of day –Time to the next conference –Discovery of interesting events Interactive request must be able to preempt long- running batch jobs –But allocation determined by sites, experiments, …