Grand Challenge in MDC2 D. Olson, LBNL 31 Jan 1999 STAR Collaboration Meeting

Slides:



Advertisements
Similar presentations
Chapter 9. Performance Management Enterprise wide endeavor Research and ascertain all performance problems – not just DBMS Five factors influence DB performance.
Advertisements

Database System Concepts and Architecture
Aug Arie Shoshani Particle Physics Data Grid Request Management working group.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Grid Collector: Enabling File-Transparent Object Access For Analysis Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
File Systems (2). Readings r Silbershatz et al: 11.8.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW  Understand the difference between service.
Module 18 Monitoring SQL Server 2008 R2. Module Overview Monitoring Activity Capturing and Managing Performance Data Analyzing Collected Performance Data.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
Alok 1Northwestern University Access Patterns, Metadata, and Performance Alok Choudhary and Wei-Keng Liao Department of ECE,
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
BES III core/reconstruction software planning for Release 3.0 BES III core/reconstruction software planning for Release 3.0 Li Weidong
A User’s Introduction to the Grand Challenge Software STAR-GC Workshop Oct 1999 D. Zimmerman.
Int. Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT2005), Zeuthen, Germany, May 2005 Bitmap Indices for Fast End-User.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
Jerome Lauret RCF Advisory Committee Meeting The Data Carousel what problem it’s trying to solve the data carousel and the grand challenge the bits and.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
David N. Brown Lawrence Berkeley National Lab Representing the BaBar Collaboration The BaBar Mini  BaBar  BaBar’s Data Formats  Design of the Mini 
Grid Workload Management Massimo Sgaravatto INFN Padova.
1 New developments in the HENP-GC HENP-GC Collaboration New Capabilities in the HENP Grand Challenge Storage Access System and its Application at RHIC.
Grand Challenge MDC1 Plans Doug Olson Nuclear Science Division, Berkeley Lab for the HENP-GC Collaboration RCF Meeting September 24, 1998.
STAR C OMPUTING STAR Computing Infrastructure Torre Wenaus BNL STAR Collaboration Meeting BNL Jan 31, 1999.
Computer Science Research and Development Department Computing Sciences Directorate, L B N L 1 Storage Management and Data Mining in High Energy Physics.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
Erik Blaufuss University of Maryland Data Filtering and Software IceCube Collaboration Meeting Monday, March 21, 2005.
Using Bitmap Index to Speed up Analyses of High-Energy Physics Data John Wu, Arie Shoshani, Alex Sim, Junmin Gu, Art Poskanzer Lawrence Berkeley National.
An RTAG View of Event Collections, and Early Implementations David Malon ATLAS Database Group LHC Persistence Workshop 5 June 2002.
Grand Challenge and PHENIX Report post-MDC2 studies of GC software –feasibility for day-1 expectations of data model –simple robustness tests –Comparisons.
Analysis Model Migration Plan Accessing the Mini Converting to the New (dual-read) Micro Producing the Skims Cleaning up the Tag Beta Development.
1 GCA Application in STAR GCA Collaboration Grand Challenge Architecture and its Interface to STAR Sasha Vaniachine presenting for the Grand Challenge.
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
STAR Event data storage and management in STAR V. Perevoztchikov Brookhaven National Laboratory,USA.
Scientific Data Management Research Group National Energy Research Scientific Computing Center, L B N L 1 Henrik Nordberg, June 1998 Query Estimator Henrik.
STAR C OMPUTING STAR Analysis Operations and Issues Torre Wenaus BNL STAR PWG Videoconference BNL August 13, 1999.
STAR Collaboration, July 2004 Grid Collector Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani Lawrence Berkeley National.
January 26, 2003Eric Hjort HRMs in STAR Eric Hjort, LBNL (STAR/PPDG Collaborations)
PHENIX and the data grid >400 collaborators 3 continents + Israel +Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Franco Carbognani, EGO LSC-Virgo Meeting May 2007 Status and Plans LIGO-G Z Software Management.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
STAR C OMPUTING Plans for Production Use of Grand Challenge Software in STAR Torre Wenaus BNL Grand Challenge Meeting LBNL 10/23/98.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
1 fileCatalog, tagDB and GCA A. Vaniachine Grand Challenge STAR fileCatalog, tagDB and Grand Challenge Architecture A. Vaniachine presenting for the Grand.
1 Grid in STAR: Needs and Plans A. Vaniachine STAR Grid Components in STAR: Experience, Needs and Plans A. Vaniachine Lawrence Berkeley National Laboratory.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Pool Project and ROOT I/O Dirk Duellmann What is Pool? Component Breakdown Status and Plans.
Status & development of the software for CALICE-DAQ Tao Wu On behalf of UK Collaboration.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
PPDG meeting, July 2000 Interfacing the Storage Resource Broker (SRB) to the Hierarchical Resource Manager (HRM) Arie Shoshani, Alex Sim (LBNL) Reagan.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
- GMA Athena (24mar03 - CHEP La Jolla, CA) GMA Instrumentation of the Athena Framework using NetLogger Dan Gunter, Wim Lavrijsen,
T Project Review Muuntaja I1 Iteration
Module 14: Advanced Topics and Troubleshooting. Microsoft ® Windows ® Small Business Server (SBS) 2008 Management Console (Advanced Mode) Managing Windows.
D. Duellmann - IT/DB LCG - POOL Project1 Internal Pool Release V0.2 Dirk Duellmann.
Next-Generation Navigational Infrastructure and the ATLAS Event Store Abstract: The ATLAS event store employs a persistence framework with extensive navigational.
1 Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Lawrence.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
The HENP Grand Challenge Project and initial use in the RHIC Mock Data Challenge 1 D. Olson DM Workshop SLAC, Oct 1998.
POOL Based CMS Framework Bill Tanenbaum US-CMS/Fermilab 04/June/2003.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
(on behalf of the POOL team)
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
First Internal Pool Release 0.1
OO-Design in PHENIX PHENIX, a BIG Collaboration A Liberal Data Model
Level 1 Processing Pipeline
New I/O navigation scheme
Presentation transcript:

Grand Challenge in MDC2 D. Olson, LBNL 31 Jan 1999 STAR Collaboration Meeting

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 2 People key developers –Henrik Nordberg (NERSC) query estimator –Alex Sim (NERSC) query monitor –Luis Bernardo (NERSC) cache manager –Jeff Porter (BNL-STAR) query object –Dave Malon (ANL) order-optimized iterator & gcaResources API –Dave Zimmerman, (LBL-STAR) tagDB –Stephen Johnson (BNL/SB- PHENIX) –Jie Yang (UCLA,LBL) testing Query Interface Storage Manager Event Data (Objectivity) Sample User Code GC system components Expt. code & data

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 3 Outline Review of GC in MDC1 MDC2 Features Status of Software Scenario of Usage Post-MDC2 (ROOT-GC integration?) Summary

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 4 Multiple simultaneous queries Storage Manager Event Data (Objectivity) GCA Interface (QO, ooi) User Code GC system components Expt. code & data GCA Interface (QO, ooi) User Code GCA Interface (QO, ooi) User Code Query1 Query2 Query3 Analysis codes

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 5 File IDStart of queryStart of pftp requestFile staged to HPSS diskFile staged to local diskEvent ID’s retrieved by iterator File released by queries File purged from disk Symbol color identifies query Time End of query Legend

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 6 3 queries with some shared files, time delay between each query, then the same 3 queries are repeated simultaneously. The cache was large enough to hold all files so the second time all queries run at processing speed rather than I/O speed. q1q2 q3 q1,2,3 3 queries Detailed description of test results on GC web site under Results.

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 7 GC software features for MDC2 Multiple event components –introduce “file bundles” - a set of events and their corresponding files Files in multiple directories (use Objy catalog) Time estimate for query parallel query execution (multiple iterators, same token) persistent state storage manager (can restart) (QE at least) query object can save event collection Linux analysis codes (Objectivity permitting) (PHENIX?) More logging See more details at

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 8 Multiple named components RAW Evnt Hdr tracks hits Another component YAC (yet another component) SELECT (component_name, component_name,...) FROM dataset_name WHERE (predicate_conditions_on_properties) Example: SELECT tracks, hits FROM Run17 WHERE glb_trk_tot>0 & glb_trk_tot<10 & n_vert_total<3 Also for event collections. DAQ fmtObjy ROOTXDF

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 9 File Bundles File 1 File 2 File 3 File 4 File 5 File 6 Evt# Bundle {F-1,4,6; E-1,2,3,4} {F-2,4,6; E-5,6,7,8,9} {F-2,5,6; E-10} {F-3,5,6; E-12,14,16,17,18} (Evts in query, evts not in query) Evt Component 1 Evt Component 2 EC 3 File bundles are delivered to the iterator in the analysis program.

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 10 Real-time Logging/Monitoring Example Netlogger and Viewer from NERSC Data Intensive Computing Group (Tierney et al.)

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 11 Status of s/w for MDC2 Query Estimator - Ready Query Monitor - Done, Testing Cache Manager - Ready Logging - bug fixing Query Object - few mods required Order Optimized Iterator - few mods required ROOT-QO interface - proof of principle done, needs work ROOT TagDB interface to CG Index - Done

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 12 What needs to be done Interface with STAR event collections Interface with StAnalysisChain –OID --> build transient event Define content of STAR Tag Database in ROOT

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 13 Usage Scenario GC Index Query Index, make collection (loose cuts) Analyze Tags, refine collection, (tight cuts) Read Tag, make Index

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 14 TreeViewer of Tag Database in ROOT Dave Zimmerman, Jan 31. We will look at interfacing this to Query Estimator, GC Index.

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 15 Post-MDC2 Work on scalability issues after MDC2 –MDC2 does not stress scalability Tighter integration with ROOT –no Objy? Discussions w/ Rene Brun Jan , 1999 at LBL

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 16 Topics w.r.t ROOT-GC PROOF TTree –TBranch TChain TEventList QO OOI Event ID’s TagDB Bit-mapped index

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 17 QO/OOI - TChain Iterating over the OID list retrieved from the storage manager for a file bundle is analgous to adding a file to a TChain and iterating over a TEventList for the selected events in that ROOT file. It would be possible to replace OID as event identifier with ROOT file + event sequence number in file in a non-Objectivity implementation.

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 18 What’s To Do w.r.t. ROOT ROOT –mods to Chain –mods to EventList R.B. very interested in bit-mapped index GC –try OOI w/ Objy OID and get ROOT component w/ filename+evt seq.# –Consider non-Objy implementation (replace OID’s)

31 Jan 1999GC-MDC2: D. Olson, STAR Mtg 19 Summary Basic MDC2 features ready. Need more work on the interface w/ STAR. –Any candidates? (LBL post-doc open) Need to clarify Objy/ROOT roles soon after MDC2. Plan on close integration w/ ROOT.