STAR C OMPUTING STAR Analysis Operations and Issues Torre Wenaus BNL STAR PWG Videoconference BNL August 13, 1999.

Slides:



Advertisements
Similar presentations
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Advertisements

Grid Collector: Enabling File-Transparent Object Access For Analysis Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani.
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
Physical design. Stage 6 - Physical Design Retrieve the target physical environment Create physical data design Create function component implementation.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
Design of an Online Voting System COMP 204, Spring 2001Adrian Ilie & Miguel A. Otaduy.
STAR Software Basics Introduction to the working environment Lee Barnby - Kent State University.
STAR C OMPUTING ROOT in STAR Torre Wenaus STAR Computing and Software Leader Brookhaven National Laboratory, USA ROOT 2000 Workshop, CERN February 3, 2000.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems CSCI-6140 – Computer Operating Systems David Goldschmidt, Ph.D.
A User’s Introduction to the Grand Challenge Software STAR-GC Workshop Oct 1999 D. Zimmerman.
An Overview of PHENIX Computing Ju Hwan Kang (Yonsei Univ.) and Jysoo Lee (KISTI) International HEP DataGrid Workshop November 8 ~ 9, 2002 Kyungpook National.
Jerome Lauret RCF Advisory Committee Meeting The Data Carousel what problem it’s trying to solve the data carousel and the grand challenge the bits and.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
Marcelo R.N. Mendes. What is FINCoS? A Java-based set of tools for data generation, load submission, and performance measurement of event processing systems;
1 New developments in the HENP-GC HENP-GC Collaboration New Capabilities in the HENP Grand Challenge Storage Access System and its Application at RHIC.
Grand Challenge MDC1 Plans Doug Olson Nuclear Science Division, Berkeley Lab for the HENP-GC Collaboration RCF Meeting September 24, 1998.
PHENIX Simulation System 1 December 7, 1999 Simulation: Status and Milestones Tarun Ghosh, Indrani Ojha, Charles Vanderbilt University.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
STAR C OMPUTING STAR Computing Infrastructure Torre Wenaus BNL STAR Collaboration Meeting BNL Jan 31, 1999.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
Magda Distributed Data Manager Status Torre Wenaus BNL ATLAS Data Challenge Workshop Feb 1, 2002 CERN.
Using Bitmap Index to Speed up Analyses of High-Energy Physics Data John Wu, Arie Shoshani, Alex Sim, Junmin Gu, Art Poskanzer Lawrence Berkeley National.
Grand Challenge and PHENIX Report post-MDC2 studies of GC software –feasibility for day-1 expectations of data model –simple robustness tests –Comparisons.
PHENIX and the data grid >400 collaborators Active on 3 continents + Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
Field Based Data Validation: a very real experience in wrangling data, taxonomic names, and photos Moorea Biocode Project, supported by the Gordon and.
1 GCA Application in STAR GCA Collaboration Grand Challenge Architecture and its Interface to STAR Sasha Vaniachine presenting for the Grand Challenge.
GLAST Science Support CenterJuly, 2003 LAT Ground Software Workshop Status of the D1 (Event) and D2 (Spacecraft Data) Database Prototypes for DC1 Robert.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
PPDG update l We want to join PPDG l They want PHENIX to join NSF also wants this l Issue is to identify our goals/projects Ingredients: What we need/want.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
February 28, 2003Eric Hjort PDSF Status and Overview Eric Hjort, LBNL STAR Collaboration Meeting February 28, 2003.
5/2/  Online  Offline 5/2/20072  Online  Raw data : within the DAQ monitoring framework  Reconstructed data : with the HLT monitoring framework.
STAR Collaboration, July 2004 Grid Collector Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani Lawrence Berkeley National.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
PHENIX and the data grid >400 collaborators 3 continents + Israel +Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
Grand Challenge in MDC2 D. Olson, LBNL 31 Jan 1999 STAR Collaboration Meeting
STAR C OMPUTING Plans for Production Use of Grand Challenge Software in STAR Torre Wenaus BNL Grand Challenge Meeting LBNL 10/23/98.
1 fileCatalog, tagDB and GCA A. Vaniachine Grand Challenge STAR fileCatalog, tagDB and Grand Challenge Architecture A. Vaniachine presenting for the Grand.
1 L.Didenko Joint ALICE/STAR meeting HPSS and Production Management 9 April, 2000.
Marcelo R.N. Mendes. What is FINCoS? A set of tools for data generation, load submission, and performance measurement of CEP systems; Main Characteristics:
STAR C OMPUTING Introduction Torre Wenaus BNL May ‘99 STAR Computing Meeting BNL May 24, 1999.
Chapter 5: MULTIMEDIA DATABASE MANAGEMENT SYSTEM ARCHITECTURE BIT 3193 MULTIMEDIA DATABASE.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
Star Database Tutorial Package Design & Objectivity Discussion Interface Questions – What do you want? -> making requests – What do you get? -> data container.
Review for Quiz-1 Applied Operating System Concepts Patterson & Hennessy Chap.s 1,2,6,7 ECE3055b, Spring 2005
STAR Simulation. Status and plans V. Perevoztchikov Brookhaven National Laboratory,USA.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
Marcelo R.N. Mendes. What is FINCoS? A Java-based set of tools for data generation, load submission, and performance measurement of event processing systems;
1 Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Lawrence.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
Jianming Qian, UM/DØ Software & Computing Where we are now Where we want to go Overview Director’s Review, June 5, 2002.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
The HENP Grand Challenge Project and initial use in the RHIC Mock Data Challenge 1 D. Olson DM Workshop SLAC, Oct 1998.
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
SAM projects status Robert Illingworth 29 August 2012.
BESIII data processing
CS501 Advanced Computer Architecture
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
OO-Design in PHENIX PHENIX, a BIG Collaboration A Liberal Data Model
ALICE Data Challenges Fons Rademakers Click to add notes.
Chapter-1 Computer is an advanced electronic device that takes raw data as an input from the user and processes it under the control of a set of instructions.
Gizem MISIRLI Gülden OLGUN
Presentation transcript:

STAR C OMPUTING STAR Analysis Operations and Issues Torre Wenaus BNL STAR PWG Videoconference BNL August 13, 1999

STAR C OMPUTING Torre Wenaus, BNL PWG meeting 8/99 Basis of planning CAS processing with DST input based on managed production at the PWG level using the Grand Challenge Architecture Later stage processing on uDSTs (standardized at the PWG level) and ‘nanoDSTs’ (defined by individuals or small groups) occurs under the control of individual physicists and small groups Mix of LSF-based batch, and interactive  and later PROOF-like analysis?  on both Linux and Sun, but with far greater emphasis on Linux For I/O intensive processing, local Linux disks (14GB usable) and Suns available  Usage of local disks and availability of data to be managed through the file catalog

STAR C OMPUTING Torre Wenaus, BNL PWG meeting 8/99 Production Database Components of the (for want of a better term) production database:  File catalogs l Simulation data catalog §populated with all simulation-derived data, from fzd to dst, in HPSS and on disk l Real data catalog §populated with all real raw data in HPSS; dst data and disk resident data not yet in DB (disk raw data shown in web interface)  Run log and online log l fully populated and interfaced to online run log entry  Event tag databases l database of DAQ-level event tags exists. Populated by offline scanner; needs to be interfaced to buffer box  Machine catalog (not yet existing)  Disk volume catalog (not yet existing)

STAR C OMPUTING Torre Wenaus, BNL PWG meeting 8/99 Needed to implement GC in STAR GC client gcaClient interfaced to StIOMaker in root4star  User provides query, or  User provides list of run/events, and components needed  StIOMaker ‘file open’ is submission of query/list to GC  StIOMaker ‘event read’ in event loop is l delivery of ‘this event is ready’ from GC event iterator; files containing needed components are available on disk in locations returned by GC l perform file open/data load for the needed components of the event l proceed to the loop over makers as usual  Will look to user like just another input type handled by StIOMaker GC interface to STAR file catalog  Catalog itself is in place  Currently, event boundaries of files the same for all components l Need to answer ‘what file is each event component in for this event’ in general case l File catalog must include event ranges for files

STAR C OMPUTING Torre Wenaus, BNL PWG meeting 8/99 Grand Challenge queries Queries based on physics tag selections: SELECT (component1, component2, …) FROM dataset_name WHERE (predicate_conditions_on_properties) Example: SELECT dst, hits FROM Run WHERE glb_trk_tot>0 & glb_trk_tot<10 Event components: fzd, raw, dst-xdf, dst-root, hits, StrangeTag, FlowTag, StrangeMuDst, … Mapping from run/event/component to file via the database GC index assembles tags + component file locations for each event Tag based query match yields the files requiring retrieval to serve up that event Event list based queries allow using the GCA for general-purpose coordinated HPSS retrieval Event list based retrieval: SELECT dst, hits Run Event 1 Run Event 24 Run Event 3...

STAR C OMPUTING Torre Wenaus, BNL PWG meeting 8/99 Physics tag generation In doEvents.C based chain (or tacked onto end of bfc)  include StXxxTagMakers in chain  each tag maker provides a tag() method to return tag pointer  gather tags for output in a following maker  Spit them out to a Tag.root or XxxTag.root file(s)  Catalog them along with other event components in DB David Zimmerman is working on writing them out; then  Add them to catalog available to users and use them in GC index building  Add their generation/output to standard production  Multi-file, directory-scanning modes of doEvents.C can be used to process sets of tag files, or they can be catenated StFlowTagMaker* flowMaker = (StFlowTagMaker *) GetMaker("FlowTag"); FlowTag_st* aTag = flowMaker->tag();