Federated Hierarchical Filter Grids STTR-funded project with Indiana, Caltech and Deep Web Technologies A Grid infrastructure for Data Analysis Integrates.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Database Architectures and the Web
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Federated Service Oriented Information Management Ahmet Sayar
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
Building New SOA and AJAX- Based Business Applications Mark Barnard R&D Manager – Natural Business Services Software AG (Canada) Inc.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
DEV392: Extending SharePoint Products And Technologies Through Web Parts And ASP.NET Clint Covington, Program Manager Data And Developer Services - Office.
Distributed components
Software Architecture Patterns (2). what is architecture? (recap) o an overall blueprint/model describing the structures and properties of a "system"
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
ORACLE APPLICATION SERVER BY PHANINDER SURAPANENI CIS 764.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
PerfSONAR Client Construction February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Lecture 15 Introduction to Web Services Web Service Applications.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
1 Grids for Real-time and Streaming Applications GCC2005 Beijing China December Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology.
Computer Emergency Notification System (CENS)
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Composing workflows in the environmental sciences using Web Services and Inferno Jon Blower, Adit Santokhee, Keith Haines Reading e-Science Centre Roger.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
A Collaborative Framework for Scientific Data Analysis and Visualization Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox Department of Computer.
ABone Architecture and Operation ABCd — ABone Control Daemon Server for remote EE management On-demand EE initiation and termination Automatic EE restart.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
A Demonstration of Collaborative Web Services and Peer-to-Peer Grids Minjun Wang Department of Electrical Engineering and Computer Science Syracuse University,
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
December, 2006 ws-VLAM Workflow Management System a Re-factoring of VLAM Dmitry Vasyunin Adianto Wibisono Adam Belloum.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Review of PARK Reflectometry Group 10/31/2007. Outline Goal Hardware target Software infrastructure PARK organization Use cases Park Components. GUI /
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Designing the Physical Architecture
GPIR GridPort Information Repository
Database Architectures and the Web
INFNGRID Monitoring Group report
Software Design and Architecture
CHAPTER 3 Architectures for Distributed Systems
Database Architectures and the Web
Chapter 21: Cloud Computing and Related Security Issues
Chapter 22: Cloud Computing Technology and Security
Some remarks on Portals and Web Services
Oracle Architecture Overview
Ch 4. The Evolution of Analytic Scalability
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Distributed Systems Bina Ramamurthy 12/2/2018 B.Ramamurthy.
Tiers vs. Layers.
Application Web Services and Event / Messaging Systems
Federated Hierarchical Filter Grids
Distributed Systems Bina Ramamurthy 4/22/2019 B.Ramamurthy.
Database System Architectures
Gordon Erlebacher Florida State University
Status of Grids for HEP and HENP
Presentation transcript:

Federated Hierarchical Filter Grids STTR-funded project with Indiana, Caltech and Deep Web Technologies A Grid infrastructure for Data Analysis Integrates with the LHC Tiered Computing Model Directly supports general Scientific Analysis In the HEP case, the Gridlet is instantiated as a Rootlet The FHFG Architecture Composed of Information Service Gridlets managed by general Grid system services with a portlet-based portal user interface

Database SS SSSSSSSSS FS FSFS Portal FSFS OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS MD MetaData Filter Service Sensor Service Other Service Another Grid Raw Data  Data  Information  Knowledge  Wisdom Decisions S S Another Service S Another Grid S SS FS SOAP Messages

Filter Grids Three Features: Information services present data through traditional interfaces Filters that accept data between these interfaces, transform and re-present Streaming connections between all services: –High performance –Archiving –Security –Fault tolerance –Narada Brokering Filter Grids are built from Information resources wrapped as Web Services and Basic Filters that either transform or aggregate Information. Information Services and Filters support identical Service Interfaces.

Information Resource Request/SelectStatusMultiResolution Get IS = Information Service Filter Resource Request/SelectStatusMultiResolution Get MultiResolution PutIssue Queries BFS = Basic Filter Service Filters either transform or Aggregate Information

HEP Event Analysis using Filter Grids Analysis tool of choice is Root. Typical analysis activity is –Loading many files containing event data –Passing each event through a selection filter –Subjecting each selected event to a set of algorithms –Creating summary information in the form of histograms/tables/files Analysis: starts with small event samples, then applied to much larger samples Frequently these are remotely located in the Grid Our HEP implementation is a Filter Grid consisting of Clarens-hosted “Rootlets”. Each Rootlet is a full instance of the Root application, but limited in scope: –The user’s Root loads a Clarens plug-in –The Clarens interface to the Dataset Location Service allows a list of remote datasets to be generated –The client contacts each remote Grid node, connects to the Clarens server there, and instantiates a Rootlet –The user’s analysis selection code is passed over the network to the Rootlet –The list of event data files is passed to the Rootlet –The Rootlet executes, and terminates. –The output histograms/tables/files are then made available via the Clarens server, and fetched, aggregated and processed as required.

 Physicist at Tier3 using Root on GBytes of ntuples  Loads Clarens Root plugin. Connects to Clarens at Tier2. Sends analysis code (.C/.h files).  Clarens creates Rootlet, passes it.C/.h files  Rootlet runs analysis code on TBytes of ntuples, creating high statistics output data.  Root at Tier3 receives and plots data Rootlets Root embedded in a Clarens server Root nTuples Clarens PluginXML/RPC GBytes Root nTuples ~10 TBytes Analysis.C, Analysis.h     Tier3Tier2 

Higgs diphoton Analysis using Rootlets