The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.

Slides:



Advertisements
Similar presentations
Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
Advertisements

Case Study 1: Data Replication for LIGO Scott Koranda Ann Chervenak.
RLS and DRS Roadmap Items Ann Chervenak Robert Schuler USC Information Sciences Institute.
Globus DataGrid Overview Bill Allcock, ANL GridPP Meeting 30 June 2003.
1 WP2: Data Management Paul Millar eScience All Hands Meeting September
EU DataGrid TestBed 2 Component Review Paul Millar (University of Glasgow) (slides based on a presentation by Erwin Laure)
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Database Architectures and the Web
High Performance Computing Course Notes Grid Computing.
Overview of the technology that comprises Attendance Enterprise.
Technical Architectures
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
Lesson 1: Configuring Network Load Balancing
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
Chapter 2 Client Server Architecture
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 1: Introduction to Windows Server 2003.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Word Wide Cache Distributed Caching for the Distributed Enterprise.
The Data Replication Service Ann Chervenak Robert Schuler USC Information Sciences Institute.
Ali Kaplan Advisor: Prof. Geoffrey C. Fox 2/02/20091.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Globus Data Replication Services Ann Chervenak, Robert Schuler USC Information Sciences Institute.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Secure, Collaborative, Web Service enabled and Bittorrent Inspired High-speed Scientific Data Transfer Framework.
Moving Large Amounts of Data Rob Schuler University of Southern California.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
MobileMAN Internal meetingHelsinki, June 8 th 2004 NETikos activity in MobileMAN project Veronica Vanni NETikos S.p.A.
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Part Four: The LSC DataGrid Part Four: LSC DataGrid A: Data Replication B: What is the LSC DataGrid? C: The LSCDataFind tool.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
CPT Demo May Build on SC03 Demo and extend it. Phase 1: Doing Root Analysis and add BOSS, Rendezvous, and Pool RLS catalog to analysis workflow.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Transaction-based Grid Data Replication Using OGSA-DAI Presented by Yin Chen February 2007.
Data Management and Transfer in High-Performance Computational Grid Environments B. Allcock, J. Bester, J. Bresnahan, A. L. Chervenak, I. Foster, C. Kesselman,
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
Data Management GridPP and EDG Gavin McCance University of Glasgow May 9, 2002
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
Rights Management in Globus Data Services Ann Chervenak, ISI/USC Bill Allcock, ANL/UC.
70-294: MCSE Guide to Microsoft Windows Server 2003 Active Directory, Enhanced Chapter 6: Active Directory Physical Design.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
1 Data Management for Internet Backplane Protocol by Tang Ming Assoc/Prof. Francis Lee School of Computer Engineering, Nanyang Technological University,
Seminar On Rain Technology
PERFORMANCE MANAGEMENT IMPROVING PERFORMANCE TECHNIQUES Network management system 1.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Cluster-Based Scalable
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
A Replica Location Service
Overview of SDN Controller Design
Key Terms Windows 2008 Network Infrastructure Confiuguration Lesson 6
Chapter 2 Database Environment Pearson Education © 2009.
Distributed P2P File System
The Globus Toolkit™: Information Services
Introduction to Databases Transparencies
Database Environment Transparencies
CLUSTER COMPUTING.
Grid Coordination by Using the Grid Coordination Protocol
Presentation transcript:

The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used to reduce access latency, improve data locality, and increase robustness, scalability and performance for distributed applications. The Replica Location Service (RLS) is a distributed registry that maintains information about the physical locations of copies and allows discovery of replicas. The RLS was designed and implemented in a collaboration between the Globus project and the DataGrid project. The distributed RLS is intended eventually to replace the centralized Globus replica catalog, providing higher performance, reliability and scalability. The RLS architecture includes five components: Consistent state maintained in Local Replica Catalogs (LRCs): Local catalogs maintain mappings between logical names for data items and target names (either physical or logical). Collective state with relaxed consistency maintained in Replica Location Indices (RLIs): An RLI aggregates state information contained in one or more LRCs. A variety of index structures may be created by varying the number of RLIs, the amount of redundancy and the parititioning of LRC updates among RLIs. Soft state maintenance of RLI state: LRCs send summaries of their state to RLIs using soft state update protocols. Information in RLIs times out and must be periodically refreshed. Optional compression of soft state updates: An RLS may include compression to reduce network requirements and RLI storage overheads. The RLS implements optional Bloom filter compression. Membership service: A membership service keeps track of the LRCs and RLIs that make up the distributed registry and their soft state update patterns. The current implementation maintains a static configuration for RLS. The RLS implementation consists of a multi-threaded front-end server that implements GSI authentication and a back-end server consisting of a mySQL relational database. There are two client APIs for the RLS written in C and Java. Features of the implementation include two types of soft state updates, the ability to associate user-defined attributes with logical or target names, and the ability to partition soft state updates among RLI index nodes using pattern matching of logical names. For more information on RLS: management/replica-location-service/index.html

The SC2002 RLS Demo RLS Sponsors and Testbed Participants The RLS demonstration testbed for SC2002 consists of over 25 servers on three continents. The testbed contains local replica catalogs (LRCs) and replica location index nodes (RLIs) of two types: those that exchange soft state updates containing bloom filter summaries of LRC content and those that exchange lists of logical names registered in LRCs. As shown in the topology picture below, some portions of the RLS distributed registry are highly connected, while in other portions, LRCs send state summaries to relatively few RLIs. This shows the flexibility of the RLS design, which allows tradeoffs between the amount of soft state update information that is exchanged and the level of redundancy and load balancing provided by the distributed registry. The testbed configuration limits the number and size (by using bloom filters) of soft state updates that are exchanged among continents.