Download presentation
Presentation is loading. Please wait.
Published byAileen Wood Modified over 9 years ago
1
H IGH E NERGY A CCELERATOR R ESEARCH O RGANIZATION KEKKEK High Availability iRODS System (HAIRS) Yutaka Kawai, KEK Adil Hasan, ULiv December 2nd, 20091Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK
2
Outline ▸Introduction ▸iRODS HA system with Director ▸Large File Transfer ▸Speed Performance ▸Future works (apply to RNS application) ▸Summary December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK2
3
Introduction ▸Replication enables high availability (HA) system for catalog service ▹Replicate by back-end, i.e. iRODS ▹Replicate by front-end; ▪i.e. AMGA (ARDA [1] Metadata Grid Application) ▫Metadata Catalogue of EGEE’s gLite 3.1 Middleware ▫Back-end : Oracle, PostgreSQL, MySQL, SQLite ▫http://amga.web.cern.ch/amga/ ▸The current iRODS HA is implemented by replicating ICAT DB with PgPool tool [2] ▹A problem when iRODS server fails ▹Solve the problem by using Director December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK3
4
The Current iRODS HA ▸ICAT DB replication by Pgpool December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK4 ICAT Pgpool iRODS ServerPostgreSQL iRODS Client Change the server info in.irodEnv A B
5
Problem of the current HA ▸Even if the iRODS server fails, clients still continue to access the same server without noticing the failure. December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK5 ICAT Pgpool iRODS Server PostgreSQL iRODS Client ? Need to change server info in.irodEnv A B
6
Solution by using Director ▸Place a Director between Client and Server ▹Monitor the iRODS server statuses ▹Load balance to the iRODS servers December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK6 ICAT Pgpool iRODS Server PostgreSQL iRODS Client Director A B
7
How to Implement Director? ▸UltraMonkey [3] ▹Linux based director ▹Low cost but not so high speed ▹Need some steps to setup ▸Hardware Director ▹High cost and high speed ▹Easy to setup (?) ▹Cisco, HP, etc. December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK7
8
UltraMonkey ▸UltraMonkey consists of 3 components ▹Linux Virtual Server (LVS) : Load balancing ▹ldirectord : Monitoring real servers ▹Linux-HA (LHA) : Monitoring directors ▸LVS and ldirectord are used here ▹LVS : Provide Virtual IP for load balance ▹ldirectord : Monitoring iRODS service ▹LHA : Future use for director redundancy December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK8
9
Virtual IP for load balance December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK9 iRODS Real Severs 192.168.2.0/24.102.101.240 Linux Director 192.168.1.0/24 VIP 192.168.1.200.240 iRODS Client.100 iRODS Client can specify only this VIP in.irodsEnv Gateway of Real Servers is Director
10
Monitoring iRODS service ▸ldirector monitors iRODS real servers ▹Polling server status via iRODS control port December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK10 RODS_VERSION 182 0 -4000 rods2.1 d 0 0 Director iRODS Server Port# 1247 req: any string ack: iRODS MsgHeader
11
Outline ▸Introduction ▸iRODS HA system with Director ▸Large File Transfer ▸Speed Performance ▸Future works (apply to RNS application) ▸Summary December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK11
12
Large File Transfer ▸iRODS uses parallel ports to transfer a large file. ▹Smaller than 32MB file is transferred through iRODS control port #1247. ▸iRODS catalog server directs a server to open parallel ports to transfer a large file ▹iRODS clients can directly connect with the server through the parallel ports. December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK12
13
Process of Large File Transfer ▸Steps to transfer a large file in iRODS December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK13 ICAT iRODS ClientiRODS Server w/o ICAT Physical Data Physical Data Start service for Parallel I/O (4) PostgreSQL (1) (3) iput a large file Find physical location to store (2) A C File Transfer via Parallel I/O
14
Large File Transfer w/ Director ▸Need to confirm whether Director interferes in transferring a large file or not ▸The physical storage should be located out of the local network of iRODS real servers ▹Director handles only iRODS catalog server IP ▹Director cannot manage all of the parallel ports December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK14
15
Process using Director ▸Works as same as normal case ▹Only one additional step between (1) and (2) December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK15 ICAT Pgpool iRODS Server PostgreSQL iRODS Client Director iRODS Server w/o ICAT Physical Data Physical Data (1) (3) (4) A B C (1)’ (2)
16
Outline ▸Introduction ▸iRODS HA system with Director ▸Large File Transfer ▸Speed Performance ▸Future works (apply to RNS application) ▸Summary December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK16
17
Speed Performance ▸Test Program ▹concurrent-test in iRODS package ▹iput, imeta, iget, imv ▹1000 entries ▹Servers are VMs (Xen) on same physical machine ▪Client is located on the different machine ▸No Director ▹552.2sec = 0.552 sec/entry ▸Use Director ▹618.4 sec = 0.618 sec/entry ▹About 10% slower December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK17 This result is reasonable to consider tradeoff between speed and availability
18
Speed Performance (cont’d) ▸Use Director and Load balance to 2 iRODS severs ▹697.8sec= 0.698 sec/entry ▹The concurrent-test is not suitable under such a Load balanced system. ▹Need a program using multi- clients/threading. December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK18
19
Outline ▸Introduction ▸iRODS HA system with Director ▸Large File Transfer ▸Speed Performance ▸Future works (apply to RNS application) ▸Summary December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK19
20
What is RNS ? ▸RNS : Resource Namespace Service ▹RNS offers a simple standard way of mapping names to endpoints within a grid or distributed network [4] ▹The latest version is available here; https://forge.gridforum.org/sf/go/doc8272 https://forge.gridforum.org/sf/go/doc8272 ▸Java based RNS application is being developed by Osaka University and Tsukuba University ▹This application is similar to iRODS ▹The other kind of RNS application is Grid Shell of Genesis II by The Virginia Center for Grid Research (VCGR) [5]. December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK20
21
Apply to RNS application?? ▸Derby can do replication? ▹http://wiki.apache.org/db-derby/ReplicationWriteup ▹No load-sharing in the above example December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK21 DB Replication RNS ServerDerby RNS Client Director ?
22
Issues in RNS application ▸Several issues to be solved ▹Derby is not enough to work replication as same as using PostgreSQL w/Pgpool ▹Need some developments to replace Derby by PostgreSQL ▹The catalog implementation in the current RNS application has specific IP addresses December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK22
23
Opinions in this study ▸Network limitation ▹Director works as NAT. Difficult to place iRODS catalog servers in different subnets. ▹But the problem depends on NAT technology. We hope some NAT vender can implement extensions. ▸Speed Performance ▹The “concurrent-test” consumes overhead. The result 10% slow is in one of the worst cases. We may see less than 10% in actual uses. ▸PostgreSQL only? ▹How about other DB services? They have the same tools as PgPool? ▹Back-end replication is enough? Front-end replication should be considered for iRODS? December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK23
24
Summary ▸iRODS HA system ▹The current approach using only PgPool ▹The new approach using Director ▹The new one can solve the current problem ▸Large File Transfer ▹iRODS large file transfer works well when using Director ▸Speed Performance ▹Director results in the speed performance of concurrent- test getting slower 10% ▸Future works ▹Apply this solution to other catalog services December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK24
25
References ▸[1] : ARDA is A Realization of Distributed Analysis for LHC, http://lcg.web.cern.ch/LCG/activities/arda/arda.html http://lcg.web.cern.ch/LCG/activities/arda/arda.html ▸[2] : iRODS High Avaliability, https://www.irods.org/index.php/iRODS_High_Avaliability https://www.irods.org/index.php/iRODS_High_Avaliability ▸[3] : Ultra Monkey project, http://www.ultramonkey.org/http://www.ultramonkey.org/ ▸[4] : citation from abstract of “Resource Namespace Service Specification”, https://forge.gridforum.org/sf/go/doc8272https://forge.gridforum.org/sf/go/doc8272 ▸[5] : http://www.cs.virginia.edu/~vcgr/wiki/index.php/Understa nding_Your_Genesis_II_Distribution#RNS_Namespace http://www.cs.virginia.edu/~vcgr/wiki/index.php/Understa nding_Your_Genesis_II_Distribution#RNS_Namespace December 2nd, 2009Interoperability of Digital Repositories @ London, UK -- Yutaka Kawai, KEK25
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.