Tiziana FerrariNetwork metrics usage for optimization of the Grid1 DataGrid Project Work Package 7 Written by Tiziana Ferrari Presented by Richard Hughes-Jones.

Slides:



Advertisements
Similar presentations
Ch. 12 Routing in Switched Networks
Advertisements

Tiziana FerrariNetwork Metrics Requirements and Usage for WP11 Tiziana Ferrari
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
Data Management Expert Panel - WP2. WP2 Overview.
1 IK1500 Communication Systems IK1330 Lecture 3: Networking Anders Västberg
IoP HEPP 2004 Birmingham, 7/4/04 David Cameron, University of Glasgow 1 Simulation of Replica Optimisation Strategies for Data.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
Towards an agent integrated speculative scheduling service L á szl ó Csaba L ő rincz, Attila Ulbert, Tam á s Kozsik, Zolt á n Horv á th ELTE, Department.
Dynamic routing – QoS routing Load sensitive routing QoS routing.
Passive traffic measurement Capturing actual Internet packets in order to measure: –Packet sizes –Traffic volumes –Application utilisation –Resource utilisation.
Lect3..ppt - 09/12/04 CIS 4100 Systems Performance and Evaluation Lecture 3 by Zornitza Genova Prodanoff.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Adaptive flow control via Interest Aggregation in CCN by Dojun Byun, Byoung-joon, Myeong-Wuk Jang Samsung Electronics, Advanced Institute of Technology.
Distributed Asynchronous Bellman-Ford Algorithm
ALICE data access WLCG data WG revival 4 October 2013.
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE II - Network Service Level Agreement (SLA) Establishment EGEE’07 Mary Grammatikou.
Optimal Scheduling of File Transfers with Divisible Sizes on Multiple Disjoint Paths Mugurel Ionut Andreica Polytechnic University of Bucharest Computer.
A Dynamic Data Grid Replication Strategy to Minimize the Data Missed Ming Lei, Susan Vrbsky, Xiaoyan Hong University of Alabama.
CHEP'07 September D0 data reprocessing on OSG Authors Andrew Baranovski (Fermilab) for B. Abbot, M. Diesburg, G. Garzoglio, T. Kurca, P. Mhashilkar.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Congestion Control in CSMA-Based Networks with Inconsistent Channel State V. Gambiroza and E. Knightly Rice Networks Group
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Bi-Hadoop: Extending Hadoop To Improve Support For Binary-Input Applications Xiao Yu and Bo Hong School of Electrical and Computer Engineering Georgia.
BALANCED DATA LAYOUT IN HADOOP CPS 216 Kyungmin (Jason) Lee Ke (Jessie) Xu Weiping Zhang.
OPERETTA: An Optimal Energy Efficient Bandwidth Aggregation System Karim Habak†, Khaled A. Harras‡, and Moustafa Youssef† †Egypt-Japan University of Sc.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
An End-to-End Adaptation Protocol for Layered Video Multicast Using Optimal Rate Allocation Jiangchuan Liu, Member, IEEE, Bo Li, Senior Member, IEEE, and.
Caitriana Nicholson, CHEP 2006, Mumbai Caitriana Nicholson University of Glasgow Grid Data Management: Simulations of LCG 2008.
DataGRID Testbed Enlargement EDG Retreat Chavannes, august 2002 Fabio HERNANDEZ
Run-time Adaptive on-chip Communication Scheme 林孟諭 Dept. of Electrical Engineering National Cheng Kung University Tainan, Taiwan, R.O.C.
Supporting Load Balancing for Distributed Data-Intensive Applications Leonid Glimcher, Vignesh Ravi, and Gagan Agrawal Department of ComputerScience and.
Dzmitry Kliazovich University of Luxembourg, Luxembourg
EGEE is a project funded by the European Union under contract IST WS-Based Advance Reservation and Co-allocation Architecture Proposal T.Ferrari,
Static Process Scheduling
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
GridNets 2006 – October 1 st Grid Resource Management by means of Ant Colony Optimization Gustavo Sousa Pavani and Helio Waldman Optical Networking Laboratory.
INFSO-RI Enabling Grids for E-sciencE Grid Services for Resource Reservation and Allocation Tiziana Ferrari Istituto Nazionale di.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Data Management The European DataGrid Project Team
Internet Measurement and Analysis Vinay Ribeiro Shriram Sarvotham Rolf Riedi Richard Baraniuk Rice University.
THE GLUE DOMAIN DEPLOYMENT The middleware layer supporting the domain-based INFN Grid network monitoring activity is powered by GlueDomains [2]. The GlueDomains.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
Simulation of O2 offline processing – 02/2015 Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture Eugen Mudnić.
BDTS and Its Evaluation on IGTMD link C. Chen, S. Soudan, M. Pasin, B. Chen, D. Divakaran, P. Primet CC-IN2P3, LIP ENS-Lyon
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
PanDA Configurator and Network Aware Brokerage Fernando Barreiro Megino, Kaushik De, Tadashi Maeno 14 March 2015, US ATLAS Distributed Facilities Meeting,
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Delay-based Congestion Control for Multipath TCP Yu Cao, Mingwei Xu, Xiaoming Fu Tsinghua University University of Goettingen.
Storage discovery in AliEn
ScotGRID is the Scottish prototype Tier 2 Centre for LHCb and ATLAS computing resources. It uses a novel distributed architecture and cutting-edge technology,
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
WP 9.4 Use Case (ESA-ESRIN, IPSL, KNMI)
Network Services Use Cases GHPN Research Group
Constraint-Based Routing
David S. L. Wei Joint Work with Alex Chia-Chun Hsu and C.-C. Jay Kuo
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
ECE 544 Protocol Design Project 2016
Storage elements discovery
Supporting Fault-Tolerance in Streaming Grid Applications
ECE 544 Protocol Design Project: Description and Timeline
Presentation transcript:

Tiziana FerrariNetwork metrics usage for optimization of the Grid1 DataGrid Project Work Package 7 Written by Tiziana Ferrari Presented by Richard Hughes-Jones

Tiziana FerrariNetwork metrics usage for optimization of the Grid2 Why Grid Optimization? performance of Grid middleware and applications can be enhanced through the use of information about the status and behaviour of network connections Network-based Optimiser –the Grid building block that provids an estimate of the transmission quality between two or more Grid nodes –can be used by any Grid service and aplication –relies on raw network performance metrics regularly monitored by the network monitor system and available in the Grid information Service

Tiziana FerrariNetwork metrics usage for optimization of the Grid3 The network-based Grid optimization service

Tiziana FerrariNetwork metrics usage for optimization of the Grid4 Application scenarios of the Network-based Grid Optimizer (NGO) 1.Resource Brokerage : –given a set of candidate Computing Elements (CE) that are equivalently good for the execution of a given job, and a list of candidate Storage Elements (SE), where a given input file requested by the job is stored, the NGO can identify the most suitable CE and SE according to a combined set of network criteria: packet loss minimization Round Trip Time minimization throughput maximization

Tiziana FerrariNetwork metrics usage for optimization of the Grid5 Application scenarios of the Network-based Grid Optimizer (NGO) - cont 2.Data management: Estimation of the time needed to transfer a file of known size from a source SE to a destination SE Identification of the most appropriate SE from which a given file replica can be retrieved given a set of input files or file fragments located in different SEs, identification of an individual SE in the set where such files or file fragments can be all transferred to, in the most efficient way

Tiziana FerrariNetwork metrics usage for optimization of the Grid6 Application scenarios of the Network-based Grid Optimizer (NGO) - cont 3.Adaptive remote file access –If the file/files needed by a given application are only known at run time, CEs and SEs cannot be selected by the Resource Broker statically prior to execution. In this case the NGO allows the application to optimise the access to remote files at run time. The optimisation is based on the dynamic selection of the most appropriate Storage Element set that the application is using as the file access pattern changes.

Tiziana FerrariNetwork metrics usage for optimization of the Grid7 Network Cost Functions The Network- based Grid Optimizer is based on a set of Network Cost Functions: –The network cost function is a compound network performance metric that provides a high-level view of connectivity between two or more Grid nodes –The network cost function is based on raw performance metrics (RTT, packet loss probability, available bandwidth, link utilization etc) –The NGO applies different network cost functions according to the application scenario

Tiziana FerrariNetwork metrics usage for optimization of the Grid8 Cost functions: examples Example 1: File_size (bits) * Throughput (SourceSE, DestinationSE) Example 2: Closeness C i,j (pl i,j,r i,j,th i,j ) Given: r i,j : Round Trip Time between CE i and SE j R max : max s,t  r s,t  for all CE s and SE t in a given set pl i,j : packet loss probability on the network path connecting CE i and SE j ’ th i,j : throughput of data transfer between CE i and SE j TH max : max s,t  th s,t  –C i,j (pl i,j,r i,j,th i,j ) =1 if CE i is local to SE j –C i,j (pl i,j,r i,j,th i,j ) =0 if SE unreachable or if pl i,j > P where P is a configurable threshold –C i,j (pl i,j,r i,j,th i,j ) = (th i,j / TH max) *  ( r i,j / R rmax ) where 0 <  < 1 otherwise

Tiziana FerrariNetwork metrics usage for optimization of the Grid9 Closeness C i,j (pl i,j,r i,j,th i,j ) Fig 1: alpha = 1/2Fig 2: alpha = 1/10

Tiziana FerrariNetwork metrics usage for optimization of the Grid10 Implementation Two APIs currently implemented in the DataGrid project for the following two application cases: –Resource Brokerage: selection of the best CE and SE for each input file needed by a given job –File transfer between a source SE and a destination SE

Tiziana FerrariNetwork metrics usage for optimization of the Grid11 Conclusions and future work Use of high-level metrics for estimation of connectivity between two or more nodes has proved to be very important in the DataGrid project for the optimization of Grid middleware and applications based on network transactions Future work: –Study of new optimization scenarios –Analysis of new cost functions –analysis of Grid Information Service performance when used for the solution of complex optimization problems