Uni Innsbruck Informatik - 1 Grid InterNetworking and QoS for the Grid Michael Welzl DPS NSG Team

Slides:



Advertisements
Similar presentations
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Advertisements

EC-GIN Europe-China Grid InterNetworking. Enriched with customised network mechanisms Original Internet technology Overview of Project Traditional Internet.
Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,
Congestion Control An Overview -Jyothi Guntaka. Congestion  What is congestion ?  The aggregate demand for network resources exceeds the available capacity.
Distributed components
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #11 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
Rice Networks Group Aleksandar Kuzmanovic & Edward W. Knightly TCP-LP: A Distributed Algorithm for Low Priority Data Transfer.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Resource Management – a Solution for Providing QoS over IP Tudor Dumitraş, Frances Jen-Fung Ning and Humayun Latif.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Internet Research Needs a Critical Perspective Towards Models –Sally Floyd –IMA Workshop, January 2004.
A Real-Time Video Multicast Architecture for Assured Forwarding Services Ashraf Matrawy, Ioannis Lambadaris IEEE TRANSACTIONS ON MULTIMEDIA, AUGUST 2005.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
3/2/2001Hanoch Levy, CS, TAU1 What Quality of Service is About Hanoch Levy Feb 2003.
The Future of the Internet Jennifer Rexford ’91 Computer Science Department Princeton University
Client/Server Architecture
A Web Services Based Streaming Gateway for Heterogeneous A/V Collaboration Hasan Bulut Computer Science Department Indiana University.
The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.
Introducing EC-GIN: Europe-China Grid InterNetworking Europe-China Grid InterNetworking European Sixth Framework STREP FP IST Michael Welzl,
Communications Recap Duncan Smeed. Introduction 1-2 Chapter 1: Introduction Our goal: get “feel” and terminology more depth, detail later in course.
Item 2005 L A Rønningen. Reservation Model Pessimistic or Optimistic Approach 1-N Senders and 1-M Receivers Sender-oriented or Receiver-oriented Immediate.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
1.1 What is the Internet What is the Internet? The Internet is a shared media (coaxial cable, copper wire, fiber optics, and radio spectrum) communication.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
Networks – Network Architecture Network architecture is specification of design principles (including data formats and procedures) for creating a network.
1 High-Level Carrier Requirements for Cross Layer Optimization Dave McDysan Verizon.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
Uni Innsbruck Informatik - 1 Grid InterNetworking Michael Welzl DPS NSG Team
Uni Innsbruck Informatik - 1 Network Support for Grid Computing (NSG) Michael Welzl DPS NSG Team
U Innsbruck Informatik - 1 CADPC/PTP in a nutshell Michael Welzl
Wolfgang EffelsbergUniversity of Mannheim1 Differentiated Services for the Internet Wolfgang Effelsberg University of Mannheim September 2001.
Beyond Best-Effort Service Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot November 2010 November.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Uni Innsbruck Informatik - 1 We Don‘t Need No Control Plane Michael Welzl, Kashif Munir Michael Welzl DPS NSG Team
Uni Innsbruck Informatik - 1 Open Issues and New Challenges for End-to-End Transport E2E RG Meeting - July 28/29, MIT, Cambridge MA Michael Welzl
CSC 600 Internetworking with TCP/IP Unit 7: IPv6 (ch. 33) Dr. Cheer-Sun Yang Spring 2001.
1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Multimedia and Networks. Protocols (rules) Rules governing the exchange of data over networks Conceptually organized into stacked layers – Application-oriented.
Networks and Distributed Systems Mark Stanovich Operating Systems COP 4610.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Uni Innsbruck Informatik - 1 Network Support for Grid Computing... a new research direction! Michael Welzl DPS NSG Team
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Uni Innsbruck Informatik - 1 Network Support for Grid Computing Recent Research Work and Plans at the University of Innsbruck Michael Welzl
7. Grid Computing Systems and Resource Management
AMQP, Message Broker Babu Ram Dawadi. overview Why MOM architecture? Messaging broker like RabbitMQ in brief RabbitMQ AMQP – What is it ?
Reading TCP/IP Protocol. Training target: Read the following reading materials and use the reading skills mentioned in the passages above. You may also.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Networks and Distributed Systems Sarah Diesburg Operating Systems COP 4610.
TCP continued. Discussion – TCP Throughput TCP will most likely generate the saw tooth type of traffic. – A rough estimate is that the congestion window.
Challenges in the Next Generation Internet Xin Yuan Department of Computer Science Florida State University
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
Uni Innsbruck Informatik th IETF, PMTUD WG: Path MTU Discovery Using Options draft-welzl-pmtud-options-01.txt Michael Welzl
Tightly Coupled Congestion Control in WebRTC Michael Welzl Upperside WebRTC conference Paris
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Introducing EC-GIN: Europe-China Grid InterNetworking Sven Hessler Institute of Computer Science, NSG team University of Innsbruck, Austria
U Innsbruck Informatik - 1 Specification of a Network Adaptation Layer for the Grid GGF7 presentation Michael Welzl University.
Mobile IP THE 12 TH MEETING. Mobile IP  Incorporation of mobile users in the network.  Cellular system (e.g., GSM) started with mobility in mind. 
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Accelerating Peer-to-Peer Networks for Video Streaming
PROTEAN: A Scalable Architecture for Active Networks
Muhammad Murtaza Yousaf, Michael Welzl
TCP-LP: A Distributed Algorithm for Low Priority Data Transfer
ExaO: Software Defined Data Distribution for Exascale Sciences
IT351: Mobile & Wireless Computing
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

Uni Innsbruck Informatik - 1 Grid InterNetworking and QoS for the Grid Michael Welzl DPS NSG Team Institute of Computer Science University of Innsbruck Q2S Colloquium Q2S/NTNU, Trondheim 4 September, 2006

Uni Innsbruck Informatik - 2 Grid InterNetworking

Uni Innsbruck Informatik - 3 What is the Grid? Metaphor: power grid –just plug in, don‘t care where (processing) power comes from, don‘t care how it reaches you Common definition: The real and specific problem that underlies the Grid concept is coordinated resource sharing and problem solving in dynamic, multi institutional virtual organizations [Ian Foster, Carl Kesselman and Steven Tuecke, “The Anatomy of the Grid – Enabling Scalable Virtual Organizations”, International Journal on Supercomputer Applications, 2001] Common terms: Virtual Team - members of one or several Virtual Organizations who use a Grid Most of the time... –the real and specific goal is High Performance Computing –virtual organizations and virtual teams are well defined (as opposed to the usage scenario) –i.e. not an „open“ system, security is a big issue

Uni Innsbruck Informatik - 4 Size Scope Grid history: parallel processing at a growing scale –Parallel CPU architectures –Multiprocessor machines –Clusters –(“Massively Distributed“) computers on the Internet Traditional goal: processing power –Grid people = parallel people; thus, goal has not changed much Broader definition (“resource sharing“) -reasonable - e.g., computers also have harddisks :-) –New research areas / buzzwords: Wireless Grid, DataGrid, Pervasive Grid, [this space reserved for your favorite research area] Grid –sometimes perhaps a little too broad, e.g., “P2P Working Group“ is now part of the Global Grid Forum Reasonable to focus on this.

Uni Innsbruck Informatik - 5 Legacy codes Components Web Services Grid Workflows based on activities MPI HPF OMP HPF OMP MPI Java Legacy Codes Descriptor Generation Component Interaction Optimization, Adaptation Service Description Discovery, Selection Deployment, Invocation Dynamic Instantiation Service Orchestration Quality of Service Grid Workflow Applications Components are built, Web (Grid) Services are defined, Activities are specified Activities (which may communicate with each other) should automatically be distributed by a scheduler

Uni Innsbruck Informatik - 6 UIBK-DPS development: ASKALON A Grid Application Development and Computing Environment XML

Uni Innsbruck Informatik - 7 Grid requirements Efficiency + ease of use –Programmer should not worry (too much) about the Grid Underlying system has to deal with –Error management –Authentification, Authorization and Accounting (AAA) –Efficient Scheduling / Load Balancing –Resource finding and brokerage –Naming –Resource access and monitoring No problem: we do it all - in Middleware de facto standard: “Globus Toolkit“ –installation of GT3 in our high performance system: 1 1/2 hours or so... –yes, it truly does it all :) 1000s of addons - GridFTP, MDS, NWS, GRAM,.. –this is just the basis - e.g., ASKALON is layered on top of Globus

Uni Innsbruck Informatik - 8 Grid-network peculiarities Special behavior –Predictable traffic pattern - this is totally new to the Internet! –Web: users create traffic –FTP download: starts... ends –Streaming video: either CBR or depends on content! (head movement,..) Could be exploited by congestion control mechanisms –Distinction: Bulk data transfer (e.g. GridFTP) vs. control messages (e.g. SOAP) –File transfers are often “pushed“ and not “pulled“ –Distributed System which is active for a while overlay based network enhancements possible –Multicast –P2P paradigm: “do work for others for the sake of enhancing the whole system (in your own interest)“ can be applied - e.g. act as a PEP,... sophisticated network measurements possible –can exploit longevity and distributed infrastructure Special requirements –file transfer delay predictions note: useless without knowing about shared bottlenecks –QoS, but for file transfers only (“advance reservation“)

Uni Innsbruck Informatik - 9 Enriched with customised network mechanisms Original Internet technology Traditional Internet applications (web browser, ftp,..) Real-time multimedia applications (VoIP, video conference,..) Today‘s Grid applications Driving a racing car on a public road Applications with special network properties and requirements Bringing the Grid to its full potential ! EC-GIN EC-GIN enabled Grid applications Research gap: Grid-specific network enhancements

Uni Innsbruck Informatik - 10 What is EC-GIN? European project: Europe-China Grid InterNetworking –STREP in European IST FP6 Call 6 –2.2 MEuro, 11 partners (7 Europe + 4 China) –Networkers developing mechanisms for Grids

Uni Innsbruck Informatik - 11 Research Challenges Research Challenges: –How to model Grid traffic? Much is known about web traffic (e.g. self-similarity) - but the Grid is different! –How to simulate a Grid-network? Necessary for checking various environment conditions May require traffic model (above) Currently, Grid-Sim / Net-Sim are two separate worlds (different goals, assumptions, tools, people) –How to specify network requirements? Explicit or implicit, guaranteed or “elastic“, various possible levels of granularity –How to align network and Grid economics? Combined usage based pricing for various resources including the network –What P2P methods are suitable for the Grid? What is the right means for storing short-lived performance data?

Uni Innsbruck Informatik - 12 Some issues: application interface... How to specify properties and requirements –Should be simple and flexible - use QoS specification languages? –Should applications be aware of this?  Trade-off between service granularity and transparency! Traditional methodOur approach GIN API

Uni Innsbruck Informatik and peer awareness Grid end system Data flow (b) NSG PEP

Uni Innsbruck Informatik - 14 Problem: How Grid folks see the Internet Abstraction - simply use what is available –still: performance = main goal Wrong. Quote from a paper review: “In fact, any solution that requires changing the TCP/IP protocol stack is practically unapplicable to real-world scenarios, (..).“ How to change this view –Create awareness - e.g. GGF GHPN-RG published documents such as “net issues with grids“, “overview of transport protocols“ –Develop solutions and publish them! (EC-GIN, GridNets) Just like Web Service community Absolutely not like Web Service community ! Conflict! Existing transport system (TCP/IP + Routing +..) works well QoS makes things better, the Grid needs it! –we now have a chance for that, thanks to IPv6

Uni Innsbruck Informatik - 15 A time-to-market issue Research begins (Real-life) coding begins Real-life tests begin Thesis writing Result: thesis + running code; tests in collaboration with different research areas Result: thesis + simulation code; perhaps early real-life prototype (if students did well) Research begins (Simulation) coding begins Thesis writing Typical Grid project Typical Network project Ideal

Uni Innsbruck Informatik - 16 Machine-only communication Trend in networks: from support of Human-Human Communication – , chat via Human-Machine Communication –web surfing, file downloads (P2P systems), streaming media to Machine-machine Communication –Growing number of commercial web service based applications –New “hype“ technologies: Sensor nets, Autonomic Computing vision Semantic Web (Services): first big step for supporting machine-only communication at a high level So far, no steps at a lower level –This would be like RTP, RTCP, SIP, DCCP,... for multimedia apps: not absolutely necessary, but advantageous

Uni Innsbruck Informatik - 17 The long-term value of Grid-net research A subset of Grid-net developments will be useful for other machine-only communication systems! Grid Sensor nets Web service applications Future work Key for achieving this: change viewpoint from “what can we do for the Grid“ to “what can the Grid do for us“ (or from “what does the Grid need“ to “what does the Grid mean to us“)

Uni Innsbruck Informatik - 18 QoS for the Grid A Grid InterNetworking exampe

Uni Innsbruck Informatik - 19 QoS: the state-of-the-art :-( Papers from SIGCOMM‘03 RIPQOS Workshop: “Why do we care, what have we learned?“ QoS`s Downfall: At the bottom, or not at all! Jon Crowcroft, Steven Hand, Richard Mortier,Timothy Roscoe, Andrew Warfield Failure to Thrive: QoS and the Culture of Operational Networking Gregory Bell Beyond Technology: The Missing Pieces for QoS Success Carlos Macian, Lars Burgstahler, Wolfgang Payer, Sascha Junghans, Christian Hauser, Juergen Jaehnert Deployment Experience with Differentiated Services Bruce Davie Quality of Service and Denial of Service Stanislav Shalunov, Benjamin Teitelbaum Networked games --- a QoS-sensitive application for QoS-insensitive users? Tristan Henderson, Saleem Bhatti What QoS Research Hasn`t Understood About Risk Ben Teitelbaum, Stanislav Shalunov Internet Service Differentiation using Transport Options:the case for policy-aware congestion control Panos Gevros

Uni Innsbruck Informatik - 20 Key reasons for QoS failure Required participation of end users and all intermediate ISPs –“normal“ Internet users want Internet-wide QoS, or no QoS at all –In a Grid, a “virtual team“ wants QoS between its nodes –Members of the team share the same ISPs - flow of $$$ is possible Technical inability to provision individual (per-flow) QoS –“normal“ Internet users unlimited number of flows come and go at any time heterogeneous traffic mix –Grid users number of members in a “virtual team“ may be limited clear distinction between bulk data transfer and SOAP messages appearance of flows mostly controlled by machines, not humans  QoS can work for the Grid ! –still, often practical problems (involvement of many ISPs for global Grid,..)

Uni Innsbruck Informatik - 21 Proposed architecture Goal: efficient per-flow QoS without signaling to routers –ultimate dream (very long-term goal): without any router involvement! (99% instead of 100% reliable guarantees) Idea: use traditional coarse-grain QoS (DiffServ) to differentiate between –long-lived bulk data transfer with advance reservation (EF) and –everything else (= SOAP etc. over TCP) (best effort) Allows us to assume isolated traffic; planned to drop this requirement later Because data transfers are long lived, apply admission control –Flows signal to resource broker (RB) when joining or leaving the network Mandate usage of one particular congestion control mechanism for all flows in the EF aggregate –Enables efficient resource usage because flows are elastic

Uni Innsbruck Informatik - 22 Key ingredients of our QoS soup Link capacities must be known, paths should be stable (capacity information should be updated upon routing change) Shared bottlenecks must be known Bottlenecks must be fairly shared by congestion control mechanism irrespective of RTT (max-main fairness required, i.e. all flows must increase their rates until they reach their limit) No signaling to routers = no way to enforce proper behavior  there must be no cheaters –User incentive: fair behavior among cooperating nodes among which Grid application is distributed –Unfair behavior between Grid apps 1 and 2 in same Grid neglected (usually acceptable, as used by same Virtual Organization)

Uni Innsbruck Informatik - 23 Link capacities must be known Can be attained with measurements Working on permanently active, (mostly) passive measurement system for the Grid that detects capacity with packet pair –send two packets p1 and p2 in a row; high probability that p2 is enqueued exactly behind p1 at bottleneck –at receiver: calculate bottleneck bandwidth via time between p1 and p2 –e.g. TCP: “Delayed ACK“ receiver automatically sends packet pairs  passive TCP receiver monitoring is quite good! –exploit longevity - minimize error by listening for a long time

Uni Innsbruck Informatik - 24 Shared bottlenecks must be known Simple basis: distributed traceroute tool –enhancement: traceroute terminates early upon detection of known hop Handle “black holes“ in traceroute –generate test messages from A, B to C - identify signature from B in A‘s traffic –method has worked in the past: “controlled flooding“ for DDoS detection

Uni Innsbruck Informatik - 25 Congestion Control mechanism must be max-min fair Was once said to be impossible without per-flow state in routers –not true; XCP and some others –but these explicit require router support... Main problem: dependence on RTT –three good indications that this can be removed without router support 1.CADPC/PTP (my Ph.D. thesis)... max-min fairness based on router feedback, but only capacity and available bandwidth (could also be obtain by measuring) 2.Result in old paper on phase effects by Sally Floyd 3.TCP Libra Problem: efficiency - no max-min fair “high-speed“ CC mechanism without router support –now: plan to change existing one based on knowledge from above examples increase/decrease factors are f(RTT)

Uni Innsbruck Informatik - 26 Per-flow QoS without signaling to routers continuous measurements; update to BB upon path change Synchronization of distributed (P2P based) database; link capacities known to all brokers 1. may I join?2. yes Synchronization of distributed (P2P based) database; all flows known to all brokers 3. I quit4. ok Traditional method: signaling to edge routers (e.g. with COPS) at this point! Synchronization of distributed (P2P based) database; all flows known to all brokers

Uni Innsbruck Informatik - 27 Efficiency via elasticity QoS guarantees in Grid: „File will be transferred within X seconds“  enables flexible resource usage

Uni Innsbruck Informatik - 28 Efficiency via elasticity /2 Flow 1 stopped, flows 2-4 automatically increase their rates –leading to earlier termination times E2‘-E4‘; known to (calculated by) BB

Uni Innsbruck Informatik - 29 Efficiency via elasticity /3 Flow 5 asks BB for admission –BB knows about current rates and promised E2-E4, grants access

Uni Innsbruck Informatik - 30 Efficiency via elasticity /4 Flow 2 terminates in time –Flows 3-5 will also terminate in time Additional flow admitted and earlier termination times than promised!

Uni Innsbruck Informatik - 31 Elasticity without Congestion Control? Significant amount of additional signaling necessary As Flow-1 stops, Flows 2-4 could increase their rates Without congestion control, signal “increase your rates“ to flows 2-4 required! As flow 5 is admitted, signal “reduce your rates“ to flows 2-4 required!

Uni Innsbruck Informatik - 32 Additional considerations How to assign different rates to different flows? –max-min fairness: if a sender “acts“ like two, it obtains twice the rate –consider rate consisting of slots (e.g. 1 kbit/s = 1 slot) –flows can consist of several slots –let congestion control mechanism operate on slots Possibility: admit new flows when things look even worse –in previous example, flows terminated earlier despite the entry of flow 5  unnecessary, would be possible to reduce their rates a bit (main goal of architecture: say „yes“ to requests and fulfil them) –what if only one of the flows would terminate earlier than promised? –introduce unfairness: remove slots from a flow which previously increased its rate if deadline can still be kept (calculated in BB) –disadvantage: more signaling again

Uni Innsbruck Informatik - 33 Difficult & distant future work Drop requirement of traffic isolation via DiffServ –constantly obtain and update conservative estimate of available bandwidth using packet pair (works without saturating link) –ensure that limit is never exceeded; “condition red“ otherwise! –Some open questions... does this require the CC mechanism to be TCP-friendly? condition red: reduce slots, or let flows be aggressive for a short time? How to handle routing changes –will be noticed, but can reduce capacity  break QoS guarantee –condition red; can happen in worst case, but to be avoided at all cost –mitigation methods very conservative estimate of available bandwidth; leave headroom tell senders to reroute via intermediate end systems Bottom line: lots of complicated issues, but possible to solve them

Uni Innsbruck Informatik - 34 Thank you! Questions?