A Decentralized Relational Information Service for Large Scale Distributed Computing Thesis Proposal April 2 nd, 2004 Dong Lu Committee Peter A. Dinda.

Slides:



Advertisements
Similar presentations
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Advertisements

Database Architectures and the Web
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
1 Scoped and Approximate Queries in a Relational Grid Information Service Dong Lu, Peter A. Dinda, Jason A. Skicewicz Prescience Lab, Dept. of Computer.
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
Nondeterministic Queries in a Relational Grid Information Service Peter A. Dinda Dong Lu Prescience Lab Department of Computer Science Northwestern University.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Path Optimization in Computer Networks Roman Ciloci.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
The Network Weather Service A Distributed Resource Performance Forecasting Service for Metacomputing Rich Wolski, Neil T. Spring and Jim Hayes Presented.
1 Modeling and Taming Parallel TCP on the Wide Area Network Dong Lu,Yi Qiao Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.
Self-Correlating Predictive Information Tracking for Large-Scale Production Systems Zhao, Tan, Gong, Gu, Wambolt Presented by: Andrew Hahn.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
Multiple constraints QoS Routing Given: - a (real time) connection request with specified QoS requirements (e.g., Bdw, Delay, Jitter, packet loss, path.
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
1 Components of a Scalable Distributed Relational Information Service Dong Lu June 14, 2005.
Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.
End-to-End Analysis of Distributed Video-on-Demand Systems P. Mundur, R. Simon, and A. K. Sood IEEE Transactions on Multimedia, Vol. 6, No. 1, Feb 2004.
Rutgers PANIC Laboratory The State University of New Jersey Self-Managing Federated Services Francisco Matias Cuenca-Acuna and Thu D. Nguyen Department.
Available bandwidth measurement as simple as running wget D. Antoniades, M. Athanatos, A. Papadogiannakis, P. Markatos Institute of Computer Science (ICS),
1 Dong Lu, Peter A. Dinda Prescience Laboratory Department of Computer Science Northwestern University Evanston, IL GridG: Synthesizing Realistic.
Hardness of Approximation and Greedy Algorithms for the Adaptation Problem in Virtual Environments Ananth I. Sundararaj, Manan Sanghi, John R. Lange and.
An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.
Characterizing and Predicting TCP Throughput on the Wide Area Network Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.
Peter Dinda Department of Computer Science Northwestern University Beth Plale Department.
Abstract Shortest distance query is a fundamental operation in large-scale networks. Many existing methods in the literature take a landmark embedding.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Presented by: Randeep Singh Gakhal CMPT 886, July 2004.
Presenter: Chi-Hung Lu 1. Problems Distributed applications are hard to validate Distribution of application state across many distinct execution environments.
A Machine Learning-based Approach for Estimating Available Bandwidth Ling-Jyh Chen 1, Cheng-Fu Chou 2 and Bo-Chun Wang 2 1 Academia Sinica 2 National Taiwan.
Advances in Language Design
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
Consistency And Replication
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Overlay Network Physical LayerR : router Overlay Layer N R R R R R N.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
LATA: A Latency and Throughput- Aware Packet Processing System Author: Jilong Kuang and Laxmi Bhuyan Publisher: DAC 2010 Presenter: Chun-Sheng Hsueh Date:
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
1 An Adaptive File Distribution Algorithm for Wide Area Network Takashi Hoshino, Kenjiro Taura, Takashi Chikayama University of Tokyo.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Large-scale Virtualization in the Emulab Network Testbed Mike Hibler, Robert Ricci, Leigh Stoller Jonathon Duerig Shashi Guruprasad, Tim Stack, Kirk Webb,
Uni Innsbruck Informatik - 1 Network Support for Grid Computing... a new research direction! Michael Welzl DPS NSG Team
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
SDN Management Layer DESIGN REQUIREMENTS AND FUTURE DIRECTION NO OF SLIDES : 26 1.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
Resource Allocation in Network Virtualization Jie Wu Computer and Information Sciences Temple University.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Accelerating Peer-to-Peer Networks for Video Streaming
Data Center Network Architectures
R. Hughes-Jones Manchester
CHAPTER 3 Architectures for Distributed Systems
Grid Information Services: alternate models
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
Modeling and Taming Parallel TCP on the Wide Area Network
Presentation transcript:

A Decentralized Relational Information Service for Large Scale Distributed Computing Thesis Proposal April 2 nd, 2004 Dong Lu Committee Peter A. Dinda (Chair) Fabian E. Bustamante Yan Chen Ian Foster (UC and ANL)

2 What is information service? An information service stores information about the resources and services in a distributed computing environment and answer queries about it. GIS is information service for Grid computing. MDS2 is an example of GIS

3 What is RGIS? RGIS: Relational Grid Information Service RGIS is a decentralized relational information service that is being built on top of distributed and replicated relational data model

4 Why RGIS? 1.RGIS can answer complex compositional queries Relational algebra (SQL) Joins Difficult in a hierarchical model (directory service) 2.Other reasons Indexes separate from data model Schema evolution Transactional insert/update/delete Consistency

5 Example Queries and Updates “Find me four Xeon machines with a total of 8 GB of RAM within 5 seconds” “Inform all my friends that the machine dualsword now has 2 GB of RAM within 500 seconds”

6 Thesis Statement A centralized relational information server, such as our current RGIS system, can’t scale with the distributed computing environment. How can we build a scalable distributed relational information service with query and update constraints? We have addressed query constraints by developing query techniques on individual servers to trade off the query time with the size of the result set. We have developed infrastructure for RGIS to support replication through update push. I propose to address update constraints, namely bounds on replica staleness. This will be built on the basis of predictive techniques for statistical quality of service (QoS) for single and parallel end-to-end TCP transfers.

7 Related works MDS2: OpenLDAP based information service. Part of the globus tool kit R-GMA: Another relational data model for GIS that focused on dynamic properties of resources MatchMaker: classified advertisements (classads), part of the condor system Redline: A language that enables the definition of Constraint Satisfaction Problems (CSP) and then apply heuristics to solve the NP-hard CSP problems etc.

8 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule

9 Motivation to build a distributed information service 1.A centralized server can’t scale with a distributed system and number of users CPU, memory and disk can easily become performance bottleneck 2.Even if we host the service using a high performance cluster, the outgoing bandwidth can easily become performance bottleneck

10 Thesis Challenges 1.Complex queries sometimes take a long time to finish We have proposed and implemented scoped, approximate and nondeterministic query techniques to address this challenge We have evaluated them using GridG 2.How to maintain proper consistency among the replicated databases? I am proposing to maintain soft real-time bounded weak consistency among the servers This Challenge Has Been Addressed

11 Thesis Challenges 3.How to bound the weak consistency with real time? I am proposing to monitor the overlay links to provide soft QoS to data transfer and then send updates to other replicas so that the consistency can be time bounded 4.How to provide soft statistical QoS to data transfer on the Internet? (TCP) I am proposing to develop a novel TCP throughput benchmarking technique and then build statistical QoS on the basis of prediction

12 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule

13 RGIS Model of a Grid module endpoint maclink macswitch iplink router host connectorswitch connectorlink Annotated network topology graph Annotation examples –Hosts: memory, disk, OS, NICs, etc. –Router/Switch: backplane bandwidth, ports –Link: latency and bandwidth Virtualization, Futures, Leases –Virtual machines Network Data link Physical Software [SC03-1]

14 RGIS Architecture [SC03-1] Developers: Lu, Dinda, Weinrich, Lange

15 RGIS Design (Intersite) RGIS Server Update Push To Friend Site Update Push To Friend Site Site RGIS server pushes local updates to friend sites Site RGIS server consolidates updates from site and friend sites Site RGIS server answers all queries originating from its site A B C

16 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Control Schedule

17 GridG: A Synthetic Grid Generator Why GridG? –Evaluation of RGIS query performance; distributed systems simulation, etc.. Output: Network topology annotated with the hardware and software on each node and link –Layer 3 network: hosts, routers, links –Hosts: memory, architecture, number of CPUs, disk, operating system, vendor, clock rate –Routers: switching capacity –Links: bandwidth and Latency [SC03-2, SIGMETRICS PER]

18 Related work: current graph generators Random: Waxman, etc. Hierarchical : Tiers, Transit-Stub, etc. Degree-based: Inet, Brite, etc. 1.GridG is the first topology generator that has a clear three-level hierarchy and also follows power law of Internet topology 2.GridG is the first generator that can annotate the hosts, routers and links with reasonable properties

19 Quick review of the Power laws of Internet topology Power LawsExpression Rank exponent Outdegree exponent Eigen exponent Hop-plot exponent

20 GridG Example Router (switching capacity) Host (arch, numcpu, clock rate, os vendor, mem, disk) Link (bw, latency)

21 Requirements for GridG Realistic topologies –Connected –Hierarchical structure –Power laws of Internet topology Realistic annotations –Distributions of attributes –Correlations of attributes  Intra-host  Inter-host

22 GridG architecture A sequence of transformations on a text- based representation of an annotated graph.

23 Topology generation (Published on ACM Sigmetrics Performance Evaluation Review) GridG follows the power laws and has a clear three level hierarchical structure We propose the following as the relationships among Internet topology power laws New rank lawOutdegree power law Eigenvalue law

24 Assumed Dependence Tree Graph annotation (Complete GridG paper published on SC’03) The dependence tree is transformed into conditional probability in the implementation of GridG

25 GridG V1.0 release

26 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule

27 SQL Example of Cluster Finder Query SELECT [scoped-approx] h1.distip, h2.distip FROM hosts h1, hosts h2, iplinks l1, iplinks l2, routers r WHERE h1.mem_mb+h2.mem_mb>=1024 and h1.os='linux' and h2.os='linux' and ((l1.src=r.distip and l2.src=r.distip and l1.dest=h1.distip and l2.dest=h2.distip) or (l1.dest=r.distip and l2.dest=r.distip and l1.src=h1.distip and l2.src=h2.distip)) and h1.distip<>h2.distip and L1.BW_MBS >= 100 AND L2.BW_MBS >= 100 [SCOPED BY r.distip=X] WITHIN 100 seconds;

28 Scoped and Approximate query techniques (published on 4 th International workshop on Grid computing) Scoped query: all the joins are limited to a neighborhood in the network, exploiting the network topology captured in the RGIS system. Approximate query: the number of joins is reduced by replacing them with constraints on individual objects and the simplified query is run against the entire network.

29 Nondeterministic query technique (Published on SC’03) Non deterministic query: a random subset of the network objects are chosen to conduct joins Another mechanism to trade off query time with query result set All the three techniques, namely, non- deterministic, scoped and approximate queries can be time bounded

30 Summary of query techniques All results Scoped results Nondeterministic results Approximate results

31 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule Proposed work Finished work

32 Update Problem “Inform my friends that the machine dualsword now as 2 GB of RAM within 500 seconds” Update Push How do I make the whole push operation run within the time bound given dynamic network conditions? Proposed work

33 Why do we need statistical QoS for data transfers on the Internet? To bound the data propagation time among the RGIS servers, we need soft deadline for data transfers –But reservations typically unavailable –Adapt to changing network  Parallel TCP, Overlay Multicast –Inform user when request is impossible Statistical QoS is a soft guarantee: To meet deadline with specified high probability. It is prediction based

34 Related works Available bandwidth estimation: packet pair, cprobe, pathchar/pchar, nettimmer, pathload, NCS, pathrate, spruce, remos, etc. TCP benchmarking: NWS, etc. can provide real time TCP throughput prediction However, available bandwidth differs significantly from TCP throughput that applications can achieve. However, recent research by Sudharshan et. al showed that simple TCP benchmarking can’t predict large file transfers well.

35 Related works Resource ReserVation Protocol: RSVP needs cooperation from routers. However, routers on a End-to-End path belong to different ISPs, thus it is hard to use in practice Network reservation based QoS: GARA is one example. Service Level Agreement (SLA): It is hard to make SLAs for End-to-End paths

36 Prediction based statistical QoS for data transfers on the Internet Main idea: predict TCP data transfer time with confidence interval Challenges: –Simple TCP benchmarking techniques failed to predict TCP throughput for large file transfers. –Internet is dynamically changing. How can we capture the dynamics on the End-to-End path? Proposed work

37 Observations File Size and TCP Throughput are strongly correlated Steady State Throughput

38 Probe Pair: a new TCP benchmarking technique Proposed work Why simple TCP benchmarking fails?

39 Experimental methodology Purpose: To study correlation between TCP throughput and flow size, and evaluate proposed TCP benchmark mechanism Testbed: 40 PlanetLab nodes in North America, Europe, Asia, and Australia. Repeat random pairing 3 times, 60 distinctive paths total. 2,430,000 TCP transfers TCP Flow size: 100 KB, 200 KB, 400 KB, 600 KB, 800 KB, 1 MB, 2 MB, 4 MB, 10 MB, (up to 1GB in other experiments)

40 Verification of Probe Pair(CDF of prediction error)

41 Internet stability Routing stability (Fundamental): –Paxson’s work show that Internet paths are heavily dominated by a single route Spatial locality and temporal locality of end-to-end TCP throughput: –Balakrishnan, et al showed that nearby Internet hosts often have almost identical distributions of observed throughput to a remote web server –Balakrishnan, et al also showed that End-to-End TCP throughput are stationary on the scale of tens of minutes. And lognormal distribution can be used to model the End-to-End TCP throughput

42 Capturing transient Internet stability Given the strong correlation between TCP flow size and throughput, what could be the proper model for End-to-End steady state TCP throughput? –lognormal is a good model for aggregated TCP throughput on a given path, namely, throughput with different TCP flow sizes –What is the proper model for steady state TCP throughput distribution?

43 Capturing transient Internet stability We define Statistical Stable Region (SSR) as the length of a period of time where the ratio between maximum and minimum estimated steady state TCP throughput is less than a constant factor With extensive Internet measurement study, we found that normal distribution can be used to model TCP throughput within each SSR

44 Capturing transient Internet stability

45 Capturing transient Internet stability

46 dualPats: predicting TCP throughput with small probe pairs We build “dynamic sampling rate adjustment algorithm” to capture the End-to-End TCP throughput dynamics and therefore to minimize probing overhead in dualPats. Proposed work

47 Parallel TCP throughput prediction Parallel TCP is widely used in distributed computing, GridFTP is one example How can we predict parallel TCP throughput without being intrusive to the network? Proposed work

48 Prediction Example

49 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule

50 Related works Strong consistency: a correctness criteria for traditional replicated transactional databases Weak consistency: examples of distributed systems that greatly favor performance over consistency include Coda, Bayou, etc.. There is no bound on inconsistency in such systems TACT is a distributed system with adjustable consistency bounds among the replicas. But the TACT system focused on logical time bounds

51 Consistency Constraints Strong consistency is hard,if possible at all, for the distributed systems Weak consistency with time bound is required for RGIS: any local update will be propagated to all friendly RGIS servers within time T Proposed work

52 Proposed approach Monitoring the overlay links to predict the data transfer time Finish data propagation within time T with high probability Use application level multicast to enhance efficiency Evaluation: Synthetic updates will be used for evaluation. One possible way is to use the GIS benchmark proposed at Indiana University. Proposed work

53 Alternative approach For better scalability, history based prediction can be combined with the overlay monitoring for the prediction. Proposed work

54 Outline Motivation Challenges System Architecture GridG for query evaluation Query techniques on a single server Providing statistical QoS to data transfers on the Internet Update Consistency Constraint Schedule

55 Proposed schedule Statistical QoS for data transfer on the Internet: Proposed completion date: April, 2004 Consistency constraints: Proposed completion date: October, 2004 Integrate RGIS system and evaluation: Proposed completion date: January, 2005 Finish writing dissertation: Proposed completion date: May, 2005

56 Acknowledgement Jack Lange, Yi Qiao, Jason Skicewicz, Andrew Weinrich.

57 Thesis Statement A centralized relational information server, such as our current RGIS system, can’t scale with the distributed computing environment. How can we build a scalable distributed relational information service with query and update constraints? We have addressed query constraints by developing query techniques on individual servers to trade off the query time with the size of the result set. We have developed infrastructure for RGIS to support replication through update push. I propose to address update constraints, namely bounds on replica staleness. This will be built on the basis of predictive techniques for statistical quality of service (QoS) for single and parallel end-to-end TCP transfers.