Download presentation
Presentation is loading. Please wait.
1
Rutgers PANIC Laboratory The State University of New Jersey Self-Managing Federated Services Francisco Matias Cuenca-Acuna and Thu D. Nguyen Department of Computer Science Rutgers University
2
2 Thu D. NguyenPANIC Lab, Rutgers University Federated Computing Rising Internet connectivity is driving a new model of federated computing −Computing systems that span multiple organizations −Sharing of computing resources, data, and services Federated computing appearing at every level −Peer-to-peer: e.g., Netnews, Gnutella, KaZaA −Business-to-business: e.g., federated web services −Scientific computing: e.g., grids (http://www.gpds.org/), PlanetLab Federated applications are hard to build and manage because they must execute in environments that are −Inherently decentralized −Widely distributed −Widely heterogeneous −Highly volatile
3
3 Thu D. NguyenPANIC Lab, Rutgers University PlanetP Infrastructure support for federated computing −Data Management −Collaborative organization of shared data −Content search, rank, and retrieval −Automatic replication for predictable availability (SRDS 2003) −Application management −Automatic (self-) management of #components and their placements −Provide a common application container PlanetP approach −Strongly consistent protocols scale poorly in wide-area environments [Gray et al. 1996, Gupta et al. 2002] −Build on probabilistic and weakly consistent protocols −Weakly consistent shared information −Autonomous actions −Randomized algorithms −Let’s deal with system sizes of up to several thousands first
4
4 Thu D. NguyenPANIC Lab, Rutgers University Application Management: Goal Reduce the deployment and management of an application to −Defining an application “fitness” model −Specify QoS targets such as service availability, max throughput Self-management of number of components and their placements to achieve QoS targets
5
5 Thu D. NguyenPANIC Lab, Rutgers University Application Management App Network App
6
6 Thu D. NguyenPANIC Lab, Rutgers University P2P Information Sharing Infrastructure (PlanetP Data Store) App Management Agent App Publish: A replica running here Current app load Our Approach: Infrastructure App Monitoring Agent Publish: Node description Current node load Availability
7
7 Thu D. NguyenPANIC Lab, Rutgers University Our Approach: Algorithm Periodically, each management agent autonomously searches for a “more fit” configuration −Configuration: #components and their placements −Genetic algorithm (directed random search) If find a configuration that is better than the current configuration by more than a threshold value, deploy it However, wait for a random delay time before deploying new configuration to avoid collisions Publish new configuration after it has been successfully deployed
8
8 Thu D. NguyenPANIC Lab, Rutgers University Example Application Model Fitness is defined as a function f(capacity, #replicas) Expected capacity of a configuration is computed using node availability and CPU idleness Function designed to achieve −Sufficient capacity to meet current load −Minimize number of replicas to minimize replication overheads −Spare capacity to gracefully handle load spikes Two QoS parameters −Availability −Maximum load
9
9 Thu D. NguyenPANIC Lab, Rutgers University Random Delay Choose T to achieve a target probability of collision Case study: −200 nodes −Gossiping interval = 1 sec −Deployment time small V = 8secs P collision = 0.1 0 T Deployment time Probabilistic bound on inconsistency in shared data V
10
10 Thu D. NguyenPANIC Lab, Rutgers University Evaluation: Environments Study responsiveness and stability in three environments −Unloaded cluster −44 PCs interconnected with 100 Mb/s & 1 Gb/s Ethernet −22 2GHz P4 PCs rated at 4000 bogomips −22 2.8GHz Pentium Xeon with hyper-threading PCs rated at ~11200 bogomips −Same cluster loaded by other researchers (large simulation jobs) −PlanetLab −Federated system for distributed systems and networking research −Widely heterogeneous x86 PC nodes: 800 – 5600 bogomips
11
11 Thu D. NguyenPANIC Lab, Rutgers University Evaluation: UDDI Service PlanetP Manager Agent Web server Java Runtime jUDDI HSQLDB/R Site 1 PlanetP community Base gossiping interval = 1sec Site 4 Site 3 Site 2 Tomcat Runtime Mon. Agent
12
12 Thu D. NguyenPANIC Lab, Rutgers University Evaluation: UDDI Service Populated UDDI service with data from Xmethods −Web site listing publicly available web services −400 services 3MB registry database Clients issue random findBusiness queries Fitness function for UDDI service set to: −Max desired CPU resources: 60,000 bogomips −~20 P4 PCs −Max of 4 to 20 replicas
13
13 Thu D. NguyenPANIC Lab, Rutgers University Cluster: No. Replicas vs. Load
14
14 Thu D. NguyenPANIC Lab, Rutgers University Cluster: Acquired Capacity vs. Load
15
15 Thu D. NguyenPANIC Lab, Rutgers University Cluster: Throughput vs. Load
16
16 Thu D. NguyenPANIC Lab, Rutgers University PlanetLab: No. Replicas vs. Load
17
17 Thu D. NguyenPANIC Lab, Rutgers University PlanetLab: Acquired Capacity vs. Load
18
18 Thu D. NguyenPANIC Lab, Rutgers University PlanetLab: Throughput vs. Load
19
19 Thu D. NguyenPANIC Lab, Rutgers University Summary and Future Work Described decentralized management framework for federated applications −Autonomous actions based on weakly consistent shared data for robustness and scalability −Probabilistic serialization to avoid collisions Responsive and stable when managing an example UDDI service in 3 test-beds More sophisticated application models? −Predictive model for resources −Account for cost of changing application configurations −Applications with multiple types of components
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.