Department of Computer Science Northwestern University

Slides:



Advertisements
Similar presentations
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
Advertisements

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.
Towards Virtual Networks for Virtual Machine Grid Computing Ananth I. Sundararaj Peter A. Dinda Prescience Lab Department of Computer Science Northwestern.
Automatic Run-time Adaptation in Virtual Execution Environments Ananth I. Sundararaj Advisor: Peter A. Dinda Prescience Lab Department of Computer Science.
Increasing Application Performance In Virtual Environments Through Run-time Inference and Adaptation Ananth I. Sundararaj Ashish Gupta Peter A. Dinda Prescience.
Increasing Application Performance In Virtual Environments Through Run-time Inference and Adaptation Ananth I. Sundararaj Ashish Gupta Peter A. Dinda Prescience.
Ashish Gupta, Marcia Zangrilli, Ananth I. Sundararaj, Peter A. Dinda, Bruce B. Lowekamp EECS, Northwestern University Computer Science, College of William.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Dynamic Topology Adaptation of Virtual Networks of Virtual Machines Ananth I. Sundararaj Ashish Gupta Peter A. Dinda Prescience Lab Department of Computer.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
Hardness of Approximation and Greedy Algorithms for the Adaptation Problem in Virtual Environments Ananth I. Sundararaj, Manan Sanghi, John R. Lange and.
Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.
eGovernance Under guidance of Dr. P.V. Kamesam IBM Research Lab New Delhi Ashish Gupta 3 rd Year B.Tech, Computer Science and Engg. IIT Delhi.
An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Resource Virtualization Winter Quarter Project.
Free Network Measurement for Adaptive Virtualized Distributed Computing Ashish Gupta, Marcia Zangrilli, Ananth Sundararaj, Anne Huang, Peter A. Dinda,
Dynamic Topology Adaptation of Virtual Networks of Virtual Machines Ananth I. Sundararaj Ashish Gupta Peter A. Dinda Prescience Lab Department of Computer.
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University.
Microsoft Virtual Academy Module 4 Creating and Configuring Virtual Machine Networks.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
A Framework for Collective Personalized Communication Laxmikant V. Kale, Sameer Kumar, Krishnan Varadarajan.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
A brief overview about Distributed Systems Group A4 Chris Sun Bryan Maden Min Fang.
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
The Center for Autonomic Computing is supported by the National Science Foundation under Grant No NSF CAC Seminannual Meeting, October 5 & 6,
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
Master Thesis Defense Jan Fiedler 04/17/98
UAB Dynamic Tuning of Master/Worker Applications Anna Morajko, Paola Caymes Scutari, Tomàs Margalef, Eduardo Cesar, Joan Sorribes and Emilio Luque Universitat.
Workshop BigSim Large Parallel Machine Simulation Presented by Eric Bohm PPL Charm Workshop 2004.
Virtual Private Grid (VPG) : A Command Shell for Utilizing Remote Machines Efficiently Kenji Kaneda, Kenjiro Taura, Akinori Yonezawa Department of Computer.
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
The LSAM Proxy Cache - a Multicast Distributed Virtual Cache Joe Touch USC / Information Sciences Institute 元智大學 資訊工程研究所 系統實驗室 陳桂慧
Troubleshooting Mesh Networks Lili Qiu Joint Work with Victor Bahl, Ananth Rao, Lidong Zhou Microsoft Research Mesh Networking Summit 2004.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
By Nitin Bahadur Gokul Nadathur Department of Computer Sciences University of Wisconsin-Madison Spring 2000.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
LACSI 2002, slide 1 Performance Prediction for Simple CPU and Network Sharing Shreenivasa Venkataramaiah Jaspal Subhlok University of Houston LACSI Symposium.
The Application of Data Mining in Telecommunication by Wang Lina February 2003.
Interaction and Animation on Geolocalization Based Network Topology by Engin Arslan.
OpenDaylight Based Machine Learning for Networks
Xin Li, Chen Qian University of Kentucky
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
Introduction to Load Balancing:
Martin Casado, Nate Foster, and Arjun Guha CACM, October 2014
California Institute of Technology
Design and Manufacturing in a Distributed Computer Environment
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
Standards and Patterns for Dynamic Resource Management
Performance Evaluation of Adaptive MPI
Aled Edwards, Anna Fischer, Antonio Lain HP Labs
Collaborative Offloading for Distributed Mobile-Cloud Apps
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Module 5 - Switches CCNA 3 version 3.0.
Software Defined Networking (SDN)
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Ananth I. Sundararaj Ashish Gupta Peter A. Dinda Prescience Lab
Utility-Function based Resource Allocation for Adaptable Applications in Dynamic, Distributed Real-Time Systems Presenter: David Fleeman {
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
An Optimization Problem in Adaptive Virtual Environments
Self-Managed Systems: an Architectural Challenge
Authors: Jinliang Fan and Mostafa H. Ammar
Research Issues in Middleware (Bhaskar)
Towards Predictable Datacenter Networks
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Department of Computer Science Northwestern University Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University Introduce the project.

Overview Motivation behind parallel programs in a VM environment Goal: To infer the communication behavior Offline implementation Evaluating with parallel benchmarks Online Monitoring in a VM environment Conclusions More specific

Virtuoso: A VM based abstraction for a Grid environment

Add: we have all this wonderful toys, environment where we can run parallel applications.. Lot of complexity. Resources are out there , but increbvidly complicated : Virtuso: provide a clean abtraction on top of all this

A distributed computing environment based on Virtual Machines Motivation A distributed computing environment based on Virtual Machines Raw machines connected to user’s network Our Focus: Middleware support to hide the Grid complexity Virtuoso project. Efficient execution of parallel applications means better performance, cost and resource utilization for both the user and the service providers

Can ssh into the machine They can use the group of machines as if they would use a local cluster…. The middleware is repsonsible for making it look like a cluster

A distributed computing environment based on Virtual Machines Motivation A distributed computing environment based on Virtual Machines Raw machines connected to user’s network Our Focus: Middleware support to hide the Grid complexity Our goal here: Efficient execution of Parallel applications in such an environment Virtuoso project. Efficient execution of parallel applications means better performance, cost and resource utilization for both the user and the service providers Add: our goal

Intelligent Placement and of parallel applications Application Behavior Intelligent Placement and virtual networking of parallel applications By combining the knowledge of these two aspects, we are in a position to make decisions. Our decision making capability depends on two factors: VM Encapsulation provide flexibility in placement and migration of parallel applications on the network Virtual Networks : can create custom routing and topologies on the underlying physical network The goal of this project focus on the upper right hand part: understanding parallel application communication behavior So what does the behavior look like: Parallel Applications communicate and compute. Communicate according to certain topologies. Also bandwidth VM Encapsulation Virtual Networks With VNET

Abstraction: A set of VMs on same Layer 2 network Virtual Ethernet LAN VNET Abstraction: A set of VMs on same Layer 2 network Virtual Ethernet LAN

An online topology inference framework for a VM environment Goal of this project ? Application Topology Low Level Traffic Monitoring An online topology inference framework for a VM environment Add: Application topology

Design an offline framework Evaluate with parallel benchmarks Approach Design an offline framework Evaluate with parallel benchmarks What is offline ? To test hypothesis Using tcpdump If successful, design an online framework for VMs

An offline topology inference framework Goal: A test-bed for traffic monitoring and evaluating topology inference methods

The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Wrote a set of perl scripts for each step to automate the whole process. Manually can be painful. Use tcpdump to capture traffic for each host Then we can do the require filtering to remove any non-application projects Do flow aggregation to generate a traffic matrix for the hosts Add: the steps are simple, explain the filterig and algorithm here. Matrix Analysis and Topology Characterization

The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Then after we get the traffic matrix, we need to have algorithms to infer the topology from the traffic matrix, leaving out the noise For this there are many ways, like pruning, pattern detection etc. Currently a simple linear scaling method is used to infer the topology. Explain the technique Matrix Analysis and Topology Characterization

The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation A simple visualization module was added which can visualize the inferred topology. Using SAMBA from Georgia Tech. Example Matrix Analysis and Topology Characterization

The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation A simple visualization module was added which can visualize the inferred topology. Using SAMBA from Georgia Tech. Example Matrix Analysis and Topology Characterization

The offline method PVMPOV Inference Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation A simple visualization module was added which can visualize the inferred topology. Using SAMBA from Georgia Tech. Example Matrix Analysis and Topology Characterization PVMPOV Inference

Infer.pl Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Explain the visualization generated. Finally a one step process: Doitall [parallel program name] and it went through all the steps and displayed the infered topology Add: pvmpov running Matrix Analysis and Topology Characterization Infer.pl

Parallel Benchmarks Evaluation Goal: To test the practicality of low level traffic based inference

Parallel Benchmarks used Synthetic benchmarks: Patterns N-dimensional mesh-neighbor N-dimensional toroid-neighbor N-dimensional hypercubes Tree reduction All-to-All Scheduling mechanism to generate deadlock free and efficient schemes The patterns application was custom - written Used PVM: Parallel virtual Machine A parallel application development library Speak: Our implementation of these communication patterns if efficient and deadlock free 1 2 3

Application benchmarks NAS PVM benchmarks Popular benchmarks for parallel computing 5 benchmarks PVM-POV : Distributed Ray Tracing Many others possible… The inference not PVM specific Applicable to all communication . e.g. MPI, even non-parallel apps Computational and communication characteristics in computational aerodynamics

Patterns application 3-D Toroid 3-D Hypercube 2-D Mesh Reduction Tree Add: identify the topologies Reduction Tree All-to-All

PVM NAS benchmarks Parallel Integer Sort

Traffic Matrix for PVM IS benchmark This traffic matrix shows how placement of host1 is crucial on the network. High bandwidth connectivity should be there Also the virtual network should facilitate the communication between other hosts and host1 Also the decisions need to be dynamic since the properties of the physical network may change Speak: which host talks to which host……

Traffic Matrix for PVM IS benchmark This traffic matrix shows how placement of host1 is crucial on the network. High bandwidth connectivity should be there Also the virtual network should facilitate the communication between other hosts and host1 Also the decisions need to be dynamic since the properties of the physical network may change Speak: host1 is communication is intensive: we can see it sends a lot from tehc column, and receives a lot as from the row Placement of host1 is crucial on the network

An Online Topology Inference Framework: VTTIF Goal: To automatically detect, monitor and report the global traffic matrix for a set of VMs running on a overlay network

VNET Overall Design Abstraction: A set of VMs on same Layer 2 network Virtual Ethernet LAN The framework can aggregate all traffic for all periods of time. Need to detect only interesting communication behavior so that appropriate action can be taken. Add: what is vnet ?

A VNET virtual layer A Virtual LAN over wide area VNET Layer Physical Layer

Extend VNET to include the required features The Challenge here Overall Design VNET Abstraction: A set of VMs on same Layer 2 network Extend VNET to include the required features Monitoring at Ethernet packet level The Challenge here Lacks manual control Detecting interesting parallel program communication ? The framework can aggregate all traffic for all periods of time. Need to detect only interesting communication behavior so that appropriate action can be taken. Add: what is vnet ?

Detecting interesting phenomenon Reactive Mechanisms Proactive Mechanisms Like a Burglar Alarm Video Surveillance Certain address properties Based on Traffic rate Etc. Provide support for queries by external agent Rate based monitoring Non-uniform discrete event sampling What is the Traffic Matrix for the last n seconds ? Explain both the mechanisms with examples

VM Network Scheduling Agent Physical Host VNET daemon VNET overlay network Traffic Analyzer Rate based Change detection Traffic Matrix Query Agent To other VNET daemons Add: text contrast is bad, text is small VM Network Scheduling Agent

Traffic Matrix Aggregation Each VNET daemon keeps track of local traffic matrix Need to aggregate this information for a global view When the rate falls, the local daemons push the traffic matrix (When do you push the traffic matrix ?) Operation is associative: reduction trees for scalability Scalability issues Performance is ok in latency and bandwidth tests. The proxy daemon

Used 4 Virtual Machines over VNET NAS IS benchmark Evaluation Used 4 Virtual Machines over VNET NAS IS benchmark We run the parallel program and the Framework automatically detects the communication and outputs the aggregated traffic matrix when the communication ends The traffic matrix was much larger than that from the physical hosts. Reasons: packets dropped in tcpdump in the offline case.

Possible to infer the topology low level traffic monitoring Conclusions Possible to infer the topology with low level traffic monitoring A Traffic Inference Framework for Virtual Machines Ready to move on to future steps: Adaptation for Performance Add: current and future work…. Add: the numbers on adaptation Add: questions/advertizing…

Capabilities for dynamic adaptation into VNET Current Work Capabilities for dynamic adaptation into VNET Spatial Inference  Network Adaptation for Improved Performance Prelim Results: Improved performance upto 40% in execution time Looking into benefits of Dynamic Adaptation

plab.cs.northwestern.edu http://virtuoso.cs.northwestern.edu For more information http://virtuoso.cs.northwestern.edu VNET is available for download PLAB web site: plab.cs.northwestern.edu