Algorithms for Selecting Multiple Mirror Sites for Parallel Download Yu Cai 12 / 2003 UCCS.

Slides:



Advertisements
Similar presentations
The strength of routing Schemes. Main issues Eliminating the buzz: Are there real differences between forwarding schemes: OSPF vs. MPLS? Can we quantify.
Advertisements

Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
CS6800 Advanced Theory of Computation
A Topological Interpretation for Mass Transit Network Connectivity July 8, 2006 Chulmin Jun, Seungjae Lee, Hyeyoung Kim & Seungil Lee The University of.
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
1 Efficient and Robust Streaming Provisioning in VPNs Z. Morley Mao David Johnson Oliver Spatscheck Kobus van der Merwe Jia Wang.
Tuesday, May 14 Genetic Algorithms Handouts: Lecture Notes Question: when should there be an additional review session?
Optimizing genetic algorithm strategies for evolving networks Matthew Berryman.
Finite State Machine State Assignment for Area and Power Minimization Aiman H. El-Maleh, Sadiq M. Sait and Faisal N. Khan Department of Computer Engineering.
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
Basic Data Mining Techniques Chapter Decision Trees.
Genetic Algorithms Can Be Used To Obtain Good Linear Congruential Generators Presented by Ben Sproat.
1 Caching/storage problems and solutions in wireless sensor network Bin Tang CSE 658 Seminar on Wireless and Mobile Networking.
Iterative Improvement Algorithms
Intelligent Agents What is the basic framework we use to construct intelligent programs?
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
Genetic Algorithms Nehaya Tayseer 1.Introduction What is a Genetic algorithm? A search technique used in computer science to find approximate solutions.
Genetic Algorithm What is a genetic algorithm? “Genetic Algorithms are defined as global optimization procedures that use an analogy of genetic evolution.
Heuristic Algorithms for Multiconstrained Quality-of-Service Routing Xin Yuan, Member, IEEE IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 10, VO. 2, APRIL.
Chapter 6: Transform and Conquer Genetic Algorithms The Design and Analysis of Algorithms.
The Shortest Path Problem
Coordinative Behavior in Evolutionary Multi-agent System by Genetic Algorithm Chuan-Kang Ting – Page: 1 International Graduate School of Dynamic Intelligent.
1 Reasons for parallelization Can we make GA faster? One of the most promising choices is to use parallel implementations. The reasons for parallelization.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Topology Design for Service Overlay Networks with Bandwidth Guarantees Sibelius Vieira* Jorg Liebeherr** *Department of Computer Science Catholic University.
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
Evolving a Sigma-Pi Network as a Network Simulator by Justin Basilico.
Genetic Algorithm for Multicast in WDM Networks Der-Rong Din.
Network Aware Resource Allocation in Distributed Clouds.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Integrating Neural Network and Genetic Algorithm to Solve Function Approximation Combined with Optimization Problem Term presentation for CSC7333 Machine.
Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.
Parallel Access For Mirror Sites in the Internet Yu Cai.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Investigation of the Effect of Neutrality on the Evolution of Digital Circuits. Eoin O’Grady Final year Electronic and Computer Engineering Project.
Heterogeneous Network Topology Generators Amer Zaheer 1.
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
Researchers: Preet Bola Mike Earnest Kevin Varela-O’Hara Han Zou Advisor: Walter Rusin Data Storage Networks.
1 Oblivious Routing in Wireless networks Costas Busch Rensselaer Polytechnic Institute Joint work with: Malik Magdon-Ismail and Jing Xi.
ASC2003 (July 15,2003)1 Uniformly Distributed Sampling: An Exact Algorithm for GA’s Initial Population in A Tree Graph H. S.
Introduction to Evolutionary Algorithms Session 4 Jim Smith University of the West of England, UK May/June 2012.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Chapter 4.1 Beyond “Classic” Search. What were the pieces necessary for “classic” search.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
2005MEE Software Engineering Lecture 11 – Optimisation Techniques.
Game Theory, Social Interactions and Artificial Intelligence Supervisor: Philip Sterne Supervisee: John Richter.
What the senior design students have been doing By Chris Klumph and Kody Willman.
Genetic Algorithms What is a GA Terms and definitions Basic algorithm.
Biologically inspired algorithms BY: Andy Garrett YE Ziyu.
Routing and Scheduling in Multistage Networks using Genetic Algorithms Advisor: Dr. Yi Pan Chunyan Ji 3/26/01.
Improving Support Vector Machine through Parameter Optimized Rujiang Bai, Junhua Liao Shandong University of Technology Library Zibo , China { brj,
Graph Data Management Lab, School of Computer Science Personalized Privacy Protection in Social Networks (VLDB2011)
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
A Fast Genetic Algorithm Based Static Heuristic For Scheduling Independent Tasks on Heterogeneous Systems Gaurav Menghani Department of Computer Engineering,
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Genetic Algorithms. Underlying Concept  Charles Darwin outlined the principle of natural selection.  Natural Selection is the process by which evolution.
Genetic algorithms for task scheduling problem J. Parallel Distrib. Comput. (2010) Fatma A. Omara, Mona M. Arafa 2016/3/111 Shang-Chi Wu.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
Genetic Algorithm(GA)
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Hirophysics.com The Genetic Algorithm vs. Simulated Annealing Charles Barnes PHY 327.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Traffic Simulator Calibration
Resource Allocation in Heterogeneous Computing Systems
Totally Disjoint Multipath Routing in Multihop Wireless Networks Sonia Waharte and Raoef Boutaba Presented by: Anthony Calce.
Boltzmann Machine (BM) (§6.4)
Md. Tanveer Anwar University of Arkansas
Presentation transcript:

Algorithms for Selecting Multiple Mirror Sites for Parallel Download Yu Cai 12 / 2003 UCCS

Introduction By utilizing the HTTP 1.1 byte range header, we can retrieve a specific range of data from a mirror server site. It provide possibility to retrieve documents from multiple mirror sites in parallel to increase the downloading speed. But the mirror sites selection is still a problem. Different server selection may result in different performance.

Diagram of Parallel Download

Project Goal In this project, we develop algorithms to choose the best multiple mirror sites for parallel download. We implement brutal force algorithms as well as genetic algorithms. We test the algorithms in the simulated network as well as the real-world network.

Related Work on Algorithms Mirror server and cache server selection problem has been studied recent years. Formal approach: abstract network model; use graph theory. Common assumptions when getting network model: a) network topology is known, b) the cost associated with each path is known, c) single and static network connections.

Related Work on Algorithms Algorithms include: (selecting M replicas among N potential sites) NP-hard problem. Need to develop heuristic algorithms, or by loosing the optimal constrains to simplify the problem to make it solvable in P- time. tree-basedgreedyrandomhot spot O(N 3 M 2 )O(N 2 M)O(NM)N 2 + min (NlogN, NM)

Problems to be studied 1) What is the possible maximum download speed for a given network topology? We refer to it as “global max speed”. 2) How many mirror sites to need to be chosen to achieve the global max speed, and which are them? 3) If we only want to choose a certain number of mirror sites, say 5 sites, what is the maximum download speed we can get, and which 5 sites to choose? We refer to the speed as “n sites max speed”. 4) There might be multiple selection results to achieve the max speed. what is the criteria to pick “the best from the best”? 5) What is the complexity of the algorithm?

Network Graph Model G=(V, E), –V: the set of nodes –E: the set of edges/paths The maximum download speed at node r using mirror server set S, mds(r,S)= The maximum download speed by selecting k mirror servers from set S, k-pds(c,S)=max{mds(c, S’)|S’ S, S’ has k nodes}

An Example mds(S1,S)=30, mds(S2,S)=25 mds(R2,S)=min(mds(S1,S),5)+min(mds(S2,S),8) =min(30,5)+min(25,8)=5+8=13, similarly, mds(R3, S)=16, mds(R1,S)=min(mds(R2,S),40)+min(mds(R3,S),3 0)=min(13,40)+min(16,30)=13+16=29, mds(c, S)=mds(R1, S)=29, 3-pds(c,S)=max{mds(c,{S1,S2,S3}),mds(c,{S1, S2,S4}),mds(c,{S1,S3,S4}),mds(c,{S2,S3,S4})} =max{20,22,21,24}=24, the subset of mirror servers to use 3-pdsSubset(c,S)={S 2,S 3,S 4 }. Similarly, 2-pds(c,S)=17 and 2-pdsSubset(c, S)={S 2,S 4 }.

Algorithm Implementation Brutal Force Algorithm: implements previous formulas. –for mds –for k-pds Genetic Algorithm: –fix-length genetic algorithm –variable-length genetic algorithm.

Genetic Algorithm 1) Assign the sequential server number, node number and path number to denote each server, node and path. Assign the initial bandwidth and server speed. 2) Initialize the first generation of chromosomes with random length by filling server number in chromosome. 3) Crossover and mutation at certain probability. Make sure no duplicated server in chromosome, and the length of chromosome is less than the given number. Several different crossover and mutation methods have been combined together for better performance. 4) Fitness function. For a given chromosome S’, use the max download speed mds(c, S’) as fitness function. 5) Run certain generations, and output the result.

Crossover in Genetic Algorithm

Testing Results We tested the algorithms on simulated network as well as real-world network. GT-ITM (Georgia Tech Internetwork Topology Models), is used to generate network topologies with varying sizes for simulation.

Parallel Download Algorithm Performance

Future Work Extend work to proxy server based multipath connections. Investigate more algorithms and related works. Do more simulation and performance test. Develop non-heuristic algorithms