Efficient Incremental Optimal Chain Partition of Distributed Program Traces Selma Ikiz Vijay K. Garg Parallel and Distributed Systems Laboratory.

Slides:

Advertisements

Similar presentations

Modeling and Analyzing Periodic Distributed Computations Anurag Agarwal Vijay Garg Vinit Ogale The University.

Advertisements

Fundamentals of Python: From First Programs Through Data Structures

Lectures on Network Flows

1 Introduction to Computability Theory Lecture15: Reductions Prof. Amos Israeli.

1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.

Ordering and Consistent Cuts Presented By Biswanath Panda.

Sorting Heapsort Quick review of basic sorting methods Lower bounds for comparison-based methods Non-comparison based sorting.

CPSC 668Set 12: Causality1 CPSC 668 Distributed Algorithms and Systems Fall 2009 Prof. Jennifer Welch.

Copyright © Cengage Learning. All rights reserved. CHAPTER 11 ANALYSIS OF ALGORITHM EFFICIENCY ANALYSIS OF ALGORITHM EFFICIENCY.

Time, Clocks, and the Ordering of Events in a Distributed System Leslie Lamport (1978) Presented by: Yoav Kantor.

February 17, 2015Applied Discrete Mathematics Week 3: Algorithms 1 Double Summations Table 2 in 4 th Edition: Section th Edition: Section th.

Liang, Introduction to Java Programming, Seventh Edition, (c) 2009 Pearson Education, Inc. All rights reserved Chapter 23 Algorithm Efficiency.

1 Chapter 24 Developing Efficient Algorithms. 2 Executing Time Suppose two algorithms perform the same task such as search (linear search vs. binary search)

CHAPTER 09 Compiled by: Dr. Mohammad Omar Alhawarat Sorting & Searching.

Order Statistics The ith order statistic in a set of n elements is the ith smallest element The minimum is thus the 1st order statistic The maximum is.

Analysis of Algorithms

The Integers. The Division Algorithms A high-school question: Compute 58/17. We can write 58 as 58 = 3 (17) + 7 This forms illustrates the answer: “3.

Survey on Trace Analyzer (2) Hong, Shin /34Survey on Trace Analyzer (2) KAIST.

The Selection Problem. 2 Median and Order Statistics In this section, we will study algorithms for finding the i th smallest element in a set of n elements.

1 Nasser Alsaedi. The ultimate goal for any computer system design are reliable execution of task and on time delivery of service. To increase system.

On Reducing the Global State Graph for Verification of Distributed Computations Vijay K. Garg, Arindam Chakraborty Parallel and Distributed Systems Laboratory.

CSC 211 Data Structures Lecture 13

1 Efficient Dependency Tracking for Relevant Events in Shared Memory Systems Anurag Agarwal Vijay K. Garg

CS 361 – Chapters 8-9 Sorting algorithms –Selection, insertion, bubble, “swap” –Merge, quick, stooge –Counting, bucket, radix How to select the n-th largest/smallest.

UNIT 5.  The related activities of sorting, searching and merging are central to many computer applications.  Sorting and merging provide us with a.

Parallel and Distributed Systems Laboratory Paradise: A Toolkit for Building Reliable Concurrent Systems Trace Verification for Parallel Systems Vijay.

Liang, Introduction to Java Programming, Sixth Edition, (c) 2007 Pearson Education, Inc. All rights reserved Chapter 23 Algorithm Efficiency.

Program Correctness. The designer of a distributed system has the responsibility of certifying the correctness of the system before users start using.

ICDCS 2006 Efficient Incremental Optimal Chain Partition of Distributed Program Traces Selma Ikiz Vijay K. Garg Parallel and Distributed Systems Laboratory.

Chapter 9 Recursion © 2006 Pearson Education Inc., Upper Saddle River, NJ. All rights reserved.

Data Structures and Algorithms Instructor: Tesfaye Guta [M.Sc.] Haramaya University.

Dilworth’s theorem and extremal set theory 張雁婷國立交通大學應用數學系.

Algorithm Design Techniques, Greedy Method – Knapsack Problem, Job Sequencing, Divide and Conquer Method – Quick Sort, Finding Maximum and Minimum, Dynamic.

CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS

Applied Discrete Mathematics Week 2: Functions and Sequences

Integer Programming An integer linear program (ILP) is defined exactly as a linear program except that values of variables in a feasible solution have.

Analysis of Algorithms

Lectures on Network Flows

Enough Mathematical Appetizers!

Chapter 5. Optimal Matchings

Algorithms Chapter 3 With Question/Answer Animations

Algorithm design and Analysis

Detecting Temporal Logic Predicates on Distributed Computations

Hidden Markov Models Part 2: Algorithms

Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.

Instructor: Shengyu Zhang

Model Checking for an Executable Subset of UML

Registered Electrical & Mechanical Engineer

Alternating tree Automata and Parity games

Lecture 6 Efficiency of Algorithms (2) (S&G, ch.3)

Ch. 8 Priority Queues And Heaps

Data Structures Review Session

Applied Discrete Mathematics Week 6: Computation

Sorting … and Insertion Sort.

Dynamic Programming Dynamic Programming 1/15/ :41 PM

Chapter 11 Limitations of Algorithm Power

Dongyun Jin, Patrick Meredith, Dennis Griffith, Grigore Rosu

Analysis of Algorithms

Enough Mathematical Appetizers!

CSC 427: Data Structures and Algorithm Analysis

CENG 351 Data Management and File Structures

Enough Mathematical Appetizers!

Reseeding-based Test Set Embedding with Reduced Test Sequences

Analysis of Algorithms

Runtime Safety Analysis of Multithreaded Programs

The Selection Problem.

Divide and Conquer Merge sort and quick sort Binary search

Analysis of Algorithms

ADDITIONAL ANALYSIS TECHNIQUES

Presentation transcript:

Efficient Incremental Optimal Chain Partition of Distributed Program Traces Selma Ikiz Vijay K. Garg Parallel and Distributed Systems Laboratory

Outline Introduction and Motivation Problem definition Outline two previous algorithms Offline vs Incremental -> Experimental Results New Incremental algorithm Experimental Results Concluding Remarks This is the general outline of today’s talk I will start with giving some background information and our motivation. I will briefly go over the two previously known algorithms and problem definition I’ll define how we can use these algorithms both in offline and incremental manner, and show some experimental results. Then finally I’ll introduce a new incremental algorithm, and show its experimental result.

Software testing and debugging Commercial software has large number of components Verification Formal proof of correctness is not feasible Predicate detection (Runtime verification) simulation & formal methods Debugging Large number of states Abstraction or grouping of states/processes Today, commercial software has large number of components, hence, in general it’s not feasible to give a formal proof of correctness. Therefore, extensive simulation is still the most common system in the industry. Predicate detection of program traces (which is also called runtime verification) is a way to verify the correctness of these computations. Also, due to the large number of states, it is hard to debug the programs. One way of easing this complexity is using grouping among states or processes.

Distributed Computation as Partial order set Partial-order models Poset (X,P) : X is a set P is antisymmetric, reflexive, and transitive binary relation on X Lamport 1978: “happened-before” relation f1 → e3  c(f1) < c(e3) Fidge 1991 & Mattern 1989 : vector-clocks (1,0,0) (2,2,0) (3,2,0) P1 e1 e2 e3 (0,1,0) (0,2,0) (0,3,0) P2 f1 f2 f3 (0,2,1) (3,2,2) P3 g1 g2

Optimal chain partition of a poset Width the size of the largest antichain (a subset of poset whose every distinct pair is mutually incomparable) A poset cannot be partition into k chains if k < width(P) [R.P. Dilworth] Debugging Visualization Testing & Analyzing Bounded sum predicates (x1+ x2+ x3 < k) Mutual exclusion violation Let’s define what we mean by optimal chain partition of a poset. The width of the poset is defined as the size of the largest subset that every distinct pair in this set is incomparable. And due to Dilworth’s thm, we know that a poset can be partitioned into k number of chains if k is greater then or equal to the width. In short, size of an optimal chain partition of a poset equals to its width. What it’s use to us? We can use it to visualize the distributed computation for debugging purposes. We can use it in predicate detection. Next I’ll go over an example to demonstrate the relation between mutual exclusion problem and optimal chain partition,.

Mutual exclusion violation example (1,0,0) (2,2,0) (3,2,0) P1 (1,0,0) (0,2,0) (3,2,2) (0,1,0) (0,2,0) (0,3,0) (3,2,0) (0,3,0) P2 (0,2,1) (3,2,2) Let’s look at the previous computation. Assume that the green events have executed critical section. We can trivially partition this poset as one chain per process. If we examine the timestamps, it’s easy to see that we can attach the third chain, at the end of the first chain. Then, we get a reduction into two chains. To show that we cannot reduce this partition any more we need to find an antichain of size two. And here’s our two concurrent events, so we detected a mutual exclusion violation. P3 Critical event

Problem definition & Previous algorithms Given a chain partition of P, C = {C1, . . . ,CN} into N disjoint chains, rearrange these chains into a chain partition with the fewest number of chains. Previous algorithms that answer the question given k chains whether it is possible to partition it into k-1 chains. Bogart and Magagnosc BM Tomlinson and Garg TG We define the chain reduction problem as follows. Given a chain partition of P into N disjoint chains, we rearrange the elements into an optimal chain partition of the poset. Previous algorithms are given by Bogart and Magagnosc , and Tomplinson and Garg . They reduce the chain partition by one if it’s possible, otherwise they return an antichain. From, here on I’ll refer to Bogart and Magagnosc algorithm as BM, and Refer to Tomplinson and Grag’s algorithm as TG. I am going to go over these algorithms by using an example.

Bogart & Magagnosc A sequence of elements a0,b0,a1,b1,…,as,bs is a reducing sequence if a0 is the least element of some chain, bi is the immediate predecessor of ai+1 in some chain, all bi’s are distinct, for all i: ai > bi in the partial order, bs is the greatest element of its chain. C1 C2 C3 (1,0,0) (0,1,0) (0,0,1) (2,0,0) (0,2,0) (0,0,2) (3,0,0) (0,3,0) (0,0,3) C4 (3,4,0) (0,3,7) (0,0,4) BM algorithm keeps an adjacency list for each event. It starts with an element, And tries to find a pattern that is called reducing sequence which enables the reduction. As an example if we start with the first tail, we’ll find that it is less then 3,4,0, 0,3,0 is less then 0,3,7 0,0,3 is less then 0,0,4. By rearranging these chains we’ll reduce the chain partition by one. (3,5,0) (0,3,8) (0,0,5) (0,0,6)

Tomlinson and Garg Output Chains Input Chains ┴ ┴ ┴ (0,0,1) (1,0,0) (0,1,0) (0,0,1) (0,0,4) Input Chains (2,0,0) (0,2,0) (0,0,2) (0,0,5) TG solves the same problem by merging them into k-1 chains. They introduce a spanning tree between input chains and output chains. At each step it only compares the chain heads to decide which event is going to be placed into the output chains. For example, in the first step it’ll find out that 0,0,1 is less then 0,0,4 and places it to the queue that’s between these two input chains. (3,0,0) (0,3,0) (0,0,3) (0,0,6) (3,4,0) (0,3,7) (3,5,0) (0,3,8)

Tomlinson and Garg Output Chains Input Chains (0,0,1) (0,0,1) (0,0,2) ┴ ┴ (0,1,0) (0,0,2) (0,0,3) (1,0,0) (0,2,0) (0,1,0) (0,0,3) (0,3,7) (0,0,4) Input Chains (2,0,0) (0,2,0) (0,3,0) (0,3,8) (0,3,7) (0,0,5) (3,0,0) (3,4,0) (0,3,0) (0,3,8) (0,0,6) (3,5,0) (3,4,0) (3,5,0)

Tomlinson and Garg Output Chains Input Chains (0,1,0) (0,0,1) (0,2,0) (0,0,2) ┴ (0,3,0) ┴ ┴ (0,2,0) (0,0,3) (1,0,0) (0,3,0) (3,4,0) (0,3,7) (0,0,4) Input Chains (2,0,0) (3,4,0) (3,5,0) (0,3,8) (0,0,5) (3,0,0) (3,5,0) (0,0,6)

Tomlinson and Garg Output Chains Input Chains (0,1,0) (0,0,1) Output Chains (1,0,0) (0,2,0) (0,0,2) (1,0,0) (2,0,0) (0,3,0) (0,0,3) (2,0,0) (3,0,0) (3,4,0) (0,3,7) (0,0,4) Input Chains (3,0,0) (3,5,0) (0,3,8) (0,0,5) This process continues till it finds an empty input chain (0,0,6)

Tomlinson and Garg Output Chains Input Chains (1,0,0) (0,1,0) (0,0,1) Output Chains (2,0,0) (0,2,0) (0,0,2) (3,0,0) (0,3,0) (0,0,3) ┴ (3,4,0) (0,3,7) (0,0,4) Input Chains (3,5,0) (0,3,8) (0,0,5) , or it finds that all the heads are incomparable to each other. In this example, it finds the first input chain empty, and appends the rest into output chains. (0,0,6)

Tomlinson and Garg Output Chains Input Chains (1,0,0) (0,1,0) (0,0,1) (2,0,0) (0,2,0) (0,0,2) (3,0,0) (0,3,0) (0,0,3) (3,4,0) (0,3,7) (0,0,4) (3,5,0) (0,3,8) (0,0,5) (0,0,6) Input Chains ┴ ┴ ┴

Offline vs Incremental Call BM or TG algorithm until there is no further reduction in the number of chains Incremental (Naïve) Place the new element into a new chain, and call the BM or TG algorithm We can use these algorithms both in offline and incremental manner. In the offline version we start with the trivial partition, and call the algorithm over and over until they return an antichain. In the naïve incremental version, we create a new chain that contains only the new element, and call the algorithm, and use it’s output as new chain partition. In order to operate on incremental posets we need to make some assumptions.

Linear extension hypothesis [Bouchitte & Rampon] Let f be the new event, then we assume that all the events happed before f are already arrived and processed. f We assume that events arrive according to linear extension hypothesis defined by Bouchitte and Rampon. Which mainly says that, the order of event arrivals is actually a linearization of the poset. As an example; Let f be the new event, then we assume that all the events that are happed before f are already arrived and processed. We implemented these algorithms and measure their performance.

Memory Usage We restricted the memory usage to 512MB. According to memory usage, TG is better then BM. And this is expected since an adjacency list is created for every event in BM.

Running times According to running time performances, Incremental algorithms are far from the offline versions. This is also expected, since we compare the same events over and over. Among two offline algorithms, TG performed better then BM. I believe this is due to the setup complexity of bm, since at every step, adjacency lists are modified. Now I move to the new incremental algorithm.

Idea behind Chain Partitioner Maximum antichain New element No effect Updates the maximum antichain Expands the maximum antichain  Given the maximum antichain, it’s sufficient to compare the new element with the maximum antichain I want to give the idea, or improvement behind the new algorithm that we call chain partitioner or CP for short. Assume that we have seen the events in blue shaded area so far. And red dashed line shows the maximum antichain in the subposet. A new element may affect the chain partition in 3 ways. Either it has no effect on the maximum antichain Or it updates the antichain by replacing an event Or it increases the maximum antichain Therefore We don’t need to start from the beginning. It is sufficient to compare the new element with the maximum antichain to define the partition.

CP example To describe the CP algorithm, Linearization: (1,2,1) (1,2,3) (1,2,0) (2,3,0) (2,4,1) (2,0,0) (3,0,0) 4,3,0) (0,1,0) (1,0,0) To describe the CP algorithm, I’ll go over an example, and we’ll see the effect of these 3 cases. Assume that the following computation is given and, we have observed this linearization. Which means that the events arrive in this order. Linearization: (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0)

CP example CP Merge CP has two sets called history and work space. (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H0: { } W0: { } H: { } O: Q: ┴ CP has two sets called history and work space. Let’s start our example; It places the first element as a new chain to the work space, then adds the second one as the second chain, it finds a tail to append for the third one. But it cannot find a chain to append 2,0,0. So it creates a new chain for it. Then makes a merge call. We also modified the merge, so that it also has history chains. Merge appends the content of output chains whenever it finds an antihain of size k or k-1, for this example k is 3. Merge takes the work chains of CP as input chains, creates output and history chains, and the tree structure. k = 3 (1,0,0) (0,1,0) (2,0,0) (1,0,0) (0,1,0) (2,0,0) (1,2,0) (1,2,0)

CP example Merge CP In the first step, (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H4: { } W4: { } H: { } O: Q: (1,0,0) ┴ ┴ In the first step, merge finds out that 1,0,0 is less then 2,0,0, and finds an antichain of size 2. Hence places 1,0,0 to history queues, updates the spanning tree. ┴ ┴ k = 3 (1,0,0) (0,1,0) (2,0,0) (1,0,0) (1,2,0) (0,1,0) (2,0,0) (1,2,0) (1,2,0)

CP example Merge CP And it continues until there’s an empty chain. (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H4: { } W4: { } H: { } O: Q: (1,0,0) (0,1,0) ┴ And it continues until there’s an empty chain. ┴ ┴ k = 3 (1,0,0) (0,1,0) (2,0,0) (1,2,0) (0,1,0) (2,0,0) (1,2,0)

CP example Merge CP Then it returns the result to CP, (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H4: { } W4: { } H: { } O: Q: (1,0,0) (0,1,0) Then it returns the result to CP, CP updates its work space with the output queues of merge, and appends history chains to its history space. Let’s continue to introduce new events to CP. (1,2,0) (2,0,0) k = 3 (1,0,0) (0,1,0) (2,0,0) (1,2,0)

CP example (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H4: { } W4: { } H: { } O: Q: (1,0,0) (0,1,0) ┴ ┴ It finds a place for fifth and sixth events at the existing chains. However it cannot place 3,0,0, hence makes a merge call. ┴ ┴ k = 3 (1,2,0) (2,0,0) (3,0,0) (1,2,0) (2,0,0) (3,0,0) (1,2,1) (2,3,0) (1,2,1) (2,3,0)

CP example Merge CP k = 3 H4: { } W4: { } H: { } O: Q: (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H4: { } W4: { } H: { } O: Q: (1,0,0) (0,1,0) ┴ (2,0,0) ┴ ┴ ┴ k = 3 (1,2,0) (2,0,0) (3,0,0) (1,2,0) (2,3,0) (2,0,0) (3,0,0) (1,2,1) (2,3,0) (1,2,1) (2,3,0)

CP example (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H0: { } W0: { } H: { } O: Q: (1,0,0) (0,1,0) (1,2,0) ┴ (2,0,0) At the third round, merge finds out that all the heads are incomparable to each other. So it stops and returns the result to CP. ┴ ┴ k = 3 (1,2,0) (2,0,0) (3,0,0) (1,2,0) (1,2,1) (2,3,0) (3,0,0) (1,2,1) (2,3,0) (1,2,1)

CP example Merge CP CP updates its work space with the input chains. (1,2,1) (1,2,3) (1,0,0), (0,1,0),(1,2,0),(2,0,0),(1,2,1),(2,3,0),(3,0,0),(1,2,3),(2,4,1),(4,3,0) (0,1,0) (1,2,0) (2,3,0) (2,4,1) (1,0,0) (2,0,0) (3,0,0) 4,3,0) CP Merge H7: { } W7: { } H: { } O: Q: (1,0,0) (0,1,0) (2,0,0) (1,2,0) CP updates its work space with the input chains. If we continue, we see that CP places the rest of the elements to the existing chains without a merge call. In conclusion, for 10 events we made 2 merge calls, while the naïve algorithm makes 10 calls. Also, by pruning the work space, we get rid of the redundant comparisons. We implemented this algorithm and compared it to TG since it has better performance compared to others. k = 3 (2,3,0) (1,2,1) (3,0,0) (2,4,1) (1,2,3) 4,3,0)

Running times By looking at the simulation result, we find out that pruning the workspace has a big impact on the incremental algorithm. To better understand the behavior, we increased our test cases.

Test suites 7 new test suites are created. Each test suite contains 18 different test cases initial partition vary from 10 to 450 chains, size vary from 100 to 70,000 We used a fixed vectorclock size (10) Reducing factor = (N-w)/N where N is the size of the initial partition, and w is the width of the poset. Posets are randomly created according to a given width and size. Test suites differ in their reducing factor. We created 7 new test suites that contains 18 different test cases. Each test case has different initial partition, and size. Here we define a new term called reducing factor. Mainly, it’s the reduction percentage in the initial partition size. We created the posets randomly according to a given width and size, and partition it for a given reducing factor. We fixed the vector clock size to 10. //Assume that initial partition size is N, and width is w, then reducing factor is defined as n minus w over n.

Reducing factor effect Here, we show only 3 of test cases out of 18. Unfortunately, there’s no distinct cut of point for all cases. But TG computes at best 8 times faster then CP (when reducing factor is 0), while CP computes at best 30 times faster then TG when reducing is 0.6. To better identify the result, we calculated the average time spend per event. //The first test case has 5000 elements, and we observed that incremental algorithm gets better after the reducing factor is O point 2. While for others it’s 0.4.

Average run time per event And in average CP performs better when reducing factor is higher then 0.2.

Comparison with related work

Concluding Remarks Partitioning a distributed computation Under the linear extension hypothesis pruning the work space (without any significant extra cost ) improves the performance of the incremental algorithm Main limitation: x1+ x2+ x3 < k (efficient only for small k) A decentralized algorithm Integrating with computation slicing As concluding remarks We presented a new incremental algorithm for finding the optimal chain partition of distributed program traces, and show that exploiting the structure of a poset increases the algorithms performance. Main limitation of cp is for bounded sum predicates, the variables have to be positive integers and if they are not binary, then cp works efficiently only for small ks. For future work, it will be nice to find a decentralized algorithm. And integrating it with the computation slicing technique would also be nice

Questions ?

Problem definition & Previous algorithms Given a chain partition of P, C = {C1, . . . ,CN} into N disjoint chains, rearrange these chains into a chain partition with the fewest number of chains. Previous algorithms that answer the question given k chains whether it is possible to partition it into k-1 chains. Bogart and Magagnosc BM Tomlinson and Garg TG Let P be a partially ordered set with n elements, then we define the chain reduction problem as follows. Given a chain partition of P into N disjoint chains, we rearrange the elements into an optimal chain partition of the poset. Previous algorithms given by Bogart and Magagnosc , and Tomplinson and Garg reduces the chain partition by one if it’s possible, otherwise they return an antichain. I am going to go over these algorithms by using an example.

Trace Model: Total Order vs Partial Order Total order: interleaving of events in a trace relevant tools: Temporal Rover [Drusinsky 00], Java-MaC [Kim, Kannan, Lee, Sokolsky, and Viswanathan 01], jPaX [Havelund and Rosu 01] Partial order: Lamport’s happened-before model e.g., jMPaX [Sen, Rosu, and Agha 03] Total order: + low computational complexity Partial order: + suitable for concurrent and distributed programs + encodes exponential number of total orders ) captures bugs that may not be found with a total order