ON THE EFFICIENCY OF THE HAMMING C-CENTERSTRING PROBLEMS Amihood Amir Liam Roditty Jessica Ficler Oren Sar Shalom.

Slides:



Advertisements
Similar presentations
Triangle partition problem Jian Li Sep,2005.  Proposed by Redstar in Algorithm board in Fudan BBS.  Motivated by some network design strategy.
Advertisements

Minimum Clique Partition Problem with Constrained Weight for Interval Graphs Jianping Li Department of Mathematics Yunnan University Jointed by M.X. Chen.
C&O 355 Lecture 23 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Approximation Algorithms
GRAPH BALANCING. Scheduling on Unrelated Machines J1 J2 J3 J4 J5 M1 M2 M3.
Greedy Algorithms Amihood Amir Bar-Ilan University.
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Private Approximation of Search Problems Amos Beimel Paz Carmi Kobbi Nissim Enav Weinreb Ben Gurion University Research partially Supported by the Frankel.
1 NP-Complete Problems. 2 We discuss some hard problems:  how hard? (computational complexity)  what makes them hard?  any solutions? Definitions 
Approximation Algorithms Chapter 5: k-center. Overview n Main issue: Parametric pruning –Technique for approximation algorithms n 2-approx. algorithm.
Complexity 16-1 Complexity Andrei Bulatov Non-Approximability.
CSC5160 Topics in Algorithms Tutorial 2 Introduction to NP-Complete Problems Feb Jerry Le
Computability and Complexity 23-1 Computability and Complexity Andrei Bulatov Search and Optimization.
Complexity 15-1 Complexity Andrei Bulatov Hierarchy Theorem.
Introduction to Approximation Algorithms Lecture 12: Mar 1.
1 Optimization problems such as MAXSAT, MIN NODE COVER, MAX INDEPENDENT SET, MAX CLIQUE, MIN SET COVER, TSP, KNAPSACK, BINPACKING do not have a polynomial.
Vertex Cover, Dominating set, Clique, Independent set
1 Vertex Cover Problem Given a graph G=(V, E), find V' ⊆ V such that for each edge (u, v) ∈ E at least one of u and v belongs to V’ and |V’| is minimized.
The Theory of NP-Completeness
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
1 Separator Theorems for Planar Graphs Presented by Shira Zucker.
On the Task Assignment Problem : Two New Efficient Heuristic Algorithms.
Linear Programming and Parameterized Algorithms. Linear Programming n real-valued variables, x 1, x 2, …, x n. Linear objective function. Linear (in)equality.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
Approximation schemes Bin packing problem. Bin Packing problem Given n items with sizes a 1,…,a n  (0,1]. Find a packing in unit-sized bins that minimizes.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b
Closest String with Wildcards ( CSW ) Parameterized Complexity Analysis for the Closest String with Wildcards ( CSW ) Problem Danny Hermelin Liat Rozenberg.
TECH Computer Science NP-Complete Problems Problems  Abstract Problems  Decision Problem, Optimal value, Optimal solution  Encodings  //Data Structure.
Approximation Algorithms
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
Complexity 25-1 Complexity Andrei Bulatov Counting Problems.
Princeton University COS 423 Theory of Algorithms Spring 2001 Kevin Wayne Approximation Algorithms These lecture slides are adapted from CLRS.
1 Approximate Algorithms (chap. 35) Motivation: –Many problems are NP-complete, so unlikely find efficient algorithms –Three ways to get around: If input.
CSCI 3160 Design and Analysis of Algorithms Chengyu Lin.
Unit 9: Coping with NP-Completeness
Approximation Algorithms for Graph Homomorphism Problems Chaitanya Swamy University of Waterloo Joint work with Michael Langberg and Yuval Rabani Open.
Non-Approximability Results. Summary -Gap technique -Examples: MINIMUM GRAPH COLORING, MINIMUM TSP, MINIMUM BIN PACKING -The PCP theorem -Application:
Minicourse on parameterized algorithms and complexity Part 4: Linear programming Dániel Marx (slides by Daniel Lokshtanov) Jagiellonian University in Kraków.
Flipping letters to minimize the support of a string Giuseppe Lancia, Franca Rinaldi, Romeo Rizzi University of Udine.
Strings Basic data type in computational biology A string is an ordered succession of characters or symbols from a finite set called an alphabet Sequence.
1 Approximation algorithms Algorithms and Networks 2015/2016 Hans L. Bodlaender Johan M. M. van Rooij TexPoint fonts used in EMF. Read the TexPoint manual.
On the Hardness of Optimal Vertex Relabeling and Restricted Vertex Relabeling Amihood Amir Benny Porat.
NP Completeness Piyush Kumar. Today Reductions Proving Lower Bounds revisited Decision and Optimization Problems SAT and 3-SAT P Vs NP Dealing with NP-Complete.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
Kernel Bounds for Path and Cycle Problems Bart M. P. Jansen Joint work with Hans L. Bodlaender & Stefan Kratsch September 8 th 2011, Saarbrucken.
Instructor: Shengyu Zhang 1. Optimization Very often we need to solve an optimization problem.  Maximize the utility/payoff/gain/…  Minimize the cost/penalty/loss/…
Approximation algorithms
The Theory of NP-Completeness
Mathematical Foundations of AI
Optimization problems such as
Approximate Algorithms (chap. 35)
Approximation algorithms
Vertex Cover, Dominating set, Clique, Independent set
Computability and Complexity
ICS 353: Design and Analysis of Algorithms
Constrained Bipartite Vertex Cover: The Easy Kernel is Essentially Tight Bart M. P. Jansen June 4th, WORKER 2015, Nordfjordeid, Norway.
Computational Complexity
Introduction to Algorithms Second Edition by
Chapter 34: NP-Completeness
On the k-Closest Substring and k-Consensus Pattern Problems
Problem Solving 4.
The Theory of NP-Completeness
More NP-Complete Problems
Instructor: Aaron Roth
Lecture 24 Vertex Cover and Hamiltonian Cycle
Presentation transcript:

ON THE EFFICIENCY OF THE HAMMING C-CENTERSTRING PROBLEMS Amihood Amir Liam Roditty Jessica Ficler Oren Sar Shalom

Motivation – the Conference Location Problem

Consensus String Problem Output: Find a point whose maximum Distance from all points is smallest Input: points in space.

Hamming Distance

Consensus String Problem (1-HRC)

History: Frances and Litman [1997]: Problem is NP -complete even for binary alphabets Therefore: 3 directions. 1.Solution for small k. 2.Fixed parameter tractability. 3.Approximation algorithms.

History: Solution for small k: Gramm, Niedermeier, and Rossmanith [2001] (3) Boucher, Brown, and Durocher [2008] (4 binary) A., Landau, Na, Park, Park, and Sim [2009] (3, radius & dist. sum optimization) A., Paryenty, and Roditty [2012] (5 binary, l 2 for all k: l k )

History: Fixed Parameter Tractability for all Parameters: Fixed l : Ben-Dor, Lancia, Perone, and Ravi [1997] Fixed k: Gramm, Niedermeier, and Rossmanith [2003] Fixed d: Sojanovic, Berman, Gumucio, Hardison, and Miller [1997] Lanctot, Li, Ma, Wang, and Zhang [1999] Sze, Lu, and Chen [2004]

History: Approximations: PTAS: Li, Ma, and Wang [2002] – not practical. Rounded LP: Ben-Dor, Lancia, Perone, and Ravi [1997] large number of variables: |Σ| l Chimani, Woste, and Bocker [2011]: can be reduced to: |Σ|( l -1) A., Paryenty, and Roditty [2011]: |T(S)| |Σ| (T(S)= set of column types)

Another Motivation – Clustering. The C-CenterStrings problem Input: 1.Points in space 2.Number c 3.Objective function f. Output: Divide the points to c sets such that for the c consensus strings c 1,c 2,…,c c, f(c 1,c 2,…,c c ) is maximum/minimum.

Three Types of Objective functions: Let HRC (Hamming Radius Clustering) be the consensus string problem defined before. 1. c-HRC: partition into c sets, each of which has center with radius d. 2. c-HRLC: partition into c sets, each of which has center with radius d, but center is part of input set. 3. c-HRSC: partition into c sets, each of which has a center and the sum of the radii does not exceed d.

The Hamming radius c-clustering problem (c-HRC) Example: For the following strings and d=1, we show it belongs to 2-HRC.

The Hamming radius local c-clustering problem (c-HRLC) Example: For the following strings and d=2, we show it belongs to 2- HRLC. Does it belong to 2-HRLC when d=1 ?

The Hamming radius c-clustering sum problem (c-HRSC) Example: For the following strings and d=2, we show it belongs to 2-HRC.

In this Paper: We consider: 1. Parametetrized Complexity, and 2. Approximations Small k is not too meaningful in the context of clustering.

C-CenterString Parameterized Complexity c Fixed k Fixed d Fixed (d=1) d/l and c Fixed l Fixed (l=2) HRC NPC polynomial time NPC polynomial time? HRLC polynomial time ? NPC HRSC NPC polynomial time? ?

Theorem: HRC,HRLC and HRSC can be solved in polynomial time for fixed k. If k≤c then input strings can be assigned to c centers where d=0. Otherwise c<k. There are c k <k k options for partitioning k strings to c sets. - For each set, find the consensus center in polynomial time. - The partition that gives the best result is the optimal solution.

C-CenterString Parameterized Complexity c Fixed k Fixed d Fixed (d=1) d/l and c Fixed l Fixed (l=2) HRC NPC polynomial time NPC polynomial time? HRLC polynomial time ? NPC HRSC NPC polynomial time? ?

Theorem: HRC is NP complete even if the radius is fixed to d = 1. d = 1 and the alphabet is binary By reduction from Vertex Cover For Triangle-Free Graphs Our input: G - Triangle-Free Graph t – size of vertex-cover set

The construction: The c parameter is t. The distance parameter d is Encode edges as bit strings of length |V|. Set the bits of the vertices on the two sides of the edge.

000????

C-CenterString Parameterized Complexity c Fixed k Fixed d Fixed (d=1) d/l and c Fixed l Fixed (l=2) HRC NPC polynomial time NPC polynomial time? HRLC polynomial time ? NPC HRSC NPC polynomial time? ?

Theorem: HRLC is NP complete even if the length is fixed to l =2 We prove by reduction from Minimum Maximal Matching for Bipartite graphs Our input: G – Bipartite Graph t – size of the minimal set that is maximal matching Maximal Matching Minimum Maximal Matching

The construction: The c parameter is t. The distance parameter d is

3254

Move strings [6,2] and [5,2] if there are centers begins in 5 or Change the center to one of the remaining strings We keep going until there are no two centers with common symbol !

Approximation Algorithms 1. A linear-time 4-Approximation for the 2-HRSC problem. 2. A polynomial time 3-Approximation for the 2-HRSC problem. 3. Special case PTAS – by computing the clusters and doing 1-HRC approximation on each cluster.

>2d Lemma

Proof center

If we had a representative from each cluster we can associate the rest of the strings to the appropriate group Now use a known approximation algorithm of 1-HRC, for finding the consensus strings of each cluster >2d

>4d Lemma Cluster c-center

Proof ≤d

Polynomial time approximation algorithm for 2-HRSC problem

Future work 1.We presented a heuristic algorithm that did very well in practice – what is its approximation ratio? 2. There are some gaps in the parameterized complexity table: a. What happens in the HRLC/HRSC cases for fixed d? b. What happens in the HRC/HRSC cases for fixed l? 3. Is there a PTAS for c-HRC? 4. Can we approximate c-HRC using LP? SDP?