Approximating the Distance to Properties in Bounded-Degree and Sparse Graphs Sharon Marko, Weizmann Institute Dana Ron, Tel Aviv University.

Slides:



Advertisements
Similar presentations
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Advertisements

Hardness of testing 3- colorability in bounded degree graphs Andrej Bogdanov Kenji Obata Luca Trevisan.
Finding Cycles and Trees in Sublinear Time Oded Goldreich Weizmann Institute of Science Joint work with Artur Czumaj, Dana Ron, C. Seshadhri, Asaf Shapira,
Deterministic vs. Non-Deterministic Graph Property Testing Asaf Shapira Tel-Aviv University Joint work with Lior Gishboliner.
Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.
1 The Monte Carlo method. 2 (0,0) (1,1) (-1,-1) (-1,1) (1,-1) 1 Z= 1 If  X 2 +Y 2  1 0 o/w (X,Y) is a point chosen uniformly at random in a 2  2 square.
Approximating Average Parameters of Graphs Oded Goldreich, Weizmann Institute Dana Ron, Tel Aviv University.
A Randomized Linear-Time Algorithm to Find Minimum Spanning Trees David R. Karger David R. Karger Philip N. Klein Philip N. Klein Robert E. Tarjan.
1. Given a predetermined property and a graph we want to distinguish between the 2 cases: 1)The graph has the property 2) The graph is “far” from having.
Christian Sohler | Every Property of Hyperfinite Graphs is Testable Ilan Newman and Christian Sohler.
Artur Czumaj Dept of Computer Science & DIMAP University of Warwick Testing Expansion in Bounded Degree Graphs Joint work with Christian Sohler.
Oded Goldreich Shafi Goldwasser Dana Ron February 13, 1998 Max-Cut Property Testing by Ori Rosen.
Proximity Oblivious Testing Oded Goldreich Weizmann Institute of Science Joint work with Dana Ron.
Mycielski’s Construction Mycielski’s Construction: From a simple graph G, Mycielski’s Construction produces a simple graph G’ containing G. Beginning with.
Testing the Diameter of Graphs Michal Parnas Dana Ron.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 8 May 4, 2005
Some Techniques in Property Testing Dana Ron Tel Aviv University.
Approximation Algorithm: Iterative Rounding Lecture 15: March 9.
Proximity Oblivious Testing Oded Goldreich Weizmann Institute of Science Joint work with Dana Ron.
Definition Hamiltonian graph: A graph with a spanning cycle (also called a Hamiltonian cycle). Hamiltonian graph Hamiltonian cycle.
Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.
Sublinear Algorithms for Approximating Graph Parameters Dana Ron Tel-Aviv University.
Michael Bender - SUNY Stony Brook Dana Ron - Tel Aviv University Testing Acyclicity of Directed Graphs in Sublinear Time.
Testing Metric Properties Michal Parnas and Dana Ron.
On Proximity Oblivious Testing Oded Goldreich - Weizmann Institute of Science Dana Ron – Tel Aviv University.
CSE 421 Algorithms Richard Anderson Lecture 4. What does it mean for an algorithm to be efficient?
On Testing Convexity and Submodularity Michal Parnas Dana Ron Ronitt Rubinfeld.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint work with Mira Gonen Dana Ron Tel-Aviv University.
1 Algorithmic Aspects in Property Testing of Dense Graphs Oded Goldreich – Weizmann Institute Dana Ron - Tel-Aviv University.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint works with Mira Gonen and Oded Goldreich Dana Ron Tel-Aviv University.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Lower Bounds for Property Testing Luca Trevisan U C Berkeley.
Christian Sohler 1 University of Dortmund Testing Expansion in Bounded Degree Graphs Christian Sohler University of Dortmund (joint work with Artur Czumaj,
Sublinear Algorithms for Approximating Graph Parameters Dana Ron Tel-Aviv University.
Finding a maximum independent set in a sparse random graph Uriel Feige and Eran Ofek.
Finding Cycles and Trees in Sublinear Time Oded Goldreich Weizmann Institute of Science Joint work with Artur Czumaj, Dana Ron, C. Seshadhri, Asaf Shapira,
Introduction to Graph Theory
Approximating the MST Weight in Sublinear Time Bernard Chazelle (Princeton) Ronitt Rubinfeld (NEC) Luca Trevisan (U.C. Berkeley)
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Approximating the Minimum Degree Spanning Tree to within One from the Optimal Degree R 陳建霖 R 宋彥朋 B 楊鈞羽 R 郭慶徵 R
Trees and Distance. 2.1 Basic properties Acyclic : a graph with no cycle Forest : acyclic graph Tree : connected acyclic graph Leaf : a vertex of degree.
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
Expanders via Random Spanning Trees R 許榮財 R 黃佳婷 R 黃怡嘉.
Mycielski’s Construction Mycielski’s Construction: From a simple graph G, Mycielski’s Construction produces a simple graph G’ containing G. Beginning with.
Lower Bounds for Property Testing Luca Trevisan U.C. Berkeley.
Testing the independence number of hypergraphs
On Centralized Sublinear Algorithms and Some Relations to Distributed Computing Dana Ron Tel-Aviv University ADGA, October 2015.
Property Testing: Sublinear-Time Approximate Decisions Oded Goldreich Weizmann Institute of Science Talk at CTW, July 2013.
Artur Czumaj DIMAP DIMAP (Centre for Discrete Maths and it Applications) Computer Science & Department of Computer Science University of Warwick Testing.
Trees Thm 2.1. (Cayley 1889) There are nn-2 different labeled trees
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
CSE 421 Algorithms Richard Anderson Winter 2009 Lecture 5.
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
On Local Partition Oracles and their Applications
On Sample Based Testers
Lower Bounds for Property Testing
On Testing Dynamic Environments
Approximating the MST Weight in Sublinear Time
Finding Cycles and Trees in Sublinear Time
Minimum Spanning Tree 8/7/2018 4:26 AM
From dense to sparse and back again: On testing graph properties (and some properties of Oded)
COMP 6/4030 ALGORITHMS Prim’s Theorem 10/26/2000.
Planarity Testing.
CIS 700: “algorithms for Big Data”
CSCI B609: “Foundations of Data Science”
On the effect of randomness on planted 3-coloring models
Introduction Wireless Ad-Hoc Network
Locality In Distributed Graph Algorithms
Presentation transcript:

Approximating the Distance to Properties in Bounded-Degree and Sparse Graphs Sharon Marko, Weizmann Institute Dana Ron, Tel Aviv University

Distance Approximation Distance approximation is an extension of property testing. Property testing: distinguish between objects (e.g., graphs) that have property P and objects that are far from having property P. Distance approximation: estimate distance of object from having property P. In both cases, algorithm is allowed a small failure probability, and task is performed by querying object, where query complexity should be sublinear in size of object.

Distance Approximation - Background First explicitly studied in [Parnas, R, Rubinfeld] together with related notion of tolerant property testing (distinguish between objects that are  1 -close and  2 -far from property). Problems studied for these extensions:  Monotonicity of functions [PRR], [Ailon, Chazelle, Comandur, Liue].  Clustering of points [PRR].  Tolerant vs. intolerant (standard) testing [Fischer, Fortnow].  Local tolerant testing of codes [Guruswami, Rudra].  Graph properties in dense graphs model [Fischer, Newman].

Distance Approximation in Dense Graphs Theorem [Fischer, Newman] : Every property that has testing algorithm in dense-graphs model whose complexity is function only of distance parameter , has distance approximation algorithm A with additive error  in model, whose complexity is function only of . That is,  P (G)-    A   P (G)+  Dense Graphs Model [Goldreich Goldwasser R]: (graph represented by n x n adjacency matrix) Queries: Is (u,v)  E ? (probe into matrix) Distance: Fraction of (n 2 ) entries in matrix that should be modified to get property 1 u v Complexity may have large dependence on 1/  (e.g., tower) but no dependence on size of graph. dist of G from P output of A [Alon, Shapira, Sudakov] give algorithm and direct analysis of additive approximation for all monotone properties.

Distance Approximation in Sparse Graphs 1 2 … d 1 n Bounded-Degree Graphs Model [Goldreich R]: (graph represented by n incidence lists of size d) Queries: Who is i’th neighbor of v? Distance: Fraction of (nd) entries in lists that should be modified to get property Suitable: (Almost)-regular sparse graphs (in particular, constant-degree graphs) Sparse Graphs Model [Parnas R]: (graph is represented by n incidence lists of varying size) Queries: Who is i’th neighbor of v? Distance: Fraction of (m) edges in graph that should be modified to get property. Suitable: General Sparse Graphs 1 n

Distance Approximation in Sparse Graphs Cont’ Definition: Algorithm A is an - distance approximation algorithm (  ≥ 1) for property P, if for every graph G and any given 0<  <1, it outputs w.h.p an estimate  A s.t.  P (G) -    A    P (G)+  where  P (G) is the distance of graph G from property P. If  = 1 then algorithm is distance approximation algorithm Note: Cannot get only multiplicative error in general in sublinear time. Must allow additive error (or dependence on 1/  P (G) )

Our Results Definition: Algorithm A is an -distance approximation algorithm for property P, if outputs w.h.p an estimate  A s.t.  P (G) -    A    P (G)+   = 1 : distance approximation algorithm poly(k/(  d avg )) 1sparsek-Edge- Connectivity d O(log(d/)) 3bounded- degree Triangle- Freeness * O(1/(  d avg ) 4 ) 1sparseEulerian O(1/  3 ) 1bounded- degree Cycle- Freeness Let d avg = m/n (n: num of vertices, m: num of edges) Property Graph Model  Complexity * Extends to subgraph-freeness

Some Notes on Our Results  Complexity of all algs but tri(sub)-free are poly in complexity of testing algs of [Goldreich R].  All algs but tri(sub)-free have only additive error. Cannot obtain such result for tri-free in poly-time/sublin-queries.  Tri(sub)-free and cycle-free algs are in bounded-degree and not (general) sparse-graphs model. Have  (n 1/2 ) lower bound for them in latter model.  Case of k=1 for connectivity was addressed in [Chazelle, Rubinfeld, Trevisan] as central part of min-span-tree weight approx alg.  Can adapt tri-free alg to get sublinear approx for min-VC size, improving on [Parnas R]. poly(k/( d avg )) 1sparsek-Edge-Conn d O(log(d/)) 3bounded - degree Triangle- Freeness * O(1/( d avg ) 4 ) 1sparseEulerian O(1/ 3 ) 1bounded - degree Cycle- Freeness

Triangle (Subgraph) Freeness Testing algorithm is a simple brute-force algorithm. Its adaptation gives a multiplicative factor of d (degree) error. Hence need different approach. Def2. Two triangles are neighbors if they share an edge. For a triangle , degree of triangle: deg(  ) = number of neighboring triangles Our goal: Estimate C M (G)/(dn) (in sublinear time) Def1. A triangle-cover of graph G is a set of edges whose removal leaves G triangle free. Let C M (G) denote min-size of triangle cover.

Min. Triangle-Cover Approx. Alg 1 (not sub-lin) 1)Let T be set of all triangles in G, TC= initial tri-cover. 2)For i=1 to r =  (log(d/  )) : (a) Select each triangle   T with prob 1/(cdeg(  )). (b) Unselect every two neighboring triangles that were selected. (c) Add all edges of selected triangles to TC. (d) Remove from T all selected triangles and their neighbors and update degrees. 3)Add to TC an edge from every remaining triangle in T. 4)Output TC.

Min. Triangle-Cover Approx. Alg (not sub-lin) Theorem: TC is a triangle cover s.t. w.h.p |TC|  3 C M (G) + (  /2)m Proof:  A triangle-cover by construction.  During the loop: chosen triangles are edge-disjoint thus at most 3 C M (G) edges added to TC in loop.  After the loop? Lemma: Exp[ n i | n i-1 ]  (1- 1/c’) n i-1 Corollary: After r =  (log(d/)) iterations, w.h.p. |T r |  (  /2)m. Since add to TC one edge from each triangle in T r, Theorem follows. Let T i be triangles left in T after i’th iteration, n i = |T i |.

Sublinear Min. Triangle-Cover Approx. Alg 1)Uniformly select s=  (1/ 2 ) vertices. 2)For each vertex v j selected construct (by BFS) subgraph induced by r-neighborhood of v j ( r =  (log(d/  )) ) 3)Run non-sublinear algorithm on union of subgraphs, and let e j be number of edges selected that are incident to v j. 4)Let Ĉ = (n/2s)  j e j and output (1/dn) Ĉ Build on approach from [PR] (for min-VC sublinear alg). Algorithm (non sub-lin) can be viewed as distributed algorithm with r =  (log(d/)) rounds. (Indeed similar to Luby’s O(log n) rounds distributed algorithm for maximal independent set.) Implies that decision on whether or not to include edge in cover depends only on r-neighborhood of edge.

Sublinear Min. Triangle-Cover Approx. Alg Theorem: Algorithm is a 3-distance-approximation algorithm for triangle freeness. Its complexity is d O(log(d/ )). Proof combines error bound of non-sublinear algorithm with sampling error. 1) Uniformly select s=(1/ 2 ) vertices. 2 ) For each vertex v j selected construct (by BFS) subgraph induced by r-neighborhood of v j ( r = (log(d/)) ) 3) Run non-sublinear algorithm on union of subgraphs, and let e j be number of edges selected that are incident to v j. 4) Let Ĉ = (n/2s)  j e j and output (1/dn)Ĉ

k-Connectivity Step 1. Let C(G) = num of connected component in G. Then  1C (G) = (C(G)-1)/m. Step 2. Let n v be num of vertices in connected component of vertex v. Then  vV (1/n v ) = C(G). Step 3. Let t = 4/(  d avg ) and V’  V be vertices that belong to connected components of size at most t. Then  vV’ (1/n v ) ≥ C(G) -  n. Algorithm: 1. Uniformly select  (1/(  d avg ) 2 ) vertices. For each v in sample S finds n v or discovers that v not in V’ (by BFS). 2. Output k=1

k-Connectivity, k >1 First attempt: Build directly on testing algorithm of [GR] – gives factor k multiplicative error (in addition to additive error). The source of multiplicative error: k-connectivity structure used in [GR] (cactus structure [Dinitz] ). Instead: Use different k-connectivity structure of extreme-sets tree/partition [Naor,Gusfield,Martel] + adapt & extend ideas from [GR]. Recall: A graph G is k-edge-connected if there are k edge-disjoint paths between every pair of vertices  all cuts (X,V\X) of size at least k X V\X

k-Connectivity, k >1, cont’ Def1: The degree of a set X, d(X) = num of edges with one end-point in X (size of cut (X,V\X)) Def2: A set X is j-extreme if d(X)=j and  YX, d(Y)>j Def3: The extreme-sets tree of G: - a leaf for every vertex v (a d(v)-extreme set), - root is V (a 0-extreme set), - if j-extreme set Y is node in tree then parent X is minimal extreme set X  Y X Y X ` V ` ` ` V1V1 V2V2 V3V n 3 {1,2,3} ``

k-Connectivity, k >1, cont’ [NGM] showed: Can use extreme-sets tree to define partition of V into extreme sets ES(G)={X 1,X 2,…,X q } s.t.  kC (G) =  i  (X i )/m where  is some (easily computable) demand function. X1X1 X2X2 X3X3 X5X5 X4X4 X6X6 V

k-Connectivity, k >1, cont’ [ NGM] showed: Extreme-sets tree defines partition of V into extreme sets ES(G)={X 1,X 2,…,X q } s.t.  kC (G) =  i (X i )/m Note2: Let X(v) be (unique) set X i in partition ES (t) (G) s.t. vX i then Note1: Can refine ES(G) and get partition ES (t) (G) s.t. (1) |X i |  t for every X i in ES (t) (G) and (2) |  kC (G) -  i  (X i )/m |   /2 for t=4k/(  d avg )

k-Connectivity, k >1, cont’ Let X(v) be (unique) set X i in partition ES (t) (G) s.t. vX i then Algorithm: 1. Uniformly select  ((k/  d avg ) 2 ) vertices. For each v in sample S find (w.h.p.) X(v) and computes  (X(v)) 2. Output Step 1 in Algorithm : Find extreme set of T size at most t that contains X(v) (“random search process” similar to [GR]), construct “extreme-sets sub-tree” of T, which determines X(v) and  (X(v)).

Summary and Open Problems  Give distance approximation algorithms for all properties studied in [GR] (testing of bounded-degree graphs). With exception of triangle(subgraph)-freeness, complexity is polynomial in that of testing algorithms, and have only additive error.  Can complexity of tri-free algorithm be improved? Can we decrease constant multiplicative factor in approximation?  Sublinear distance approximation for bipartiteness in bounded-degree/sparse graphs?  Is there any general relation between testing and distance approximation in bounded-degree/sparse graphs (as there is in dense graphs)?

Our Results Definition: Algorithm A is an -distance approximation algorithm for property P, if outputs w.h.p an estimate  A s.t.  P (G) -    A    P (G)+   = 1 : distance approximation algorithm Let d avg = m/n (n: num of vertices, m: num of edges)  k-Edge-Connectivity in sparse model: dist-approx, complexity poly(k/(  d avg ))  Triangle-Freeness in bounded-degree model: 3-dist-approx, complexity d O(log(d/)) (extends to subgraph-freeness)  Eulerian in sparse model: dist-approx, complexity O(1/(  d avg ) 4 )  Cycle-Freeness in bounded-degree model: dist-approx, complexity O(1/  3 )

Some Notes on Our Results k-Edge-Connectivity: sparse, dist-approx, poly(k/( d avg )) Triangle-Freeness: bounded-degree, 3-dist-approx, d O(log(d/)) Eulerian: sparse, dist-approx, O(1/( d avg ) 4 ) Cycle-Freeness: bounded-degree, dist-approx, O(1/ 3 )  Complexity of all algs but tri(sub)-free are poly in complexity of testing algs of [Goldreich R].  All algs but tri(sub)-free have only additive error. Cannot obtain such result in poly-time / sublinear queries.  Tri(sub)-free and cycle-free algs are in bounded-degree and not (general) sparse-graphs model. Have  (n 1/2 ) lower bound for them in latter model.  Case of k=1 for connectivity was addressed in [Chazelle, Rubinfeld, Trevisan] as central part of min-span-tree weight approx alg.  Can adapt tri-free alg to get sublinear approx for min-VC size, improving on [Parnas R].