Speaker: Chuang-Chieh Lin National Chung Cheng University Three Theorems Regarding Testing Graph Properties: The first theorem: there exists a monotone graph property which is hard to test Oded Goldreich and Luca Trevisan Random Structures and Algorithms, Vol. 23, 2003, pp. 2357. Speaker: Chuang-Chieh Lin National Chung Cheng University 2018/9/17
Computation Theory Lab, CSIE, CCU, Taiwan Outline Graph properties Testers for graph properties Monotone graph properties The first theorem Tools used for the first theorem Idea of the lower bound construction References 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan For any natural number n, we define [n] = {1, …, n}. We consider finite, undirected, labeled graphs without parallel edges. WLOG, all N-vertex graphs have [N] as their vertex set, and their edges are unordered pairs over [N]. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Graph property A graph property is a predicate defined over graphs that is preserved under graph isomorphism. That is, if G has property and G is isomorphic to G then G has property . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Graph property (contd.) We say that G = ([N], E) is -close to having property if there exists a graph G = ([N], E) having property such that the symmetric difference between E and E is at most We say that a graph G is -far from having property if it is NOT -close to having property . ² ¡ N 2 ¢ . AB = (AB)(BA) 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Symmetric difference between two graphs The symmetric difference between G1 and G2 is 3. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Testers for graph properties Testers are oracle machines that are given as input a pair (N, ). N is a size parameter > 0 is a distance parameter, oracle access to the adjacency matrix of an N-vertex graph. Oracle: It can be visualized as a Turing machine with a black box, called an oracle, which is able to decide certain decision problems in a single step. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Testers for graph properties (contd.) An oracle machine T is called a tester for property if for every graph G = ([N], E) and every , the following two conditions hold: 1 . I f G h a s p r o e t y Q , n P [ T ( N ; ² ) = ] ¸ 2 3 i - m v g · 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Testers for graph properties (contd.) A typical query (u, v) to oracle G is answer 1 iff the edge (u, v) is in the graph G. The tester T (for ) is said to be one-sided error if it always accepts graphs having the property . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Testers for graph properties (contd.) The query complexity of a tester T is a function q: N [0,1] N such that q(N, ) is an upper bound on the number of queries made by T on input (N, ) and oracle access to the adjacency predicate of any N-vertex graph. The query complexity of a property is the minimum query complexity of testers for . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Monotone graph properties In this paper, the monotone graph property is defined as follows. A graph property is called monotone if adding any edge to any graph that has property results in a graph that has property . Connectivity, Clique,, … are monotone. Triangle-free, Chordality, bipartiteness, … are NOT monotone. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan By saying that a graph property is in NP, we mean that the problem of deciding whether a given graph has property is in NP. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan The first theorem There exists a monotone graph property in NP for which every tester requires (N2) queries (even when invoked with constant distance parameter). 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Tools used for the first theorem We use efficient constructions of small-bias probability spaces. An -biased sample space over {0, 1}n, is a multiset S such that, for every nonempty set I [n], if s = s1 …sn is selected uniformly in S, then ¯ P r [ © i 2 I s = 1 ] ¡ · ² . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan -biased sample space Such sample spaces can be constructed in time poly(n/); specifically, |S| = (n/)2 suffices. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan a 0,1 string s of length 5: 1 Let Ai be an event such that s has i 1’s. Assume that |s| = 5. We have the probability that s has odd 1’s is P r [ A 1 3 5 ] = µ ¶ ( 2 ) 4 + ¢ : 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan a 0,1 string s of length 6: 1 Assume that |s| = 6. The probability that s has odd 1’s is still P r [ A 1 3 5 ] = µ 6 ¶ ( 2 ) + ¢ : 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Actually, we can easily show by induction that the probability of having odd 1’s in a 0,1-string is ½, so is the case for even 1’s. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Fact 1 L e t ² ¸ , 2 [ n ] a d S b - i s m p l c o v r f ; 1 g . T h y u I ® = : ¯ P ( 8 ) ¡ · 很像”不公平”的”樂透” 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Fact 2 可用 Chebyshev’s inequality 證明 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Idea of the lower-bound construction 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Constructing a 0.12-t-biased sample space over -bit long strings, where t = N2/200. S u c h s a m p l e o f i z O ( ¡ N 2 ¢ = : 1 t ) + d x . ¡ N 2 ¢ B Omit from the sample space any sample that has less than on third of one-entries. C For each sample s in the residual space, we define a graph Gs = ([N], Es) by letting (i, j) Es iff (i, j)th bit of s equals 1. 關鍵在A的 t 值,使得 bias 很小,讓 testers 花\Omega(N^2)才能 test We call these 22t+o(t) graphs the basic graphs. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan (1,2) (1,3) (1,4) (1,5) (2,3) (2,4) (2,5) (3,4) (3,5) (4,5) ¡ 5 2 ¢ b i t s 1 1 2 3 4 5 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan For each sample s in the residual space, we define a graph Gs = ([N], Es) by letting (i, j) Es iff (i, j)th bit of s equals 1. We call these 22t+o(t) graphs the basic graphs. D For every s in the sample space (equivalently, every basic graph Gs) and every permutation over [N], we consider the secondary graph Gs, = ([N], Es,) that is defined so that ((u), (v)) Es, iff (u, v) Es. By construction, the set of secondary graphs are closed under graph isomorphism, and so this collection does constitute a graph property. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan D For every s in the sample space (equivalently, every basic graph Gs) and every permutation over [N], we consider the secondary graph Gs, = ([N], Es,) that is defined so that ((u), (v)) Es, iff (u, v) Es. By construction, the set of secondary graphs are closed under graph isomorphism, and so this collection does constitute a graph property. E For every secondary graph G= ([N], E), and every E E, we introduce the final graph G= ([N], E) 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Omit from the sample space any sample that has less than on third of one-entries. At this point, the reason for the modification in the initial sample space may be clear. If, for example, the sample space had contained the all-zero string, then the set of final graphs would have contained all graphs, and testing membership in it would have been trivial. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan A simple observation Question: How many different simple graphs of N vertices? Another similar question: How many Boolean functions with n variables? 2 ( N ) = n ¡ 1 2 n 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan : final graphs : a random graph 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan The aim of the proof Our aim is to show that, although a random graph is far from the set of final graphs, no algorithm that make o(N2) queries can distinguish a random graph from a graph selected among the final graphs (with some distribution that is not necessarily uniform). Since a tester for the set of final graphs must accept any final graph (with high probability), we conclude that such tester must make (N2) queries. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
The aim of the proof (contd.) Specifically, throughout the rest of the analysis, we refer to testers of N-vertex graphs that should : accept with probability at least 2/3 every graph that has the property, and reject with probability at least 2/3 every graph that is 0.1-far from having the property. Thus, we omit (fix) the distance parameter (to be 0.1). 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Claim 1 The probability that a random graph is 0.1-close to some final graph is at most 0.1. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Claim 2 Let M be a probabilistic oracle machine that makes at most t queries. Let RN denote a random graph, and BN denote a graph uniformly selected among the basic graphs. Then, ¯ P r [ M R N ( ) = 1 ] ¡ B < : 2 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan The first theorem There exists a monotone graph property in NP such that the query complexity of , denoted by q, satisfies q (N, 0.1) = (N2). 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Theorem 1 Consider the graph property, denoted by , corresponding to the set of final graphs denoted above. Suppose that M is a tester for this property and that M makes less than N2/200 queries (invoked with distance parameter 0.1) Then by Claim 2, we have (*) ¯ P r [ M R N ( ) = 1 ] ¡ B < : 2 . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Proof of Theorem 1 (contd.) Since each graph in the support of BN (i.e., each basic graph) has property , the tester must accept such graph with probability at least 2/3. The tester may accept with probability at most 1/3 each graph that is 0.1-far from having property is at least 0.9 (denote it by y). By Claim 1, we have (**) P r [ M B N ( ) = 1 ] ¸ 2 3 > : 6 . P r [ M R N ( ) = 1 ] · y ¢ 3 + ¡ : 9 4 (***) 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Proof of Theorem 1 (contd.) (**) (***) contradicts to (*). Hence the theorem follows. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Recall to Claim 1: The probability that a random graph is 0.1-close to some final graph is at most 0.1. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 1 The key observation is that the set of final graphs is very sparse. Each basic graph gives rise to at most N! secondary graphs. Each secondary graph gives rise to at most final graphs. 2 = 3 ( N ) < 2 N = 3 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 1 (contd.) Thus, the number of final graphs is at most where the above holds for all sufficiently large N. My idea: For each final graph G, we have at most graphs which are 0.1-close to G. 2 t + o ( ) ¢ N ! = 3 < : 7 2 : 1 ( N ) 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 1 (contd.) Thus for all sufficiently large N, we can show that less than 10% of the graphs are 0.1-close to some final graph. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Recall to Claim 2: Let M be a probabilistic oracle machine that makes at most t queries. Let RN denote a random graph, and BN denote a graph uniformly selected among the basic graphs. Then, ¯ P r [ M R N ( ) = 1 ] ¡ B < : 2 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 We identify -bit long strings with N-vertex graphs. They are obtained as in the first stage of the construction. Let GN denote a graph uniformly selected among all graphs in the sample space without discarding from the space those samples having less than on third of one-entries (i.e., less than edges) Thus, BN is obtained from GN by conditioning that GN has at least edges. ¡ N 2 ¢ 1 3 ¡ N 2 ¢ 1 3 ¡ N 2 ¢ 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 (contd.) By Fact 2: The probability that an element in it has less than one third of one-entries is very small (e.g., tends to 0 when N ) 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 (contd.) It follows that (i) ¯ P r [ M G N ( ) = 1 ] ¡ B < : . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 (contd.) The sample space underlying the construction of GN has bias at most 0.1 2t. It follows by Fact 1 that, for every t distinct unordered pairs (u1, v1), ... , (ut , vt) [N][N] and every 12 …t {0.1}t, the probability that for every i query (ui,vi) to GN is answered by i with probability within 2t 0.1 2t. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 (contd.) Observe that the internal coins of M together with the oracle answers to M determine oracle queries of M. For any fixed sequent of coins for M, any fixed sequence of t answers occurs with probability within 2t 0.1 2t under GN (rather than with probability 2t under RN). Thus, for any fixed sequence of coins for M, the observed deviation of the t answers of GN from the t answers of RN is at most 0.1. The internal coins of M here are fair. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Computation Theory Lab, CSIE, CCU, Taiwan Proof of Claim 2 (contd.) Thus we have Combing the result of (i): the claim follows (by triangular inequality). (ii) ¯ P r [ M G N ( ) = 1 ] ¡ R < : . (i) ¯ P r [ M G N ( ) = 1 ] ¡ B < : . 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan
Thank you.
Computation Theory Lab, CSIE, CCU, Taiwan References [AS05] N. Alon and A. Shapira: Every monotone graph property is testable. STOC’05. [AGHP92] N. Alon, O. Goldreich, J. Hastad, and R. Peralta: Simple constructions of almost k-wise independent random variables. J. Random Struct. Alg., 3(3) (1992), pp. 289304. [AK02] N. Alon and M. Krivelevich. Testing k-colorability. SIAM J. Discrete Math., 15(2) (2002), pp. 211227. [GGR98] O. Goldreich, S. Goldwasser, and D. Ron: Property testing and its connection to learning and approximation. J. ACM, 45(4) (1998), pp. 653750. [GT03] O. Goldreich and L. Trevisan: Three theorems regarding testing graph properties. Random Strut. Alg., 23 (2003), pp. 2357. [NN93] J. Naor and M. Naor: Small-bias probability spaces: Efficient constructions and applications. SIAM J. Comput., 22 (1993), pp. 838856. 2018/9/17 Computation Theory Lab, CSIE, CCU, Taiwan