Presentation is loading. Please wait.

Presentation is loading. Please wait.

Fourier Analysis, Projections, Influence, Junta, Etc…

Similar presentations


Presentation on theme: "Fourier Analysis, Projections, Influence, Junta, Etc…"— Presentation transcript:

1 Fourier Analysis, Projections, Influence, Junta, Etc…

2 © S.Safra Boolean Functions and Juntas Def: A Boolean function

3 © S.Safra f f * * -1* 1* 11* 11-1* -1-1* -11* -11-1* -111* -1-1-1* -1-11* 111* 1-1* 1-1-1* 1-11* Functions as an Inner-Product Vector-Space f f 2n2n 2n2n * * -1* 1* 11* 11-1* -1-1* -11* -11-1* -111* -1-1-1* -1-11* 111* 1-1* 1-1-1* 1-11*

4 © S.Safra Functions’ Vector-Space A functions f is a vector A functions f is a vector Addition: ‘f+g’(x) = f(x) + g(x) Addition: ‘f+g’(x) = f(x) + g(x) Multiplication by scalar ‘c  f’(x) = c  f(x) Multiplication by scalar ‘c  f’(x) = c  f(x)

5 © S.Safra Variables` Influence The influence of an index i  [n] on a Boolean function f:{1,-1} n  {1,-1} is The influence of an index i  [n] on a Boolean function f:{1,-1} n  {1,-1} is

6 © S.Safra Norms Def: Expectation Norm Def: Expectation Norm Def: Sum Norm Def: Sum Norm

7 © S.Safra Inner-Product A functions f is a vector A functions f is a vector Inner product (normalized) Inner product (normalized)

8 © S.Safra Simple Observations Claims: For any function f whose range is {-1, 0, 1}

9 © S.Safra Monomials What would be the monomials over x  P[n]? What would be the monomials over x  P[n]? All powers except 0 and 1 disappear! All powers except 0 and 1 disappear! Hence, one for each character S  [n] Hence, one for each character S  [n] These are all the multiplicative functions These are all the multiplicative functions

10 © S.Safra Fourier-Walsh Transform Consider all characters Consider all characters Given any function let the Fourier-Walsh coefficients of f be Given any function let the Fourier-Walsh coefficients of f be thus f can described as thus f can described as

11 © S.Safra Fourier Transform: Norm Norm: (Sum) Thm [Parseval]: Hence, for a Boolean f

12 © S.Safra Variables` Influence The influence of an index i  [n] on a Boolean function f:{1,-1} n  {1,-1} is The influence of an index i  [n] on a Boolean function f:{1,-1} n  {1,-1} is Which can be expressed in terms of the Fourier coefficients of f Claim: Which can be expressed in terms of the Fourier coefficients of f Claim:

13 © S.Safra Restriction and Average Def: Let I  [n], x  P([n]\I), the restriction function is Def: the average function is Note: [n] I x y I x y y y y y

14 © S.Safra In Fourier Expansion Prop: Prop: And since the expectation of a function is its coefficient on the empty character: And since the expectation of a function is its coefficient on the empty character: Corollary: Corollary:

15 © S.Safra Expectation and Variance Claim: Claim: Hence, for any f Hence, for any f

16 © S.Safra Average Sensitivity Def: the sensitivity of x w.r.t. f is Def: the sensitivity of x w.r.t. f is Def: the average-sensitivity of f is Def: the average-sensitivity of f is

17 © S.Safra When as(f)=1 Def: a balanced function f is s.t. Def: a balanced function f is s.t. Thm: a balanced, Boolean f s.t. as(f)=1 is a dictatorship Thm: a balanced, Boolean f s.t. as(f)=1 is a dictatorship Proof: observe that and since as(f)=1 it must be that however ||f|| 2 =1 hence Proof: observe that and since as(f)=1 it must be that however ||f|| 2 =1 hence

18 © S.Safra Linear, Boolean Functions Proof(cont): Proof(cont): pick any x; f(x)  {-1, 1} pick any x; f(x)  {-1, 1} Pick {i} with non-zero coefficient Pick {i} with non-zero coefficient Observe that f(x  {i})  {-1, 1} however differ from f(x) Observe that f(x  {i})  {-1, 1} however differ from f(x) Conclusion: Conclusion:

19 © S.Safra Codes and Boolean Functions Def: an m-bit code is a subset of the set of all the m- binary string C  {-1,1} m The distance of a code C, which is the minimum, over all pairs of legal-words (in C), of the Hamming distance between the two words A Boolean function over n binary variables, is a 2 n -bit string Hence, a set of Boolean functions can be considered as a 2 n -bits code

20 © S.Safra Hadamard Code In the Hadamard code the set of legal-words consists of all multiplicative (linear if over {0,1}) functions C={  S | S  [n]} namely all characters

21 © S.Safra Hadamard Test Given a Boolean f, choose random x and y; check that f(x)f(y)=f(xy) Prop(completeness): a legal Hadamard word (a character) always passes this test

22 22 Hadamard Test – Soundness Prop(soundness): Proof:

23 © S.Safra Long-Code In the long-code the set of legal-words consists of all monotone dictatorships This is the most extensive binary code, as its bits represent all possible binary values over n elements

24 © S.Safra Long-Code Encoding an element e  [n] : Encoding an element e  [n] : E e legally-encodes an element e if E e = f e E e legally-encodes an element e if E e = f e F F F F T T T T T T

25 © S.Safra Testing Long-code Def(a long-code list-test): given a code-word f, probe it in a constant number of entries, and accept almost always if f is a monotone dictatorship accept almost always if f is a monotone dictatorship reject w.h.p if f does not have a sizeable fraction of its Fourier weight concentrated on a small set of variables, that is, if  a semi-Junta J  [n] s.t. reject w.h.p if f does not have a sizeable fraction of its Fourier weight concentrated on a small set of variables, that is, if  a semi-Junta J  [n] s.t. Note: a long-code list-test, distinguishes between the case f is a dictatorship, to the case f is far from a junta.

26 © S.Safra Motivation – Testing Long-code The long-code list-test are essential tools in proving hardness results. The long-code list-test are essential tools in proving hardness results. Hence finding simple sufficient-conditions for a function to be a junta is important. Hence finding simple sufficient-conditions for a function to be a junta is important.

27 © S.Safra Perturbation Def: denote by   the distribution over all subsets of [n], which assigns probability to a subset x as follows: independently, for each i  [n], let i  x with probability 1-  i  x with probability 1-  i  x with probability  i  x with probability 

28 © S.Safra Long-Code Test Given a Boolean f, choose random x and y, and choose z   ; check that f(x)f(y)=f(xyz) Prop(completeness): a legal long- code word (a dictatorship) passes this test w.p. 1- 

29 29 Long-code Test – Soundness Prop(soundness): Proof:

30 © S.Safra Variation Def: the variation of f (extension of influence ) Prop: the following are equivalent definitions to the variation of f:

31 © S.Safra Proof Recall Therefore

32 © S.Safra Proof – Cont. Recall Recall Therefore (by Parseval): Therefore (by Parseval):

33 © S.Safra High vs Low Frequencies Def: The section of a function f above k is and the low-frequency portion is

34 © S.Safra Junta Test Def: A Junta test is as follows: A distribution over l queries For each l-tuple, a local-test that either accepts or rejects:T[x 1, …, x l ]: {1, -1} l  {T,F} s.t. for a j-junta f whereas for any f which is not ( , j)-Junta

35 © S.Safra Fourier Representation of influence Proof: consider the I-average function on P[I] which in Fourier representation is and

36 © S.Safra Fourier Representation of influence Proof: consider the influence function which in Fourier representation is and

37 © S.Safra Subsets` Influence Def: The Variation of a subset I  [n] on a Boolean function f is and the low-frequency influence

38 © S.Safra Independence-Test The I-independence-test on a Boolean function f is, for Lemma:

39 © S.Safra

40 Junta Test The junta-size test JT on a Boolean function f is The junta-size test JT on a Boolean function f is Randomly partition [n] to I 1,.., I r Randomly partition [n] to I 1,.., I r Run the independence-test t times on each I h Run the independence-test t times on each I h Accept if ≤j of the I h fail their independence-tests Accept if ≤j of the I h fail their independence-tests For r>>j 2 and t>>j 2 / 

41 © S.Safra Completeness Lemma: for a j-junta f Proof: only those sets which contain an index of the Junta would fail the independence-test

42 © S.Safra Soundness Lemma: Proof: Assume the premise. Fix  <<1/t and let

43 © S.Safra |J| ≤ j Prop: r >> j implies |J| ≤ j Proof: otherwise, J spreads among I h w.h.p. J spreads among I h w.h.p. and for any I h s.t. I h  J ≠  it must be that Variation I h (f) >  and for any I h s.t. I h  J ≠  it must be that Variation I h (f) > 

44 © S.Safra High Frequencies Contribute Little Prop: k >> r log r implies Proof: a character S of size larger than k spreads w.h.p. over all parts I h, hence contributes to the influence of all parts. If such characters were heavy (>  /4), then surely there would be more than j parts I h that fail the t independence-tests

45 © S.Safra Almost all Weight is on J Lemma: Proof: assume by way of contradiction otherwise since for a random partition w.h.p. (Chernoff bound) for every h however, since for any I the influence of every I h would be ≥  /100rk

46 © S.Safra Find the Close Junta Now, since consider the (non Boolean) which, if rounded outside J is Boolean and not more than  far from f

47 © S.Safra Consider the q-biased product distribution  q : Def: The probability of a subset F and for a family of subsets  Consider the q-biased product distribution  q : Def: The probability of a subset F and for a family of subsets  Product, Biased Distribution

48 © S.Safra Beckner/Nelson/Bonami Inequality Def: let T  be the following operator on any f, Prop: Proof:

49 © S.Safra Beckner/Nelson/Bonami Inequality Def: let T  be the following operator on any f, Thm: for any p≥r and  ≤((r-1)/(p-1)) ½

50 © S.Safra Beckner/Nelson/Bonami Corollary Corollary 1: for any real f and 2≥r≥1 Corollary 2: for real f and r>2

51 © S.Safra Average Sensitivity The sum of variables’ influence is referred to as the average sensitivity Which can be expressed by the Fourier coefficients as

52 © S.Safra Freidgut Theorem Thm: any Boolean f is an [ , j]-junta for Proof: 1. Specify the junta J 2. Show the complement of J has little influence

53 © S.Safra Specify the Junta Set k=  (as(f)/  ), and  =2 -  (k) Let We’ll prove: and let hence, J is a [ ,j]-junta, and |J|=2 O(k)

54 © S.Safra High Frequencies Contribute Little Prop: Proof: a character S of size larger than k contributes at least k times the square of its coefficient to the average sensitivity. If such characters were heavy (>  /4), as(f) would have been large

55 © S.Safra Altogether Lemma: Proof:

56 © S.Safra Altogether

57 Biased  q - Influence The  q -influence of an index i  [n] on a boolean function f:P[n]  {1,-1} is The  q -influence of an index i  [n] on a boolean function f:P[n]  {1,-1} is

58 © S.Safra Biased Walsh Product The usual Fourier basis  is not orthogonal with respect to the biased inner-product, The usual Fourier basis  is not orthogonal with respect to the biased inner-product, Hence, we use the Biased Walsh Product: Hence, we use the Biased Walsh Product:

59 © S.Safra Thm [Margulis-Russo]: For monotone f Hence Lemma: For monotone f  > 0,  q  [p, p+  ] s.t. as q (f)  1/  Proof: Otherwise  p+  (f) > 1

60 © S.Safra Proof [Margulis-Russo]:

61 © S.Safra Influential People and Issues The theory of the influence of variables on Boolean functions [BL, KKL] and related issues, has been introduced to tackle social choice problems, furthermore has motivated a magnificent sequence of works, related to Economics [K], percolation [BKS], Hardness of approximation [DS] Revolving around the Fourier/Walsh analysis of Boolean functions… The theory of the influence of variables on Boolean functions [BL, KKL] and related issues, has been introduced to tackle social choice problems, furthermore has motivated a magnificent sequence of works, related to Economics [K], percolation [BKS], Hardness of approximation [DS] Revolving around the Fourier/Walsh analysis of Boolean functions… And the real important question: And the real important question:

62 © S.Safra Where to go for Dinner? Who has suggestions: Each cast their vote in an (electronic) envelope, and have the system decided, not necessarily according to majority… It turns out someone –in the Florida wing- has the power to flip some votes Power influence

63 © S.Safra Voting Systems n agents, each voting either “for” (T) or “against” (F) – a Boolean function over n variables f is the outcome n agents, each voting either “for” (T) or “against” (F) – a Boolean function over n variables f is the outcome The values of the agents (variables) may each, independently, flip with probability The values of the agents (variables) may each, independently, flip with probability It turns out: one cannot design an f that would be robust to such noise -that is, would, on average, change value w.p. < O(1) - unless taking into account only very few of the votes It turns out: one cannot design an f that would be robust to such noise -that is, would, on average, change value w.p. < O(1) - unless taking into account only very few of the votes

64 © S.Safra [n] x I I z Noise-Sensitivity How often does the value of f changes when the input is perturbed? x I I z

65 © S.Safra Def( ,p,x [n] ): Let 0< <1, and x  P([n]). Then y~ ,p,x, if y = (x\I)  z where Def( ,p,x [n] ): Let 0< <1, and x  P([n]). Then y~ ,p,x, if y = (x\I)  z where I~  [n] is a noise subset, and I~  [n] is a noise subset, and z~  p I is a replacement. z~  p I is a replacement. Def( -noise-sensitivity): let 0< <1, then [ When p=½ equivalent to flipping each coordinate in x independently w.p. /2.] [n] x I I z Noise-Sensitivity

66 © S.Safra Noise-Sensitivity – Cont. Advantage: very efficiently testable (using only two queries) by a perturbation-test. Advantage: very efficiently testable (using only two queries) by a perturbation-test. Def (perturbation-test): choose x~  p, and y~ ,p,x, check whether f(x)=f(y) The success is proportional to the noise- sensitivity of f. Def (perturbation-test): choose x~  p, and y~ ,p,x, check whether f(x)=f(y) The success is proportional to the noise- sensitivity of f. Prop: the -noise-sensitivity is given by Prop: the -noise-sensitivity is given by

67 © S.Safra Relation between Parameters Prop: small ns  small high-freq weight Proof: therefore: if ns is small, then Hence the high frequencies must have small weights (as). Prop: small as  small high-freq weight Proof:

68 © S.Safra Main Result Theorem:  constant  >0 s.t. any Boolean function f:P([n])  {-1,1} satisfying is an [ ,j]-junta for j=O(  -2 k 3  2k ). Corollary: fix a p-biased distribution  p over P([n]) Let >0 be any parameter. Set k=log 1- (1/2) Then  constant  >0 s.t. any Boolean function f:P([n])  {-1,1} satisfying is an [ ,j]-junta for j=O(  -2 k 3  2k )

69 © S.Safra First Attempt: Following Freidgut’s Proof Thm: any Boolean function f is an [ ,j]-junta for Proof: 1. Specify the junta where, let k=O(as(f)/  ) and fix  =2 -O(k) 2. Show the complement of J has small variation P([n]) J

70 © S.Safra If k were 1 Easy case (!?!): If we’d have a bound on the non- linear weight, we should be done. The linear part is a set of independent characters (the singletons) In order for those to hit close to 1 or -1 most of the time, they must avoid the Law of Large Numbers, namely be almost entirely placed on one singleton [by Chernoff like bound] Thm[FKN, ext.]: Assume f is close to linear, then f is close to shallow (  a constant function or a dictatorship)

71 © S.Safra How to Deal with Dependency between Characters Recall Recall (theorem’s premise) (theorem’s premise) Idea: Let Partition [n]\J into I 1,…,I r, for r >> k Partition [n]\J into I 1,…,I r, for r >> k w.h.p f I [x] is close to linear (low freq characters intersect I expectedly by  1 element, while high-frequency weight is low). w.h.p f I [x] is close to linear (low freq characters intersect I expectedly by  1 element, while high-frequency weight is low). P([n]) J I1I1 I2I2 IrIr I

72 © S.Safra Shallow Function Def: a function f is linear, if only singletons have non-zero weight Def: a function f is linear, if only singletons have non-zero weight Def: a function f is shallow, if f is either a constant or a dictatorship. Def: a function f is shallow, if f is either a constant or a dictatorship. Claim: Boolean linear functions are shallow. Claim: Boolean linear functions are shallow. 0123kn0123kn weight Character size

73 © S.Safra Boolean Linear  Shallow Claim: Boolean linear functions are shallow. Claim: Boolean linear functions are shallow. Proof: let f be Boolean linear function, we next show: Proof: let f be Boolean linear function, we next show: 1.  {i o } s.t. (i.e. ) 2. And conclude, that eitheror i.e. f is shallow

74 © S.Safra Claim 1 Claim 1: let f be Boolean linear function, then  {i o } s.t. Claim 1: let f be Boolean linear function, then  {i o } s.t. Proof: w.l.o.g assume Proof: w.l.o.g assume for any z  {3,…,n} consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} for any z  {3,…,n} consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} then. then. Next value must be far from {-1,1} Next value must be far from {-1,1} A contradiction! (boolean function) A contradiction! (boolean function) Therefore Therefore 1 ?

75 © S.Safra Claim 1 Claim 1: let f be Boolean linear function, then  {i o } s.t. Claim 1: let f be Boolean linear function, then  {i o } s.t. Proof: w.l.o.g assume Proof: w.l.o.g assume for any z  {3,…,n} consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} for any z  {3,…,n} consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} then. then. But this is impossible as f(x 00 ),f(x 10 ),f(x 01 ), f(x 11 )  {-1,1}, hence their distances cannot all be >0 ! But this is impossible as f(x 00 ),f(x 10 ),f(x 01 ), f(x 11 )  {-1,1}, hence their distances cannot all be >0 ! Therefore. Therefore. 1 ?

76 © S.Safra Claim 2 Claim 2: let f be Boolean function, s.t. Then eitheror Claim 2: let f be Boolean function, s.t. Then eitheror Proof: consider f(  ) and f(i 0 ): Proof: consider f(  ) and f(i 0 ): Then Then but f is Boolean, hence but f is Boolean, hence therefore therefore 1 0

77 © S.Safra Linearity and Dictatorship Prop: Let f be a balanced linear Boolean function then f is a dictatorship. Proof: f(  ),f(i 0 )  {-1,1}, hence Prop: Let f be a balanced Boolean function s.t. as(f)=1, then f is a dictatorship. Proof:, but f is balanced, (i.e. ), therefore f is also linear.

78 © S.Safra Proving FKN: almost-linear  close to shallow Theorem: Let f:P([n])   be linear, Theorem: Let f:P([n])   be linear, Let Let let i 0 be the index s.t. is maximal let i 0 be the index s.t. is maximalthen Note: f is linear, hence w.l.o.g., assume i 0 =1, then all we need to show is: We show that in the following claim and lemma. Note: f is linear, hence w.l.o.g., assume i 0 =1, then all we need to show is: We show that in the following claim and lemma.

79 © S.Safra Corollary Corollary: Let f be linear, and then  a shallow boolean function g s.t. Corollary: Let f be linear, and then  a shallow boolean function g s.t. Proof: let, let g be the boolean function closest to l. Then, this is true, as Proof: let, let g be the boolean function closest to l. Then, this is true, as is small (by theorem), is small (by theorem), and additionallyis small, since and additionallyis small, since

80 © S.Safra Claim 1 Claim 1: Let f be linear. w.l.o.g., assume then  global constant c=min{p,1-p} s.t. Claim 1: Let f be linear. w.l.o.g., assume then  global constant c=min{p,1-p} s.t. {} {1} {2} {i}{n} {1,2} {1,3}{n-1,n}S{1,..,n} weight Characters Each of weight no more than c 

81 © S.Safra Proof of Claim1 Proof: assume Proof: assume for any z  {3,…,n}, consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} for any z  {3,…,n}, consider x 00 =z, x 10 =z  {1}, x 01 =z  {2}, x 11 =z  {1,2} then then Next value must be far from {-1,1} ! Next value must be far from {-1,1} ! A contradiction! (to ) A contradiction! (to ) 1 ?

82 © S.Safra Where to go for Dinner? Who has suggestions: Each cast their vote in an (electronic) envelope, and have the system decided, not necessarily according to majority… It turns out someone –in the Florida wing- has the power to flip some votes Power influence Of course they’ll have to discuss it over dinner….


Download ppt "Fourier Analysis, Projections, Influence, Junta, Etc…"

Similar presentations


Ads by Google