1 Zero-sum square matrices Paul Balister Yair Caro Cecil Rousseau Raphael Yuster.

Slides:



Advertisements
Similar presentations
5.4 Basis And Dimension.
Advertisements

5.1 Real Vector Spaces.
1 Decomposing Hypergraphs with Hypertrees Raphael Yuster University of Haifa - Oranim.
Applied Informatics Štefan BEREŽNÝ
On the Density of a Graph and its Blowup Raphael Yuster Joint work with Asaf Shapira.
Transformations We want to be able to make changes to the image larger/smaller rotate move This can be efficiently achieved through mathematical operations.
Information and Coding Theory
Bounds on Code Length Theorem: Let l ∗ 1, l ∗ 2,..., l ∗ m be optimal codeword lengths for a source distribution p and a D-ary alphabet, and let L ∗ be.
Basic properties of the integers
The Structure of Polyhedra Gabriel Indik March 2006 CAS 746 – Advanced Topics in Combinatorial Optimization.
Math 3121 Abstract Algebra I
Ramseyan Theorems for Numbers. page 2Ramseyan Theorems for Numbers Contents 1.Sum-Free Sets 2.Zero-Sum Sets 3.Szemerédi’s Cube Lemma.
The number of edge-disjoint transitive triples in a tournament.
Chapter 5 Orthogonality
Totally Unimodular Matrices Lecture 11: Feb 23 Simplex Algorithm Elliposid Algorithm.
Online Graph Avoidance Games in Random Graphs Reto Spöhel Diploma Thesis Supervisors: Martin Marciniszyn, Angelika Steger.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Linear Equations in Linear Algebra
Arithmetic Operations on Matrices. 1. Definition of Matrix 2. Column, Row and Square Matrix 3. Addition and Subtraction of Matrices 4. Multiplying Row.
Separate multivariate observations
Induction and recursion
Linear Algebra With Applications by Otto Bretscher. Page The Determinant of any diagonal nxn matrix is the product of its diagonal entries. True.
Set theory Sets: Powerful tool in computer science to solve real world problems. A set is a collection of distinct objects called elements. Traditionally,
Definitions from the Text, sections 1.1 – 1.4
C&O 355 Mathematical Programming Fall 2010 Lecture 17 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
The Hat Game 11/19/04 James Fiedler. References Hendrik W. Lenstra, Jr. and Gadiel Seroussi, On Hats and Other Covers, preprint, 2002,
 Row and Reduced Row Echelon  Elementary Matrices.
Systems of Linear Equation and Matrices
Diophantine Approximation and Basis Reduction
Linear Algebra Chapter 4 Vector Spaces.
Great Theoretical Ideas in Computer Science.
Multivariate Statistics Matrix Algebra II W. M. van der Veld University of Amsterdam.
Information and Coding Theory Linear Block Codes. Basic definitions and some examples. Juris Viksna, 2015.
2.4 Sequences and Summations
Simplex method (algebraic interpretation)
WEEK 8 SYSTEMS OF EQUATIONS DETERMINANTS AND CRAMER’S RULE.
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
Basic Concepts in Number Theory Background for Random Number Generation 1.For any pair of integers n and m, m  0, there exists a unique pair of integers.
Matrices CHAPTER 8.1 ~ 8.8. Ch _2 Contents  8.1 Matrix Algebra 8.1 Matrix Algebra  8.2 Systems of Linear Algebra Equations 8.2 Systems of Linear.
Set, Combinatorics, Probability & Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS Slides Set,
The Integers. The Division Algorithms A high-school question: Compute 58/17. We can write 58 as 58 = 3 (17) + 7 This forms illustrates the answer: “3.
Copyright © Cengage Learning. All rights reserved. CHAPTER 7 FUNCTIONS.
Chapter 9: Geometric Selection Theorems 11/01/2013
Great Theoretical Ideas in Computer Science.
1 Combinatorial Algorithms Local Search. A local search algorithm starts with an arbitrary feasible solution to the problem, and then check if some small,
1 The number of orientations having no fixed tournament Noga Alon Raphael Yuster.
Addressing and hashing Question: For a network of computers it is desirable to be able to send a message from A to B without B knowing that a message is.
1 Rainbow Decompositions Raphael Yuster University of Haifa Proc. Amer. Math. Soc. (2008), to appear.
Section 2.3 Properties of Solution Sets
Maximum density of copies of a graph in the n-cube John Goldwasser Ryan Hansen West Virginia University.
Information and Coding Theory Cyclic codes Juris Viksna, 2015.
Word : Let F be a field then the expression of the form a 1, a 2, …, a n where a i  F  i is called a word of length n over the field F. We denote the.
1 Decomposition into bipartite graphs with minimum degree 1. Raphael Yuster.
CS 103 Discrete Structures Lecture 13 Induction and Recursion (1)
1 Quasi-randomness is determined by the distribution of copies of a graph in equicardinal large sets Raphael Yuster University of Haifa.
Chapter 13 Backtracking Introduction The 3-coloring problem
1 Covering Non-uniform Hypergraphs Endre Boros Yair Caro Zoltán Füredi Raphael Yuster.
Copyright © Cengage Learning. All rights reserved. CHAPTER 8 RELATIONS.
Department of Statistics University of Rajshahi, Bangladesh
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Theory of Computational Complexity Probability and Computing Ryosuke Sasanuma Iwama and Ito lab M1.
Sorting by placement and Shift Sergi Elizalde Peter Winkler By 資工四 B 周于荃.
Theory of Computational Complexity M1 Takao Inoshita Iwama & Ito Lab Graduate School of Informatics, Kyoto University.
Infinite sets We say that a set A is infinite if a proper subset B exists of A such that there is a bijection It is easy to see that no set with a finite.
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
Chapter 3 The Real Numbers.
Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.
Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.
Systems of distinct representations
Integer and fractional packing of graph families
Presentation transcript:

1 Zero-sum square matrices Paul Balister Yair Caro Cecil Rousseau Raphael Yuster

2 Definitions Let A be a matrix over the integers, and let p be a positive integer. A submatrix B of A is called zero-sum mod p if the sum of each row and each column of B is a multiple of p. Let M(p,k) denote the least integer m for which every square matrix of order at least m over the integers, has a square submatrix of order k which is zero-sum mod p. As usual in zero- sum problems, we assume that k is a multiple of p, otherwise M(p,k) may be infinite. If k is a multiple of p, then M(p,k) is finite by a standard Ramsey-type argument: Consider the elements of our matrices as taken from Z p + : we have a coloring of an m*m grid with p colors, so, for sufficiently large m, there is a k*k sub-grid which is monochromatic. This translates to a (very specialized) zero-sum mod p submatrix of order k

3 Graph-theoretic interpretation (p=2) A zero-one matrix A=A m,n can be interpreted as the adjacency matrix of a bipartite graph G=(X  Y, E), where X=x 1,…,x m and Y=y 1,…,y n. In this interpretation, A(i,j)=1 if and only if x i is adjacent to y j. Hence, M(2,k) is the least integer m that guarantees that every bipartite graph with equal vertex classes having cardinality m, has an induced subgraph with equal vertex classes having cardinality k, such that all degrees are even. By the Ramsey-type argument mentioned above, it is obvious that M(2,k) ≤ B(k) where B(k) is the Bipartite Ramsey Number. The best upper bound and lower bound for B(k) are currently: √2e -1 k2 k/2 < B(k) ≤ 2 k (k-1)+1 In particular, we get M(2,k) ≤ 2 k(k-1) +1.

4 Main result Unlike the exponential lower bound for B(k), we can show that M(2,k) is linear in k. We can also show that M(3,k) is linear in k for infinitely many values of k. Theorem: –limsup M(2,k)/k ≤ 4. –liminf M(3,k)/k ≤ 20. In particular, for every positive integer s, M(3,3 s ) ≤ 20 * 3 s (1+o(1)) Conjecture: For every positive integer p, there exists a constant c p such that limsup M(p,k)/k ≤ c p.

5 The theorem implies the conjecture for p=2 with c 2 ≤ 4. It is not difficult to construct examples showing M(2,k) ≥2k+1. Thus, we cannot have c 2 < 2. The construction (example for k=4):

6 It is extremely difficult to compute M(2,k) (moreover M(p,k) for p >2) even for relatively small values of k. It is an easy exercise to show that M(2,2)=5. A naive approach for computing M(2,4) with a computer program which generates all possible 0-1 matrices and testing them would fail. By using, once again, the fact that M(2,k) ≥ 2k+1, we know that M(2,4) ≥ 9. Even generating all 0-1 matrices of order 9 is not feasible as there are 2 81 such matrices (and even then, maybe the correct value is larger than 9). We used a sophisticated computer search which enables us to narrow down the checks considerably. With the aid of this search we were able to show: Proposition: –M(2,4)=10. A binary matrix of order 9 that demonstrates M(2,4) > 9 is the following (it is unique up to complement and row/column interchanges):

7 J0IIJ00IJJ0IIJ00IJ M(2,6) is already an open problem. For general p, it is not difficult to show that M(p,k)/k grows exponentially with p. For fixed p the lower bound is linear in k M(p,k) ≥ (k/5)e p/2e. For p > 3, nothing but the trivial exponential upper bound that follows from bipartite Ramsey numbers with p colors is known. Thus, the conjecture is far from being proved except for p=2 (and p=3 with the liminf version).

8 Proof of the main result First tool: A theorem of Enomoto, Frankl, Ito and Nomura. Recall that a linear binary code of length t contains the distance k if it contains as a codeword a vector with k ones and t-k zeroes. Their theorem: Let k be even, and let A be a binary matrix with k-1 rows and 2k columns. Then, A is a parity-check matrix of a linear code which contains the distance k. Example: k=4 and some arbitrary 3*8 matrix A The theorem simply says that there exists a binary vector x of length 8 with 4 positions equaling 1 such that Ax=0 (in Z 2 ).

9 Second tool: A theorem of Olson Let G be a finite Abelian group. The Davenport Constant of G, denoted D(G), is the least positive integer t such that in any sequence of t elements of G there is a (nonempty) subsequence whose sum is zero. Let p be a prime, and let G be a p-group. Then, Olson proved the following:

10 An even vector is a binary vector whose number of nonzero coordinates is even. More generally, a p-divisible vector is a vector from (Z p ) k whose sum of coordinates is divisible by p. We use the latter two tools to prove the following lemmas: The p=2 lemma: Let k be an even positive integer. Then, every sequence of 2k even vectors of length k each, contains a subsequence of exactly k vectors whose sum is the zero vector. (If k is a power of 2 we can improve 2k to 2k-1). Proof: Consider the matrix A whose columns are the elements of the sequence of 2k even vectors of length k. Ignoring the last row of A we have a parity check matrix of a linear code which contains the distance k. This means we can choose k columns whose sum is zero. Adding back the last coordinate to these columns still gives a zero sum, since the columns of A are even vectors.

11 The p=3 lemma: Let k be a power of 3. Every sequence of 5k-2 vectors of length k each that are 3-divisible, contains a subsequence of exactly k vectors whose sum is the zero vector (in (Z 3 ) k ). Proof: Let a 1,…,a 5k-2 be a sequence of 3-divisible vectors where a i  (Z 3 ) k. Let b i be the same as a i, except that the last coordinate of b i is always one. Note that b i need not be 3- divisible anymore. We consider the b i as elements of G=(Z 3 ) k-1 ×Z 3k. Since k is a power of 3, G is a p-group for p=3. According to Olson’s Theorem, D(G)=1+2(k-1)+(3k-1)=5k-2. Hence, there is a nonempty subsequence of b 1,…,b 5k-2 whose sum is zero. Since the last coordinate of b i is the element 1 of Z 3k, such a subsequence must contain exactly 3k elements. Thus, w.l.o.g., b 1 +…+b 3k =0. Now let us consider these 3k vectors as elements of G’=(Z 3 ) k-1 ×Z k. Again, G’ is a p-group for p=3, so D(G’)=1+2(k-1)+(k-1)=3k-2. Hence, there is a

12 nonempty subsequence of b 1,…,b 3k whose sum is zero. Since the last coordinate of b i is the element 1 of Z k, such a subsequence must contain either exactly k or exactly 2k elements. In the latter case, the sum of the remaining k vectors not in the subsequence is also zero. We proved there is always a subsequence of k elements whose sum is zero in G’. Now, since the a i are 3-divisible vectors we immediately get that after setting the last coordinate back to its original value the corresponding subsequence of the a i is also zero.

13 Further definitions Let x be a vector. A k-subvector of x is taken by selecting k coordinates of x. Clearly, if x  (Z p ) t then every k-subvector is a member of (Z p ) k, and x has exactly t!/(k!(t-k)!) distinct k- subvectors. Let e k (x) denote the number of p-divisible k-subvectors of x. We shall always assume that k is divisible by p. If x=(1,…1) then trivially e k (x)= t!/(k!(t-k)!), but, in general, e k (x) may be a lot smaller. For example, if x=(1,1,1,0,0)  (Z 2 ) 5 then e 2 (x)=4<10. Now, put

14 The counting lemma: –Let k be even, and let t satisfy Then, M(2,k) ≤ t. –Let k be a power of 2, and let t satisfy Then, M(2,k) ≤ t. –Let k be a power of 3, and let t satisfy Then, M(3,k) ≤ t. Proof: We prove the first part. Let A be a square 0-1 matrix of order t. We must show that A has a square submatrix A’ of order k, such that all rows and all columns of A’ are even vectors. The condition in the lemma implies that there exists a subset of k columns of A, such that the set of t line vectors they induce contains more than 2k-1 even vectors. Thus, there is also a subset of 2k rows such that in the induced 2k * k submatrix each row has zero sum. By the p=2 lemma there is a subset of k rows (out of the 2k rows) such that each column is also zero sum. The proofs of the other cases are identical using the p=2 lemma (Olson version) and the p=3 lemma.

15 It is not difficult to compute explicit small values of f(p,k,t). Clearly, if x and y are two binary vectors having the same length, and the same number of zeroes then e k (x)=e k (y). More generally, if r denotes the number of zeroes in a binary vector of length t then, Similarly, for the case p=3, if r denotes the number of zeroes and s denotes the number of ones in a 3-ary vector of length t then, Example: f(2,4,13)=343. The minimum is obtained when r=9 (or r=4). We can use the second part of the counting lemma and see that 343 > (6/13)*13!/(4!9!). Thus, we immediately get M(2,4) ≤ 13.

16 Computing lower bounds for f(p,k,t) in the general case is a more complicated task. e.g.: what would one guess is the minimum possible fraction of 3-divisible k-subvectors of a 20k-vector? The next two lemmas give a bound for every p ≥ 2, in case k=  (t) and t→∞. Before we state the lemmas, we need to define a particular constant: For a sequence of integers a 1,…,a m let w p (a 1,…,a m ) be the complex number Example: let z=e 2  i/3 w 3 (4,8)=w 3 (1,2)=0.25(4+(1+z)(1+z 2 )+(1+z 2 )(1+z))=1.5 This number is always real (in fact, rational). Define z p = inf w p (a 1,…,a m ) where the infimum is taken over all finite sequences a 1,…,a m of all lengths m ≥ 0.

17 If all a i =1 and  ≠1 then |(1+  a j )/2| ≤ cos(  /p)=  <1, and so w p ≤ 1+(p-1)  m. Letting m→∞ we see that z p ≤ 1. Also note that, trivially, z 2 =1. The z p lemma: z p =p2 1-p. A combinatorial proof: The sum of  j taken over all roots is p if p | j and 0 otherwise. Hence w p (a 1,…,a m ) is just p2 -m times the number of subsets X of {1,…,m} such that ∑ i  x a i = 0 mod p. If m=p-1 and all a i =1 then the only such subset X is . Hence z p ≤ p2 1-p and the result will follow if we can show the number of subsets X is always at least 2 m+1-p. We prove this by induction on m. Let S m be the set of residue classes mod p that can be written in the form ∑ i  x a i for some X  {1,…,m} and assume that for each element of S m there are at least 2 s m such subsets X. Now S m =S m-1  (S m-1 +a m ). We consider two cases:

18 1.S m ≠ S m-1. Then |S m |>|S m-1 |. 2.S m =S m-1. Then S m-1 +a m =S m-1 and so if x  S m then both x and x-a m lie in S m-1. Thus x can be written as a sum of the a i 's in at least 2 s m- 1 ways which do not involve a m and 2 s m- 1 ways that do involve a m. Thus s m ≥ s m In all cases |S m |+s m ≥ |S m-1 |+s m Now S 0 ={0} and s 0 =0, so by induction |S m |+s m ≥ m+1. Since |S m | ≤ p we have s m ≥ m+1-p. Thus 0  S m can be written as a sum of a i mod p in at least 2 m+1-p ways.

19 The f(p,k,t) lower bound lemma: Let k ≤ t/2. Then for any fixed prime p: Proof: Let T={1,…,t} and let x=(x 1,…,x t )  (Z p ) t be a fixed vector for which e k (x)=f(p,k,t). Assume p | k and k ≤ t/2. We wish to estimate the number f(p,k,t) of sets K  T, |K|=k with ∑ i  K x i = 0 mod p. Fix the k element subset K={1,…,k} of T and instead count the number p k of permutations   S t such that ∑ i=1..k x  (i) =0 mod p. It is clear that p k =f(p,k,t) k!(t-k)! Define Then  (1)=1, |  (  )| ≤ 1, p k /t!=p -1 ∑   (  ). Let  be a partition of T into k pairs S i ={a i,b i }, i=1,…,k, and a remaining set S 0 of t-2k numbers. We also regard  as the set of permutations  such that {  (i),  (i+k)}=S i for all i=1,…,k.

20 For each  there are exactly 2 k (t-2k)! choices for   . where and the expectation is over a uniform random choice of  (there are exactly t!/(2 k (t-2k)!) choices for  ). Assume now  ≠1. If a ≠ b mod p then | (  a +  b )/2| ≤ cos(  /p)=  <1. Thus |  ( ,  )| ≤  n  where n  is the number of i=1,…,k such that x a i ≠ x b i mod p. Hence |  (  )| ≤ E  [  n  ]. Let r s be the number of coordinates with x i = s mod p. Then E  [n  ]=  =k ∑ i<j (2r i r j )/(t(t-1)). It is not hard to show that Var  [n  ]=O(  ): Indeed, note that n  = ∑ j=1..k n  (j) where n  (j) are indicator random variables that are equal to 1 in case x a j ≠ x b j mod p. Since Cov[n  (j 1 ), n  (j 2 )] = O(  2 /(k 2 t)) and since  ≤ k < t we have: Var  [n  ] ≤  + k 2 O (  2 /(k 2 t)) = O (  ).

21 By Tchebychev's Inequality, if  →∞ then n  is almost surely large as well and  (  )=o(1). In this case Now assume  is bounded and k →∞. Let r=t- max r i be the number of coordinates of x not equal to the most common value. Then  ≥ 2kr(t/p)/t 2, so r=O(t/k) and kr 2 /t 2 =o k (1). By adding a constant vector to x and using p | k we can assume the most common coefficient in x is 0. Let m  be the number of S i with both x a i and x b i non-zero. Then, trivially, Pr[m  >0] ≤ E  [m  ]=O(kr 2 /t 2 )=o(1). If m  =0 then  ( ,  ) is a product of n  terms of the form (1+  a i )/2. Hence, the sum over all roots of unity of  ( ,  ) is at least z p. It follows that

22 Completing the proof of the main theorem: Let  >0 and let 0<  <  /(1+  ). Let k be an even integer sufficiently large such that Such a k exists by the lower bound lemma. Now, Since 0.5(1-  )>(2k-1)/(4k(1+  )) we get by the counting lemma that M(2,k) ≤ 4k(1+  ). Similarly, by using the second part of the counting lemma we get that for k sufficiently large which is a power of 3, and so M(3,k) ≤ 20k(1+  ).