1 Distributed (Local) Monotonicity Reconstruction Michael Saks Rutgers University C. Seshadhri Princeton University (Now IBM Almaden)

Slides:

Advertisements

Similar presentations

Pretty-Good Tomography Scott Aaronson MIT. Theres a problem… To do tomography on an entangled state of n qubits, we need exp(n) measurements Does this.

Advertisements

Sublinear-time Algorithms for Machine Learning Ken Clarkson Elad Hazan David Woodruff IBM Almaden Technion IBM Almaden.

Guy EvenZvi LotkerDana Ron Tel Aviv University Conflict-free colorings of unit disks, squares, & hexagons.

Approximate List- Decoding and Hardness Amplification Valentine Kabanets (SFU) joint work with Russell Impagliazzo and Ragesh Jaiswal (UCSD)

Incremental Linear Programming Linear programming involves finding a solution to the constraints, one that maximizes the given linear function of variables.

Foundations of Cryptography Lecture 10 Lecturer: Moni Naor.

Sub Exponential Randomize Algorithm for Linear Programming Paper by: Bernd Gärtner and Emo Welzl Presentation by : Oz Lavee.

Generalization and Specialization of Kernelization Daniel Lokshtanov.

Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.

A UNIFIED FRAMEWORK FOR TESTING LINEAR-INVARIANT PROPERTIES ARNAB BHATTACHARYYA CSAIL, MIT (Joint work with ELENA GRIGORESCU and ASAF SHAPIRA)

1 Distribution-free testing algorithms for monomials with a sublinear number of queries Elya Dolev & Dana Ron Tel-Aviv University.

Distribution-free testing algorithms for monomials with a sublinear number of queries Elya Dolev & Dana Ron Tel-Aviv University.

Property Testing: A Learning Theory Perspective Dana Ron Tel Aviv University.

Models and Security Requirements for IDS. Overview The system and attack model Security requirements for IDS –Sensitivity –Detection Analysis methodology.

Proclaiming Dictators and Juntas or Testing Boolean Formulae Michal Parnas Dana Ron Alex Samorodnitsky.

Tirgul 10 Rehearsal about Universal Hashing Solving two problems from theoretical exercises: –T2 q. 1 –T3 q. 2.

1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 12 June 18, 2006

1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 8 May 4, 2005

CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.

CSE115/ENGR160 Discrete Mathematics 03/03/11 Ming-Hsuan Yang UC Merced 1.

NP-Complete Problems Reading Material: Chapter 10 Sections 1, 2, 3, and 4 only.

Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.

Tirgul 8 Universal Hashing Remarks on Programming Exercise 1 Solution to question 2 in theoretical homework 2.

Sublinear time algorithms Ronitt Rubinfeld Blavatnik School of Computer Science Tel Aviv University TexPoint fonts used in EMF. Read the TexPoint manual.

Michael Bender - SUNY Stony Brook Dana Ron - Tel Aviv University Testing Acyclicity of Directed Graphs in Sublinear Time.

Testing Metric Properties Michal Parnas and Dana Ron.

EXPANDER GRAPHS Properties & Applications. Things to cover ! Definitions Properties Combinatorial, Spectral properties Constructions “Explicit” constructions.

CSE 421 Algorithms Richard Anderson Lecture 4. What does it mean for an algorithm to be efficient?

1 On approximating the number of relevant variables in a function Dana Ron & Gilad Tsur Tel-Aviv University.

On Testing Convexity and Submodularity Michal Parnas Dana Ron Ronitt Rubinfeld.

Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.

Foundations of Privacy Lecture 11 Lecturer: Moni Naor.

. PGM 2002/3 – Tirgul6 Approximate Inference: Sampling.

Foundations of Cryptography Lecture 2 Lecturer: Moni Naor.

How Robust are Linear Sketches to Adaptive Inputs? Moritz Hardt, David P. Woodruff IBM Research Almaden.

The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,

Correlation testing for affine invariant properties on Shachar Lovett Institute for Advanced Study Joint with Hamed Hatami (McGill)

Approximating the MST Weight in Sublinear Time Bernard Chazelle (Princeton) Ronitt Rubinfeld (NEC) Luca Trevisan (U.C. Berkeley)

Chapter 11 Limitations of Algorithm Power. Lower Bounds Lower bound: an estimate on a minimum amount of work needed to solve a given problem Examples:

Computing and Communicating Functions over Sensor Networks A.Giridhar and P. R. Kumar Presented by Srikanth Hariharan.

The Complexity of Optimization Problems. Summary -Complexity of algorithms and problems -Complexity classes: P and NP -Reducibility -Karp reducibility.

Analysis of Algorithms

Distance sensitivity oracles in weighted directed graphs Raphael Yuster University of Haifa Joint work with Oren Weimann Weizmann inst.

1 Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Penn State University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.

Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.

CSC 211 Data Structures Lecture 13

Index FAQ Functions, properties. elementary functions and their inverses 2. előadás.

CS717 Algorithm-Based Fault Tolerance Matrix Multiplication Greg Bronevetsky.

CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.

NP-Complete problems.

NP-Completness Turing Machine. Hard problems There are many many important problems for which no polynomial algorithms is known. We show that a polynomial-time.

Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.

1 Distributed Monotonicity Restoration Michael Saks Rutgers University C. Seshadhri Princeton University.

Approximation Algorithms based on linear programming.

PRIMES is in P Manindra Agrawal Neeraj Kayal Nitin Saxena Dept of CSE, IIT Kanpur.

ICS 353: Design and Analysis of Algorithms NP-Complete Problems King Fahd University of Petroleum & Minerals Information & Computer Science Department.

CSE15 Discrete Mathematics 03/06/17

Algorithms for Big Data: Streaming and Sublinear Time Algorithms

Stochastic Streams: Sample Complexity vs. Space Complexity

On Testing Dynamic Environments

Approximating the MST Weight in Sublinear Time

Functions Defined on General Sets

From dense to sparse and back again: On testing graph properties (and some properties of Oded)

Lecture 18: Uniformity Testing Monotonicity Testing

NP-Completeness Yin Tat Lee

Some Rules for Expectation

ICS 353: Design and Analysis of Algorithms

Enumerating Distances Using Spanners of Bounded Degree

RS – Reed Solomon List Decoding.

Compact routing schemes with improved stretch

Presentation transcript:

1 Distributed (Local) Monotonicity Reconstruction Michael Saks Rutgers University C. Seshadhri Princeton University (Now IBM Almaden)

2 Overview Introduce a new class of algorithmic problems: Distributed Property Reconstruction (extending framework of program self-correction, robust property testing locally decodable codes) A solution for the property Monotonicity

3 Data Sets Data set = function f : Γ  V Γ = finite index set V = value set In this talk, Γ = [n] d = {1,…,n} d V = nonnegative integers f = d-dimensional array of nonnegative integers

4 For f,g with common domain Γ: dist(f,g) =fraction of domain where f(x) ≠ g(x) Distance between two data sets

5 Properties of data sets Focus of this talk:  Monotone: nondecreasing along every line (Order preserving) When d=1, monotone = sorted

6 Some Algorithmic problems for P Given data set f (as input): Recognition: Does f satisfy P? RobustTesting: (Define ε(f) = min{ dist(f,g) : g satisfies P}) For some 0 ≤ ε 1 < ε 2 < 1, output either  ε(f) > ε 1 :f is far from P  ε(f) < ε 2 : f is close to P (If ε 1 < ε(f) ≤ ε 2 then can decide either)

7 Property Reconstruction Setting: Given f We expect f to satisfy P (e.g. we run algorithms on f that assume P) but f may not satisfy P Reconstruction problem for P: Given data set f, produce data set g that  satisfies P  is close to f: d(f,g) is not much bigger than ε(f)

8 What does it mean to produce g? Offline computation Input: function table for f Output: function table for g

9 Distributed monotonicity reconstruction Want algorithm A that on input x, computes g(x)  may query f(y) for any y  has access to a short random string s and is otherwise deterministic.

10 Distributed Property Reconstruction Goal: WHP (with probability close to 1) (over choices of random string s): g has property P d(g,f) = O( ε(f) ) Each A(x) runs quickly in particular only reads f(y) for a small number of y.

11 Distributed Property Reconstruction Precursors: Online Data Reconstruction Model (Ailon-Chazelle-Liu-Seshadhri)[ACCL] Locally Decodable Codes and Program self- correction (Blum-Luby-Rubinfeld; Rubinfeld-Sudan; etc ) Graph Coloring (Goldreich-Goldwasser-Ron) Monotonicity Testing (Dodis-Goldreich- Lehman- Raskhodnikova-Ron-Samorodnitsky; Goldreich-Goldwasser- Lehman-Ron-Samorodnitsky;Fischer;Fischer-Lehman-Newman- Raskhodnikova-Rubinfeld-Samorodnitsky;Ergun-Kannan-Kumar- Rubinfeld-Vishwanathan; etc) Tolerant Property Testing (Parnas, Ron, Rubinfeld)

12 Example: Local Decoding of Codes Data set f = boolean string of length n Property = is a Code word of a given error correcting code C Reconstruction = Decoding to a close code word Distributed reconstruction = Local decoding

13 Key issue: making answers consistent For error correcting code, can assume  input f decodes to a unique g.  The set of positions that need to be corrected is determined by f. For general property,  many different g (even exponentially many) that are close to f may have the property  We want to ensure that A produces one of them.

14 An example Monotonicity with input array: 1,….,100, 111,…,120,101,…,110,121,…,200.

15 Monotonicity Reconstruction: d=1 f is a linear array of length n First attempt at distributed reconstruction: A(x) looks at f(x) and f(x-1) If f(x) ≥ f(x-1), then g(x) = f(x) Otherwise, we have a non-monotonicity g(x) = max { f(x), f(x-1) }

16 Monotonicity Reconstruction: d=1 Second attempt Set g(x) = max{ f(1), f(2),…, f(x) } g is monotone but  A(x) requires time Ω(x)  dist(g,f) may be much larger than ε(f)

17 Our results (for general d ) A distributed monotonicity reconstruction algorithm for general dimension d such that: Time to compute g(x) is (log n) O(d) dist(f,g) = C 1 (D)  f) Shared random string s has size (d log n) O(1) (Builds on prior results on monotonicity testing and online monotonicity reconstruction.)

18 Which array values should be changed? A subset S of Γ is f-monotone if f restricted to S is monotone. For each x in Γ, A(x) must: Decide whether g(x) = f(x) If not, then determine g(x) Preserved = { x : g(x) = f(x) } Corrected = { x : g(x) ≠ f(x) } In particular, Preserved must be f-monotone

19 Identifying Preserved The partition (Preserved, Corrected) must satisfy:  Preserved is f-monotone  |Corrected|/|Γ| = O(ε(f)) Preliminary algorithmic problem:

20 Classification problem Classify each y in Γ as Green or Red  Green is f - monotone  Red has size O(ε(f)|Γ|) Need subroutine Classify(y).

21 A sufficient condition for f-monotonicity A pair (x,y) in Γ × Γ is a violation if x f(y) To guarantee that Green is f - monotone: Red should hit all violations: For every violation (x,y) at least one of x,y is Red

22 Classify: 1-dimensional case d=1: Γ={1,…,n} f is a linear array. For x in Γ, and subinterval J of Γ: violations(x,J)=|{y in J : (x,y) is a violation}|

23 Constructing a large f-monotone set The set Bad: x in Bad if for some interval J containing x |violations(x,J)|≥|J|/2 Lemma. 1)Good=Γ - Bad is f-monotone 2)|Bad| ≤ 4 ε(f) |Γ|. Proof: 1) If x,y are a violation then one of them is Bad for the interval [x,y].

24 Lemma. Good=Γ \ Bad is f-monotone |Bad| ≤ 4 ε(f) |Γ|. So we’d like to take: Green=Good Red = Bad

25 How do we compute Good? To test whether y in Good: For each interval J containing y, check violations(y,J)< |J|/2 Difficulties There are  (n) intervals J containing y For each J, computing violations(y,J) takes time  (|J|).

26 Speeding up the computation Estimate violations(y,J) by random sampling sample size polylog(n) is sufficient violations* (y,J) denotes the estimate Compute violations* (y,J) only for a carefully chosen set of test intervals

27 Set of test intervals Want set T of intervals of [n] with: Each x is in few ( O(log n) ) intervals of T If S is any subset of T, then S has a subfamily C such that: each x in U S belongs to 1 or 2 sets of C. For any interval I, there is a J in T containing I, of length at most 4|I|.

28 The Test Set T Assume n=|Γ|=2 k k layers of intervals Layer j consists of 2 k-j+1 -1 intervals of size 2 j

29 View T as a DAG Every point x belongs to 2(log n) intervals All intervals in a level have same size For any subcollection S of T, the subset of S- maximal intervals covers each point of the union exactly once or twice For x<y there is an interval J of size at most 4|[x,y]| containing [x,y].

30 Subroutine classify To classify y Iffor each J in T containing y violations*(y,J) <.1 |J| then y is Green else y is Red

31 Where are we? We have a subroutine Classify On input x,  Classify outputs Green or Red  Runs in time polylog(n) WHP  Green is f-monotone  |Red| ≤ 20ε(f)|Γ|

32 Defining g(x) for Red x The natural way to define g(x) is: Green(x) = { y : y ≤x and y Green} g(x) = max{f(y) : y in Green(x))} = f(max{Green(x)}) In particular, this gives g(x) = f(x) for Green x

33 Computing m(x) Can search backwards from x to find first Green Inefficient if x is followed by a long Red stretch Is there a more efficient way to find m(x) ? No, unless the Red and Green sets have non-trivial structure that can be exploited…. ….and there seems to be no such structure. x m(x)

34 Computing m(x) Can search back from x to find first Green Inefficient if x is preceded by a long Red stretch x m(x)

35 Approximating m(x)? x Pick random Sample(x) of points less than x  Density inversely proportional to distance from x  Size is polylog(n) Green* (x) = { y: y in Sample(x), y Green} m*(x) = max {y in Green* (x)} m*(x)

36 Is m*(x) good enough? x m*(x) y Suppose y is Green and m*(x) ≤ y ≤ x Since y is Green: g(y) = f(y) and g(x) = f(m*(x)) < f(y) = g(y) g is not monotone

37 Is m*(x) good enough? To ensure monotonicity we need: x < y implies m*(x) < m*(y) Requires relaxing the requirement: for all Green z, m*(z) = z x m*(x) y

38 Thinning out Green* (x) Plan: Eliminate certain unsafe points from Green*(x) Roughly, y is unsafe for x if for some z > x Some interval beginning with y and containing x has a high density of Reds. (There is a non-trivial chance that Sample(z) has no Green points ≥ y.) x y z

39 Thinning out Green* (x) Green* (x) = { y: y in Sample(x), y Green} m*(x) = max {y in Green* (x)} Green^(x) = { y: y in Green* (x), y safe for x} m^(x) = max {y in Green^ (x)} (Hiding: Efficient implementation of Green^(x))

40 Redefining Green^(x) WHP if x ≤ y, then m^(x) ≤ m^(y) {x: m^(x) ≠ x} is O(ε(f) |Γ|).

41 Summary of 1-dimensional case Classify points as Green and Red  Few Red points  f restricted to Green is f-monotone For each x, choose Sample(x)  size polylog(n)  All points less than x  Density inversely proportional to distance from x Green^ (x) from Sample(x) that are safe for x  m^(x) is the maximum of Green^(x) Output g(x)=f(m^(x))

42 Dimension greater than 1 For x < y, want g(x) < g(y) x y

43 Red/Green Classification Extend the Red/Green classification to higher dimensions:  f restricted to Green is Monotone  Red is small Straightforward (mostly) extension of 1-dimensional case

44 Given Red/Green classification In the one-dimensional case, Green^ (x) = sampled Green points safe for x g(x) = f(max {y : y in Green^ (x) }.

45 The Green points below x Set of Green maxima could be very large Sparse Random Sampling will only roughly capture the frontier Finding an appropriate definition of unsafe points is much harder than in the one dimensional case 0 1 x

46 Further work The g produced by our algorithm has d(g,f) ≤ C(d)ε(f)|Γ|  Our C(d) is exp(d 2 ).  What should C(d) be? (Guess: C(d) = exp(d) ) Distributed reconstruction for other interesting properties?  (Reconstructing expanders, Kale,Peres, Seshadhri, FOCS 08)