Sketching and Embedding are Equivalent for Norms Alexandr Andoni (Simons Inst. / Columbia) Robert Krauthgamer (Weizmann Inst.) Ilya Razenshteyn (MIT, now.

Slides:



Advertisements
Similar presentations
1+eps-Approximate Sparse Recovery Eric Price MIT David Woodruff IBM Almaden.
Advertisements

The Data Stream Space Complexity of Cascaded Norms T.S. Jayram David Woodruff IBM Almaden.
Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT
Numerical Linear Algebra in the Streaming Model Ken Clarkson - IBM David Woodruff - IBM.
The Average Case Complexity of Counting Distinct Elements David Woodruff IBM Almaden.
Optimal Bounds for Johnson- Lindenstrauss Transforms and Streaming Problems with Sub- Constant Error T.S. Jayram David Woodruff IBM Almaden.
Numerical Linear Algebra in the Streaming Model
Xiaoming Sun Tsinghua University David Woodruff MIT
Tight Lower Bounds for the Distinct Elements Problem David Woodruff MIT Joint work with Piotr Indyk.
Property Testing of Data Dimensionality Robert Krauthgamer ICSI and UC Berkeley Joint work with Ori Sasson (Hebrew U.)
A Nonlinear Approach to Dimension Reduction Robert Krauthgamer Weizmann Institute of Science Joint work with Lee-Ad Gottlieb TexPoint fonts used in EMF.
1 Approximating Edit Distance in Near-Linear Time Alexandr Andoni (MIT) Joint work with Krzysztof Onak (MIT)
Algorithmic High-Dimensional Geometry 1 Alex Andoni (Microsoft Research SVC)
Spectral Approaches to Nearest Neighbor Search arXiv: Robert Krauthgamer (Weizmann Institute) Joint with: Amirali Abdullah, Alexandr Andoni, Ravi.
Overcoming the L 1 Non- Embeddability Barrier Robert Krauthgamer (Weizmann Institute) Joint work with Alexandr Andoni and Piotr Indyk (MIT)
Metric embeddings, graph expansion, and high-dimensional convex geometry James R. Lee Institute for Advanced Study.
Geometric embeddings and graph expansion James R. Lee Institute for Advanced Study (Princeton) University of Washington (Seattle)
Metric Embeddings As Computational Primitives Robert Krauthgamer Weizmann Institute of Science [Based on joint work with Alex Andoni]
A Nonlinear Approach to Dimension Reduction Lee-Ad Gottlieb Weizmann Institute of Science Joint work with Robert Krauthgamer TexPoint fonts used in EMF.
Data Structures and Functional Programming Algorithms for Big Data Ramin Zabih Cornell University Fall 2012.
Navigating Nets: Simple algorithms for proximity search Robert Krauthgamer (IBM Almaden) Joint work with James R. Lee (UC Berkeley)
Sketching and Embedding are Equivalent for Norms Alexandr Andoni (Simons Institute) Robert Krauthgamer (Weizmann Institute) Ilya Razenshteyn (CSAIL MIT)
1 Lecture 18 Syntactic Web Clustering CS
Avraham Ben-Aroya (Tel Aviv University) Oded Regev (Tel Aviv University) Ronald de Wolf (CWI, Amsterdam) A Hypercontractive Inequality for Matrix-Valued.
1 Sampling Lower Bounds via Information Theory Ziv Bar-Yossef IBM Almaden.
1 Edit Distance and Large Data Sets Ziv Bar-Yossef Robert Krauthgamer Ravi Kumar T.S. Jayram IBM Almaden Technion.
Optimal Data-Dependent Hashing for Approximate Near Neighbors
Embedding and Sketching Alexandr Andoni (MSR). Definition by example  Problem: Compute the diameter of a set S, of size n, living in d-dimensional ℓ.
Embedding and Sketching Non-normed spaces Alexandr Andoni (MSR)
Algorithms on negatively curved spaces James R. Lee University of Washington Robert Krauthgamer IBM Research (Almaden) TexPoint fonts used in EMF. Read.
On Embedding Edit Distance into L_11 On Embedding Edit Distance into L 1 Robert Krauthgamer (Weizmann Institute and IBM Almaden)‏ Based on joint work (i)
Information Complexity Lower Bounds for Data Streams David Woodruff IBM Almaden.
Streaming Algorithms Piotr Indyk MIT. Data Streams A data stream is a sequence of data that is too large to be stored in available memory Examples: –Network.
Information Theory for Data Streams David P. Woodruff IBM Almaden.
13 th Nov Geometry of Graphs and It’s Applications Suijt P Gujar. Topics in Approximation Algorithms Instructor : T Kavitha.
Sublinear Algorithms via Precision Sampling Alexandr Andoni (Microsoft Research) joint work with: Robert Krauthgamer (Weizmann Inst.) Krzysztof Onak (CMU)
Similarity Searching in High Dimensions via Hashing Paper by: Aristides Gionis, Poitr Indyk, Rajeev Motwani.
1 Embedding and Similarity Search for Point Sets under Translation Minkyoung Cho and David M. Mount University of Maryland SoCG 2008.
Geometric Problems in High Dimensions: Sketching Piotr Indyk.
Embedding and Sketching Sketching for streaming Alexandr Andoni (MSR)
Massive Data Sets and Information Theory Ziv Bar-Yossef Department of Electrical Engineering Technion.
Data Stream Algorithms Lower Bounds Graham Cormode
Lower Bounds for Embedding Edit Distance into Normed Spaces A. Andoni, M. Deza, A. Gupta, P. Indyk, S. Raskhodnikova.
Optimal Data-Dependent Hashing for Nearest Neighbor Search Alex Andoni (Columbia University) Joint work with: Ilya Razenshteyn.
Sketching and Embedding are Equivalent for Norms Alexandr Andoni (Columbia) Robert Krauthgamer (Weizmann Inst) Ilya Razenshteyn (MIT) 1.
Summer School on Hashing’14 Dimension Reduction Alex Andoni (Microsoft Research)
Sketching complexity of graph cuts Alexandr Andoni joint work with: Robi Krauthgamer, David Woodruff.
Random Access Codes and a Hypercontractive Inequality for
Approximate Near Neighbors for General Symmetric Norms
New Characterizations in Turnstile Streams with Applications
Dimension reduction for finite trees in L1
Fast Dimension Reduction MMDS 2008
Sublinear Algorithmic Tools 3
Sublinear Algorithmic Tools 2
Spectral Approaches to Nearest Neighbor Search [FOCS 2014]
Lecture 10: Sketching S3: Nearest Neighbor Search
Sketching and Embedding are Equivalent for Norms
Turnstile Streaming Algorithms Might as Well Be Linear Sketches
Near(est) Neighbor in High Dimensions
Data-Dependent Hashing for Nearest Neighbor Search
Lecture 16: Earth-Mover Distance
Linear sketching with parities
Near-Optimal (Euclidean) Metric Compression
Yair Bartal Lee-Ad Gottlieb Hebrew U. Ariel University
Overcoming the L1 Non-Embeddability Barrier
Streaming Symmetric Norms via Measure Concentration
Dimension versus Distortion a.k.a. Euclidean Dimension Reduction
Lecture 15: Least Square Regression Metric Embeddings
Approximating Edit Distance in Near-Linear Time
On Solving Linear Systems in Sublinear Time
Presentation transcript:

Sketching and Embedding are Equivalent for Norms Alexandr Andoni (Simons Inst. / Columbia) Robert Krauthgamer (Weizmann Inst.) Ilya Razenshteyn (MIT, now at IBM Almaden) 1

Sketching n d When is sketching possible? 2 Compress a massive object to a small sketch Rich theories: high-dimensional vectors, matrices, graphs Similarity search, compressed sensing, numerical linear algebra Dimension reduction (Johnson, Lindenstrauss 1984): random projection on a low-dimensional subspace preserves distances

Motivation: similarity search Model dis-similarity as a metric Sketching may speed-up computation and allow indexing Interesting metrics: Euclidean ℓ 2 : d(x, y) = (∑ i |x i – y i | 2 ) 1/2 Manhattan, Hamming ℓ 1 : d(x, y) = ∑ i |x i – y i | ℓ p distances d(x, y) = (∑ i |x i – y i | p ) 1/p for p ≥ 1 Edit distance, Earth Mover’s Distance etc. This talk: sketching metrics 3

Sketching metrics Alice and Bob each hold a point from a metric space (say x and y) Both send s-bit sketches to Charlie For r > 0 and D > 1 distinguish d(x, y) ≤ r d(x, y) ≥ Dr Shared randomness, allow 1% probability of error Trade-off between s and D Various variants: general protocols etc sketch(x)sketch(y) d(x, y) ≤ r or d(x, y) ≥ Dr? 0110…1 4 AliceBob Charlie x y

The main question Which metrics can we sketch efficiently? (Kanpur 2006)

Near Neighbor Search via sketches Near Neighbor Search (NNS): Given n-point dataset P A query q within r from some data point Return any data point within Dr from q Sketches of size s imply NNS with space n O(s) and a 1-probe query Proof idea: amplify probability of error to 1/n by increasing the size to O(s log n); sketch of q determines the answer For many metrics: the only approach 6

The main question Which metrics can we sketch efficiently? (Kanpur 2006)

Sketching ℓ p norms (Indyk 2000): can sketch ℓ p for 0 < p ≤ 2 via random projections using p-stable distributions For D = 1 + ε one gets s = O(1 / ε 2 ) Tight by (Woodruff 2004) For p > 2 sketching ℓ p is somewhat hard (Bar-Yossef, Jayram, Kumar, Sivakumar 2002), (Indyk, Woodruff 2005) To achieve D = O(1) one needs sketch size to be s = Θ~(d 1-2/p ) 8

The main question (quantitative) Which metrics can we sketch with constant sketch size and approximation?

X Y Beyond ℓ p norms A map f: X → Y is an embedding with distortion C, if for a, b from X: d X (a, b) / C ≤ d Y (f(a), f(b)) ≤ d X (a, b) Reductions for geometric problems 10 a b f(a) f(b) f f Sketches of size s and approximation D for Y Sketches of size s and approximation CD for X

Metrics with good sketches Summary: a metric X admits sketches with s, D = O(1), if: X = ℓ p for p ≤ 2 X embeds into ℓ p for p ≤ 2 with distortion O(1) Are there any other metrics with efficient sketches? We don’t know! Some new techniques waiting to be discovered? Above are the only “tractable” spaces? 11

A normed space: R d equipped with a metric (think ℓ p or matrix norms) The main result If a normed space X admits sketches of size s and approximation D, then for every ε > 0 the space X embeds into ℓ 1 – ε with distortion O(sD / ε) Embedding into ℓ p, p ≤ 2 Efficient sketches (Kushilevitz, Ostrovsky, Rabani 1998) (Indyk 2000) For norms 12

Application: lower bounds for sketches Convert non-embeddability into lower bounds for sketches in a black box way 13 No embeddings with distortion O(1) into ℓ 1 – ε No sketches * of size and approximation O(1) * in fact, any communication protocols

Example 1: the Earth Mover’s Distance For x: R [Δ]×[Δ] → R with zero average, ‖x‖ EMD is the cost of the best transportation of the positive part of x to the negative part Initial motivation for this work! (Kanpur 2006) Upper bounds: (Andoni, Do Ba, Indyk, Woodruff 2009), (Charikar 2002), (Indyk, Thaper 2003), (Naor, Schechtman 2005) Lower bound also holds for the minimum-cost matching metric on subsets No embedding into ℓ 1 – ε with distortion O(1) (Naor, Schechtman 2005) No sketches with D = O(1) and s = O(1) 14

Example 2: the Trace Norm For an n × n matrix A define the Trace Norm (the Nuclear Norm) ‖A‖ to be the sum of the singular values Previously: lower bounds only for linear sketches (Li, Nguyen, Woodruff 2014) Any embedding into ℓ 1 requires distortion Ω(n 1/2 ) (Pisier 1978) Any sketch must satisfy sD = Ω(n 1/2 / log n) 15

The sketch of the proof 16 Good sketches for X Absence of certain Poincaré-type inequalities on X [Andoni-Jayram-P ă traşcu 2010], Direct sum for Information Complexity Weak embedding of X into ℓ 2 Convex duality + compactness Uniform embedding of X into ℓ 2 [Johnson-Randrianarivony 2006], Lipschitz extension Linear embedding of X into ℓ 1-ε [Aharoni-Maurey-Mityagin 1985], Fourier analysis Good sketches for ℓ ∞ (X) Crucially use that X is a norm ‖(x 1, x 2, …, x k )‖ = max i ‖x i ‖ ‖ x 1 – x 2 ‖ ≤ 1 → ‖ f(x 1 ) – f(x 2 ) ‖ ≤ 1 ‖ x 1 – x 2 ‖ ≥ sD → ‖ f(x 1 ) – f(x 2 ) ‖ ≥ 10 [Andoni-Krauthgamer 2007]: almost works, but gives a tiny gap instead of 1 vs 10

Open problems Extend to as general class of metrics as possible (Edit Distance?) Can one strengthen our theorem to “sketches with O(1) size and approx. imply embedding into ℓ 1 with distortion O(1)”? Equivalent to an old open problem from Functional Analysis [Kwapien 1969] Keep in mind negative-type metrics that do not embed into ℓ 1 [Khot-Vishnoi 2005] [Cheeger-Kleiner-Naor 2009] Spaces that require s = Ω(d) for D = O(1) besides ℓ ∞ ? Linear sketches with f(s) measurements and g(D) approximation? 17 Any questions?