On the Impossibility of Dimension Reduction for Doubling Subsets of L p Yair Bartal Lee-Ad Gottlieb Ofer Neiman.

Slides:



Advertisements
Similar presentations
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Advertisements

Embedding Metric Spaces in Their Intrinsic Dimension Ittai Abraham, Yair Bartal*, Ofer Neiman The Hebrew University * also Caltech.
Embedding the Ulam metric into ℓ 1 (Ενκρεβάτωση του μετρικού χώρου Ulam στον ℓ 1 ) Για το μάθημα “Advanced Data Structures” Αντώνης Αχιλλέως.
Overcoming the L 1 Non- Embeddability Barrier Robert Krauthgamer (Weizmann Institute) Joint work with Alexandr Andoni and Piotr Indyk (MIT)
Metric Embeddings with Relaxed Guarantees Hubert Chan Joint work with Kedar Dhamdhere, Anupam Gupta, Jon Kleinberg, Aleksandrs Slivkins.
Metric Embedding with Relaxed Guarantees Ofer Neiman Ittai Abraham Yair Bartal.
Metric embeddings, graph expansion, and high-dimensional convex geometry James R. Lee Institute for Advanced Study.
Embedding Metrics into Ultrametrics and Graphs into Spanning Trees with Constant Average Distortion Ittai Abraham, Yair Bartal, Ofer Neiman The Hebrew.
A Nonlinear Approach to Dimension Reduction Lee-Ad Gottlieb Weizmann Institute of Science Joint work with Robert Krauthgamer TexPoint fonts used in EMF.
Navigating Nets: Simple algorithms for proximity search Robert Krauthgamer (IBM Almaden) Joint work with James R. Lee (UC Berkeley)
Complexity 15-1 Complexity Andrei Bulatov Hierarchy Theorem.
Convergent and Correct Message Passing Algorithms Nicholas Ruozzi and Sekhar Tatikonda Yale University TexPoint fonts used in EMF. Read the TexPoint manual.
Approximate Range Searching in the Absolute Error Model Guilherme D. da Fonseca CAPES BEX Advisor: David M. Mount.
Proximity algorithms for nearly-doubling spaces Lee-Ad Gottlieb Robert Krauthgamer Weizmann Institute TexPoint fonts used in EMF. Read the TexPoint manual.
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Wireless Sensor Networks 19th Lecture Christian Schindelhauer.
Advances in Metric Embedding Theory Ofer Neiman Ittai Abraham Yair Bartal Hebrew University.
Approximate Nearest Neighbors and the Fast Johnson-Lindenstrauss Transform Nir Ailon, Bernard Chazelle (Princeton University)
Avraham Ben-Aroya (Tel Aviv University) Oded Regev (Tel Aviv University) Ronald de Wolf (CWI, Amsterdam) A Hypercontractive Inequality for Matrix-Valued.
Lower Bounds on the Distortion of Embedding Finite Metric Spaces in Graphs Y. Rabinovich R. Raz DCG 19 (1998) Iris Reinbacher COMP 670P
Visual Recognition Tutorial
Doubling Dimension in Real-World Graphs Melitta Lorraine Geistdoerfer Andersen.
Embedding and Sketching Alexandr Andoni (MSR). Definition by example  Problem: Compute the diameter of a set S, of size n, living in d-dimensional ℓ.
Random Projections of Signal Manifolds Michael Wakin and Richard Baraniuk Random Projections for Manifold Learning Chinmay Hegde, Michael Wakin and Richard.
Distance scales, embeddings, and efficient relaxations of the cut cone James R. Lee University of California, Berkeley.
Efficient Regression in Metric Spaces via Approximate Lipschitz Extension Lee-Ad GottliebAriel University Aryeh KontorovichBen-Gurion University Robert.
Subdivision of Edge In a graph G, subdivision of an edge uv is the operation of replacing uv with a path u,w,v through a new vertex w.
Entropy-based Bounds on Dimension Reduction in L 1 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A AAAA A Oded Regev.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Fast, precise and dynamic distance queries Yair BartalHebrew U. Lee-Ad GottliebWeizmann → Hebrew U. Liam RodittyBar Ilan Tsvi KopelowitzBar Ilan → Weizmann.
Payoffs in Location Games Shuchi Chawla 1/22/2003 joint work with Amitabh Sinha, Uday Rajan & R. Ravi.
1 The number of orientations having no fixed tournament Noga Alon Raphael Yuster.
13 th Nov Geometry of Graphs and It’s Applications Suijt P Gujar. Topics in Approximation Algorithms Instructor : T Kavitha.
An optimal dynamic spanner for points residing in doubling metric spaces Lee-Ad Gottlieb NYU Weizmann Liam Roditty Weizmann.
1 Building The Ultimate Consistent Reader. 2 Introduction We’ve already built a consistent reader (cube-Vs.-point)... Except it had variables ranging.
Similarity Searching in High Dimensions via Hashing Paper by: Aristides Gionis, Poitr Indyk, Rajeev Motwani.
Spectral Sequencing Based on Graph Distance Rong Liu, Hao Zhang, Oliver van Kaick {lrong, haoz, cs.sfu.ca {lrong, haoz, cs.sfu.ca.
Geometric Problems in High Dimensions: Sketching Piotr Indyk.
Embeddings, flow, and cuts: an introduction University of Washington James R. Lee.
Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality Piotr Indyk, Rajeev Motwani The 30 th annual ACM symposium on theory of computing.
Doubling Dimension: a short survey Anupam Gupta Carnegie Mellon University Barriers in Computational Complexity II, CCI, Princeton.
A light metric spanner Lee-Ad Gottlieb. Graph spanners A spanner for graph G is a subgraph H ◦ H contains vertices, subset of edges of G Some qualities.
Lower Bounds for Embedding Edit Distance into Normed Spaces A. Andoni, M. Deza, A. Gupta, P. Indyk, S. Raskhodnikova.
Two Finger Caging of Concave Polygon Peam Pipattanasomporn Advisor: Attawith Sudsang.
Advances in Metric Embedding Theory Yair Bartal Hebrew University &Caltech UCLA IPAM 07.
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
Oct 23, 2005FOCS Metric Embeddings with Relaxed Guarantees Alex Slivkins Cornell University Joint work with Ittai Abraham, Yair Bartal, Hubert Chan,
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Algorithms for Radio Networks Winter Term 2005/2006.
Clustering – Definition and Basic Algorithms Seminar on Geometric Approximation Algorithms, spring 11/12.
TU/e Algorithms (2IL15) – Lecture 11 1 Approximation Algorithms.
Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"
Packing to fewer dimensions
Dimension reduction for finite trees in L1
Generalized Sparsest Cut and Embeddings of Negative-Type Metrics
Advances in Metric Embedding Theory
Ultra-low-dimensional embeddings of doubling metrics
Greedy Routing with Bounded Stretch
Dimension reduction techniques for lp (1<p<2), with applications
Lecture 10: Sketching S3: Nearest Neighbor Search
Additive Combinatorics and its Applications in Theoretical CS
Lecture 16: Earth-Mover Distance
Constrained Bipartite Vertex Cover: The Easy Kernel is Essentially Tight Bart M. P. Jansen June 4th, WORKER 2015, Nordfjordeid, Norway.
Y. Kotidis, S. Muthukrishnan,
Yair Bartal Lee-Ad Gottlieb Hebrew U. Ariel University
Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"
Dimension versus Distortion a.k.a. Euclidean Dimension Reduction
Lecture 15: Least Square Regression Metric Embeddings
The Intrinsic Dimension of Metric Spaces
Clustering.
Routing in Networks with Low Doubling Dimension
Presentation transcript:

On the Impossibility of Dimension Reduction for Doubling Subsets of L p Yair Bartal Lee-Ad Gottlieb Ofer Neiman

Embedding and Distortion

JL Lemma Lemma: Any n points in L 2 can be embedded into L 2 k, k=O((log n)/ε 2 ) with 1+ε distortion Extremely useful for many applications: – Machine learning – Compressive sensing – Nearest Neighbor search – Many others… Limitations: specific to L 2, dimension depends on n – There are lower bounds for dimension reduction in L 1, L ∞

Lower bounds on Dimension Reduction

Doubling Dimension Doubling constant: The minimal λ so that every ball of radius 2r can be covered by λ balls of radius r Doubling dimension: log 2 λ A measure for dimensionality of a metric space Generalizes the dimension for normed space: L p k has doubling dimension Θ(k) The volume argument holds only for metrics with high doubling dimension

Overcoming the Lower Bounds? One could hope for an analogous version of the JL- Lemma for doubling subsets Question: Does every set of points in L 2 of constant doubling dimension, embeds to constant dimensional space with constant distortion? More ambitiously: Any subset of L 2 with doubling constant λ, can be embedded into L 2 k, k=O((log λ)/ε 2 ) with 1+ε distortion

Our Result

Implications

The Laakso Graph A recursive graph, G i+1 is obtained from G i by replacing every edge with a copy of G 1 A series-parallel graph Has doubling constant 6 G0G0 G1G1 G2G2

Simple Case: p=∞

v Consider a single iteration The pair a,b is an edge of the previous iteration Let f j be the j-th coordinate There is a natural embedding that does not increase stretch... But then u,v may be distorted ab s t u f j (a) f j (b)

Simple Case: p=∞ For simplicity (and w.l.o.g) assume – f j (s)=(f j (b)-f j (a))/4 – f j (t)=3(f j (b)-f j (a))/4 – f j (v)=(f j (b)-f j (a))/2 Let Δ j (u) be the difference between f j (u) and f j (v) The distortion D requirement imposes that for some j, Δ j (u)>1/D (normalizing so that d(u,v)=1) v ab s t u f j (a) f j (b) Δ j (u)

Simple Case: p=∞ v ab s t f j (a) f j (b) Δ j (u) u v ab s t f h (a) f h (b) -Δ h (u) u

Simple Case: p=∞ v ab s t f j (a) f j (b) Δ j (u) u

Simple Case: p=∞ u s a b t v

Going Beyond Infinity For p<∞, we cannot use the Laakso graph – Requires high distortion to embed it into L p Instead, we build an instance in L p, inspired by the Laakso graph The new points u,v will use a new dimension Parameter ε determines the (scaled) u,v distance as u v t b ε

Going Beyond Infinity Problem: the u,s distance is now larger than 1, roughly 1+ε p Causes a loss of ≈ ε p in the stretch of each level Since u,v are at distance ε, the increase to the stretch is now only (ε/D) 2 When p>2, there is a choice of ε for which the increase overcomes the loss as u v t b ε

Conclusion We show a strong lower bound against dimension reduction for doubling subsets of L p, for any p>2 Can our techniques be extended to 1<p<2 ? – The u,s distance when p<2 is quite large, ≈ 1+(p-1)ε 2, so a different approach is required General doubling metrics embed to L p with distortion O(log 1/p n) (for p≥2) – Can this distortion bound be obtained in constant dimension?