Embedding Metric Spaces in Their Intrinsic Dimension Ittai Abraham, Yair Bartal, Ofer Neiman The Hebrew University also Caltech.

Slides:

Advertisements

Similar presentations

Optimal Lower Bounds for 2-Query Locally Decodable Linear Codes Kenji Obata.

Advertisements

Efficient classification for metric data Lee-Ad GottliebWeizmann Institute Aryeh KontorovichBen Gurion U. Robert KrauthgamerWeizmann Institute TexPoint.

Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.

Optimal Bounds for Johnson- Lindenstrauss Transforms and Streaming Problems with Sub- Constant Error T.S. Jayram David Woodruff IBM Almaden.

Approximate Max-integral-flow/min-cut Theorems Kenji Obata UC Berkeley June 15, 2004.

CSE 211- Discrete Structures1 Relations Ch 2 schaums, Ch 7 Rosen.

The Capacity of Wireless Networks

Vertex cover might be hard to approximate within 2 - ε Subhash Khot, Oded Regev Slides by: Ofer Neiman.

Hardness of Approximating Multicut S. Chawla, R. Krauthgamer, R. Kumar, Y. Rabani, D. Sivakumar (2005) Presented by Adin Rosenberg.

On Complexity, Sampling, and -Nets and -Samples. Range Spaces A range space is a pair, where is a ground set, it’s elements called points and is a family.

Lower bounds for epsilon-nets

The Communication Complexity of Approximate Set Packing and Covering

Embedding the Ulam metric into ℓ 1 (Ενκρεβάτωση του μετρικού χώρου Ulam στον ℓ 1 ) Για το μάθημα “Advanced Data Structures” Αντώνης Αχιλλέως.

A Metric Notion of Dimension and Its Applications to Learning Robert Krauthgamer (Weizmann Institute) Based on joint works with Lee-Ad Gottlieb, James.

Metric Embeddings with Relaxed Guarantees Hubert Chan Joint work with Kedar Dhamdhere, Anupam Gupta, Jon Kleinberg, Aleksandrs Slivkins.

Metric Embedding with Relaxed Guarantees Ofer Neiman Ittai Abraham Yair Bartal.

Compact Routing with Slack in Low Doubling Dimension Goran Konjevod, Andr é a W. Richa, Donglin Xia, Hai Yu CSE Dept., Arizona State University {goran,

Cse 521: design and analysis of algorithms Time & place T, Th pm in CSE 203 People Prof: James Lee TA: Thach Nguyen Book.

Metric embeddings, graph expansion, and high-dimensional convex geometry James R. Lee Institute for Advanced Study.

Embedding Metrics into Ultrametrics and Graphs into Spanning Trees with Constant Average Distortion Ittai Abraham, Yair Bartal, Ofer Neiman The Hebrew.

Paths, Trees and Minimum Latency Tours Kamalika Chaudhuri, Brighten Godfrey, Satish Rao, Satish Rao, Kunal Talwar UC Berkeley.

A Nonlinear Approach to Dimension Reduction Lee-Ad Gottlieb Weizmann Institute of Science Joint work with Robert Krauthgamer TexPoint fonts used in EMF.

Navigating Nets: Simple algorithms for proximity search Robert Krauthgamer (IBM Almaden) Joint work with James R. Lee (UC Berkeley)

UMass Lowell Computer Science Graduate Analysis of Algorithms Prof. Karen Daniels Spring, 2010 Lecture 3 Tuesday, 2/9/10 Amortized Analysis.

Proximity algorithms for nearly-doubling spaces Lee-Ad Gottlieb Robert Krauthgamer Weizmann Institute TexPoint fonts used in EMF. Read the TexPoint manual.

1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Wireless Sensor Networks 19th Lecture Christian Schindelhauer.

On the Union of Cylinders in Esther Ezra Duke University On the Union of Cylinders in  3 Esther Ezra Duke University.

UMass Lowell Computer Science Graduate Analysis of Algorithms Prof. Karen Daniels Spring, 2009 Lecture 3 Tuesday, 2/10/09 Amortized Analysis.

Advances in Metric Embedding Theory Ofer Neiman Ittai Abraham Yair Bartal Hebrew University.

Probabilistic Methods in Coding Theory: Asymmetric Covering Codes Joshua N. Cooper UCSD Dept. of Mathematics Robert B. Ellis Texas A&M Dept. of Mathematics.

Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.

Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.

Doubling Dimension in Real-World Graphs Melitta Lorraine Geistdoerfer Andersen.

Embedding and Sketching Alexandr Andoni (MSR). Definition by example  Problem: Compute the diameter of a set S, of size n, living in d-dimensional ℓ.

Algorithms on negatively curved spaces James R. Lee University of Washington Robert Krauthgamer IBM Research (Almaden) TexPoint fonts used in EMF. Read.

Entropy-based Bounds on Dimension Reduction in L 1 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A AAAA A Oded Regev.

1 By: MOSES CHARIKAR, CHANDRA CHEKURI, TOMAS FEDER, AND RAJEEV MOTWANI Presented By: Sarah Hegab.

On the union of cylinders in 3-space Esther Ezra Duke University.

Ch. 6 - Approximation via Reweighting Presentation by Eran Kravitz.

1 Oblivious Routing in Wireless networks Costas Busch Rensselaer Polytechnic Institute Joint work with: Malik Magdon-Ismail and Jing Xi.

Fast, precise and dynamic distance queries Yair BartalHebrew U. Lee-Ad GottliebWeizmann → Hebrew U. Liam RodittyBar Ilan Tsvi KopelowitzBar Ilan → Weizmann.

1 Combinatorial Algorithms Local Search. A local search algorithm starts with an arbitrary feasible solution to the problem, and then check if some small,

13 th Nov Geometry of Graphs and It’s Applications Suijt P Gujar. Topics in Approximation Algorithms Instructor : T Kavitha.

An optimal dynamic spanner for points residing in doubling metric spaces Lee-Ad Gottlieb NYU Weizmann Liam Roditty Weizmann.

Topics in Algorithms 2007 Ramesh Hariharan. Tree Embeddings.

CSE 421 Algorithms Lecture 15 Closest Pair, Multiplication.

Doubling Dimension: a short survey Anupam Gupta Carnegie Mellon University Barriers in Computational Complexity II, CCI, Princeton.

On the Impossibility of Dimension Reduction for Doubling Subsets of L p Yair Bartal Lee-Ad Gottlieb Ofer Neiman.

Advances in Metric Embedding Theory Yair Bartal Hebrew University &Caltech UCLA IPAM 07.

Oct 23, 2005FOCS Metric Embeddings with Relaxed Guarantees Alex Slivkins Cornell University Joint work with Ittai Abraham, Yair Bartal, Hubert Chan,

Clustering – Definition and Basic Algorithms Seminar on Geometric Approximation Algorithms, spring 11/12.

Coarse Differentiation and Planar Multiflows

Dimension reduction for finite trees in L1

Haim Kaplan and Uri Zwick

Advances in Metric Embedding Theory

Ultra-low-dimensional embeddings of doubling metrics

Dimension reduction techniques for lp (1<p<2), with applications

Lecture 16: Earth-Mover Distance

Near-Optimal (Euclidean) Metric Compression

Light Spanners for Snowflake Metrics

Yair Bartal Lee-Ad Gottlieb Hebrew U. Ariel University

Richard Anderson Lecture 13 Divide and Conquer

Metric Methods and Approximation Algorithms

Dimension versus Distortion a.k.a. Euclidean Dimension Reduction

Embedding Metrics into Geometric Spaces

Lecture 15: Least Square Regression Metric Embeddings

Lecture 15, Winter 2019 Closest Pair, Multiplication

The Intrinsic Dimension of Metric Spaces

Routing in Networks with Low Doubling Dimension

Presentation transcript:

Embedding Metric Spaces in Their Intrinsic Dimension Ittai Abraham, Yair Bartal*, Ofer Neiman The Hebrew University * also Caltech

Emebdding Metric Spaces Metric spaces (X,d X ), (Y,d Y ) Metric spaces (X,d X ), (Y,d Y ) Embedding is a function f : XY Embedding is a function f : XY Distortion is the minimal α such that Distortion is the minimal α such that d X (x,y)d Y (f(x),f(y))α·d X (x,y) d X (x,y)d Y (f(x),f(y))α·d X (x,y)

Intrinsic Dimension Doubling Constant : The minimal λ such any ball of radius r>0, can be covered by λ balls of radius r/2. Doubling Constant : The minimal λ such any ball of radius r>0, can be covered by λ balls of radius r/2. Doubling Dimension : dim( X ) = log 2 λ. Doubling Dimension : dim( X ) = log 2 λ. The problem: Relation between metric dimension to intrinsic dimension. The problem: Relation between metric dimension to intrinsic dimension.

Previous Results Given a λ -doubling finite metric space (X,d) and 0<γ<1, it s snow-flake version (X,d γ ) can be embedded into L p with distortion and dimension depending only on λ [Assouad 83]. Given a λ -doubling finite metric space (X,d) and 0<γ<1, it s snow-flake version (X,d γ ) can be embedded into L p with distortion and dimension depending only on λ [Assouad 83]. Conjecture (Assouad) : This hold for γ=1. Conjecture (Assouad) : This hold for γ=1. Disproved by Semmes. Disproved by Semmes. A lower bound on distortion of for L 2, with a matching upper bound [GKL 03]. A lower bound on distortion of for L 2, with a matching upper bound [GKL 03].

Rephrasing the Question Is there a low-distortion embedding for a finite metric space in its intrinsic dimension? Is there a low-distortion embedding for a finite metric space in its intrinsic dimension? Main result : Yes. Main result : Yes.

Main Results Any finite metric space (X,d) embeds into L p : Any finite metric space (X,d) embeds into L p : With distortion O(log 1+θ n) and dimension O(dim(X)/θ), for any θ>0. With distortion O(log 1+θ n) and dimension O(dim(X)/θ), for any θ>0. With constant average distortion and dimension O(dim(X)log(dim(X))). With constant average distortion and dimension O(dim(X)log(dim(X))).

Additional Result Any finite metric space (X,d) embeds into L p : Any finite metric space (X,d) embeds into L p : With distortion and dimension. With distortion and dimension. ( For all D (log n)/dim(X) ). ( For all D (log n)/dim(X) ). In particular Õ(log 2/3 n) distortion and dimension into L 2. In particular Õ(log 2/3 n) distortion and dimension into L 2. Matches best known distortion result [KLMN 03] for D=(log n)/dim(X), with dimension O(log n log(dim(X))). Matches best known distortion result [KLMN 03] for D=(log n)/dim(X), with dimension O(log n log(dim(X))).

Distance Oracles Compact data structure that approximately answers distance queries. Compact data structure that approximately answers distance queries. For general n -point metrics: For general n -point metrics: [TZ 01] O(k) stretch with O(kn 1/k ) bits per label. [TZ 01] O(k) stretch with O(kn 1/k ) bits per label. For a finite λ -doubling metric: For a finite λ -doubling metric: O(1) average stretch with Õ(log λ) bits per label. O(1) average stretch with Õ(log λ) bits per label. O(k) stretch with Õ(λ 1/k ) bits per label. O(k) stretch with Õ(λ 1/k ) bits per label. Follows from variation on snow- flake embedding (Assouad).

First Result Thm: For any finite λ -doubling metric space (X,d) on n points and any 0<θ<1 there exists an embedding of (X,d) into L p with distortion O(log 1+θ n) and dimension O((log λ)/θ). Thm: For any finite λ -doubling metric space (X,d) on n points and any 0<θ<1 there exists an embedding of (X,d) into L p with distortion O(log 1+θ n) and dimension O((log λ)/θ).

Probabilistic Partitions P={S 1,S 2,…S t } is a partition of X if P={S 1,S 2,…S t } is a partition of X if P(x) is the cluster containing x. P(x) is the cluster containing x. P is Δ-bounded if diam(S i )Δ for all i. P is Δ-bounded if diam(S i )Δ for all i. A probabilistic partition P is a distribution over a set of partitions. A probabilistic partition P is a distribution over a set of partitions. A Δ-bounded P is η-padded if for all xєX : A Δ-bounded P is η-padded if for all xєX :

η-padded Partitions The parameter η determines the quality of the embedding. The parameter η determines the quality of the embedding. [Bartal 96]: η=Ω(1/log n) for any metric space. [Bartal 96]: η=Ω(1/log n) for any metric space. [CKR01+FRT03]: Improved partitions with η(x)=1/log(ρ(x,Δ)). [CKR01+FRT03]: Improved partitions with η(x)=1/log(ρ(x,Δ)). [GKL 03] : η=Ω(1/log λ) for λ -doubling metrics. [GKL 03] : η=Ω(1/log λ) for λ -doubling metrics. [KLMN 03]: Used to embed general + doubling metrics into L p : distortion O((log λ) 1-1/p (log n) 1/p ), dimension O(log 2 n). [KLMN 03]: Used to embed general + doubling metrics into L p : distortion O((log λ) 1-1/p (log n) 1/p ), dimension O(log 2 n). The local growth rate of x at radius r is:

Uniform Local Padding Lemma A local padding : padding probability for x is independent of the partition outside B(x,Δ). A local padding : padding probability for x is independent of the partition outside B(x,Δ). A uniform padding : padding parameter η(x) is equal for all points in the same cluster. A uniform padding : padding parameter η(x) is equal for all points in the same cluster. There exists a Δ-bounded prob. partition with local uniform padding parameter η(x) : There exists a Δ-bounded prob. partition with local uniform padding parameter η(x) : η(x)>Ω(1/log λ) η(x)>Ω(1/log λ) η(x)> Ω(1/log(ρ(x,Δ))) η(x)> Ω(1/log(ρ(x,Δ))) v1v1 v2v2 v3v3 C1C1 C2C2 η(v 3 ) η(v 1 )

Plan: A simpler result of: A simpler result of: Distortion O(log n). Distortion O(log n). Dimension O(loglog n·log λ). Dimension O(loglog n·log λ). Obtaining lower dimension of O(log λ). Obtaining lower dimension of O(log λ). Brief overview of: Brief overview of: Constant average distortion. Constant average distortion. Distortion-dimension tradeoff. Distortion-dimension tradeoff.

For each scale iє Z, create uniformly padded local probabilistic 8 i -bounded partition P i. For each scale iє Z, create uniformly padded local probabilistic 8 i -bounded partition P i. For each cluster choose σ i (S)~Ber(½) i.i.d. For each cluster choose σ i (S)~Ber(½) i.i.d. f i (x)=σ i (P i (x))·min{η i -1 (x)·d(x,X\P i (x)), 8 i } f i (x)=σ i (P i (x))·min{η i -1 (x)·d(x,X\P i (x)), 8 i } Deterministic upper bound : Deterministic upper bound : |f(x)-f(y)| O(log n·d(x,y)). |f(x)-f(y)| O(log n·d(x,y)). using using Embedding into one dimension x d(x,X\P i (x) PiPi

Lower Bound - Overview Create a r i -net for all integers i. Create a r i -net for all integers i. Define success event for a pair (u,v) in the r i -net, d(u,v)8 i : as having contribution > 8 i /4, for many coordinates. Define success event for a pair (u,v) in the r i -net, d(u,v)8 i : as having contribution > 8 i /4, for many coordinates. In every coordinate, a constant probability of having contribution for a net pair (u,v). In every coordinate, a constant probability of having contribution for a net pair (u,v). Use Lovasz Local Lemma. Use Lovasz Local Lemma. Show lower bound for other pairs. Show lower bound for other pairs.

u x Lower Bound – Other Pairs? x,y some pair, d(x,y)8 i. u,v the nearest in the r i -net to x,y. x,y some pair, d(x,y)8 i. u,v the nearest in the r i -net to x,y. Suppose that |f(u)-f(v)|>8 i /4. Suppose that |f(u)-f(v)|>8 i /4. We want to choose the net such that |f(u)-f(x)|<8 i /16, choose r i = 8 i /(16·log n). We want to choose the net such that |f(u)-f(x)|<8 i /16, choose r i = 8 i /(16·log n). Using the upper bound |f(u)-f(x)| log n·d(u,x) 8 i /16 Using the upper bound |f(u)-f(x)| log n·d(u,x) 8 i /16 |f(x)-f(y)| |f(u)-f(v)|-|f(u)-f(x)|-|f(v)-f(y)| 8 i /4-2·8 i /16 = 8 i /8. |f(x)-f(y)| |f(u)-f(v)|-|f(u)-f(x)|-|f(v)-f(y)| 8 i /4-2·8 i /16 = 8 i /8. y v 8 i /(16log n)

u v r i -net pair (u,v). Can assume that 8 id(u,v)/4. r i -net pair (u,v). Can assume that 8 id(u,v)/4. It must be that P i (u)P i (v) It must be that P i (u)P i (v) With probability ½ : d(u,X\P i (u))η i 8 i With probability ½ : d(u,X\P i (u))η i 8 i With probability ¼ : σ i (P i (u))=1 and σ i (P i (v))=0 With probability ¼ : σ i (P i (u))=1 and σ i (P i (v))=0 LowerBound:

Lower Bound – Net Pairs d(u,v)8 i. Consider d(u,v)8 i. Consider If R<8 i /2 : If R<8 i /2 : With prob. 1/8 f i (u)-f i (v) 8 i. With prob. 1/8 f i (u)-f i (v) 8 i. If R 8 i /2 : If R 8 i /2 : With prob. 1/4 f i (u)=f i (v)=0. With prob. 1/4 f i (u)=f i (v)=0. In any case In any case Lower scales do not matter Lower scales do not matter u v η i (u) 8 i The good event for pair in scale i depend on higher scales, but has constant probability given any outcome for them. Oblivious to lower scales.

Local Lemma Lemma (Lovasz): Let A 1,…A n be bad events. G=(V,E) a directed graph with vertices corresponding to events with out-degree at most d. Let c:VN be rating function of event such that (A i,A j )єE then c(A i )c(A j ), if Lemma (Lovasz): Let A 1,…A n be bad events. G=(V,E) a directed graph with vertices corresponding to events with out-degree at most d. Let c:VN be rating function of event such that (A i,A j )єE then c(A i )c(A j ), if and and then then Rating = radius of scale.

Lower Bound – Net Pairs A success event E(u,v) for a net pair u,v : there is contribution from at least 1/16 of the coordinates. A success event E(u,v) for a net pair u,v : there is contribution from at least 1/16 of the coordinates. Locality of partition – the net pair depend only on nearby points, with distance < 8 i. Locality of partition – the net pair depend only on nearby points, with distance < 8 i. Doubling constant λ, and r i 8 i /log n - there are at most λ loglog n such points, so d=λ loglog n. Doubling constant λ, and r i 8 i /log n - there are at most λ loglog n such points, so d=λ loglog n. Taking D=O(log λ·loglog n) coordinates will give roughly e -D = λ -loglog n failure probability. Taking D=O(log λ·loglog n) coordinates will give roughly e -D = λ -loglog n failure probability. By the local lemma, there is exists an embedding such that E(u,v) holds for all net pairs. By the local lemma, there is exists an embedding such that E(u,v) holds for all net pairs.

Obtaining Lower Dimension To use the LLL, probability to fail in more than 15/16 of the coordinates must be < λ -loglog n To use the LLL, probability to fail in more than 15/16 of the coordinates must be < λ -loglog n Instead of taking more coordinates, increase the success probability in each coordinate. Instead of taking more coordinates, increase the success probability in each coordinate. If probability to obtain contribution in each coordinate >1-1/ log n, it is enough to take O(log λ) coordinates. If probability to obtain contribution in each coordinate >1-1/ log n, it is enough to take O(log λ) coordinates. Similarly, if failure prob. in each coordinate < log -θ n, enough to take O((log λ)/θ) coordinates

Using Several Scales Create nets only every θloglog n scales. Create nets only every θloglog n scales. A pair (x,y) in scale i (i.e. d(x,y)8 i ) will find a close net pair in nearest smaller scale i. A pair (x,y) in scale i (i.e. d(x,y)8 i ) will find a close net pair in nearest smaller scale i. 8 i <log θ n·8 i, so lose a factor of log θ n in the distortion. 8 i <log θ n·8 i, so lose a factor of log θ n in the distortion. Consider scales i-θloglog n,…,i. Consider scales i-θloglog n,…,i. i i θloglog n > i-θloglog n i+θloglog n

Using Several Scales Take u,v in the net with d(u,v)8 i. Take u,v in the net with d(u,v)8 i. A success in one of these scales will give A success in one of these scales will give contribution > 8 i-θloglog n = 8 i /log θ n. contribution > 8 i-θloglog n = 8 i /log θ n. The success for u,v in each scale is : The success for u,v in each scale is : Unaffected by higher scales events Unaffected by higher scales events Independent of events far away in the same scale. Independent of events far away in the same scale. Oblivious to events in lower scales. Oblivious to events in lower scales. Probability that all scales failed< (7/8) θloglog n. Probability that all scales failed< (7/8) θloglog n. Take only D=O((log λ)/θ) coordinates. Take only D=O((log λ)/θ) coordinates. Lose a factor of log θ n in the distortion` i i-θloglog n i+θloglog n

Constant Average Distortion Scaling distortion – for every 0 polylog(1/ε). Scaling distortion – for every 0 polylog(1/ε). Upper bound of log(1/ε), by standard techniques. Upper bound of log(1/ε), by standard techniques. Lower bound: Lower bound: Define a net for any scale i>0 and ε=exp{-8 j }. Define a net for any scale i>0 and ε=exp{-8 j }. Every pair (x,y) needs contribution that depends on: Every pair (x,y) needs contribution that depends on: d(x,y). d(x,y). The ε -value of x,y. The ε -value of x,y. Sieve the nets to avoid dependencies between different scales and different values of ε. Sieve the nets to avoid dependencies between different scales and different values of ε. Show that if a net pair succeeded, the points near it will also succeed. Show that if a net pair succeeded, the points near it will also succeed.

Constant Average Distortion Lower bound cont… Lower bound cont… The local Lemma graph depends on ε, use the general case of local Lemma. The local Lemma graph depends on ε, use the general case of local Lemma. For a net pair (u,v) in scale 8 i – consider scales: 8 i -loglog(1/ε),…,8 i -loglog(1/ε)/2. For a net pair (u,v) in scale 8 i – consider scales: 8 i -loglog(1/ε),…,8 i -loglog(1/ε)/2. Requires dimension O(log λ·loglog λ). Requires dimension O(log λ·loglog λ). λ. The net depends on λ.

Distortion-Dimension Tradeoff Distortion : Distortion : Dimension : Dimension : Instead of assigning all scales to a single coordinate: Instead of assigning all scales to a single coordinate: For each point x: For each point x: Divide the scales into D bunches of coordinates, in each Divide the scales into D bunches of coordinates, in each Create a hierarchical partition. Create a hierarchical partition. D (log n)/log λ Upper bound needs the x,y scales to be in the same coordinates

Conclusion Main result: Main result: Embedding metrics into their intrinsic dimension. Embedding metrics into their intrinsic dimension. Open problem: Open problem: Best distortion in dimension O(log λ). Best distortion in dimension O(log λ). Dimension reduction in L 2 : Dimension reduction in L 2 : For a doubling subset of L 2,is there an embedding into L 2 with O(1) distortion and dimension O(dim(X))? For a doubling subset of L 2,is there an embedding into L 2 with O(1) distortion and dimension O(dim(X))? For p>2 there is a doubling metric space requiring dimension at least Ω(log n) for embedding into L P with distortion O(log 1/p n).