Testing Surface Area Ryan O’Donnell Carnegie Mellon & Boğaziçi University joint work with Pravesh Kothari (UT Austin), Amir Nayyeri (Oregon), Chenggang.

Slides:



Advertisements
Similar presentations
The Equivalence of Sampling and Searching Scott Aaronson MIT.
Advertisements

Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.
Optimal Bounds for Johnson- Lindenstrauss Transforms and Streaming Problems with Sub- Constant Error T.S. Jayram David Woodruff IBM Almaden.
Subspace Embeddings for the L1 norm with Applications Christian Sohler David Woodruff TU Dortmund IBM Almaden.
When Is Nearest Neighbors Indexable? Uri Shaft (Oracle Corp.) Raghu Ramakrishnan (UW-Madison)
Long cycles, short cycles, min-degree subgraphs, and feedback arc sets in Eulerian digraphs Raphael Yuster joint work with Asaf Shapira Eilat 2012.
Ryan ODonnell Carnegie Mellon University Karl Wimmer CMU & Duquesne Amir Shpilka Technion Rocco Servedio Columbia Parikshit Gopalan UW & Microsoft SVC.
Ryan Donnell Carnegie Mellon University O. 1. Describe some TCS results requiring variants of the Central Limit Theorem. Talk Outline 2. Show a flexible.
05/11/2005 Carnegie Mellon School of Computer Science Aladdin Lamps 05 Combinatorial and algebraic tools for multigrid Yiannis Koutis Computer Science.
The Capacity of Wireless Networks
1 Motion and Manipulation Configuration Space. Outline Motion Planning Configuration Space and Free Space Free Space Structure and Complexity.
Shortest Vector In A Lattice is NP-Hard to approximate
Section 1.4.  “open interval”- an interval where the function values at the endpoints are undefined (a, b)  “closed interval”- an interval where the.
One time-travelling bit is as good as logarithmically many
Inapproximability of MAX-CUT Khot,Kindler,Mossel and O ’ Donnell Moshe Ben Nehemia June 05.
Yang Cai Oct 01, An overview of today’s class Myerson’s Auction Recap Challenge of Multi-Dimensional Settings Unit-Demand Pricing.
Deterministic vs. Non-Deterministic Graph Property Testing Asaf Shapira Tel-Aviv University Joint work with Lior Gishboliner.
Gillat Kol joint work with Ran Raz Locally Testable Codes Analogues to the Unique Games Conjecture Do Not Exist.
High Cycle Fatigue (HCF) analysis; cont.
Monte Carlo Methods and Statistical Physics
Stokes Theorem. Recall Green’s Theorem for calculating line integrals Suppose C is a piecewise smooth closed curve that is the boundary of an open region.
Kevin Matulef MIT Ryan O’Donnell CMU Ronitt Rubinfeld MIT Rocco Servedio Columbia.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Machine Learning Week 2 Lecture 1.
A general agnostic active learning algorithm
1 Heat flow and a faster Algorithm to Compute the Surface Area of a Convex Body Hariharan Narayanan, University of Chicago Joint work with Mikhail Belkin,
Computing the Fréchet Distance Between Folded Polygons
Dictator tests and Hardness of approximating Max-Cut-Gain Ryan O’Donnell Carnegie Mellon (includes joint work with Subhash Khot of Georgia Tech)
A Randomized Polynomial-Time Simplex Algorithm for Linear Programming Daniel A. Spielman, Yale Joint work with Jonathan Kelner, M.I.T.
Mean Shift A Robust Approach to Feature Space Analysis Kalyan Sunkavalli 04/29/2008 ES251R.
16 MULTIPLE INTEGRALS.
On Testing Convexity and Submodularity Michal Parnas Dana Ron Ronitt Rubinfeld.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint work with Mira Gonen Dana Ron Tel-Aviv University.
Learning and testing k-modal distributions Rocco A. Servedio Columbia University Joint work (in progress) with Ilias Diakonikolas UC Berkeley Costis Daskalakis.
Random Variable and Probability Distribution
Introduction to Monte Carlo Methods D.J.C. Mackay.
How complex ~ Developed by Andrew Derer ~ MathScience Innovation Center, Richmond, VA.
Multiple Integrals 12. Double Integrals over General Regions 12.3.
Copyright © Cengage Learning. All rights reserved Double Integrals over General Regions.
DOUBLE INTEGRALS OVER GENERAL REGIONS
1 As we have seen in section 4 conditional probability density functions are useful to update the information about an event based on the knowledge about.
A Conjecture for the Packing Density of 2413 Walter Stromquist Swarthmore College (joint work with Cathleen Battiste Presutti, Ohio University at Lancaster)
Bell Ringer x – 1.8 = x = x =  Solve. 1. x + 3 = x = 4 x =
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
Distribute and Combine Like Terms Applications. What is the area of the following shape? 5 2x-3 1.
SECTION 12.5 TRIPLE INTEGRALS.
Unique Games Approximation Amit Weinstein Complexity Seminar, Fall 2006 Based on: “Near Optimal Algorithms for Unique Games" by M. Charikar, K. Makarychev,
Cube A cube[1] is a three-dimensional solid object bounded by six square faces, facets or sides, with three meeting at each vertex[1]three-dimensionalsquarefacetsvertex.
Area/Sigma Notation Objective: To define area for plane regions with curvilinear boundaries. To use Sigma Notation to find areas.
Continuous Random Variables Lecture 24 Section Tue, Oct 18, 2005.
Perimeter & Area. Today’s Objectives:  Learn what it means to find perimeter and area.  Practice finding or estimating the perimeter and area.
CS Statistical Machine learning Lecture 7 Yuan (Alan) Qi Purdue CS Sept Acknowledgement: Sargur Srihari’s slides.
Math 3121 Abstract Algebra I
Eigenvalues and Eigenvectors
Arcs, Sectors & Segments
Week 3 2. Review of inner product spaces
Stochastic Streams: Sample Complexity vs. Space Complexity
New Characterizations in Turnstile Streams with Applications
Dana Ron Tel Aviv University
Review of Shapes OK, so we are going to try and name the following shapes: No calling out, I want you to write down the name of the shapes We will take.
Surface Area and Volume
Area and Volume Area is the amount of space contained in a two-dimensional figure Volume is the amount of space in a three-dimensional figure.
CSCI B609: “Foundations of Data Science”
Hariharan Narayanan, University of Chicago Joint work with
Dimension versus Distortion a.k.a. Euclidean Dimension Reduction
11. Conditional Density Functions and Conditional Expected Values
11. Conditional Density Functions and Conditional Expected Values
1 cm 1 cm 1 cm.

Presentation transcript:

Testing Surface Area Ryan O’Donnell Carnegie Mellon & Boğaziçi University joint work with Pravesh Kothari (UT Austin), Amir Nayyeri (Oregon), Chenggang Wu (Tsinghua)

In 2 dimensions… “surface area” is called “perimeter” BTW: This is one shape, that happens to be disconnected

In 1 dimension… “surface area” equals “# of endpoints” BTW: This is one shape, that happens to be disconnected = “2 × # of intervals”

Our Theorem Given S,, and query access to F ⊂ [0,1] n, there’s an O(1/)-query (nonadaptive) algorithm s.t.: Says YES whp if perim(F) ≤ S; Says NO whp if F is -far from all G with perim(G) ≤ 1.28 S. vol(F∆G) >

Our Theorem Given S,, and query access to F ⊂ [0,1] n, there’s an O(1/)-query (nonadaptive) algorithm s.t.: Says YES whp if perim(F) ≤ S; Says NO whp if F is -far from all G with perim(G) ≤ 1.28 S. No Curse Of Dimensionality! No assumptions about F!

Our Theorem Given S,, and query access to F ⊂ [0,1] n, there’s an O(1/) -query (nonadaptive) algorithm s.t.: Says YES whp if perim(F) ≤ S; Says NO whp if F is -far from all G with perim(G) ≤ (κ n +δ ) S. 1 δ 2.5

Prior work whodim.queriesapprox factor κ [KR98] [BBBY12] [KNOW14] [Nee14] 1O(1/)1/ 1 O(1/ 4 ) 1 n O(1/) < 1.28 ∀ n any 1+δ if n=1 nO(1/) any 1+δ 1 O(1/ 3.5 ) 1

Prior work whodim.queriesapprox factor κ [KR98] [BBBY12] [KNOW14] [Nee14] 1O(1/)1/ 1 O(1/ 4 ) 1 n O(1/) < 1.28 ∀ n any 1+δ if n=1 nO(1/) any 1+δ 1 O(1/ 3.5 ) 1 Remark: We obtained same results in Gaussian space. So did Neeman.

Property Testing framework is necessary Theorem [BNN06]: If F ⊂ [0,1] n promised to be convex, can estimate perim(F) to factor 1+δ whp using poly(n/δ) queries. No “-far” stuff. We don’t assume convexity, curvature bounds, connectedness — nothing.

Property Testing framework is necessary

Soundness theorem challenge: Cut string, smooth side, fill in holes.

Algorithm: Buffon’s Needle Crofton Formula. Let F ⊂ [0,1] n Pick x ~ ℝ n / ℤ n uniformly. Pick y ~ B λ (x). Line segment xy called “the needle”. Then… ℝn / ℤn.ℝn / ℤn. x y E[ #( xy∩ ∂ F ) ] = c n · λ · perim(F) F

Algorithm: Buffon’s Needle Crofton Formula. Let F ⊂ [0,1] n Pick x ~ ℝ n / ℤ n uniformly. Pick y ~ B λ (x). Line segment xy called “the needle”. Then… ℝn / ℤn.ℝn / ℤn. x y E[ #( xy∩ ∂ F ) ] = c n · λ · perim(F) explicit dimension- dependent constant, Θ(n –1/2 ) F

x y E[ #( xy∩ ∂ F ) ] = c n · λ · perim(F) F E[ 1 {x ∈ F, y ∉ F, or vice versa} ] ≤ Pr[ 1 F (x)≠1 F (y) ] = NS F (λ) := The “Noise Sensitivity” of F:

Algorithm and Completeness Recall: NS F (λ) = Pr [ 1 F (x) ≠ 1 F (y) ] x ~ ℝ n / ℤ n y ~ B λ (x) ’ ≤ c n · λ · perim(F) 0. Given S,, set λ such that =.01 · c n · λ · S. 1. Empirically estimate NS F (λ). 2. Say YES iff ≤ (1 +δ ) · c n · λ · S. Query complexity, Completeness: ✔

Soundness? Recall: NS F (λ) = Pr [ 1 F (x) ≠ 1 F (y) ] x ~ ℝ n / ℤ n y ~ B λ (x) ’ ≤ c n · λ · perim(F) 0. Given S,, set λ such that =.01 · c n · λ · S. 1. Empirically estimate NS F (λ). 2. Say YES iff ≤ (1 +δ ) · c n · λ · S. Query complexity, Completeness: ✔

Recall: NS F (λ) = Pr [ 1 F (x) ≠ 1 F (y) ] x ~ ℝ n / ℤ n y ~ B λ (x) ’ ≤ c n · λ · perim(F) Soundness? Q: If NS F (λ) ≤ c n · λ · S, is perim(F) ≾ S? A: Not necessarily. (F may “wiggle at a scale ≪ λ”.) Q: I.e., is perim(F) ≾ (c n λ) –1 · NS F (λ) always? Q: Is F at least close to some G with perim(G) ≾ (c n λ) –1 · NS F (λ) ? YES!

Recall: NS F (λ) = Pr [ 1 F (x) ≠ 1 F (y) ] x ~ ℝ n / ℤ n y ~ B λ (x) ’ ≤ c n · λ · perim(F) Soundness? Our Theorem: For every F ⊂ ℝ n / ℤ n and every λ, F is O(NS F (λ))-close to a set G with perim(G) ≤ C n λ –1 · NS F (λ). (Here C n /c n =: κ n ∈ [1, 4/π] for all n.)

Our Theorem: For every F ⊂ ℝ n / ℤ n and every λ, F is O(NS F (λ))-close to a set G with perim(G) ≤ C n λ –1 · NS F (λ). Given F, how do you “find” G?

Finding G from F F

1. Define g : ℝ n / ℤ n → [0,1] by y~B λ (x) g(x) = Pr [ y ∈ F ]. F

Finding G from F 1. Define g : ℝ n / ℤ n → [0,1] by y~B λ (x) g(x) = Pr [ y ∈ F ]. 2. Choose θ ∈ [0,1] from the triangular distribution: 01 2 pdf: φ θ 3. G := {x : g(x) > θ}.

1-Slide Sketch of Analysis G being O(NS F (λ))-close to F (whp) is easy. Theorem: E[ perim(G) ?

1-Slide Sketch of Analysis G being O(NS F (λ))-close to F (whp) is easy. Theorem: E[ perim(G) ] ?

1-Slide Sketch of Analysis G being O(NS F (λ))-close to F (whp) is easy. Theorem: E[ perim(G) ] = E[ φ θ (g(x)) · ‖∇ g(x) ‖ ] x~ℝn/ℤnx~ℝn/ℤn (“Coarea Formula”) Theorem: E[ perim(G) ] ≤ Lip(g) · E[ φ θ (g(x)) ] Theorem: E[ perim(G) ] ≤ Lip(g) · 4 NS F (λ) Theorem: E[ perim(G) ] ≤ O(n –1/2 ) λ –1 · 4 NS F (λ) Theorem: E[ perim(G) ] = C n λ –1 · NS F (λ)

[Neeman 14]’s version Picks needles of Gaussian length, rather than uniform on a ball. Uses a more clever pdf φ θ.

Thanks!