Oracle Separation of BQP and PH

Slides:



Advertisements
Similar presentations
Quantum Lower Bounds You probably Havent Seen Before (which doesnt imply that you dont know OF them) Scott Aaronson, UC Berkeley 9/24/2002.
Advertisements

Quantum Versus Classical Proofs and Advice Scott Aaronson Waterloo MIT Greg Kuperberg UC Davis | x {0,1} n ?
Quantum Software Copy-Protection Scott Aaronson (MIT) |
Limitations of Quantum Advice and One-Way Communication Scott Aaronson UC Berkeley IAS Useful?
How to Solve Longstanding Open Problems In Quantum Computing Using Only Fourier Analysis Scott Aaronson (MIT) For those who hate quantum: The open problems.
Scott Aaronson BQP und PH A tale of two strong-willed complexity classes… A 16-year-old quest to find an oracle that separates them… A solution at lastbut.
Oracles Are Subtle But Not Malicious Scott Aaronson University of Waterloo.
The Equivalence of Sampling and Searching Scott Aaronson MIT.
Scott Aaronson (MIT) BQP and PH A tale of two strong-willed complexity classes… A 16-year-old quest to find an oracle that separates them… A solution at.
Solving Hard Problems With Light Scott Aaronson (Assoc. Prof., EECS) Joint work with Alex Arkhipov vs.
On the Complexity of Parallel Hardness Amplification for One-Way Functions Chi-Jen Lu Academia Sinica, Taiwan.
Pseudorandomness from Shrinkage David Zuckerman University of Texas at Austin Joint with Russell Impagliazzo and Raghu Meka.
Linear-Degree Extractors and the Inapproximability of Max Clique and Chromatic Number David Zuckerman University of Texas at Austin.
Complexity Theory Lecture 6
Average-case Complexity Luca Trevisan UC Berkeley.
How to Fool People to Work on Circuit Lower Bounds Ran Raz Weizmann Institute & Microsoft Research.
Approximate List- Decoding and Hardness Amplification Valentine Kabanets (SFU) joint work with Russell Impagliazzo and Ragesh Jaiswal (UCSD)
Foundations of Cryptography Lecture 10 Lecturer: Moni Naor.
Gillat Kol joint work with Ran Raz Competing Provers Protocols for Circuit Evaluation.
Scott Aaronson (MIT) Forrelation A problem admitting enormous quantum speedup, which I and others have studied under various names over the years, which.
Linear Systems With Composite Moduli Arkadev Chattopadhyay (University of Toronto) Joint with: Avi Wigderson TexPoint fonts used in EMF. Read the TexPoint.
Hardness amplification proofs require majority Ronen Shaltiel University of Haifa Joint work with Emanuele Viola Columbia University June 2008.
Two Sides of the Coin Problem Gil Cohen Joint with: Anat Ganor and Ran Raz.
Using Nondeterminism to Amplify Hardness Emanuele Viola Joint work with: Alex Healy and Salil Vadhan Harvard University.
Arithmetic Hardness vs. Randomness Valentine Kabanets SFU.
Hardness amplification proofs require majority Emanuele Viola Columbia University Work done at Harvard, IAS, and Columbia Joint work with Ronen Shaltiel.
1 Recap (I) n -qubit quantum state: 2 n -dimensional unit vector Unitary op: 2 n  2 n linear operation U such that U † U = I (where U † denotes the conjugate.
Scott Aaronson (MIT) Andris Ambainis (U. of Latvia) Forrelation: A Problem that Optimally Separates Quantum from Classical Computing H H H H H H f |0 
Correlation testing for affine invariant properties on Shachar Lovett Institute for Advanced Study Joint with Hamed Hatami (McGill)
One Complexity Theorist’s View of Quantum Computing Lance Fortnow NEC Research Institute.
On Constructing Parallel Pseudorandom Generators from One-Way Functions Emanuele Viola Harvard University June 2005.
Quantum Computing MAS 725 Hartmut Klauck NTU
On approximate majority and probabilistic time Emanuele Viola Institute for advanced study Work done during Ph.D. at Harvard University June 2007.
Polynomials Emanuele Viola Columbia University work partially done at IAS and Harvard University December 2007.
Quantum algorithms vs. polynomials and the maximum quantum-classical gap in the query model.
1 Introduction to Quantum Information Processing CS 467 / CS 667 Phys 467 / Phys 767 C&O 481 / C&O 681 Richard Cleve DC 3524 Course.
1 Introduction to Quantum Information Processing CS 467 / CS 667 Phys 667 / Phys 767 C&O 481 / C&O 681 Richard Cleve DC 653 Lecture.
Forrelation: A Problem that Optimally Separates Quantum from Classical Computing.
CS151 Complexity Theory Lecture 16 May 20, The outer verifier Theorem: NP  PCP[log n, polylog n] Proof (first steps): –define: Polynomial Constraint.
Pseudorandom Bits for Constant-Depth Circuits with Few Arbitrary Symmetric Gates Emanuele Viola Harvard University June 2005.
List Decoding Using the XOR Lemma Luca Trevisan U.C. Berkeley.
Hardness amplification proofs require majority Emanuele Viola Columbia University Work also done at Harvard and IAS Joint work with Ronen Shaltiel University.
1 Introduction to Quantum Information Processing QIC 710 / CS 667 / PH 767 / CO 681 / AM 871 Richard Cleve DC 2117 Lectures
Pseudorandomness: New Results and Applications Emanuele Viola IAS April 2007.
Richard Cleve DC 2117 Introduction to Quantum Information Processing CS 467 / CS 667 Phys 667 / Phys 767 C&O 481 / C&O 681 Lecture.
Information Complexity Lower Bounds
Derandomization & Cryptography
Randomness and Computation
New Characterizations in Turnstile Streams with Applications
Dana Ron Tel Aviv University
Circuit Lower Bounds A combinatorial approach to P vs NP
Umans Complexity Theory Lectures
From dense to sparse and back again: On testing graph properties (and some properties of Oded)
Pseudorandomness when the odds are against you
Background: Lattices and the Learning-with-Errors problem
Tight Fourier Tails for AC0 Circuits
Pseudo-derandomizing learning and approximation
Linear sketching with parities
Linear sketching over
Linear sketching with parities
Classical Algorithms from Quantum and Arthur-Merlin Communication Protocols Lijie Chen MIT Ruosong Wang CMU.
Indistinguishability by adaptive procedures with advice, and lower bounds on hardness amplification proofs Aryeh Grinberg, U. Haifa Ronen.
CS21 Decidability and Tractability
Emanuele Viola Harvard University June 2005
Switching Lemmas and Proof Complexity
Oracle Separation of BQP and PH
On Derandomizing Algorithms that Err Extremely Rarely
Stronger Connections Between Circuit Analysis and Circuit Lower Bounds, via PCPs of Proximity Lijie Chen Ryan Williams.
Emanuele Viola Harvard University October 2005
Pseudorandomness: New Results and Applications
Presentation transcript:

Oracle Separation of BQP and PH Avishay Tal (Stanford University) joint with Ran Raz (Princeton University)  © Kevin Hong for Quanta Magazine

The Landscape of Complexity Classes PSPACE PH NP coNP BPP P BQP?

Where does BQP fit in the landscape? BQP: Bounded Error Quantum Polynomial Time We know: BPP ⊆ BQP ⊆ PSPACE Oracle Separations: ∃oracle 𝐴: 𝐍 𝐏 𝑨 ⊈𝐁𝐐 𝐏 𝑨 [BBBV’97] ∃oracle 𝐴: 𝐁𝐐 𝐏 𝑨 ⊈𝐁𝐏 𝐏 𝑨 [BV’93] ∃oracle 𝐴: 𝐁𝐐 𝐏 𝑨 ⊈𝐌 𝐀 𝑨 [Watrous’00] Could it be possible that BQP ⊆ PH ? Could it be possible that BQP ⊆ AM ?

Our Main Result: BQP vs. PH Recall: a language 𝐿 in PH iff there exists a constant 𝑘, and a poly-time computable relation 𝑅 s.t. 𝑥∈𝐿 ⟺ ∃𝑦 1 ∀ 𝑦 2 ∃ 𝑦 3 … 𝑄 𝑘 𝑦 𝑘 :𝑅 𝑥, 𝑦 1 , …, 𝑦 𝑘 𝑦 1 + 𝑦 2 + … + 𝑦 𝑘 ≤poly 𝑥 Our Main Result: ∃ oracle 𝐴: 𝐁𝐐 𝐏 𝑨 ⊈ 𝐏𝐇 𝑨 *In fact: BQP cannot be captured by 𝑘 alternations, as long as 𝑦 1 + … + 𝑦 𝑘 ≤exp⁡(|𝑥|/5𝑘)

The Black-Box/Query Model 𝑖=1 𝑁 𝛼 𝑖 | 𝑖 𝑖=1 𝑁 𝛼 𝑖 𝑥 𝑖 | 𝑖 𝑥∈ ±1 𝑁 𝑖 𝑥 𝑖 Complexity measure: number of queries to the black box. Deterministic Query Complexity = Decision Tree Complexity Quantum Query Complexity = Queries are made in superposition PH analog = AC0 circuits Known reductions: Black-box separations imply oracle separations

The Pseudorandomness Setting f Def’n: a distribution 𝐷 is pseudorandom against a class of functions 𝒞 if ∀𝑓∈𝒞: 𝐄 𝑥~𝐷 𝑓 𝑥 ≈ 𝐄 𝑥∼𝑈 [𝑓 𝑥 ]

The Pseudorandomness Setting [Aaronson’10, Fefferman-Shaltiel-Umans-Viola’12] Can you find a distribution which is pseudorandom for AC0 but not pseudorandom for poly-log-time quantum algorithms? f  an oracle separation between BQP from PH

F Let 𝐷 be a distribution over −1,1 2𝑁 . We say that an algorithm 𝐴 distinguishes between 𝐷 and 𝑈 with advantage 𝛼 if 𝛼= 𝐄 𝑥~𝐷 𝐴 𝑥 − 𝐄 𝑥∼𝑈 𝐴 𝑥 . Main Result: We present a distribution 𝐷 such that: ∃a log(N) time quantum algorithm distinguishing between 𝐷 and 𝑈 with advantage Ω 1/ log 𝑁 Any quasipoly(N)-size constant-depth circuit distinguishes between 𝐷 and 𝑈 with advantage 𝑂 1 𝑁 Standard techniques  amplify advantage of quantum algorithm to be 0.99 or even 1-1/poly(N).

The Separating Distribution D (Based on Aaronson’s Forrelation distribution) Let 𝑁 be a power of 2. Let 𝜖=1/𝑂 log 𝑁 . Let 𝐺 to be a multi-variate gaussian (MVG) distribution on ℝ 2𝑁 with zero-means and covariance matrix 𝜖⋅ 𝐼 𝑁 𝐻 𝐻 𝐼 𝑁 where 𝐻 is the 𝑁×𝑁 Hadamard matrix with 𝐻 𝑖,𝑗 = 1 𝑁 1 𝑁 ⋅ −1 <𝑖,𝑗> Sample 𝑧~𝐺, truncate each 𝑧 𝑖 to be within [−1,1] Sampling 𝑧′~𝐷: Alternative view of G: Sample x1,…, xN ~ N(0,eps) independently Take y_1,…, yN to be H*x^t. Output (x1, …, xN, y1,…, yN) Based on Aaronson Forrelation distribution. Aaronson took w = sgn(z).

Quantum Algorithm Distinguishing D [Aaronson’10, Aaronson-Ambainis’15]: 1-query O(log N)-time quantum algorithm 𝑄 s.t. Pr 𝑄 accepts input 𝑥,𝑦 = 1+Φ 𝑥,𝑦 2 where Φ 𝑥,𝑦 = 1 𝑁 3/2 ⋅ 𝑖=1 𝑁 𝑗=1 𝑁 −1 <𝑖,𝑗> ⋅ 𝑥 𝑖 ⋅ 𝑦 𝑗 Intuition to the difference between Q and PH: The distribution D has very small pairwise correlations -- at most 1/sqrt{N}. Q can somehow accumulate all these N^2 tiny pairwise correlations in the input. On the other hand, we will show that AC0 cannot. 𝐄 𝑥,𝑦 ~𝑈 Φ 𝑥,𝑦 =0 𝐄 𝑥,𝑦 ~𝐷 Φ 𝑥,𝑦 ≈𝜖=Ω 1 log 𝑁

D is Pseudorandom for AC0 We are left to prove: 𝐷 is pseudorandom for AC0. Main Ingredients & Techniques: Fourier Analysis AC0 circuits are well-approximated by sparse low-degree polynomials. Fractional PRG approach of [CHHL18]. Sum of independent Gaussians is a Gaussian.

Bounded Depth Circuits A C 0 [𝑠,𝑑]: 𝑠 gates (size of the circuit) depth 𝑑 alternating gates We focus on A C 0 𝑁 polylog 𝑁 ,𝑂 1

What do we know about AC0? [Ajtai’83, Furst-Saxe-Sipser’84, Yao’85,Håstad ’86]: Parity not in A C 0 𝑁 polylog 𝑁 ,𝑂 1 . Parity requires exp 𝑁 1/(𝑑−1) size for depth 𝑑.  ∃ oracle 𝐴: 𝐏𝐒𝐏𝐀𝐂 𝐄 𝑨 ⊈ 𝐏𝐇 𝑨 Fourier-analytical proof technique: AC0 circuits can be well-approximated (in ℓ 2 ) by low-degree polynomials (over ℝ). [Håstad ’86,LMN’93] Parity cannot. Potential problem with the approach: 𝑂 log 𝑁 time quantum algorithms (BQLogTime) are also well-approximated by low-degree polys.

The Difference between BQLogTime and AC0 Both BQLogtime & AC0 are approximated by low-degree polynomials, but these polynomials are different! BQLogtime can have dense low-degree polynomials, e.g. Φ 𝑥,𝑦 = 1 𝑁 3/2 ⋅ 𝑖=1 𝑁 𝑗=1 𝑁 −1 <𝑖,𝑗> ⋅ 𝑥 𝑖 ⋅ 𝑦 𝑗 [T’14]: AC0 has sparse low-degree approximations ∀𝑘: 𝑆⊆ 𝑛 , 𝑆 =𝑘 𝑓 𝑆 ≤ polylog 𝑁 𝑘

Fourier Analytical Approach – First Attempt The Fourier expansion of 𝑓: −1,1 2𝑁 →{−1,1}: 𝑓 𝑥 = 𝑆⊆ 2𝑁 𝑓 𝑆 ⋅ 𝑖∈𝑆 𝑥 𝑖 Goal: 𝐄 𝑧 ′ ∼𝐷 𝑓 𝑧′ − 𝐄 𝑥∼𝑈 𝑓 𝑥 = 𝑂 1 𝑁 Recall: Sampling 𝑧′~𝐷: Sample 𝑧~𝐺, truncate each 𝑧 𝑖 to be within [−1,1] For 𝑖=1, …, 2𝑁, sample independently 𝑧 𝑖 ′ ∈ −1,1 with 𝐄 𝑧 𝑖 ′ = 𝑧 𝑖 Using multilinearity of 𝑓 and that whp trunc 𝑧 =𝑧: 𝐄 𝑧 ′ ∼𝐷 𝑓 𝑧′ = 𝐄 𝑧∼𝐺 𝑓 trunc(𝑧) ≈ 𝐄 𝑧∼𝐺 𝑓 𝑧  Suffices to show 𝐄 𝑧∼𝐺 𝑓 𝑧 − 𝐄 𝑥∼𝑈 𝑓 𝑥 = 𝑂 1 𝑁

Fourier Analytical Approach – First Attempt 𝐄 𝑧∼𝐺 𝑓 𝑧 − 𝐄 𝑥∼𝑈 𝑓 𝑥 = 𝑆⊆ 2𝑁 𝑓 𝑆 ⋅ 𝐄 𝑧∼𝐺 𝑖∈𝑆 𝑧 𝑖 − 𝐄 𝑥∼𝑈 𝑖∈𝑆 𝑥 𝑖 = 𝑆⊆ 2𝑁 , 𝑆 ≥1 𝑓 𝑆 ⋅ 𝐄 𝑧∼𝐺 𝑖∈𝑆 𝑧 𝑖 = ℓ=1 𝑁 𝑆 =2ℓ 𝑓 𝑆 ⋅ 𝐄 𝑧∼𝐺 𝑖∈𝑆 𝑧 𝑖 ≤ ℓ=1 𝑁 𝑆 =2ℓ 𝑓 𝑆 ⋅ 𝜖 ℓ ⋅ ℓ! 𝑁 ℓ ≤ ℓ=1 𝑁 polylog 𝑁 2ℓ ⋅ 𝜖 ℓ ⋅ ℓ! 𝑁 ℓ Contribution of first O ( 𝑁 ) terms: 𝜖⋅polylog(𝑁)/ 𝑁 Contribution of larger terms? One way to solve it is to take eps (think of it as noise) to be smaller than 1/polylog(N) but this would hurt the advantage of the quantum algorithm  this gives a separation between quantum algorithms with more resources then PH which is not too interesting (even parity on polylog bits can give such a separation) Our main insight (presented in the next slides) is that this is not needed and indeed the high degree terms do not matter too much

Main Technical Lemma Suppose 𝑍~𝐺 is a zero-mean MVG on ℝ 2𝑁 with ∀𝑖: 𝐯𝐚𝐫 𝑍 𝑖 ≤1/𝑂 log 𝑁 𝛿 ∀𝑖,𝑗: 𝐜𝐨𝐯 𝑍 𝑖 , 𝑍 𝑗 ≤𝛿 Then, for any quasi-poly size constant depth AC0 circuit 𝑓, 𝐄 𝑧∼𝐺 𝑓 𝑧 − 𝐄 𝑥∼𝑈 𝑓 𝑥 ≤𝛿⋅polylog 𝑁  whp 𝑍 ∈ −1,1 2𝑁 Which properties of AC0 circuits are used in the proof? The bound 𝑆 =2 𝑓 𝑆 ≤polylog(𝑁) Closure under restrictions. Credit to Boaz Barak and Jarosław Błasiok blog post for improved notation/presentation that we adapt: https://windowsontheory.org/2018/06/17/on-the-raz-tal-oracle-separation-of-bqp-and-ph/ 𝐺 fools any class of functions with these two properties

Viewing 𝑍~𝐺 as a result of a random walk A Thought Experiment: Instead of sampling 𝑍~𝐺 at once, we sample 𝑡 vectors 𝑍 (1) ,…, 𝑍 𝑡 ~𝐺 independently, and take 𝑍= 1 √𝑡 ⋅ 𝑍 (1) +… + 𝑍 (𝑡) Based on the work of [Chattopadhyay, Hatami, Hosseini, Lovett’18] Picture from http://en.wikipedia.org/wiki/Random_walk

Viewing 𝑍~𝐺 as a result of a random walk Sample 𝑡 vectors 𝑍 (1) ,…, 𝑍 𝑡 ~𝐺 Define 𝒕+𝟏 hybrids: 𝐻 0 = 0 For 𝑖=1,…,𝑡 𝐻 𝑖 = 1 √𝑡 ⋅ 𝑍 (1) +… + 𝑍 (𝑖) Observe: 𝐻 𝑡 ~ 𝐺. Taking 𝑡→∞ yields a Brownian motion. We take 𝑡 =poly 𝑁 . Claim: for 𝑖=0, …, 𝑡−1, 𝐄 𝑓 𝐻 𝑖+1 −𝐄 𝑓 𝐻 𝑖 ≤ 𝛿 𝑡 ⋅polylog(𝑁). 𝐻 𝑡 𝐻 𝑖+1 𝐻 𝑖 𝐻 1 𝐻 0

Claim - Base Case Base Case: 𝐄 𝑓 𝐻 1 −𝐄 𝑓 𝐻 0 =𝐄 𝑓 1 𝑡 ⋅ 𝑍 1 −𝑓 0 𝐄 𝑓 𝐻 1 −𝐄 𝑓 𝐻 0 =𝐄 𝑓 1 𝑡 ⋅ 𝑍 1 −𝑓 0 = ℓ=1 𝑁 𝑆 =2ℓ 𝑓 𝑆 ⋅ 𝐄 𝑧∼𝐺 1 √𝑡 2ℓ ⋅ 𝑖∈𝑆 𝑧 𝑖 ≤ ℓ=1 𝑁 𝑆 =2ℓ 𝑓 𝑆 ⋅ 𝛿 ℓ ⋅𝑂 ℓ ℓ 𝑡 ℓ ≤ 𝛿 𝑡 ⋅polylog 𝑁 + 𝑜 𝛿 𝑡

Reducing the General Case to the Base Case Lemma [CHHL’18]: for all 𝑧 0 ∈ −1/2,1/2 2𝑁 𝑔 𝑧 =𝑓 𝑧+ 𝑧 0 −𝑓 𝑧 0 can be written as 𝐄 𝜌 𝑓 𝜌 2⋅𝑧 − 𝑓 𝜌 0 where 𝑓 𝜌 is a random restriction of 𝑓 (whose marginals depend on 𝑧 0 ). Conditioned on 𝐻 𝑖 ∈ −1/2,1/2 2𝑁 (happens whp): 𝐄 𝑓 𝐻 𝑖+1 −𝐄 𝑓 𝐻 𝑖 ≤ 𝐄 𝑓 𝐻 𝑖 + 1 √𝑡 𝑍 (𝑖+1) − 𝑓 𝐻 𝑖 ≤ 𝐄 𝑓 𝜌 2 √𝑡 ⋅ 𝑍 (𝑖+1) − 𝑓 𝜌 0 ≤ 4𝛿 𝑡 ⋅polylog(𝑁)

Recap: Proof by Picture [CHHL’18]: i-th step ≈ first step, using closure under restrictions. i-th step First Step: Simple Fourier Analysis Only second level matters. first step

Recap Defined a distribution 𝐷 based on MVG 𝐺. 𝐷 is not pseudorandom for log(N)-time quantum algorithms. [Aaronson’09, Aaronson-Ambainis’15] 𝐷 is pseudorandom for AC0 (our contribution) 𝐄 𝑧∼𝐺 𝑓 𝑧 − 𝐄 𝑥∼𝑈 𝑓 𝑥 ≤𝛿⋅polylog 𝑁 : Thought experiment: Viewing 𝑍~𝐺 as a result of a random walk with 𝑡 tiny steps. AC0 circuits are well-approximated by sparse low-degree polynomials [T’14]  first step has advantage 𝛿 𝑡 ⋅polylog 𝑁 [Chattopadhyay, Hatami, Hosseini, Lovett ’18]:  𝑖-th step has advantage 𝛿 𝑡 ⋅polylog 𝑁

Open Problems & New results Follow-ups: [Aaronson, Fortnow]: an oracle 𝐴 s.t. 𝐁𝐐 𝐏 𝑨 ⊈ 𝐏 𝑨 =𝐍 𝐏 𝑨 [Fortnow]: under our oracle PH is infinite. Open Problems: Does the original suggestion of [Aaronson’09] (without 1/log 𝑁 noise) work? [Aaronson]: Find an oracle 𝐴 s.t. 𝐍 𝐏 𝑨 ⊆𝐁𝐐 𝐏 𝑨 𝐏𝐇 𝑨 ⊈𝐁𝐐 𝐏 𝑨 [Fortnow]: Does 𝐍 𝐏 𝐁𝐐𝐏 ⊈𝐁𝐐 𝐏 𝐍𝐏 ?

Open Problems 2: Pseudorandomness Separate BQLogTime and AC0 ⊕ . Suffices to show for all 𝑓 in AC0 ⊕ : 𝑆 =2 𝑓 𝑆 ≤ 𝑁 polylog(𝑁) Conjecture [CHLT’18]: for all 𝑓 in AC0 ⊕ 𝑆 =2 𝑓 𝑆 ≤polylog(𝑁) Claim [CHLT’18]: Conjecture implies a PRG for AC0 ⊕ with polylog(𝑁) seed length.

Thank You!  © Kevin Hong for Quanta Magazine