Igor Carboni Oliveira University of Oxford

Slides:



Advertisements
Similar presentations
Unconditional Weak derandomization of weak algorithms Explicit versions of Yao s lemma Ronen Shaltiel, University of Haifa :
Advertisements

Complexity Theory Lecture 6
Are lower bounds hard to prove? Michal Koucký Institute of Mathematics, Prague.
Approximate List- Decoding and Hardness Amplification Valentine Kabanets (SFU) joint work with Russell Impagliazzo and Ragesh Jaiswal (UCSD)
Talk for Topics course. Pseudo-Random Generators pseudo-random bits PRG seed Use a short “ seed ” of very few truly random bits to generate a long string.
Massive Online Teaching to Bounded Learners Brendan Juba (Harvard) Ryan Williams (Stanford)
Lecture 16: Relativization Umans Complexity Theory Lecturess.
Learning Juntas Elchanan Mossel UC Berkeley Ryan O’Donnell MIT Rocco Servedio Harvard.
Linear Systems With Composite Moduli Arkadev Chattopadhyay (University of Toronto) Joint with: Avi Wigderson TexPoint fonts used in EMF. Read the TexPoint.
Circuit Complexity and Derandomization Tokyo Institute of Technology Akinori Kawachi.
Time vs Randomness a GITCS presentation February 13, 2012.
Non-Uniform ACC Circuit Lower Bounds Ryan Williams IBM Almaden TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.
Some Thoughts regarding Unconditional Derandomization Oded Goldreich Weizmann Institute of Science RANDOM 2010.
Derandomization: New Results and Applications Emanuele Viola Harvard University March 2006.
Eric Allender Rutgers University Chipping Away at P vs NP: How Far Are We from Proving Circuit Size Lower Bounds? Joint work with Michal Koucky ʹ Czech.
Arithmetic Hardness vs. Randomness Valentine Kabanets SFU.
CS151 Complexity Theory Lecture 7 April 20, 2015.
–Def: A language L is in BPP c,s ( 0  s(n)  c(n)  1,  n  N) if there exists a probabilistic poly-time TM M s.t. : 1.  w  L, Pr[M accepts w]  c(|w|),
Eric Allender Rutgers University Circuit Complexity, Kolmogorov Complexity, and Prospects for Lower Bounds DCFS 2008.
Eric Allender Rutgers University Cracks in the Defenses: Scouting Out Approaches on Circuit Lower Bounds CSR 2008.
In a World of BPP=P Oded Goldreich Weizmann Institute of Science.
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen.
Eric Allender Rutgers University Dual VP Classes Joint work with Anna Gál (U. Texas) and Ian Mertz (Rutgers) MFCS, Milan, August 27, 2015.
On Constructing Parallel Pseudorandom Generators from One-Way Functions Emanuele Viola Harvard University June 2005.
Pseudorandom Generators and Typically-Correct Derandomization Jeff Kinne, Dieter van Melkebeek University of Wisconsin-Madison Ronen Shaltiel University.
Umans Complexity Theory Lectures Lecture 1a: Problems and Languages.
Eric Allender Rutgers University Graph Automorphism & Circuit Size Joint work with Joshua A. Grochow and Cristopher Moore (SFI) Simons Workshop, September.
My Favorite Ten Complexity Theorems of the Past Decade II Lance Fortnow University of Chicago.
Umans Complexity Theory Lectures Lecture 17: Natural Proofs.
CS151 Complexity Theory Lecture 16 May 20, The outer verifier Theorem: NP  PCP[log n, polylog n] Proof (first steps): –define: Polynomial Constraint.
Pseudorandom Bits for Constant-Depth Circuits with Few Arbitrary Symmetric Gates Emanuele Viola Harvard University June 2005.
Pseudo-random generators Talk for Amnon ’ s seminar.
Comparing Notions of Full Derandomization Lance Fortnow NEC Research Institute With thanks to Dieter van Melkebeek.
Eric Allender Rutgers University Curiouser and Curiouser: The Link between Incompressibility and Complexity CiE Special Session, June 19, 2012.
Technion Haifa Research Labs Israel Institute of Technology Underapproximation for Model-Checking Based on Random Cryptographic Constructions Arie Matsliah.
Pseudorandomness: New Results and Applications Emanuele Viola IAS April 2007.
Umans Complexity Theory Lectures Lecture 9b: Pseudo-Random Generators (PRGs) for BPP: - Hardness vs. randomness - Nisan-Wigderson (NW) Pseudo- Random Generator.
Complexity Theory and Explicit Constructions of Ramsey Graphs Rahul Santhanam University of Edinburgh.
Random projections and depth hierarchy theorems
P & NP.
From Classical Proof Theory to P vs. NP
Information Complexity Lower Bounds
Derandomization & Cryptography
Hans Bodlaender, Marek Cygan and Stefan Kratsch
Negation-Limited Formulas
Algorithms vs. Circuit Lower Bounds
Circuit Lower Bounds A combinatorial approach to P vs NP
Pseudodeterministic Constructions in Subexponential Time
Constructing hard functions from learning algorithms
Pseudorandomness when the odds are against you
Hardness Magnification
Tight Fourier Tails for AC0 Circuits
Analysis and design of algorithm
Finding Large Set Covers Faster via the Representation Method
Local Error-Detection and Error-correction
An average-case lower bound against ACC0
Nikhil Bansal, Shashwat Garg, Jesper Nederlof, Nikhil Vyas
Umans Complexity Theory Lectures
Pseudo-derandomizing learning and approximation
How to Delegate Computations: The Power of No-Signaling Proofs
Exponential Time Paradigms Through the Polynomial Time Lens
Chapter 11 Limitations of Algorithm Power
Indistinguishability by adaptive procedures with advice, and lower bounds on hardness amplification proofs Aryeh Grinberg, U. Haifa Ronen.
Emanuele Viola Harvard University June 2005
Impossibility of SNARGs
CS151 Complexity Theory Lecture 5 April 16, 2019.
On Derandomizing Algorithms that Err Extremely Rarely
Recent Structure Lemmas for Depth-Two Threshold Circuits
Stronger Connections Between Circuit Analysis and Circuit Lower Bounds, via PCPs of Proximity Lijie Chen Ryan Williams.
Emanuele Viola Harvard University October 2005
Presentation transcript:

Igor Carboni Oliveira University of Oxford Conspiracies between Learning Algorithms, Lower Bounds, and Pseudorandomness Igor Carboni Oliveira University of Oxford Joint work with Rahul Santhanam (Oxford)

Context Minor algorithmic improvements imply lower bounds (Williams, 2010). NEXP not contained in ACC0 (Williams, 2011), and extensions.

This Work Analogue of Williams’ celebrated lower bound program in Learning Theory. Combining and extending existing connections. Further applications of the “Pseudorandom Method”: Hardness of MCSP, Karp-Lipton Theorems for BPEXP. etc.

Lower bounds from learning

Learning Model (Randomized, MQs, Uniform Dist.) A Boolean circuit class C is fixed. from C[s(n)] is selected. Randomized Algorithm Learner must output w.h.p a hypothesis such that:

Some learning algorithms Combinatorial lower bounds Lower bounds are unknown, or obtained via diagonalization [Jac97] DNFs can be learned in polynomial time. Harmonic-Sieve/Boosting [LMN93] AC0 circuits learnable in quasi-polynomial time. Fourier Concentration [CIKK16] AC0[p] learnable in quasi-polynomial time. Pseudorandomness/Natural Property

Can we learn AC0 circuits with Mod 6 gates in sub-exponential time? As far as I know, open even for: AND o OR o MAJ circuits, MOD2 o AND o THR circuits. Definition. Non-trivial learning algorithm: ▶ Runs in randomized time ▶ For every function f in C:

Non-trivial learning implies lower bounds Let BPE = BPTIME[2O(n)]. Theorem. Let C be any subclass of Boolean circuits closed under restrictions. Example: C = (depth-6)-ACC0, AND o OR o THR, etc. If for each k>1, C[nk] admits a non-trivial learning algorithm, then for each k > 1, BPE is not contained in C[nk].

LBs from Proofs, Derandomization, Learning Non-trivial SAT/Proof System Derandomization Deterministic Exact Learning Randomized Learning Assumption Proofs checked in deterministic time Algorithm runs in Learner runs in randomized time Consequence LBs for NEXP LBs for EXP LBs for BPEXP Reference [Wil10] [Wil10], [SW13] [KKO13] [This Work]

Remarks on lower bounds from Learning ▶ Learning approach won’t directly work for classes containing PRFs. ▶ Conceivable that one can design non-trivial learning algorithms for a class C under the assumption that BPEXP is contained in P/poly. ▶ Learning connection applies to virtually any circuit class of interest, and there is no depth blow-up. It can lead to new lower bounds for restricted classes such as THR o THR and ACC0.

Previous work on learning vs. lower bounds ▶ Systematic investigation initiated about 10 years ago: [FK06] Lower bounds for BPEXP from polynomial time learnability. [HH11] Lower bounds for EXP from deterministic exact learning. [KKO13] Optimal lower bounds for EXP from deterministic exact learning. [Vol14] Lower bounds for BPP/1 from polynomial time learnability. [Vol’15] Further results for learning arithmetic circuits.

A Challenge in Getting Lower Bounds from Randomized Learning Williams’ lower bounds from non-trivial SAT algorithms: a non-trivial algorithm can be used to violate a tight hierarchy theorem for NTIME. Challenge in Randomized Learning: lack of strong hierarchy theorems for BPTIME. The approach has to be indirect, and we must do something different ...

Speedup Phenomenon in Learning Theory Speedup Lemma. Let be any class of Boolean circuits containing AC0[2]. Suppose that for each the class admits a non-trivial learning algorithm. Then for each and , the class is strongly learnable in time

SAT Algorithms vs. Learning Algorithms

Main Techniques: “Speedup Lemma” Non-trivial Learner + NW-Generator + Hardness Amplification Faster Learner! 1. Given oracle access to in C[poly], implicitly construct a “pseudorandom” ensemble of functions in C[poly] on nδ bits. (using NW-generator + Hardness Amplification [CIKK16]) Intuition: Non-trivial learner can distinguish this ensemble from random functions. This can be done in time 2. This distinguisher (i.e. the non-trivial learner) and the reconstruction procedures of NW-generator and Hardness Amplification can be used to strongly learn in time

Main Techniques: “LBs from Learning” 1. Starting from non-trivial learner, apply the Speedup Lemma to obtain a sub-exponential time learner. Speedup 2. Adapting the techniques from [KKO13], randomized sub-exponential time learnability of C[poly] implies BPE lower bounds against C[nk]. 3. Using an additional win-win argument, this holds under minimal assumptions on C, and with no blow-up in the reduction.

Combining and extending existing connections

▶ Further motivation for the following question: Nontrivial SAT / ACC0 lower bounds [Wil11] NEXP EXP ZPEXP REXP MAEXP P/poly lower bounds [BFT98] Well-known connection to PRGs/ derandomization of BPP [IW97] BPEXP Non-trivial learning [This work] [OS17] Connections to pseudo-deterministic algorithms. ▶ Further motivation for the following question: Which algorithmic upper bounds imply lower bounds for ZPEXP and REXP, respectively?

One-sided error: Lower bounds for REXP We combine the satisfiability and learning connections to lower bounds to show: [ Informal ] If a circuit class C admits both non-trivial SAT and non-trivial Learning then REXP is not contained in C. Corollary. [ACC0 lower bounds from non-trivial learning] If for every depth d>1 and modulo m>1 there is such that has non-trivial learning algorithms, then Indicates that combining the two frameworks might have further benefits.

Zero-error: Lower bounds for ZPEXP [IKW02], [Wil13] Connections between natural properties without density condition, Satisfiability Algorithms, and NEXP lower bounds. [CIKK16] Connections between BPP-natural properties and Learning Algorithms. We give a new connection between P-natural properties and ZPEXP lower bounds. Let be a circuit class closed under restrictions. Theorem. [ZPEXP lower bounds from natural properties] If for some there are P-natural properties against then

Further Applications of our Techniques

A rich web of techniques and connections Learning speedup Karp-Lipton collapses “Pseudorandom Method” Hardness of MCSP LBs from Learning Use of (conditional) PRGs and related tools, often in contexts where (pseudo)randomness is not intrinsic.

Karp-Lipton Collapses Connection between uniform class and non-uniform circuit class: [KL80] If then Assumption Consequence Major Application EXP in P/poly EXP = MA [BFT98] MAEXP not in P/poly [BFT98] NEXP in P/poly NEXP = EXP [IKW02] SAT / LB Connection [Wil10] Randomized Exponential Classes such as BPEXP ?

Karp-Lipton for randomized classes Theorem 1. If then The advice is needed for technical reasons. But it can be eliminated in some cases: Theorem 2. If then ▶ Check paper for Karp-Lipton collapses for ZPEXP, and related results.

! Hardness of MCSP Open. Prove that MCSP is not in AC0[2] Minimum Circuit Size Problem: Given 1s and a Boolean function represented as an N-bit string, Is it computed by a circuit of size at most s? Recent work on MCSP and its variants: [KC00], [ABK+06], [AHM+08], [KS08], [AD14], [HP15], [AHK15], [MW15], [HP15], [AGM15], [HW16]. ! Open. Prove that MCSP is not in AC0[2] [ABK+06] MCSP is not in AC0.

Theorem. If MCSP is in TC0 then NC1 collapses to TC0. Our result We prove the first hardness result for MCSP for a standard complexity class beyond AC0: Theorem. If MCSP is in TC0 then NC1 collapses to TC0. The argument describes a non-uniform TC0 reduction from NC1 to MCSP via pseudorandomness.

Additional applications of our techniques ▶ Equivalences between truth-table compression [CKK+14] and randomized learning models in the sub-exponential time regime. (For instance, equivalence queries can be eliminated in sub-exp time randomized learning of expressive concept classes.) ▶ A dichotomy between Learnability and Pseudorandomness in the non-uniform exponential-security setting: “A circuit class is either learnable or contains pseudorandom functions, but not both.” In other words, learnability is the only obstruction to pseudorandomness. (Morally, ACC0 is either learnable in sub-exp time or contains exp-secure PRFs.)

Problems and Directions Is there a speedup phenomenon for complex classes (say AC0[p] and above) for learning under the uniform distribution with random examples? Can we establish new lower bounds for modest circuit classes by designing non-trivial learning algorithms?

Towards lower bounds against NC? Non-trivial learning implies lower bounds: First example of lower bound connection from non-trivial randomized algorithms. Problem. Establish a connection between non-trivial randomized SAT algorithms and lower bounds. (First step in a program to obtain unconditional lower bounds against NC.)

Thank you