Lower bounds for approximate membership dynamic data structures

Lower bounds for approximate membership dynamic data structures
Shachar Lovett IAS Ely Porat Bar-Ilan University Synergies in lower bounds, June 2011

Information theoretic lower bounds
Information theory is a powerful tool to prove lower bounds, e.g. in data structures Study size of data structure (unlimited access) Static d.s.: pure information theory Dynamic d.s.: communication game

Talk overview Approximate set membership problem
Bloom filters (simple near-optimal solution) Lower bounds – static case New dynamic lower bounds

Approximate set membership
Large universe U Represent subset S  U Query: is x  S? Data structure representing S approximately: If x  S: answer YES always If x  S: answer NO with high probability Why approximately? To save space U ~S S

Applications Storage (or communication) is costly, but a small false positive error can be tolerated Original applications (70’s): dictionaries, databases – Bloom filters Nowadays: mainly network applications

Bloom filters S={x1,x2,…,xn} Hash function h:U  {1,…,m}
Bit array of length m

h(x1)=4 1 Bit array of length m

Bloom filters S={x1,x2,…,xn} Query: y  S? Hash function h:U  {1,…,m}
Bit array of length m

Bloom filters S={x1,x2,…,xn} Query: y  S? Hash function h:U  {1,…,m}
h(y)=3 1 Bit array of length m

Bloom filters S={x1,x2,…,xn} Query: y  S? NO Hash function
h:U  {1,…,m} Query: y  S? NO h(y)=3 1 Bit array of length m

Bloom filters: analysis
hash S={x1,x2,…,xn} Query: y  S? If y  S: returns YES always If y  S: returns NO with probability Error ½: Error : (repetition) 1 Bit array of length m

Known bounds Upper bounds (e.g. algorithms) Lower bounds:
Bloom filter: Improvements: [Porat-Matthias’03, Arbitman-Naor-Segev’10] Lower bounds: information theoretic: Can be matched by static data structures [Charles-Chellapilla’08,Dietzfelbinger-Pagh’08,Porat’08] This work: dynamic d.s.

Static lower bounds Static settings: insert + query
Yao’s min-max principle: prove lower bound for deterministic data structure, randomized inputs Insert: x1,…,xn m bits Query: y

Static lower bounds Deterministic data structure: compression
Insert: x1,…,xn m bits Query: y Static lower bounds Deterministic data structure: compression maps all sets to a small family of sets Input: random set Accept set: Properties: Small memory: No false negatives: Few false positives: Optimal setting:

Static lower bounds Set S, Represented by Goal: show #A(S) large U
Insert: x1,…,xn m bits Query: y Static lower bounds U A(S) S Set S, Represented by Goal: show #A(S) large

Static lower bounds Properties: Assume that General case: convexity
Insert: x1,…,xn m bits Query: y Static lower bounds Properties: Assume that If then General case: convexity

Dynamic lower bounds Basic dynamic settings: two inserts + query
Break inputs to k, n-k chunks m bits m bits Insert: x1,…,xk Insert: xk+1,…,xn Query: y

Dynamic lower bounds Accepting sets: Properties:
Insert: x1,…,xk m bits Insert: xk+1,…,xn Query: y Dynamic lower bounds Accepting sets: Properties: General approach: analyze size of accepting sets Sets A(x1,…,xk) can’t be too small (covering) Sets A(A(x1,…,xk),xk+1,…,xn) can’t be too large (error) These yield the trivial lower bound again… 

Dynamic lower bounds Method of typical inputs On a typical input:
Insert: x1,…,xk m bits Insert: xk+1,…,xn Query: y Dynamic lower bounds Method of typical inputs On a typical input: A(x1,…,xk) not too small A(A(x1,…,xk),xk+1,…,xn) not too large Inputs uncorrelated with data structure: Yields an improved lower bound  (note: “typical” can be 1% of inputs)

Dynamic lower bounds Functional inequality:
Insert: x1,…,xk m bits Insert: xk+1,…,xn Query: y Dynamic lower bounds Functional inequality: Free parameter: k – how to break input Optimal choice: Extension: break input into more parts Doesn’t seem to help much

Summary THANK YOU! Approximate membership problem
Static algorithms match static information theoretic lower bound: This work: new dynamic information theoretic lower bound THANK YOU!

Lower bounds for approximate membership dynamic data structures

Similar presentations

Presentation on theme: "Lower bounds for approximate membership dynamic data structures"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Lower bounds for approximate membership dynamic data structures

Similar presentations

Presentation on theme: "Lower bounds for approximate membership dynamic data structures"— Presentation transcript:

Similar presentations

About project

Feedback