Download presentation
Presentation is loading. Please wait.
Published byEllen Langeland Modified over 6 years ago
1
Persistent Bloom Filter: Membership Testing for the Entire History
Yanqing Peng1 Jinwei Guo2 Feifei Li1 Weining Qian2 Aoying Zhou2 1 University of Utah East China Normal University Problem Definition PBF-2 Has IP address ever visited my web server between 9:30am and 9:40am? Maintain a binary tree logically, merge nodes in the same level in practice. Number of BFs = Number of Levels In PBF-1: operation on element x at the i-th node of level j In PBF-2: operation on (x, i) at the j-th node Temporal Membership Testing Given a temporal set π= { π π , π π , π π , π π , β¦}, an element π, and a time range [π , π] Test if exists π‘β[π ,π] such that π, π βπ High Level Idea Binary decomposition Reduce #accesses to log(query length) Ex: Binary decomposition of [3, 8] Bit Allocation π
π :Number of elements in the i-th BF π π :Number of accesses to the i-th BF π π :Number of bits allocated for the i-th BF π π :A parameter needs to be solved π : Total number of bits (budget) PBF-1 Split time range into epochs, learn parameters for current epoch from previous epochs. A binary tree of Bloom Filters Each node maintains a BF of elements appears in time [π ,π] Insert x at time t: Insert x into all nodes the path from root to the leaf [π‘,π‘] Query x in time [π ,π] : Query x to all nodes in the decomposition of [π ,π] O(T) #BFs. T: Time upper bound Evaluation About 50x space saving than original dataset Baseline: Standard BF, pair elements with timestamps Slightly slower insertion, magnitudes better query efficiency compared to baseline False positive rate: PBF1-offline > PBF2-offline = PBF2-online > PBF1-online >>> baseline
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.