The Range Mode ProblemCell Probe Lower Bounds for the Range Mode ProblemThe Range k-frequency Problem Preprocess an array A of n elements into a space.

Slides:



Advertisements
Similar presentations
Tight Bounds for Distributed Functional Monitoring David Woodruff IBM Almaden Qin Zhang Aarhus University MADALGO Based on a paper in STOC, 2012.
Advertisements

Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT
The Average Case Complexity of Counting Distinct Elements David Woodruff IBM Almaden.
Optimal Bounds for Johnson- Lindenstrauss Transforms and Streaming Problems with Sub- Constant Error T.S. Jayram David Woodruff IBM Almaden.
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Xiaoming Sun Tsinghua University David Woodruff MIT
Truthful Mechanisms for Combinatorial Auctions with Subadditive Bidders Speaker: Shahar Dobzinski Based on joint works with Noam Nisan & Michael Schapira.
Shortest Vector In A Lattice is NP-Hard to approximate
Circuit and Communication Complexity. Karchmer – Wigderson Games Given The communication game G f : Alice getss.t. f(x)=1 Bob getss.t. f(y)=0 Goal: Find.
Lecture 19. Reduction: More Undecidable problems
STATISTICS.
INTERVAL TREE & SEGMENTATION TREE
Fast Algorithms For Hierarchical Range Histogram Constructions
I/O-Algorithms Lars Arge Fall 2014 September 25, 2014.
Why Simple Hash Functions Work : Exploiting the Entropy in a Data Stream Michael Mitzenmacher Salil Vadhan.
Lower bound: Decision tree and adversary argument
1 Cell Probe Complexity - An Invitation Peter Bro Miltersen University of Aarhus.
Bichromatic Separating Circles. Problem Given two sets of points, R and B ∈ R ² where | R | = n and | B | = m: – Find the smallest circle containing all.
Zoo-Keeper’s Problem An O(nlogn) algorithm for the zoo-keeper’s problem Sergei Bespamyatnikh Computational Geometry 24 (2003), pp th CGC Workshop.
Constant-Time LCA Retrieval
I/O-Algorithms Lars Arge Aarhus University February 27, 2007.
1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.
Limitations of VCG-Based Mechanisms Shahar Dobzinski Joint work with Noam Nisan.
Computing Diameter in the Streaming and Sliding-Window Models J. Feigenbaum, S. Kannan, J. Zhang.
Approximate Range Searching in the Absolute Error Model Guilherme D. da Fonseca CAPES BEX Advisor: David M. Mount.
CSC 2300 Data Structures & Algorithms March 27, 2007 Chapter 7. Sorting.
Why Simple Hash Functions Work : Exploiting the Entropy in a Data Stream Michael Mitzenmacher Salil Vadhan.
I/O-Algorithms Lars Arge University of Aarhus March 1, 2005.
I/O-Algorithms Lars Arge Spring 2009 March 3, 2009.
I/O-Algorithms Lars Arge Aarhus University March 5, 2008.
1 On approximating the number of relevant variables in a function Dana Ron & Gilad Tsur Tel-Aviv University.
On Testing Convexity and Submodularity Michal Parnas Dana Ron Ronitt Rubinfeld.
Time Series Data Analysis - II
Orthogonal Range Searching I Range Trees. Range Searching S = set of geometric objects Q = query object Report/Count objects in S that intersect Q Query.
Presentation by: H. Sarper
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Problems and MotivationsOur ResultsTechnical Contributions Membership: Maintain a set S in the universe U with |S| ≤ n. Given an x in U, answer whether.
Equality Function Computation (How to make simple things complicated) Nitin Vaidya University of Illinois at Urbana-Champaign Joint work with Guanfeng.
Evaluation of software engineering. Software engineering research : Research in SE aims to achieve two main goals: 1) To increase the knowledge about.
MCS 312: NP Completeness and Approximation algorithms Instructor Neelima Gupta
Mehdi Mohammadi March Western Michigan University Department of Computer Science CS Advanced Data Structure.
Section 2.4. Section Summary Sequences. Examples: Geometric Progression, Arithmetic Progression Recurrence Relations Example: Fibonacci Sequence Summations.
Dynamic Covering for Recommendation Systems Ioannis Antonellis Anish Das Sarma Shaddin Dughmi.
MotivationFundamental ProblemsProblems on Graphs Parallel processors are becoming common place. Each core of a multi-core processor consists of a CPU and.
Open Problem: Dynamic Planar Nearest Neighbors CSCE 620 Problem 63 from the Open Problems Project
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
PHYS 773: Quantum Mechanics February 6th, 2012
06/12/2015Applied Algorithmics - week41 Non-periodicity and witnesses  Periodicity - continued If string w=w[0..n-1] has periodicity p if w[i]=w[i+p],
Data Stream Algorithms Lower Bounds Graham Cormode
Discrete Random Variables. Introduction In previous lectures we established a foundation of the probability theory; we applied the probability theory.
Inequalities for Stochastic Linear Programming Problems By Albert Madansky Presented by Kevin Byrnes.
Lower bounds on data stream computations Seminar in Communication Complexity By Michael Umansky Instructor: Ronitt Rubinfeld.
Chapter 13 Backtracking Introduction The 3-coloring problem
MotivationLocating the k largest subsequences: Main ideasResults Problem definitions Problem instance ( k=5 ) Bibliography
ProblemResultsConcurrent Range Reporting Input: n points in d dimensional space. Support reporting the t points in a query box. The best previous result:
ProblemData StructuresLower Bound Preprocess a set of N 3-dimensional points into an I/O-efficient data structure, such that all points inside an axis.
Internal Memory Pointer MachineRandom Access MachineStatic Setting Data resides in records (nodes) that can be accessed via pointers (links). The priority.
Common Intersection of Half-Planes in R 2 2 PROBLEM (Common Intersection of half- planes in R 2 ) Given n half-planes H 1, H 2,..., H n in R 2 compute.
The Message Passing Communication Model David Woodruff IBM Almaden.
The Implicit ModelA Moveable DictionaryA Working Set Dictionary The implicit model is the RAM model but with some restrictions: you are only allowed to.
Theory of Computational Complexity M1 Takao Inoshita Iwama & Ito Lab Graduate School of Informatics, Kyoto University.
Lower bounds for Unconditionally Secure MPC Ivan Damgård Jesper Buus Nielsen Antigoni Polychroniadou Aarhus University.
Modeling with Recurrence Relations
Ariel Rosenfeld Bar-Ilan Uni.
HIERARCHY THEOREMS Hu Rui Prof. Takahashi laboratory
Orthogonal Range Searching and Kd-Trees
Nikhil Bansal, Shashwat Garg, Jesper Nederlof, Nikhil Vyas
Classical Algorithms from Quantum and Arthur-Merlin Communication Protocols Lijie Chen MIT Ruosong Wang CMU.
Kinetic Collision Detection for Convex Fat Objects
Complexity Theory: Foundations
Presentation transcript:

The Range Mode ProblemCell Probe Lower Bounds for the Range Mode ProblemThe Range k-frequency Problem Preprocess an array A of n elements into a space efficient data structure supporting the following queries. Mode(i, j) Find a most occurring element in A[i..j]. Approximate mode(i,j) Find an element occurring at least 1/(1+ε) times as often as a mode in A[i..j]. Using communication complexity techniques, we prove a query lower bound for the range mode problem of Ω(log n / log (Sw/n)) time for any data structure using S cells of space. This means that any data structure using O(n log O(1) n) space will require a query time of Ω(log n / log log n), and that for constant time queries we require n 1+Ω(1) words of space. The proof uses techniques introduced by Miltersen, and later refinements [5]. Here Alice simulates the query algorithm and Bob holds the data structure, whenever Alice queries a cell, Bob responds with the contents. Preprocess an array A of n elements into a space efficient data structure supporting the following queries. Range k-frequency(i, j) Find an element in A[i..j] occurring exactly k times. We prove that for fixed k>1, this problem is equivalent to the 2D orthogonal rectangle stabbing problem. And for k=1 it is no harder than 4-sided 3D orthogonal range emptiness. The following is an example of the reduction from 2D orthogonal rectangle stabbing to range 2-frequency. The rectangles are projected on the the x- and y- axis. We visit the end points of the line segments in increasing order, adding starting points once and ending points twice. Finally the two strings X and Y are concatenated to form the array A. Below is an example of the reduction from range 2-frequency to 2D orthogonal rectangle stabbing. Notice that “a” has frequency two when 1 ≤ i ≤ 2 and 4 ≤ j ≤ 7. This gives rise to the bottommost red rectangle. In a similar way all other ranges where a or b has frequency two give rise to the remaining rectangles. Thus if point (i,j) stabs a rectangle, then there is an element with frequency two in A[i..j]. Applications The mode is a general statistical measure. The range mode applies among other things to describe discrete events. For instance if we create a list over the winners of football matches, a query would correspond to asking between these two matches, who won the most matches. (1+ε)-approximate Range Mode We constructed a very simple data structure using a few bit tricks that answers 3- approximate range mode queries in constant time using linear space. The best previous approximation achieving these bounds was a 4-approximation by Bose, Kranakis, Morin and Tang [3]. Building upon any of these data structures, we can, given an arbitrary ε, construct a data structure that answers (1+ε)-approximate range mode queries in O(log ε -1 ) time using O(n/ε) space. This construction is similar to [3]. Selected Previous Results  Previously no non-trivial lower bound has been shown for neither the range mode nor the approximate range mode problem.  Grabowski and Petersen [2] have presented a data structure supporting range mode queries in constant time and using O(n 2 log log n / log 2 n) words of space.  Petersen [4] has presented a data structure supporting range mode queries in O(n ε ) time and using O(n 2-2ε ) words of space.  Bose, Kranakis, Morin and Tang [3] have presented a data structure supporting (1+ε)-approximate range mode problem queries in O(log log n + log ε -1 ) time using linear space. References [1] Greve, Jørgensen, Larsen, Truelsen. Cell probe lower bounds and approximations for range mode. ICALP [2] Grabowski, Petersen. Range mode and range median queries in constant time and sub-quadratic space. Inf. Process. Lett [3] Bose, Kranakis, Morin, Tang. Approximate range mode and range median queries. STACS [4] Petersen. Improved bounds for range mode and range median queries. CCTTPCS [5] Pătraşcu, Thorup. Higher lower bounds for near-neighbor and further rich problems. FOCS Cell Probe Lower Bounds and Approximations for Range Mode MADALGO – Center for Massive Data Algorithmics, a Center of the Danish National Research Foundation Jakob Truelsen Aarhus University Mark Greve Aarhus University A ij mode(2, 11) = c abaecdcdcbbaxt Pătraşcu showed a lower bound for lopsided set disjointness. We show how to solve such instances using a range mode data structure. Thus obtaining the claimed query time/space tradeoff.