Internal Memory Pointer MachineRandom Access MachineStatic Setting Data resides in records (nodes) that can be accessed via pointers (links). The priority.

Slides:



Advertisements
Similar presentations
1 Computational Geometry Chapter Range queries How do you efficiently find points that are inside of a rectangle? –Orthogonal range query ([x 1,
Advertisements

External Memory Hashing. Model of Computation Data stored on disk(s) Minimum transfer unit: a page = b bytes or B records (or block) N records -> N/B.
Indexing Mobile Objects on the plane Revisited Computer Engineering and Informatics Department, Polytechnic School, University of Patras The authors would.
Augmenting Data Structures Advanced Algorithms & Data Structures Lecture Theme 07 – Part I Prof. Dr. Th. Ottmann Summer Semester 2006.
I/O-Algorithms Lars Arge Fall 2014 September 25, 2014.
Lars Arge 1/43 Big Terrain Data Analysis Algorithms in the Field Workshop SoCG June 19, 2012 Lars Arge.
Konstantinos Tsakalidis 1 Dynamic Data Structures: Orthogonal Range Queries and Update Efficiency Konstantinos Tsakalidis PhD Defense 23 September 2011.
Hierarchy-conscious Data Structures for String Analysis Carlo Fantozzi PhD Student (XVI ciclo) Bioinformatics Course - June 25, 2002.
I/O-Algorithms Lars Arge January 31, Lars Arge I/O-algorithms 2 Random Access Machine Model Standard theoretical model of computation: –Infinite.
I/O-Algorithms Lars Arge Aarhus University February 27, 2007.
Dynamic Planar Range Maxima Queries (presented at ICALP 2011) Gerth Stølting Brodal Aarhus University Kostas Tsakalidis University of Primorska, October.
Dynamic Planar Range Maxima Queries (presented at ICALP 2011) Gerth Stølting Brodal Aarhus University Kostas Tsakalidis LIAFA, Université Paris Diderot,
Computer Science Day 2008 Algorithms and Data Structures for Faulty Memory Gerth Stølting Brodal Department of Computer Science, University of Aarhus,
I/O-Algorithms Lars Arge Spring 2009 February 2, 2009.
Cache-Oblivious B-Trees
I/O-Algorithms Lars Arge Spring 2009 January 27, 2009.
I/O-Algorithms Lars Arge Spring 2007 January 30, 2007.
I/O-Algorithms Lars Arge Aarhus University February 7, 2005.
I/O-Algorithms Lars Arge Spring 2009 April 28, 2009.
I/O-Algorithms Lars Arge University of Aarhus March 1, 2005.
I/O-Algorithms Lars Arge Spring 2009 March 3, 2009.
I/O-Algorithms Lars Arge Aarhus University February 6, 2007.
I/O-Algorithms Lars Arge Spring 2006 February 2, 2006.
Lars Arge1, Mark de Berg2, Herman Haverkort3 and Ke Yi1
I/O-Algorithms Lars Arge Aarhus University March 5, 2008.
Fully Persistent B-Trees 23 rd Annual ACM-SIAM Symposium on Discrete Algorithms, Kyoto, Japan, January 18, 2012 Gerth Stølting Brodal Konstantinos Tsakalidis.
I/O-Efficient Structures for Orthogonal Range Max and Stabbing Max Queries Second Year Project Presentation Ke Yi Advisor: Lars Arge Committee: Pankaj.
I/O-Algorithms Lars Arge Aarhus University March 9, 2006.
I/O-Algorithms Lars Arge Aarhus University February 14, 2008.
Comparison Based Dictionaries: Fault Tolerance versus I/O Efficiency Gerth Stølting Brodal Allan Grønlund Jørgensen Thomas Mølhave University of Aarhus.
I/O-Algorithms Lars Arge University of Aarhus March 7, 2005.
Cache Oblivious Search Trees via Binary Trees of Small Height
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Lars Arge 1/12 Lars Arge. 2/12  Pervasive use of computers and sensors  Increased ability to acquire/store/process data → Massive data collected everywhere.
I/O-Algorithms Lars Arge Spring 2008 January 31, 2008.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
AALG, lecture 11, © Simonas Šaltenis, Range Searching in 2D Main goals of the lecture: to understand and to be able to analyze the kd-trees and.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Memory Management Last Update: July 31, 2014 Memory Management1.
1 Geometric Intersection Determining if there are intersections between graphical objects Finding all intersecting pairs Brute Force Algorithm Plane Sweep.
I/O-Algorithms Lars Arge Fall 2014 August 28, 2014.
Problems and MotivationsOur ResultsTechnical Contributions Membership: Maintain a set S in the universe U with |S| ≤ n. Given an x in U, answer whether.
Space Efficient Data Structures for Dynamic Orthogonal Range Counting Meng He and J. Ian Munro University of Waterloo.
B-trees and kd-trees Piotr Indyk (slides partially by Lars Arge from Duke U)
1 Chapter 17 Disk Storage, Basic File Structures, and Hashing Chapter 18 Index Structures for Files.
An Optimal Cache-Oblivious Priority Queue and Its Applications in Graph Algorithms By Arge, Bender, Demaine, Holland-Minkley, Munro Presented by Adam Sheffer.
Mehdi Mohammadi March Western Michigan University Department of Computer Science CS Advanced Data Structure.
MotivationFundamental ProblemsProblems on Graphs Parallel processors are becoming common place. Each core of a multi-core processor consists of a CPU and.
Compressed Prefix Sums O’Neil Delpratt Naila Rahman Rajeev Raman.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
Chapter 15 A External Methods. © 2004 Pearson Addison-Wesley. All rights reserved 15 A-2 A Look At External Storage External storage –Exists beyond the.
Equivalence Between Priority Queues and Sorting in External Memory
CMPS 3130/6130 Computational Geometry Spring 2015
Problem Definition I/O-efficient Rectangular Segment Search Gautam K. Das and Bradford G. Nickerson Faculty of Computer science, University of New Brunswick,
IntroductionReduction to Arrangements of Hyperplanes We keep the canonical triangulation (bottom, left corner triangulation) of each convex cell of the.
ProblemResultsConcurrent Range Reporting Input: n points in d dimensional space. Support reporting the t points in a query box. The best previous result:
1 Cache-Oblivious Query Processing Bingsheng He, Qiong Luo {saven, Department of Computer Science & Engineering Hong Kong University of.
ProblemData StructuresLower Bound Preprocess a set of N 3-dimensional points into an I/O-efficient data structure, such that all points inside an axis.
X1x1 x2x2 top-k y 3-sided x1x1 x2x2 External Memory Three-Sided Range Reporting and Top-k Queries with Sublogarithmic Updates Gerth Stølting Brodal Aarhus.
arxiv.org/abs/ y 3-sided x1 x2 x1 x2 top-k
Memory Management 5/11/2018 9:49 PM
Multidimensional Access Structures
B-Trees 7/5/2018 4:26 AM Presentation for use with the textbook Data Structures and Algorithms in Java, 6th edition, by M. T. Goodrich, R. Tamassia, and.
External Memory Hashing
STACS arxiv.org/abs/ y 3-sided x1 x2 x1 x2 top-k
External Memory Hashing
8th Workshop on Massive Data Algorithms, August 23, 2016
Presentation transcript:

Internal Memory Pointer MachineRandom Access MachineStatic Setting Data resides in records (nodes) that can be accessed via pointers (links). The priority search tree achieves. [McCreight, SIAM J.Comp.’85] Data is stored in an infinite array of cells, each containing a word of bits. A CPU can access any cell and perform arithmetic or bitwise operations. The fusion tree achieves or where Q and U are expected bounds [Willard, SODA’92]. [Mortensen, SODA’03] reduces U achieving,. Probabilistic Distributions and Our Results In the pointer machine is possible, when no updates are allowed [Mehlhorn, A. Tsakalidis et al., IPL’87]. In the RAM, when the x-coordinates are integers in the range [N] (rank space), is possible. When both x- and y-coordinates are integers one can achieve [Brodal et al., FOCS’00]. Q and U can be significantly reduced if we assume that the coordinates are being drawn from an unknown, continuous μ-random distribution. Two examples are the Zipfian distribution, which is commonly used in practice, and the Smooth distribution, which is a generalization of many known distributions. We attain when both x- and y-coordinates are μ-random, and when furthermore x is zipfian. All reduced complexities are expected with high probability [K.Tsakalidis et al., ICDT’10]. The latter bounds hold also when x is smooth. U becomes expected amortized [K.Tsakalidis, Brodal et al., ISAAC’09]. Accordingly, in the I/O model, we attain when both x and y are μ-random. When x is zipfian and y is smooth, then can be attained. All reduced complexities are expected with high probability. [K.Tsakalidis et al., ICDT’10] can be achieved by only assuming that x is smooth. Here, U is expected amortized. [K.Tsakalidis, Brodal et al., ISAAC’09] External Memory I/O ModelCache-oblivious Model The aforementioned models are insufficient to capture the complexity of algorithms running on modern computers. In fact, modern architectures consist of a memory hierarchy, where the bottleneck lies on the transfer of data between consecutive levels, rather than on computation itself. The Input/Output Model (I/O model) studies two consecutive levels of the memory hierarchy. Data resides in an external memory that consists of blocks of size B. An I/O operation transfers a block to the internal memory of size M, where computation is performed for free. [Aggarwal, Vitter, C.ACM’88] The external priority search tree is the most efficient data structure for this model and attains [Arge et al., PODS’99]. The cache-oblivious model differs from the I/O model only in the fact that the algorithm does not “know” the values of B and M. Thus, it suffices just to study two consecutive levels of the hierarchy, in order to deduce results that hold for all the levels. There exists a static data structure that attains. In a companion paper, the same authors showed that space is needed, in order to answer queries in the above query time. [Afshani, Zeh et al., SoCG’09] Orthogonal Planar 3-sided Range Reporting MADALGO – Center for Massive Data Algorithmics, a Center of the Danish National Research Foundation Konstantinos Tsakalidis Aarhus University CPU Main Memory CPU Main Memory Disk This is a basic problem in computational geometry and finds applications in spatial databases, computer graphics, geographic information systems, and more areas. y1y1 x1x1 x2x2 Smooth Zipfian Registers L1 Cache Main Memory Disk CPU L2 Cache Store points (x,y) in the 2-dimensional plane, such as to report all points that lie within any query region of the form, namely a three-sided rectangle with one side unbounded. The dynamic setting allows for insertions and deletions of points (updates). Efficiency (Complexity) is measured in terms of (S: used space, Q: query time, U: update time).