Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems Mugurel Ionut Andreica Politehnica University.

Slides:



Advertisements
Similar presentations
Introduction to Algorithms
Advertisements

Lecture 15. Graph Algorithms
Trees for spatial indexing
Computational Geometry
Augmenting Data Structures Advanced Algorithms & Data Structures Lecture Theme 07 – Part I Prof. Dr. Th. Ottmann Summer Semester 2006.
I/O-Algorithms Lars Arge Fall 2014 September 25, 2014.
Convex Hull obstacle start end Convex Hull Convex Hull
Visibility Graphs May Shmuel Wimer Bar-Ilan Univ., Eng. Faculty Technion, EE Faculty.
1 Closest Points A famous algorithmic problem... Given a set of points in the plane (cities in the U.S., transistors on a circuit board, computers on a.
Minimum Dissatisfaction Personnel Scheduling Mugurel Ionut Andreica Politehnica University of Bucharest Computer Science Department Romulus Andreica, Angela.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
K-structure, Separating Chain, Gap Tree, and Layered DAG Presented by Dave Tahmoush.
I/O-Algorithms Lars Arge Aarhus University February 27, 2007.
Lecture 12 : Special Case of Hidden-Line-Elimination Computational Geometry Prof. Dr. Th. Ottmann 1 Special Cases of the Hidden Line Elimination Problem.
Accessing Spatial Data
Special Cases of the Hidden Line Elimination Problem Computational Geometry, WS 2007/08 Lecture 16 Prof. Dr. Thomas Ottmann Algorithmen & Datenstrukturen,
I/O-Algorithms Lars Arge University of Aarhus March 1, 2005.
I/O-Algorithms Lars Arge Spring 2009 March 3, 2009.
1 Geometric Solutions for the IP-Lookup and Packet Classification Problem (Lecture 12: The IP-LookUp & Packet Classification Problem, Part II) Advanced.
Geometric Data Structures Computational Geometry, WS 2007/08 Lecture 13 Prof. Dr. Thomas Ottmann Algorithmen & Datenstrukturen, Institut für Informatik.
I/O-Algorithms Lars Arge Aarhus University March 5, 2008.
Hidden-Line Elimination Computational Geometry, WS 2006/07 Lecture 14 Prof. Dr. Thomas Ottmann Algorithmen & Datenstrukturen, Institut für Informatik Fakultät.
Aggregating Information in Peer-to-Peer Systems for Improved Join and Leave Distributed Computing Group Keno Albrecht Ruedi Arnold Michael Gähwiler Roger.
Lecture 11 : More Geometric Data Structures Computational Geometry Prof. Dr. Th. Ottmann 1 Geometric Data Structures 1.Rectangle Intersection 2.Segment.
ICDE A Peer-to-peer Framework for Caching Range Queries Ozgur D. Sahin Abhishek Gupta Divyakant Agrawal Amr El Abbadi Department of Computer Science.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Orthogonal Range Searching I Range Trees. Range Searching S = set of geometric objects Q = query object Report/Count objects in S that intersect Q Query.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
1 Geometric Intersection Determining if there are intersections between graphical objects Finding all intersecting pairs Brute Force Algorithm Plane Sweep.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Chapter Tow Search Trees BY HUSSEIN SALIM QASIM WESAM HRBI FADHEEL CS 6310 ADVANCE DATA STRUCTURE AND ALGORITHM DR. ELISE DE DONCKER 1.
Decision Optimization Techniques for Efficient Delivery of Multimedia Streams Mugurel Ionut Andreica, Nicolae Tapus Politehnica University of Bucharest,
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Reliability Analysis of Tree Networks Applied to Balanced Content Replication Mugurel Ionut Andreica, Nicolae Tapus Polytechnic University of Bucharest.
Offline Algorithmic Techniques for Several Content Delivery Problems in Some Restricted Types of Distributed Systems Mugurel Ionut Andreica, Nicolae Tapus.
Optimal Scheduling of File Transfers with Divisible Sizes on Multiple Disjoint Paths Mugurel Ionut Andreica Polytechnic University of Bucharest Computer.
UNC Chapel Hill M. C. Lin Orthogonal Range Searching Reading: Chapter 5 of the Textbook Driving Applications –Querying a Database Related Application –Crystal.
External Memory Algorithms for Geometric Problems Piotr Indyk (slides partially by Lars Arge and Jeff Vitter)
14/13/15 CMPS 3130/6130 Computational Geometry Spring 2015 Windowing Carola Wenk CMPS 3130/6130 Computational Geometry.
How to reform a terrain into a pyramid Takeshi Tokuyama (Tohoku U) Joint work with Jinhee Chun (Tohoku U) Naoki Katoh (Kyoto U) Danny Chen (U. Notre Dame)
Maximum Reliability K-Hop Multicast Strategy in Tree Networks Mugurel Ionut Andreica, Nicolae Tapus Polytechnic University of Bucharest Computer Science.
Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Mehdi Mohammadi March Western Michigan University Department of Computer Science CS Advanced Data Structure.
Computing the Volume of the Union of Cubes in R 3 Pankaj K. Agarwal Haim Kaplan Micha Sharir.
Efficient Data Structures for Online QoS-Constrained Data Transfer Scheduling Mugurel Ionut Andreica, Nicolae Tapus Politehnica University of Bucharest.
Lars Arge Presented by Or Ozery. I/O Model Previously defined: N = # of elements in input M = # of elements that fit into memory B = # of elements per.
UNC Chapel Hill M. C. Lin Computing Voronoi Diagram For each site p i, compute the common inter- section of the half-planes h(p i, p j ) for i  j, using.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
CSE554Contouring IISlide 1 CSE 554 Lecture 5: Contouring (faster) Fall 2015.
CSE554Contouring IISlide 1 CSE 554 Lecture 3: Contouring II Fall 2011.
CSE554Contouring IISlide 1 CSE 554 Lecture 5: Contouring (faster) Fall 2013.
Lecture 15 Computational Geometry Geometry sweeping Geometric preliminaries Some basics geometry algorithms.
1 Bottleneck Routing Games on Grids Costas Busch Rajgopal Kannan Alfred Samman Department of Computer Science Louisiana State University.
1. For minimum vertex cover problem in the following graph give
UNC Chapel Hill M. C. Lin Geometric Data Structures Reading: Chapter 10 of the Textbook Driving Applications –Windowing Queries Related Application –Query.
Spatial Data Management
CSE 554 Lecture 5: Contouring (faster)
Representing Sets (2.3.3) Huffman Encoding Trees (2.3.4)
Spatial Indexing.
Query Processing in Databases Dr. M. Gavrilova
CMPS 3130/6130 Computational Geometry Spring 2017
KD Tree A binary search tree where every node is a
Orthogonal Range Searching and Kd-Trees
Reporting (1-D) Given a set of points S on the line, preprocess them to build structure that allows efficient queries of the from: Given an interval I=[x1,x2]
Dynamic Data Structures for Simplicial Thickness Queries
Database Design and Programming
Processing an Offline Insertion-Query Sequence with Applications
Segment Tree and Its Usage for geometric Computations
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems Mugurel Ionut Andreica Politehnica University of Bucharest Computer Science Department

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 2 Summary Motivation Largest Empty Circle Largest Hyper-Rectangle with Fixed Aspect Ratio Circle Containment Hyper-Rectangle Containment Maximum Weight Subsequence Accepted by a NFA Conclusions

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 3 Motivation Extracting information from multidimensional data – applications in: –Business planning & integration –Statistics –Scientific research –Distributed systems Distributed, peer-to-peer message routing system –Peer identifiers: mapped into a metric space –Analysis of the distribution of identifiers in the metric space (empty zones, covering,...)

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 4 Largest Empty Circle (1/2) n points (x i,y i ) located inside a circle C find the largest circle C’, fully contained inside C, which contains none of the given points binary search the radius R of C’ => feasibility test –shrink the radius of C by R –inflate every point into a circle of radius R –decide if the union of the n circles covers the shrinked circle C => we need an algorithm for deciding if the union of n circles covers another circle

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 5 Largest Empty Circle (2/2) Compute a set S of all the x-coordinates of the intersection points of the circles, plus {x i, x i -R, x i +R} Maintain only the distinct x-coordinates in S (O(n 2 ) values) + sort them: xs 1 <xs 2 <...<xs M for each pair of consecutive coordinates (xs i, xs i+1 ) –xmid=(xs i +xs i+1 )/2 –each circle j intersects the vertical line x=xmid on an interval Ij –compute the union of the intervals (for each interval in the union, store the circle ids of its two endpoints) –compute the interval Ic (intersection of the shrinked circle C with x=xmid) –if union does not cover Ic => can increase R (in the binary search) –if all unions cover Ic => R is too large O(n 3 ·log(n)) (or O(n 2 ·(n+log(n)))

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 6 Largest Hyper-Rectangle with Fixed Aspect Ratio n d-dimensional points inside a d-dimensional hyper-rectangle R find the largest volume empty hyper-rectangle with fixed aspect ratio –length j =f j ·length 1 (j=1,...,d) (f 1 =1) binary search length 1 => feasibility test inflate every point to a hyper-rectangle with length 1 and fixed aspect ratio + shrink R by length i on dimension i (similar to the circle case) verify if the union of the hyper-rectangles covers R –clip every hyper-rectangle on the boundaries R –compute the volume V union of the union of the clipped hyper-rectangles – O(n d-1 ·log(n)) time –If V union =Volume(R) => R is covered => decrease length 1 ; otherwise, increase length 1

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 7 Circle Containment n circles (x i, y i, r i ) every two circles A and B: completely disjoint or A is included in B (or B in A) decide if each circle contains (is contained in) another circle consider left and right half-circles sweep a vertical line from left to right (considering only right half-circles) ; afterwards, do the same from right to left (with left half-circles only) maintain a balanced tree T of disjoint intervals (the intersections of the half-circles with the sweep line) when we reach the left half of a half-circle => we traverse all the intervals in T intersecting the vertical segment I=[y i -r i,y i +r i ] ; we adjust their length (it might be outdated) and check for inclusion –remove the intervals included in I (or I, if it is included in some interval from T) –insert (back) into T the adjusted, non-included intervals overall: O(n·log(n))

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 8 Hyper-Rectangle Containment n d-dimensional hyper-rectangles decide, for each hyper-rectangles, if it includes (is included in) another hyper-rectangle sweep line on the d-th coordinate associate a (2·d-2)-dimensional point p i to each rectangle i maintain a range tree R at left side of dimension d of rectangle i: insert p i into R with weight = the right side on dimension d-1 at right side of dimension d of rectangle i: –remove p i from R –range max query the range tree => find the hyper-rectangle Q with the largest (d-1)th dimension and s.t. the other dims are appropriate –we test if Q includes R

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 9 Maximum Weight Subsequence Accepted by a NFA sequence of n d-dimensional points with weights a non-deterministic finite automaton NFA –each directed edge (q(1),q(2)): d intervals [a(1),b(1)],..., [a(d),b(d)] –pass from q(1) to q(2) only if the difference between the coordinates of the next point (and the current one) on dim j is within [a(j),b(j)] find a non-contiguous subsequence of points of maximum total weight s.t. NFA reaches a final state W max (i,j)=the maximum weight of a subsequence ending at point i and reaching the state q(j) (dynamic programming) O(n·(m+|E|)·log d (n)) –m = #of states of NFA –|E|=# of (directed) edges of NFA –maintain a range tree for each state of the NFA very general model special cases: longest increasing subsequence, longest alternating (antimonotonic) subsequence

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 10 Conclusions Geometric data analysis problems –Largest Empty Circle –Largest Hyper-Rectangle with Fixed Aspect Ratio –Circle Containment –Hyper-Rectangle Containment –Maximum Weight Subsequence Accepted by a NFA Applications to many domains –my motivation: analysis of data from distributed systems with node identifiers mapped into a metric space

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems 11 Thank You !