Download presentation
Presentation is loading. Please wait.
1
Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010 1 ADBIS 2010
2
Outline 22 September 2010 ADBIS 20102 Motivation Problem Statement Background Partial mirroring method Results Future work
3
Motivation 22 September 2010 3 Top500 Cluster: 84.8% MPP: 14.8% Others: 0.4% ADBIS 2010
4
Problem Statement 22 September 2010 4 Not expensive parallel hardware needs not expensive parallel database management system Today we have no such chip parallel database management system ADBIS 2010
5
Background 22 September 2010 5ADBIS 2010
6
Exchange operator 22 September 2010 ADBIS 20106 p: port ψ : distributing function
7
Parallel plan for query Q = R S 22 September 2010 ADBIS 20107
8
Query processing in cluster system 22 September 2010 8ADBIS 2010
9
The problem 22 September 2010 ADBIS 20109 Load balancing
10
Partial mirroring method Fragmentation strategy Replication strategy 10
11
Fragmentation strategy 22 September 2010 ADBIS 201011 FRAGMENTATION Source relation P0P0 P1P1 PnPn Relation is divided into fragments distributed among cluster nodes Each fragment is divided into sequence of segments with an equal length Segment is the minimal unit of replication...
12
Replication strategy 22 September 2010 ADBIS 201012 Disk D i Disk D j ρ j = 50% RiRi
13
Load balancing method 22 September 2010 ADBIS 201013 D1D1 D2D2 t1t1 D1D1 D2D2 t2t2 D1D1 D2D2 - scheduled to process D1D1 D2D2 t4t4 - processed t3t3 P1P1 P2P2 P1P1 P2P2 P1P1 P1P1 P2P2 P2P2 - not used
14
Parallel agent with two input streams 22 September 2010 ADBIS 201014
15
Load balancing algorithm 22 September 2010 ADBIS 201015
16
Parameters of experiments 22 September 2010 ADBIS 201016
17
Speedup versus replication factor 22 September 2010 ADBIS 201017
18
Speedup versus skew factor θ 22 September 2010 ADBIS 201018 0.68 corresponds to the ”80-20” rule (80 percents of tuples of the relation will be stored in 20 percents of fragments) 0 corresponds to the uniform distribution
19
Future Work 22 September 2010 ADBIS 201019 To incorporate the proposed technique of parallel query execution into open source PostgreSQL DBMS. To extend this approach on GRID DBMS for clusters with multicor processors.
20
22 September 2010 ADBIS 201020 Thank you
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.