Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September ADBIS 2010
Outline 22 September 2010 ADBIS Motivation Problem Statement Background Partial mirroring method Results Future work
Motivation 22 September Top500 Cluster: 84.8% MPP: 14.8% Others: 0.4% ADBIS 2010
Problem Statement 22 September Not expensive parallel hardware needs not expensive parallel database management system Today we have no such chip parallel database management system ADBIS 2010
Background 22 September ADBIS 2010
Exchange operator 22 September 2010 ADBIS p: port ψ : distributing function
Parallel plan for query Q = R S 22 September 2010 ADBIS 20107
Query processing in cluster system 22 September ADBIS 2010
The problem 22 September 2010 ADBIS Load balancing
Partial mirroring method Fragmentation strategy Replication strategy 10
Fragmentation strategy 22 September 2010 ADBIS FRAGMENTATION Source relation P0P0 P1P1 PnPn Relation is divided into fragments distributed among cluster nodes Each fragment is divided into sequence of segments with an equal length Segment is the minimal unit of replication...
Replication strategy 22 September 2010 ADBIS Disk D i Disk D j ρ j = 50% RiRi
Load balancing method 22 September 2010 ADBIS D1D1 D2D2 t1t1 D1D1 D2D2 t2t2 D1D1 D2D2 - scheduled to process D1D1 D2D2 t4t4 - processed t3t3 P1P1 P2P2 P1P1 P2P2 P1P1 P1P1 P2P2 P2P2 - not used
Parallel agent with two input streams 22 September 2010 ADBIS
Load balancing algorithm 22 September 2010 ADBIS
Parameters of experiments 22 September 2010 ADBIS
Speedup versus replication factor 22 September 2010 ADBIS
Speedup versus skew factor θ 22 September 2010 ADBIS corresponds to the ”80-20” rule (80 percents of tuples of the relation will be stored in 20 percents of fragments) 0 corresponds to the uniform distribution
Future Work 22 September 2010 ADBIS To incorporate the proposed technique of parallel query execution into open source PostgreSQL DBMS. To extend this approach on GRID DBMS for clusters with multicor processors.
22 September 2010 ADBIS Thank you