M.Nedim Alpdemir, Anastasios Gounaris¹, Arijit Mukherjee², Desmond Fitzgerald, Norman W. Paton¹, Paul Watson², Rizos Sakellariou¹, Alvaro A.A. Fernandes¹, Jim Smith 2 1 University of Manchester, UK 2 University of Newcastle, UK Acknowledgement: Much of the content in many of the slides has been authored by co-workers, especially Nedim Alpdemir and Jim Smith. Errors are mine, of course. Experience on Performance Evaluation with OGSA- DQP
21 Sep 2005AHM Outline of talk The OGSA-DQP system Impact of infrastructure layers Profiling
21 Sep 2005AHM OGSA-DQP system Unified view of and access to remote DBMSs
21 Sep 2005AHM Selecting Resources in OGSA-DQP Unified schema machines
21 Sep 2005AHM Evaluating Queries in OGSA-DQP query plan
21 Sep 2005AHM High level architecture
21 Sep 2005AHM Brief tour: an illustration varchar 12 varchar 12 varchar 12 id goterm goterm.OID string goterm.id string goterm.type string goterm.name goterms goterms LIKE...
21 Sep 2005AHM Where we are The OGSA-DQP system Impact of infrastructure layers Profiling Work based on publicly available releases OGSA-DQP 2.0, OGSA-DAI 4.0, GT 3.2. !! Some of the figures presented do not describe the behaviour of the system any more and refer to pre-optimisation stages. Complementarily, see OGSA-DAI papers in AHM
21 Sep 2005AHM Data Sources Protein_goterm ( B ~ 404KB) Protein_interaction ( B ~ 227KB)
21 Sep 2005AHM Representation in WebRowSet
21 Sep 2005AHM (1) Access Techniques
21 Sep 2005AHM Configurations JDBC local/remote GDS local/remote sync/async OGSA-DQP GQES co-located with GDS GQES calls GDS asynchronously
21 Sep 2005AHM Scan-1 (protein_goterm)
21 Sep 2005AHM Scan-2 (protein_interaction)
21 Sep 2005AHM Join (both tables)
21 Sep 2005AHM Remote Access by Block Block access benefits local cases too (e.g. DQP access to protein_goterms 156 -> 28s)
21 Sep 2005AHM (2) Breaking Down Costs ( ) 15.1 ( ) Total Cost: 28.6 secs
21 Sep 2005AHM (3) Parallelizing Operation Call
21 Sep 2005AHM Configuration Two parameters: a number of copies of WS are available; a number of spare machines are available – compiler plants an op-call on each.
21 Sep 2005AHM Measurements
21 Sep 2005AHM Lessons Increase granularity of inter-service communication, like access to GDS. Reduce delivery cost – coalesce root evaluator within GDQS – in progress. Parallelizing expensive operations can be beneficial – ongoing work. Translations performed in transfers and XML WebRowSet processing can be expensive – ongoing work.
21 Sep 2005AHM Current Work New Release Supports WS-RF/ WS-I Builds on top of OGSA-DAI 7.0 (due in September) Includes optimisations Investigating adaptive and fault-tolerant mechanisms
21 Sep 2005AHM Contact OGSA-DAI and OGSA-DQP software Project site, mentioning adaptivity and fault-tolerance