Some TPC-H queries on Teradata and PostgreSQL

Slides:



Advertisements
Similar presentations
Extreme Performance with Oracle Data Warehousing
Advertisements

OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Data Warehouse Tuning. 7 - Datawarehouse2 Datawarehouse Tuning Aggregate (strategic) targeting: –Aggregates flow up from a wide selection of data, and.
Big Data Working with Terabytes in SQL Server Andrew Novick
Nondeterministic Queries in a Relational Grid Information Service Peter A. Dinda Dong Lu Prescience Lab Department of Computer Science Northwestern University.
Performance and Scalability. Optimizing PerformanceScaling UpScaling Out.
Faster Than Alter – Less Downtime Chris Schneider.
Shimin Chen Big Data Reading Group.  Energy efficiency of: ◦ Single-machine instance of DBMS ◦ Standard server-grade hardware components ◦ A wide spectrum.
Meanwhile RAM cost continues to drop Moore’s Law on total CPU processing power holds but in parallel processing… CPU clock rate stalled… Because.
High-Performance Task Distribution for Volunteer Computing Rom Walton
Virtual techdays INDIA │ 9-11 February 2011 SQL 2008 Query Tuning Praveen Srivatsa │ Principal SME – StudyDesk91 │ Director, AsthraSoft Consulting │ Microsoft.
Tuning Relational Systems I. Schema design  Trade-offs among normalization, denormalization, clustering, aggregate materialization, vertical partitioning,
Adapted from a talk by: Sapna Jain & R. Gokilavani Some slides taken from Jingren Zhou's talk on Scope : isg.ics.uci.edu/slides/MicrosoftSCOPE.pptx.
PNUTS: YAHOO!’S HOSTED DATA SERVING PLATFORM FENGLI ZHANG.
Analyzing the Energy Efficiency of a Database Server Hanskamal Patel SE 521.
Jingren Zhou, Per-Ake Larson, Ronnie Chaiken ICDE 2010 Talk by S. Sudarshan, IIT Bombay Some slides from original talk by Zhou et al. 1.
Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.
Module 18 Monitoring SQL Server 2008 R2. Module Overview Monitoring Activity Capturing and Managing Performance Data Analyzing Collected Performance Data.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
Ingres Plus X100 Equals Ingres Vectorwise. Agenda  Why?  Introduction to Vectorwise  Groundwork  Vectorwise and OPF  Vectorwise and QEF.
Index tuning Performance Tuning.
A Paradigm Shift in Database Optimization: From Indices to Aggregates Presented to: The Data Warehousing & Data Mining mini-track – AMCIS 2002 as Research-in-Progress.
Zois Vasileios Α. Μ :4183 University of Patras Department of Computer Engineering & Informatics Diploma Thesis.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
© Dennis Shasha, Philippe Bonnet – 2013 Communicating with the Outside.
Business Intelligence Appliance Powerful pay as you grow BI solutions with Engineered Systems.
© Dennis Shasha, Alberto Lerner, Philippe Bonnet 2004 DBMS Performance Monitoring.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
02/09/2010 Industrial Project Course (234313) Virtualization-aware database engine Final Presentation Industrial Project Course (234313) Virtualization-aware.
SQL Server 2000 Sys Admin Jeremiah Curtis Engineering Services
Exploiting Asynchronous IO using the Asynchronous Iterator Model Suresh Iyengar * S. Sudarshan Santosh Kumar # Raja Agrawal & IIT Bombay Current affiliations:
Building a Distributed Full-Text Index for the Web by Sergey Melnik, Sriram Raghavan, Beverly Yang and Hector Garcia-Molina from Stanford University Presented.
SQL/Lesson 7/Slide 1 of 32 Implementing Indexes Objectives In this lesson, you will learn to: * Create a clustered index * Create a nonclustered index.
Copyright 2007, Information Builders. Slide 1 Machine Sizing and Scalability Mark Nesson, Vashti Ragoonath June 2008.
Relational Operator Evaluation. Overview Application Programmer (e.g., business analyst, Data architect) Sophisticated Application Programmer (e.g.,
SQL Server 2005 XML Datatype David Wilson Ohio North SQL Server Special Interest Group July 12, 2007.
Your Data Any Place, Any Time Performance and Scalability.
Last Updated : 27 th April 2004 Center of Excellence Data Warehousing Group Teradata Performance Optimization.
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
SQL Query Analyzer. Graphical tool that allows you to:  Create queries and other SQL scripts and execute them against SQL Server databases. (Query window)
Lock Tuning. Overview Data definition language (DDL) statements are considered harmful DDL is the language used to access and manipulate catalog or metadata.
Troubleshooting Dennis Shasha and Philippe Bonnet, 2013.
SQL Server 2016 – New Features Tilahun Endihnew March 12, 2016.
Presentation Title Goes Here …presentation subtitle. SCRIPTS FOR RETRIEVING DATASETS.
ETL Validator Deployment Options
Planning a Migration.
Understanding and Improving Server Performance
Wander Join: Online Aggregation via Random Walks
Value of Serializability
SQL Server 2000 and Access 2000 limits
Database Performance Tuning &
Methodology – Physical Database Design for Relational Databases
WORKFLOW PETRI NETS USED IN MODELING OF PARALLEL ARCHITECTURES
Arranging the Join Order: the Wong-Youssefi algorithm (INGRES)
Database Performance Tuning and Query Optimization
IDISK Cluster 8 disks, 8 CPUs, DRAM /shelf
Introduction to Query Optimization
Optimizing Queries Using Materialized Views
Database.
External Sorting The slides for this text are organized into chapters. This lecture covers Chapter 11. Chapter 1: Introduction to Database Systems Chapter.
Troubleshooting Techniques(*)
Index Tuning Additional knowledge.
Query Processing CSD305 Advanced Databases.
Semantic Query Optimization
Chapter 11 Database Performance Tuning and Query Optimization
Big Data Analytics: Exploring Graphs with Optimized SQL Queries
Query Optimization Highlights
Performance And Scalability In Oracle9i And SQL Server 2000
Query Processing.
Presentation transcript:

Some TPC-H queries on Teradata and PostgreSQL Project Partners: Amreek Singh (02329025) Chetan Vaity (02329901)

Motivation Usage of real Database Systems Gain some experience in database tuning Work with Teradata machine in SIT Test Setup Twin processors with 2GB RAM, proprietary parallel storage system Windows 2000 Advanced Server Teradata v4.1.2 Twin Xeon processors with 2GB RAM, RAID 5 Linux Kernel version 2.4.18-10smp PostgreSQL v7.2.1

TPC-H Schema A typical manufacturing concern database Part (200K rows) Order (1500K rows) Partsupp (800K rows) Lineitem (6000K rows) Supplier (10K rows) Customer (150K rows) Nation (24 rows) Region (5 rows) A typical manufacturing concern database Approximately 1GB of data

TPC-H Query 2 Teradata Query Plan Region Nation SELECT s_acctbal, s_name, n_name, p_partkey, p_mfgr, s_address, s_phone, s_comment FROM part, supplier, partsupp, nation, region WHERE p_partkey = ps_partkey AND s_suppkey = ps_suppkey AND p_size = 15 AND p_type like '%BRASS' AND s_nationkey = n_nationkey AND n_regionkey = r_regionkey AND r_name = 'EUROPE' AND ps_supplycost = ( min(ps_supplycost) partsupp, supplier, nation, region ) ORDER BY s_acctbal desc, n_name, s_name, p_partkey; Region Nation Region (r_name=‘EUROPE’) Part Part (p_size=15) (p_type=‘%BRASS’) Partsupp Supplier Teradata Query Plan

Analysis of query execution plans of both systems Added indexes (B-Tree indexes on all) Rewrote the query using “explicit join” clause Reduced query time from 40 minutes to 2 seconds Region Nation Region (r_name=‘EUROPE’) Part Part (p_size=15) (p_type=‘%BRASS’) Partsupp Supplier PostgreSQL Query Plan

Query execution times Teradata t1 t2 t3 PostgreSQL Initial 31 sec 25 sec 26 sec Q2 11 sec 10 sec 1 sec Q3 2 min 15sec 2 min 11 sec 1 min 16 sec After adding secondary index on n_nationkey on supplier table After Collect statistics PostgreSQL Initial After “ANALYZE” After adding indexes Q6 33 sec 32 sec 30 sec Q2 40 m 19 sec 35 min 4 sec 2 sec Q3 11 min 43 sec 11 min 32 sec 1 min 11 sec

Conclusion: Bibliography Query plans are very useful in database tuning Parallel architecture under full DBMS control performs Bibliography http://www.tpc.org PostgreSQL Documentation Teradata Documentation Database Tuning, Dennis Shasha