Download presentation
Presentation is loading. Please wait.
Published byPearl Brooks Modified over 9 years ago
1
School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011
2
The Procedure of Installing SQL Server 2005 Microsoft SQL Server 2005 Express Microsoft SQL Server Management Studio Express Introduction of TPC-H and Generate lineitem.tbl Import Lineitem.tbl into SQL Server Experiment about the Efficiency between Croup By and Group By With Cube
3
Configuration demands
4
Install procedure for SQL server 2005 Express
8
This situation only for installing VS2005 already
9
Install procedure for SQL server 2005 Express
11
Connect to SQL Server
13
The interface of SQL Server
15
The TPC Benchmark™H (TPC-H) is a decision support benchmark. The components of the TPC-H database are defined to consist of eight separate and individual tables.
17
Get the tpch_2_14_0 The DBGEN program can be downloaded at the following URL: http://www.tpc.org/tpch/spec/tpch_2_14_0.zip The schema of LINEITEM can be found at page 12 in the tpch2.14.0.doc, which can be downloaded at the following URL: http://www.tpc.org/tpch/spec/tpch2.14.0_cb.doc
18
Create lineitem.tbl (Linux)
19
Create a new query
20
Create database dbTPC
21
Use graphical interfaces
23
Create the table use SQL use dbTPC create table lineitem ( orderkey int, partkey int, suppkey int, linenumber int, quantity int, extendedprice decimal, discount decimal, tax decimal, returnflag nchar(1), linestatus nchar(1), shipdate datetime, commitdate datetime, receiptdate datetime, shipinstruct nchar(25), shipmode nchar(10), comment varchar(44) )
24
Create the table use interface
25
Step 1 Import file into SQL Server Using Bulk Insert. BULK INSERT Tablename FROM 'D: \lineitem.tbl' WITH ( FIELDTERMINATOR = '|', ROWTERMINATOR = '|\r' )
26
When GROUP BY and Aggregate Functions are used together, the practical meaning is significant. The Aggregate Functions generate a value for each group when used together with GROUP BY, other than for the whole table.
27
Example : Display the how many lineitems are at each returning status. SQL: SELECT returnflag, COUNT(*) FROM lineitem GROUP BY returnflag
28
Example : Display the quantity of lineitems which come from the same order and at the same returning status. order and they. SQL: SELECT returnflag, orderkey, SUM(quantity) FROM lineitem GROUP BY returnflag, orderkey
29
The CUBE operator generates a result set that is a multidimensional cube. A multidimensional cube is an expansion of fact data, The expansion is based on columns that the user wants to analyze The cube is a result set that contains all the possible combinations of the dimensions.
30
SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item, Color WITH CUBE
31
SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item, Color WITH CUBE
32
These four rows report the the original sum, in another words this time we get four groups with their sum value. SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item,Color
33
These two rows report the subtotals for the Item dimension. both have null in the Color dimension to show that aggregate date came from rows having any value for the Color dimension. SELECT Item, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item
34
These two rows report the subtotals for the Color dimension. both have null in the Item dimension to show that aggregate date came from rows having any value for the item dimension. SELECT Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Color
35
This row reports the grand total for the cube. All values of both dimensions are summarized in the row. SELECT SUM(Quantity) AS QtySum FROM Inventory Then we can extend this situation to n dimensions. 2 n different combinations of the dimensions should be considered.
36
Analysis the column orderkey, partkey, suppkey, linenumber of Table LineItem using WITH CUBE.
37
Using 16 GROUP BY clauses simulate the result set of GROUP BY WITH CUBE.
38
GROUP BYmillisecond No grouping(1)16 Group with 4 column(1)orderkey, partkey, suppkey, linenumber31 Group with 3 column(4)orderkey, partkey, suppkey31 orderkey, partkey,linenumber16 orderkey, suppkey, linenumber31 partkey, suppkey, linenumber16 Group with 2 column(6)orderkey, partkey16 orderkey, linenumber15 orderkey, suppkey16 suppkey, linenumber15 partkey, suppkey15 partkey, linenumber15 Group with 1 column(4)orderkey16 partkey16 suppkey31 linenumber16 Total302 GROUP BY WITH CUBEorderkey,partkey,suppkey,linenumber140
39
A.Use the DBGEN program of the TPC-H Benchmark to generate all the eight tables of the TPC-H schema, with the Scale Factor set to 1. B. Create a database with eight tables including possible constrains(You can refer to tpch2.14.0.doc), and then import the generated data. Submit all the nine queries and the time cost for importing data.
40
THANK YOU!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.