Presentation is loading. Please wait.

Presentation is loading. Please wait.

School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011.

Similar presentations


Presentation on theme: "School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011."— Presentation transcript:

1 School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011

2 The Procedure of Installing SQL Server 2005 Microsoft SQL Server 2005 Express Microsoft SQL Server Management Studio Express Introduction of TPC-H and Generate lineitem.tbl Import Lineitem.tbl into SQL Server Experiment about the Efficiency between Croup By and Group By With Cube

3 Configuration demands

4 Install procedure for SQL server 2005 Express

5

6

7

8 This situation only for installing VS2005 already

9 Install procedure for SQL server 2005 Express

10

11 Connect to SQL Server

12

13 The interface of SQL Server

14

15 The TPC Benchmark™H (TPC-H) is a decision support benchmark. The components of the TPC-H database are defined to consist of eight separate and individual tables.

16

17 Get the tpch_2_14_0 The DBGEN program can be downloaded at the following URL: http://www.tpc.org/tpch/spec/tpch_2_14_0.zip The schema of LINEITEM can be found at page 12 in the tpch2.14.0.doc, which can be downloaded at the following URL: http://www.tpc.org/tpch/spec/tpch2.14.0_cb.doc

18 Create lineitem.tbl (Linux)

19 Create a new query

20 Create database dbTPC

21 Use graphical interfaces

22

23 Create the table use SQL use dbTPC create table lineitem ( orderkey int, partkey int, suppkey int, linenumber int, quantity int, extendedprice decimal, discount decimal, tax decimal, returnflag nchar(1), linestatus nchar(1), shipdate datetime, commitdate datetime, receiptdate datetime, shipinstruct nchar(25), shipmode nchar(10), comment varchar(44) )

24 Create the table use interface

25 Step 1 Import file into SQL Server Using Bulk Insert. BULK INSERT Tablename FROM 'D: \lineitem.tbl' WITH ( FIELDTERMINATOR = '|', ROWTERMINATOR = '|\r' )

26 When GROUP BY and Aggregate Functions are used together, the practical meaning is significant. The Aggregate Functions generate a value for each group when used together with GROUP BY, other than for the whole table.

27 Example : Display the how many lineitems are at each returning status. SQL: SELECT returnflag, COUNT(*) FROM lineitem GROUP BY returnflag

28 Example : Display the quantity of lineitems which come from the same order and at the same returning status. order and they. SQL: SELECT returnflag, orderkey, SUM(quantity) FROM lineitem GROUP BY returnflag, orderkey

29 The CUBE operator generates a result set that is a multidimensional cube. A multidimensional cube is an expansion of fact data, The expansion is based on columns that the user wants to analyze The cube is a result set that contains all the possible combinations of the dimensions.

30 SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item, Color WITH CUBE

31 SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item, Color WITH CUBE

32 These four rows report the the original sum, in another words this time we get four groups with their sum value. SELECT Item, Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item,Color

33 These two rows report the subtotals for the Item dimension. both have null in the Color dimension to show that aggregate date came from rows having any value for the Color dimension. SELECT Item, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Item

34 These two rows report the subtotals for the Color dimension. both have null in the Item dimension to show that aggregate date came from rows having any value for the item dimension. SELECT Color, SUM(Quantity) AS QtySum FROM Inventory GROUP BY Color

35 This row reports the grand total for the cube. All values of both dimensions are summarized in the row. SELECT SUM(Quantity) AS QtySum FROM Inventory Then we can extend this situation to n dimensions. 2 n different combinations of the dimensions should be considered.

36 Analysis the column orderkey, partkey, suppkey, linenumber of Table LineItem using WITH CUBE.

37 Using 16 GROUP BY clauses simulate the result set of GROUP BY WITH CUBE.

38 GROUP BYmillisecond No grouping(1)16 Group with 4 column(1)orderkey, partkey, suppkey, linenumber31 Group with 3 column(4)orderkey, partkey, suppkey31 orderkey, partkey,linenumber16 orderkey, suppkey, linenumber31 partkey, suppkey, linenumber16 Group with 2 column(6)orderkey, partkey16 orderkey, linenumber15 orderkey, suppkey16 suppkey, linenumber15 partkey, suppkey15 partkey, linenumber15 Group with 1 column(4)orderkey16 partkey16 suppkey31 linenumber16 Total302 GROUP BY WITH CUBEorderkey,partkey,suppkey,linenumber140

39 A.Use the DBGEN program of the TPC-H Benchmark to generate all the eight tables of the TPC-H schema, with the Scale Factor set to 1. B. Create a database with eight tables including possible constrains(You can refer to tpch2.14.0.doc), and then import the generated data. Submit all the nine queries and the time cost for importing data.

40 THANK YOU!


Download ppt "School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011."

Similar presentations


Ads by Google