Scalable Data Warehouse & Data Marts ReportsAnalysis SQL Server DBMS SQL Server Integration Services Custom OLTP Increase usage & trust Consolidate & reduce cost
7 Method for designing a cost-effective, balanced system for Data Warehouse workloads Reference hardware configurations developed in conjunction with hardware partners using this method Best practices for data layout, loading and management
Software: SQL Server 2008 Enterprise Windows Server 2008 Hardware: Tight specifications for servers, storage and networking ‘Per core’ building block Configuration guidelines: Physical table structures Indexes Compression SQL Server settings Windows Server settings Loading
Data Path Data Warehouse Analysis Services Cubes PerformancePoint SAN, Storage Array Reporting Services Web Analytic Tools Integration Services ETL SharePoint Services Microsoft Office SharePoint Data Staging, Bulk Loading Subject Area Data Marts Supporting SystemsBI Data Storage SystemsPresentation Layer Systems Reference Architecture Scope (dashed) Excel Services Presentation Data
Designing High Performance I/O
CPU PCI Bus I/O Controller / HBA Cabling Array Cache Spindle Windows SQL Serv. DB ComponentBalance For… CPUMaximum consumption rate of cached data for targeted query mix Controller (Service Processor) Bandwidth needed to feed CPU cores (based on targeted query mix) HBAAggregate bandwidth array controllers will deliver SwitchAligned with HBA bandwidth and optimized for sequential IO DisksAggregate bandwidth of array controllers / Database capacity
Disk Subsystem Server NIC Memory Network SQL File Layout HBA
Key takeaway: If tuning for sequential, be careful about I/O weaving
Stripe SizeTransfer SizeRandom SizeSequential 128KB64KB1.2GB/sec2.1GB/sec 128KB512KB1.5GB/sec2.1GB/sec 256KB64KB1.0GB/sec2.1GB/sec 256KB512KB2.0GB/sec2.1GB/sec
Cache Fiber Channel Ports Controllers/Processors Switch HBA Switch PCI Bus Best Practice: Make sure you have the tools to monitor the entire path to the drives. Understand utilization of individual componets Fabric Array
SELECTL_RETURNFLAG, L_LINESTATUS, SUM(L_QUANTITY) AS SUM_QTY, SUM(L_EXTENDEDPRICE) AS SUM_BASE_PRICE, SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)) AS SUM_DISC_PRICE, SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)*(1+L_TAX)) AS SUM_CHARGE, AVG(L_QUANTITY) AS AVG_QTY, AVG(L_EXTENDEDPRICE) AS AVG_PRICE, AVG(L_DISCOUNT) AS AVG_DISC, COUNT(*) AS COUNT_ORDER FROMLINEITEM GROUP BYL_RETURNFLAG, L_LINESTATUS ORDER BYL_RETURNFLAG, L_LINESTATUS
File 1 File 2 File 1 File 2 File 1 Extent File 1 Table 1 Table 2 Table 1 1:32 1:38 1:37 1:40 1:39 1:33 1:31 1:35 1:34 1:36
1:32 1:31 1:35 1:34 1:33 1:36 1:38 1:37 1:40 1:39 1:32 1:31 1:35 1:34 1:33 Key Order of Index
ARY01D1v01ARY01D1v01 ARY01D2v02ARY01D2v02 ARY02D1v03ARY02D1v03 ARY02D2v04ARY02D2v04 ARY03D1v05ARY03D1v05 ARY03D2v06ARY03D2v06 ARY04D1v07ARY04D1v07 ARY04D2v08ARY04D2v08 ARY05v09ARY05v09 DB1-1.ndfDB1-1.ndf DB1-7.ndfDB1-7.ndf DB1-5.ndfDB1-5.ndfDB1-3.ndfDB1-3.ndf DB1-2.ndfDB1-2.ndfDB1-4.ndfDB1-4.ndfDB1-6.ndfDB1-6.ndf DB1-8.ndfDB1-8.ndf DB1.ldfDB1.ldf FT Storage Enclosure
ARY01D1v01ARY01D1v01 ARY01D2v02ARY01D2v02 ARY02D1v03ARY02D1v03 ARY02D2v04ARY02D2v04 ARY03D1v05ARY03D1v05 ARY03D2v06ARY03D2v06 ARY04D1v07ARY04D1v07 ARY04D2v08ARY04D2v08 ARY05v09ARY05v09 DB1-1.ndfDB1-1.ndf DB1-7.ndfDB1-7.ndf DB1-5.ndfDB1-5.ndfDB1-3.ndfDB1-3.ndf DB1-2.ndfDB1-2.ndfDB1-4.ndfDB1-4.ndfDB1-6.ndfDB1-6.ndfDB1-8.ndfDB1-8.ndf DB1.ldfDB1.ldf 4MB4MB 4MB4MB 4MB4MB 4MB4MB 4MB4MB4MB4MB 4MB4MB 4MB4MB
Extent Extent Extent 65… …Extent 128 Extent 129
Extent Extent Extent Extent Extent
Extent 1… …Extent 64 Extent 65… …Extent 128 Extent 129
ARY01D1v01ARY01D1v01 ARY01D2v02ARY01D2v02 ARY02D1v03ARY02D1v03 ARY02D2v04ARY02D2v04 ARY03D1v05ARY03D1v05 ARY03D2v06ARY03D2v06 ARY04D1v07ARY04D1v07 ARY04D2v08ARY04D2v08 DB1-1.ndfDB1-1.ndf DB1-7.ndfDB1-7.ndf DB1-5.ndfDB1-5.ndfDB1-3.ndfDB1-3.ndf DB1-2.ndfDB1-2.ndfDB1-4.ndfDB1-4.ndfDB1-6.ndfDB1-6.ndf DB1-8.ndfDB1-8.ndf 4MB4MB 4MB4MB 4MB4MB 4MB4MB 4MB4MB4MB4MB 4MB4MB 4MB4MB