SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.

Slides:



Advertisements
Similar presentations
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Advertisements

© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Identity Management - Login © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
© 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Reprint Outstanding Transactions Report © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
Data Management Conference ETL In SQL Server 2008 Allan Mitchell London September 29th.
Feature: Purchase Requisitions - Requester © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
Dos and don’ts of Columnstore indexes The basis of xVelocity in-memory technology What’s it all about The compression methods (RLE / Dictionary encoding)
MIX 09 4/15/ :14 PM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Feature: Payroll and HR Enhancements © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
SQL Server Integration Services Deep Dive Warren Stevens-Baytopp BI Architect - GijimaAst
Windows 7 Training Microsoft Confidential. Windows ® 7 Compatibility Version Checking.
Feature: Purchase Order Prepayments II © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: OLE Notes Migration Utility
Feature: Web Client Keyboard Shortcuts © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Feature: SmartList Usability Enhancements © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
Session 1.
Built by Developers for Developers…. © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Assign an Item to Multiple Sites © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
Business Solutions Using Microsoft ® Office SharePoint ® Server ROADSHOW.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Print Remaining Documents © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
Connect with life Connect with life
NEXT: Overview – Sharing skills & code.
demo Receive Inventory Export Parse and Normalize.
A Windows Azure application runs multiple instances of each role A Windows Azure application behaves correctly when.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Document Attachment –Replace OLE Notes © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
Feature: Suggested Item Enhancements – Sales Script and Additional Information © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows.
Feature: Customer Combiner and Modifier © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Feature: Employee Self Service Timecard Entry © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
SQL Server SQL Azure Visual Studio“Quadrant” SQL Server Modeling Services Entity Framework ADO.NET“M”/EDM Data Services …
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
demo Instance AInstance B Read “7” Write “8”

customer.
demo © 2008 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
demo Demo.
Feature: Void Historical/Open Transaction Updates © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
demo QueryForeign KeyInstance /sm:body()/x:Order/x:Delivery/y:TrackingId1Z
Feature: Suggested Item Enhancements – Analysis and Assignment © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and.
Windows Azure SQL Data Sync Name Title Microsoft Corporation.
projekt202 © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Introduction to Microsoft Dynamics GP 2013 Module 2:Basic Elements.
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks.
© 2008 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.

IoCompleteRequest (Irp);... p = NULL; …f(p);
John Tran Business Program Manager, The Suddath Companies
SQL Server Integration Services
Возможности Excel 2010, о которых следует знать
Entity Based Staging SQL Server 2012 Tyler Graham
Business Intelligence for Project Server/Online
ISC440: Web Programming 2 Server-side Scripting PHP 3
Performance Tuning SSIS
Title of Presentation 12/2/2018 3:48 PM
From DTS to SSIS, Redesign or Upgrade
Feature: Document Attachment - Flow from Master Records
8/04/2019 9:13 PM © 2006 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Виктор Хаджийски Катедра “Металургия на желязото и металолеене”
5/8/2019 3:20 AM bQuery-Tool 3.0 A new and elegant way to create queries and ad-hoc reports on your Baan/Infor ERP LN data. This Baan session is a query.
Шитманов Дархан Қаражанұлы Тарих пәнінің
Title of Presentation 5/24/2019 1:26 PM
Presentation transcript:

SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP

Merge data from heterogeneous data stores: Text files Mainframes Spreadsheets Multiple RDBMS Refresh data in data warehouses and data marts Cleanse data before loading to remove errors High-speed load of data into online transaction processing (OLTP) and online analytical processing (OLAP) databases Build BI into a data transformation process without the need for redundant staging environments Lots more Integration Services Why ETL Matters

 Data sources can be diverse, including custom or scripted adapters  Transformation components shape and modify data in many ways.  Data is routed by rules or error conditions for cleansing and conforming.  Flows can be as complex as your business rules, but highly concurrent.  And finally data can be loaded in parallel to many varied destinations. SSIS Overview

Data Sources –Excel Common problem - not all data coming through correctly By Default Excel will determine the column types based on a “Majority Type” rule. Overcome this by forcing a type in the Data connector

Data Sources Continued Data Sources –Verifying Connectivity / Availability ETL Tasks run through some of the steps and then fail on connectivity issues. Why Would you want to check for this? Use scripting task.

Scripting

Data Sources Continued Data Sources –OLE DB Provider Selecting Table or View dropdown as a source. So what is the problem with this? Replace with what –Select * from [TABLENAME] – not much better or is it? –Select [field list] from [TABLENAME] – resource usage

If a table is selected –SSIS issues an OPENROWSET If a SQL statement is used –SSIS issues sp_executesql. OLE DB Provider

Sourcing Data Common Requirement –Get all Data from one table that does not exist in another Get all rows from a staging table where the business key is not in the dimension table Conventional T-SQL Using SSIS

INSERT INTO DIM_DATE SELECT s.* FROM STG_DATE s LEFT OUTER JOIN DIM_DATE d ON s.DateID = d.DateID WHERE d.DateID IS NULL INSERT INTO DIM_DATE SELECT s.* FROM STG_DATE s WHERE DateID NOT IN (SELECT DISTINCT DateID FROM DIM_DATE d) Sourcing Data Conventional T-SQL

Sourcing Data Using SSIS Merge Join –Same as first T-SQL Statement –Requires a Sort and Conditional Split Lookup –Using the SSIS functionality. –Less Coding –Uses the error output as the valid records. Speed Comparisons

Lookups Exact Matching Want data that matches a specific field. –Normal usage of Lookup Range Comparisons Want data that falls between 2 values –The Caching SQL Statement –Mapping of Parameters

High-end 32 bit performance The test included parsing a text file and passing through 7 transformations The graph shows throughput of 17Gb/Hr Parallelism is one of the keys to performance on 32-bit.

What does 64-bit enable? Increased memory capacity enables very high performance components –But some Integration Services components benefit more than others High performance Integration Services components enable new warehouse architectures –Especially load balancing between the integration process and the warehouse server

Components in the data flow Some components work with data row by row –Calculating new columns –Converting data –Character conversions –Look-up joins to reference tables These benefit more from parallelism than memory

Components in the data flow Some components need to work with the entire data set –Aggregation –Sorting –Fuzzy (best match) Lookups and Deduplication These benefit from increased memory –64 bit enables potentially huge data sets to be worked on in memory

An example 64-bit benefit: Lookups Lookups involve matching incoming values to a reference database 3 types of lookups: –Cached – the reference set is cached in memory –Partial cache – the cache is built up as lookups are found –No cache – every lookup needs a roundtrip to the reference database 32-bit cache size is memory-constrained 64-bit cache size could cache the largest reference sets

Enabling new architectures … In this traditional scenario, the integration process simply conforms data and loads the database server The database performs aggregations, sorting and other operations … … but has to contend with competing demands for resources from user queries This solution does not scale to very large volumes of data and multiple, complex aggregations Traditional warehouse loading

Enabling new architectures … Here, SQL Server Integration Services conforms the data as before … … but also aggregates and sorts, and loads the database This frees-up the database server for user queries With 64-bit this solution scales well to very large volumes of data and multiple, complex aggregations Even with 32 bit, this architecture can be scaled-out to use a separate box for the integration process Warehouse loading with SQL Server Integration Services

Customer benefits of SSIS Performance –Data flows process large volumes of data efficiently - even through complex operations Facility –Many prebuilt adapters and transformations reduce hand coding –Extensible object model enables specialized custom or scripted components –Highly productive visual environment speeds development and debugging “Smarts” –Data cleansing features enable difficult data to be handled during loading –Data mining brings intelligent handling of data for imputation of incomplete data, conditional processing of potential problems, or smart escalation of issues such as fraud detection

Some Performance Tuning Tips Only Select the Columns that you need Use a SQL Server Destination instead of an OLE DB Destination If using an OLE DB Destination – use the table or view with fast load data access mode. Use standardized naming conventions Where possible, filter your data in the Source Adapter rather than using a Conditional Split transform component LOOKUP components will generally work quicker than MERGE JOIN components where the 2 can be used for the same task Use caching in your LOOKUP components where possible. It makes them quicker. Just watch that you are not grabbing too many resources. Use Sequence containers to organize package structure into logical units of work.

© 2011 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.