Informatica PowerCenter Performance Tuning Tips

Slides:



Advertisements
Similar presentations
Adam Jorgensen Pragmatic Works Performance Optimization in SQL Server Analysis Services 2008.
Advertisements

4 Oracle Data Integrator First Project – Simple Transformations: One source, one target 3-1.
Physical Database Design CIT alternate keys - named constraints - indexes.
Physical Database Monitoring and Tuning the Operational System.
Transaction log grows unexpectedly
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
Intro Informatica Productivity Pack Save Time and Money while Increasing the Quality of Your PowerCenter Deployment Louis Hausle.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
CSC271 Database Systems Lecture # 30.
Workflow Manager and General Tuning Tips. Topics to discuss… Working with Workflows Working with Tasks General Tuning Tips.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
Chapter 16 Methodology – Physical Database Design for Relational Databases.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
DBMS Implementation Chapter 6.4 V3.0 Napier University Dr Gordon Russell.
10/10/2012ISC239 Isabelle Bichindaritz1 Physical Database Design.
Methodology – Physical Database Design for Relational Databases.
- Joiner Transformation. Introduction ►Transformations help to transform the source data according to the requirements of target system and it ensures.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Accelerating PHP Applications Ilia Alshanetsky O’Reilly Open Source Convention August 3rd, 2005.
SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.
for all Hyperion video tutorial/Training/Certification/Material Essbase Optimization Techniques by Amit.
INCREMENTAL AGGREGATION After you create a session that includes an Aggregator transformation, you can enable the session option, Incremental Aggregation.
Best Practices in Loading Large Datasets Asanka Padmakumara (BSc,MCTS) SQL Server Sri Lanka User Group Meeting Oct 2013.
Aggregator Stage : Definition : Aggregator classifies data rows from a single input link into groups and calculates totals or other aggregate functions.
Diving into Query Execution Plans ED POLLACK AUTOTASK CORPORATION DATABASE OPTIMIZATION ENGINEER.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
SQL IMPLEMENTATION & ADMINISTRATION Indexing & Views.
Welcome POS Synchronize Concept 08 Sept 2015.
Introduction to Informatica PowerCenter
Indexes By Adrienne Watt.
CHP - 9 File Structures.
Record Storage, File Organization, and Indexes
Large-scale file systems and Map-Reduce
Physical Database Design
Lecture 16: Data Storage Wednesday, November 6, 2006.
Hitting the SQL Server “Go Faster” Button
Tree-Structured Indexes
Advanced QlikView Performance Tuning Techniques
Physical Database Design and Performance
External Sorting Chapter 13
CSE-291 (Cloud Computing) Fall 2016
Database Performance Tuning &
Methodology – Physical Database Design for Relational Databases
Methodology – Monitoring and Tuning the Operational System
Physical Database Design for Relational Databases Step 3 – Step 8
Presented by: Warren Sifre
Database Performance Tuning and Query Optimization
Lecture 11: DMBS Internals
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Introduction to Execution Plans
Power BI Performance …Tips and Techniques.
Hitting the SQL Server “Go Faster” Button
國立臺北科技大學 課程:資料庫系統 fall Chapter 18
Lecture#12: External Sorting (R&G, Ch13)
Sidharth Mishra Dr. T.Y. Lin CS 257 Section 1 MH 222 SJSU - Fall 2016
Steve Hood SimpleSQLServer.com
External Sorting Chapter 13
Selected Topics: External Sorting, Join Algorithms, …
Index Use Cases.
The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited)
Methodology – Monitoring and Tuning the Operational System
Chapter 11 Database Performance Tuning and Query Optimization
Diving into Query Execution Plans
5/7/2019 Map Reduce Map reduce.
Introduction to Execution Plans
Indexes and Performance
Testing & Security Dr. X.
Introduction to Execution Plans
External Sorting Chapter 13
Lecture 20: Representing Data Elements
Presentation transcript:

Informatica PowerCenter Performance Tuning Tips Pittsburgh Informatica User Group March 22nd 2016 Gregory Reynolds reynolds.gregory@gmail.com

Topics General Tips Source Qualifiers Lookups Sorters / Aggregators / Joiners Targets Other Options

General Tips Remove “Dead Weight” Set Proper Attributes on All Ports Filter unneeded records in the source qualifier Turn off or remove unused ports Filter and aggregate as soon as possible Trim strings immediately Set Proper Attributes on All Ports Improper data types require behind the scenes conversions or cause loss of data Incorrect data sizes require conversions and may truncate data unintentionally Watch Your Logs Don’t ignore warnings and non-critical errors. It requires a significant amount of resources to write to a log file Choose terse level logging in production to save space and processing

Source Qualifier Balance Work Effort Local Files Use Built-In Options Consider the time it takes the source to do work instead PowerCenter is faster at many things / DBMS is faster at others Local Files When reading from files it is almost always faster to ship the file to the local server before reading it then it is to try and read it with a remote connection Large data files likely would be better off staged before trying to do any complex transformations Use Built-In Options A sort is usually faster here than anywhere in the mapping Same thing for distinct Choose Deterministic Output if possible

Lookup Unconnected Lookups Cache Type Size Matters Filter The Data Using an unconnected lookup multiple times saves all of the cost associated with building the cache except the first time. Cache Type Static cache is the default for a reason. Best for a large amount of lookup records Disabling the cache sends every lookup to the database as an individual request. Works well for a very small amount of lookup requests versus a large table. Persistent cache allows you to reuse a cache across multiple sessions. Warning: it may require a lot of drive space Dynamic cache keeps records in the lookup up to date automatically as new data is sent to the target Size Matters Too small a cache allotment will force large lookups to go to disk Too large a cache will waste available memory that other transformations could use Filter The Data Source data can be filtered without using an override More records filtered mean less time to retrieve and smaller more efficient caches

Sorter, Aggregator, and Joiner Maximum Cost No data will pass through until every record has been read and sorted Large cache usage Session will fail if cache grows larger than capacity Waste Not Want Not Filter out as many unneeded records as is possible before hand Sorted Input Whenever possible use sorted input to save the majority of processing and cache cost from joiners as well as aggregators. Data will flow in incremental chunks if the input is sorted versus waiting until the end, releasing memory and cache requirements as it does so. Simplified Group Bys Grouping by a simple port may save an aggregator having to complete all the grouping before making any final decisions. Smaller Master Set Designate the master data in a sorter as the side that is expected to have fewer rows. Two large data flows will not join as quickly

Targets Rejected Rows Update Else Insert Commit Not… Bulk Loading Rejecting rows from the target database will slow down normal operations by potentially several seconds per record! Look out for unexpected NULL values, bad dates, and invalid keys. Update Else Insert With this option selected, every insert is approximately twice as slow as an update Other options include using a lookup to determine the appropriate action before hand Commit Not… Setting too low a threshold for database commits will require transactions to wait more often while the commit is completing. Setting it too high will cause the database to use an increasing amount of temp space and may cause it to slow down transaction speeds Bulk Loading Bulk loading is a fast “cheat” to be able to push large amount of data into some types of databases at high speed but there can’t any indices on the target table

Other Tuning Options System Competition Workflow Parallelization Session Partitioning Push-down Optimization Data Partitions Buffer Size Indices and Primary Keys System Competition Redesign slow mappings to find a better way