Informatica & ETL Testing

Slides:



Advertisements
Similar presentations
CC SQL Utilities.
Advertisements

Importing Test Scores (including the use of Excel’s VLOOKUP function) Chris A. McManigal Camden County Schools Kingsland, GA.
Utilizing the GDB debugger to analyze programs Background and application.
CIS 240 Introduction to UNIX Instructor: Sue Sampson.
Lecture 02CS311 – Operating Systems 1 1 CS311 – Lecture 02 Outline UNIX/Linux features – Redirection – pipes – Terminating a command – Running program.
UNIX By Darcy Tatlock. 1. Successful Log Into Unix To actively manipulate your website you need to be logged in. Without being logged in you cannot enter.
Installation Process: Step 1 Step1: Login in to Unix Host name or IP address type in douglass User name type in user name Password type in user password.
Introduction to UNIX/Linux Exercises Dan Stanzione.
Copyright 2007, Information Builders. Slide 1 Maintain & JavaScript: Two Great Tools that Work Great Together Mark Derwin and Mark Rawls Information Builders.
A Guide to SQL, Eighth Edition Chapter Three Creating Tables.
1MIS 2008 / Merging Two Data Collections: Lessons Learned from the CCD-EDEN Merger EDEN Best-Practice Sharing Oregon File Preparation Practices.
Let’s Make An Form! Bonney Armstrong GD 444 Westwood College February 9, 2005.
Updating FireRMS Presented by Dan Cook Implementation Team Lead ZOLL.
1 Testing & Debugging in Informatica PowerCenter.
Lesson 9-Setting and Using Permissions. Overview Describing file permissions. Using execute permissions with a file. Changing file permissions using mnemonics.
Data Management Console Synonym Editor
Siebel 8.0 Module 5: EIM Processing Integrating Siebel Applications.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Write-through Cache System Policies discussion and A introduction to the system.
Week 7 : Chapter 7 Agenda SQL 710 Maintenance Plan:
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
Isecur1ty training center Presented by : Eng. Mohammad Khreesha.
Gold – Crystal Reports Introductory Course Cortex User Group Meeting New Orleans – 2011.
Copyright 2007, Information Builders. Slide 1 Machine Sizing and Scalability Mark Nesson, Vashti Ragoonath June 2008.
SQL SERVER 2008 Installation Guide A Step by Step Guide Prepared by Hassan Tariq.
© 2008 Wipro Ltd - Confidential Informatica & ETL Testing Rahul Parashar.
1 Introduction to Unix. 2 What is UNIX?  UNIX is an Operating System (OS).  An operating system is a control program that helps the user communicate.
INCREMENTAL AGGREGATION After you create a session that includes an Aggregator transformation, you can enable the session option, Incremental Aggregation.
Updating E-journal Holdings with Millennium Silver “Coverage Load” Carolina Innovative Users Group 2005 Meeting University of North Carolina at Charlotte.
PHP AND SQL SERVER: CONNECTION IST 210: Organization of Data IST210 1.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
COPA Rollover How to successfully complete the COPA School Year End Rollover from to
IT Chapter 2 Part A How Computers Work Input, process, output, and storage The operating system helps the computer perform four basic operations,
Migrating Wordpress Migrating Wordpress can sometimes get more complicated as it should. There is no plugin that does this for you, the best way is to.
RMAN Maintenance.
RMAN Maintenance.
How to use the job list Version Yoel Kortick.
Welcome POS Synchronize Concept 08 Sept 2015.
Core LIMS Training: Project Management
Project Management: Messages
Validation.
Prepared by: Eng. Maryam Adel Abdel-Hady
Core LIMS Training: Data Maintenance
Download/Upload Inventory
APA-OTS WordPress Multi-Site HTTPS Migration: a Case Study
Download/Upload Receipts
Core LIMS Training: Data Maintenance
By Jonathan Rinfret CREATING A BASH SCRIPT By Jonathan Rinfret
The Linux Operating System
Validation.
© Paradigm Publishing, Inc.
TRAINING OF FOCAL POINTS on the CountrySTAT SYSTEM based on FENIX
Multi-host Internet Access Portal (MIAP) Enhancement Guide
DATABASE MANAGEMENT SYSTEM
5 Tips for Upgrading Reports to v 6.3
Guide To UNIX Using Linux Third Edition
Teaching slides Chapter 8.
U.S. Environmental Protection Agency
Yoel Kortick Senior Librarian
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
User Guide Subversion client TortoiseSVN
Overview of Contract Association Batch Upload
Running a Java Program using Blue Jay.
Upload/Download Inventory
Upload/Download Inventory
Using Veera with R and Shiny to Build Complex Visualizations
Maplewood Year End Check List.
Presentation transcript:

Informatica & ETL Testing Rahul Parashar Confidential © 2008 Wipro Ltd

Case Study for Testing the Business requirements at Informatica workflow level by manipulating the file in Unix environment. Confidential © 2008 Wipro Ltd 2

Preface In projects where Informatica (or any other tool) is used to perform the data migration , it is driven by certain business rules. This is a case study to validate some of these business rules by manipulating the files in Unix server, it allows testers to validate the business requirements at Informatica workflow level also if Informatica server is installed in Unix, it becomes important for us to perform such validations. Confidential © 2008 Wipro Ltd 3

Validations In ETL Projects there are certain validations like Record Count check. Duplicate data check. Data check in the target file. We usually have to write certain SQL queries to test these validations in all our projects which sometimes is tedious job. To save our time every time when we have to perform such operations (which is generic to all ETL projects) and to make the testing more effective at the workflow level in Informatica we can perform certain steps at Unix level which is being discussed in this presentation.

Sample Business Requirements These are some sample business requirements which we normally come across in data migration projects. Source file should have two line header. 2. Source file should have the count of records in the trailer. 3. Source file should not have more then 50% of the records available in the file for previous business date (max count) 4. File taken as input should have min 10 records (min count). Confidential © 2008 Wipro Ltd

There are two ways of testing the above requirements :- Validation Procedures There are two ways of testing the above requirements :- FTP the file to local drive, make the necessary changes and then upload the file again and run the Informatica mapping B. Using the VI editor, open and edit the file in Unix server itself.

FTP Method Go the source location of the file using the run command

Get the file into the local drive H:\\, but before you get the file into the local drive please make sure that you have all the permissions to manipulate that file else you will see a 0 byte file being ftp’ed to your local drive.

Once you get the file into the H:\\ drive open the file and 1. Remove the header of the file. 2. Then save the file. 3. After that ftp the file into your local directory (using ‘put’ command) using the ftp server. 4. Then copy the same file into the Informatica source files location. 5. Run the Informatica workflow.

We can perform the other operations to validate the trailer check and record count check can be performed by modifying the file and uploading the file at source location using the FTP commands.

Using VI Editor VI Editor is file editor used in Unix to manipulate the files. To manipulate the data in the file we must have all the access rights to the file. To achieve this first copy the file in your local directory in Unix server by using the command cp sample.csv /home/local/ . Once the file is available in the local directory check for the permissions available for you on the file. Change the permissions to RWX by using the chmod command if you don't have all the access rights. Open the file with VI editor using command :- VI File name

Remove the header by ‘dd’ command in VI editor then save the changes in the file and put the file back into the Informatica source location and run the workflow The workflow should fail at the step where header validation is done. Similarly we can perform other validations by removing the trailer and then changing the number of records in the file by copying the data for other two business requirements.

Validation Results For The first business requirement that we are validating where the file is having two line header the workflow should run successfully when the file is having two line header Change the header of the file by inserting some junk values. The Workflow should fail when the header values are changed to some junk data Remove both the headers and run the workflow it should fail at the step where header validation is done. Remove one of the headers and run the workflow, the workflow should fail. These are the few validations that can be performed while validating the results and making sure the business rules are implemented properly in the mapping.

Validation Results Similarly for second validation where we have to check the trailer records we can perform following validations. Remove the trailer and run the workflow, the workflow should fail at the step where trailer validation is performed. Change the trailer value to some junk value (other then the total number of records), mapping should fail Keep the correct trailer record the mapping should Pass the trailer check step. Insert some alphanumeric values in the trailer record, the mapping should fail in this case. Similarly we can validate other business requirements. By changing the file using VI editor or using the FTP commands.

Sample Mapping This the sample mapping in Informatica for header trailer and count validation. When Header validation fails this step should fail in the mapping.

Informatica steps And for trailer, max count and Min count validations we have following mappings

Informatica View If workflow succeeds then in workflow monitor following is displayed

Informatica Workflow monitor View If header check fails in workflow monitor following is displayed

Important points We can verify which step in Informatica workflow has failed by checking the same in the workflow monitor. All set of business rules for the files to be migrated using Informatica (or any other tool used for ETL) can be verified following this approach. This approach allows the tester to validate the business requirements at the workflow level. These validations are specific to the files which are to be migrated using the ETL tool before the migration process.

Points to remember There are certain points that must be kept in mind while performing the operations on file in Unix. Always keep the backup of the file in your local drive. Always make sure that the file which you are referring to is taken as the input for the workflow being tested. While performing file manipulation in VI editor, make a copy as (sample_backup.csv) in your directory. Always keep in mind the target table whether it is truncate or progression load.

Thank You Confidential © 2008 Wipro Ltd 21