PDF AUTOMATION Pro The third eye for all your PDF Automation needs

Slides:



Advertisements
Similar presentations
XP New Perspectives on Microsoft Office Word 2003 Tutorial 7 1 Microsoft Office Word 2003 Tutorial 7 – Collaborating With Others and Creating Web Pages.
Advertisements

Chapter 3 – Web Design Tables & Page Layout
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Working with Tables for Page Design – Lesson 41 Working with Tables for Page Design Lesson 4.
BCIS IB, (Test 1) Excel Lessons 1, 2 and 3 Press Space bar to Advance Frame.
Word Lesson 11 Customizing Tables and Creating Charts Microsoft Office 2010 Advanced Cable / Morrison 1.
Copyright 2003 Peter McDevitt 1 Microsoft Excel 2002 Lecture 3 – A Professional Looking Worksheet.
Site Modules > Page Builder Access the Page Builder module through the Site Modules top navigation link. Access Page Builder from the Site Modules navigation.
Intermediate Level Course. Text Format The text styles, bold, italics, underlining, superscript and subscript, can be easily added to selected text. Text.
CMPF 112 : COMPUTING SKILLS FOR ENGINEERS MICROSOFT OFFICE SUITE: WORD.
Pasewark & Pasewark 1 Excel Lesson 2 Changing the Appearance of a Worksheet Microsoft Office 2007: Introductory.
® Microsoft Office 2010 Excel Tutorial 2: Formatting a Workbook.
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
XP New Perspectives on Microsoft Office Excel 2003, Second Edition- Tutorial 3 1 Microsoft Office Excel 2003 Tutorial 3 – Developing a Professional- Looking.
XHTML1 Tables and Lists. XHTML2 Objectives In this chapter, you will: Create basic tables Structure tables Format tables Create lists.
XP Creating Web Pages with HTML Using Tables. XP Objectives Create a text table Create a table using the,, and tags Create table headers and captions.
Cognizant Reusable Automation Framework for Testing C.R.A.F.T.
Quark QuarkXPress 4 Intermediate Level Course. Working with Master Pages The Document Layout Palette allows you to add, delete, and move document and.
HTML Tables and Forms Creating Web Pages with HTML CIS 133 Web Programming Concepts 1.
Cognizant Reusable Automation Framework for Testing C.R.A.F.T.
With Alex Conger – President of Webmajik.com FrontPage 2002 Level I (Intro & Training) FrontPage 2002 Level I (Intro & Training)
CST JavaScript Validating Form Data with JavaScript.
CHAPTER 14 Formatting a Workbook Part 1. Learning Objectives Format text, numbers, dates, and time Format cells and ranges CMPTR Chapter 14: Formatting.
Lesson 5: Using Tables.
Developing Effective Reports
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
Chapter 5 Java Script And Forms JavaScript, Third Edition.
Chapter 6: Forms JavaScript - Introductory. Previewing the Product Registration Form.
XHTML Introductory1 Forms Chapter 7. XHTML Introductory2 Objectives In this chapter, you will: Study elements Learn about input fields Use the element.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
Thank You ©2012, Cognizant. Rapido has been created by the Research and Development team from QE&A Technology CoE Rapido is continuously enhanced and.
Microsoft Office Excel 2003 Tutorial 3 – Developing a Professional-Looking Worksheet.
IC 3 BASICS, Internet and Computing Core Certification Key Applications Lesson 10 Creating and Formatting an Excel Worksheet.
Tutorial 7 Creating Forms. Objectives Session 7.1 – Create an HTML form – Insert fields for text – Add labels for form elements – Create radio buttons.
WORKBOOK FORMATTING Nolan Tomboulian Tomboulian.wikispaces.com HOW THINGS LOOK CELL COLORFONT COLOR CELL BORDERSFONT SIZE CELL SIZEFONT.
1 Lesson 18 Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. ACCESS 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 7 – Adding and.
JavaScript, Fourth Edition Chapter 5 Validating Form Data with JavaScript.
McGraw-Hill/Irwin The O’Leary Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Access 2002 Lab 5 Creating Custom Forms.
In Business Series © Prentice Hall Microsoft Office Word 2007 In Business Core Chapter 3 Word Document Enhancements.
| ©2009, Cognizant Technology SolutionsConfidential All rights reserved. The information contained herein is subject to change without notice. ©2009, Cognizant.
IC 3 BASICS, Internet and Computing Core Certification Key Applications Lesson 11 Organizing the Worksheet.
Excel – Lesson 2 Excel Lesson 2 Changing the Appearance of a Worksheet Microsoft Office 2007: Introductory Pasewark & Pasewark 1.
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 19 Organizing and Enhancing Worksheets 1 Morrison / Wells / Ruffolo.
LEARNING HTML PowerPoint #1 Cyrus Saadat, Webmaster.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. WORD 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 15 Advanced Tables.
1 Lesson 13 Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
HTML(Hyper Text Markup Language) ByNaveen. Introduction HTML or Hyper Text Markup Language is the standard markup language Its used to create the web.
Lesson 6 Formatting Cells and Ranges. Objectives:  Insert and delete cells  Manually format cell contents  Copy cell formatting with the Format Painter.
Introduction to HTML. _______________________________________________________________________________________________________________ 2 Outline Key issues.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Microsoft® Access Generate forms quickly 1 Modify controls in Layout View 2 Work with form sections 3 Modify controls in Design View 4 Add calculated.
IE 411/511: Visual Programming for Industrial Applications Lecture Notes #2 Introduction to the Visual Basic Express 2010 Integrated Development Environment.
XP New Perspectives on Macromedia Dreamweaver MX 2004 Tutorial 5 1 Adding Shared Site Elements.
Microsoft ® Excel ® 2013 Enhanced Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts.
1 Word Processing Intermediate Using Microsoft Office 2000.
GCE Software Systems Development
Creating Oracle Business Intelligence Interactive Dashboards
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
Creating Accessible PDFs from Word Docs
Word Processing.
DB Implementation: MS Access Forms
Microsoft Excel All editions of Microsoft office.
Formatting a Workbook Part 1
DREAMWEAVER MX 2004 Chapter 3 Working with Tables
DB Implementation: MS Access Forms
Lesson 19 Organizing and Enhancing Worksheets
Word Processing Software Photo credit: © 2007 JupiterImagesCorporation.
Title of presentation | Presentation by [Enter details in 'Header & Footer' field 18/05/2019.
Presentation transcript:

PDF AUTOMATION Pro The third eye for all your PDF Automation needs SICPA Test Automation Consulting Proposal

Introduction to PDF Automation Pro What? Cognizant’s solution for test automation of PDF documents, consisting of a suite of 3 tools Designed to address most aspects of PDF automation such as comparison of similar documents, extraction of specific data from a document, automating an interactive form, etc. Supports integration with most of the functional testing tools in the market Why? PDF Automation Pro helps to significantly reduce the manual effort required for PDF automation PDF Automation Pro eliminates any manual errors which might creep in, especially for large documents PDF Automation Pro fits perfectly into the existing test infrastructure and enables integration with end-to-end tests Introduction to PDF Automation Pro Who? PDF Automation Pro has been created by the Research and Development team from Cognizant’s Automation Centre of Excellence PDF Automation Pro is continuously enhanced and updated by the R&D team, based on feedback from the end users of the tool PDF Automation Pro has a dedicated helpdesk to assist end users with implementation and troubleshooting

Overview of PDF Automation Pro PDF Probe PDF Assist PDF PerFORM Provides a solution for automation of PDF interactive forms. Supports filling up empty forms as well as extracting data from filled-in forms Provides a solution for comparing PDF documents and reporting the differences, if any Also supports comparison of a PDF document with an MS Word document Provides a solution to extract specific content from within a PDF document based on user defined criteria. The extracted content can subsequently be validated against an expected result for testing purposes. Overview of PDF Automation Pro Core Features Each of these tools comes with a simple and user friendly GUI which can be directly used to automate the PDF documents as required. In addition, all the 3 tools expose APIs which enable easy integration with most of the functional automation tools in the market. A handy code generator is included with all the tools, which automatically generates the API calls required to automate the PDF documents as required. These code generators support multiple languages including VBScript, C#, VB.NET and Java, and generate code which is consistent with Cognizant’s accepted standards and conventions.

Overview of PDF Probe Highlights SRC DOCUMENT COMPARATOR TRG Comparison Features Textual content comparison Font size comparison Font family comparison Font style comparison (Bold and Italics) Font colour comparison Line spacing comparison Whitespace comparison Special Features Ability to compare a specified range of pages Batch comparison of multiple document sets Batch comparison of multiple documents against a specified template Provision to ignore the case (uppercase/lowercase) while comparing Supports comparison of multi-column text, tables, header and footer Supports comparison of password protected documents Comparison Reports Visual report in HTML format Detailed report in Excel format (optional) Both reports contain a high level summary, as well as corresponding performance statistics Highlights

Overview of PDF Assist Highlights Text Extraction features SEARCH VALIDATE APPLICATION PROGRAMMING INTERFACE Text Extraction features Get the occurrence count of a specified word Get the word next to a given search key Get the text in between two specified words Get the hash value for a given key based on a specified delimiter (for key-value pairs separated by a delimiter such as “:”) Get the metadata of a given word, including font name, colour, width, etc. Get the document metadata , including PDF Author,PDF title,PDF producer etc. Special features Enables fine-tuning the content extraction with features such as limiting the search to a specified range of pages, case sensitive searching, etc. Overview of PDF Assist Highlights Supports searching within tables as well as document headers/footers Supports extracting content from password protected documents Image Extraction features Extract the specified image from the document Get the metadata of a specified image, including the position, dimensions, and pixel-by-pixel data UI features Clearly displays the description, input parameters and return values for the API selected Validates the user inputs to ensure that they are within acceptable boundaries

Overview of PDF PerFORM APPLICATION PROGRAMMING INTERFACE FILL VALIDATE EXTRACT Overview of PDF PerFORM Form filling features Get the complete list of form fields from the document loaded Select specific fields to be filled in – this includes all types of fields such as textboxes, checkboxes, radio buttons, etc. Specify appropriate values for the selected fields Fill the specified values and save the filled form into a specified location Form values extraction/validation features Get the complete list of form fields from the document loaded Select specific fields whose values are to be extracted – this includes all types of fields such as textboxes, checkboxes, radio buttons, etc. If required, specify the expected values for the selected fields Extract the values from the fields specified Compare the extracted values with the expected results (if specified), and report any differences found Highlights

PDF Probe Comparison with other tools DiffPDF DiffDoc Adobe Acrobat Pro i-net PDFC PDF Probe Textual content comparison (including headers, footers, tables, multi-column text, etc.) Comparison of metadata such as font color, font family, font size, font style, etc. Comparison of images Partially possible, using the "Compare Appearance" mode Integration with functional automation tools Execution can be triggered using the command line, but the comparison results cannot be retrieved and reported from the automation tool No API or command line execution possible to enable integrations with functional automation tools A Java API is provided, which enables integration with any Java based automation tool; this can be used for continuous integration as well. Apart from this, a command line option is also available, however the comparison results cannot be retrieved and reported from the automation tool in this case. Yes, the API provided enables integration with most of the automation tools. This can be used for continuous integration as well. The API calls are automatically generated by the tool. Support for password protected documents Provision for bulk comparison and template comparison Visual report highlighting the differences Detailed report documenting the differences Compare MS Word with PDF Licensing Open Source Licensed Priced PDF Probe Comparison with other tools

Comparison with other tools (contd.) Document content extraction tools: There are many tools which enable extraction of content from a PDF document However, such tools provide only basic features such as extracting all the text from the document or from a specific page None of the tools provide the range of search criteria as provided by PDF Assist, which helps to really zero in on the exact content required to be extracted from the document To sum up, PDF Assist is probably the most advanced tool in the PDF content extraction space Interactive PDF forms automation tools: There are many APIs available which enable the automation of PDF forms by writing appropriate scripts Adobe has also released its Adobe Test Toolkit to cater to this requirement, however, the tool has not really matured yet The USP of PDF PerFORM in this space is the code generation facility, as well as the ability to directly fill an empty form or extract content from a filled-in form through the GUI provided Comparison with other tools (contd.)

Appendix

Limitations of PDF Automation Pro General: Documents created by non-standard PDF writers may not be processed properly. If a single word or a single line contains multiple font faces, the results may be unexpected. The time taken to load the document for processing is directly proportional to the size of the document. Large documents may take a long time to load. PDF Probe: Images cannot be compared. The recommended approach here is to use PDF Assist to extract the required images and use any available image comparison algorithms. Values in form fields like checkboxes, radio buttons, etc. cannot be compared, and the presence of such fields may affect the accuracy of the comparison. When images are present in the document, the line spacing comparison might be affected. The comparison may be inaccurate if there are significant differences with respect to margin and line spacing between the documents. Split sections within documents are supported; however, the word wrapping must be similar across the source and target documents. Limitations of PDF Automation Pro

Limitations of PDF Automation Pro PDF Probe (contd.): Word documents with tables in headers cannot be compared. If there is a text content deviation together with any other deviation like font size, color, etc., only the text deviation will be highlighted in the tool’s HTML report. The Excel report, however, will capture all the differences. Documents may not be compared properly if the font size of the words in the document is too small Border lines ,underline, table borders may not be displayed in html report The comparison may be inaccurate if the same content of source scattered in different position(page) of target document. PDF Probe does not support Page range for WORD-PDF. WORD- PDF comparison’s performance is slower than PDF-PDF comparison. Based on the coordinates retrieved by third party tool(Used internally for retrieving the PDF content), the html report are generated. Therefore html report accuracy it depends on the quality of PDF. Word Document with Image can give unexpected results Tool will read the content line by line even though it is a table. It won’t read the values cell by cell or column by column. Therefore if you find any text deviation in a line together with any other deviation like font size, color, etc., only the text deviation will be highlighted in the tool’s HTML report and in Excel report Limitations of PDF Automation Pro

Limitations of PDF Automation Pro PDF Assist: General Images accessed using PDF Assist reflect the properties of the original image file, even if some of these properties may have changed while embedding it into the document. For example: The image may have been resized within the document, but PDF Assist will return only the original size of the image. The image may have been rotated by some angle while placing it into the document, but PDF Assist will return the original orientation of the image. In rare cases, words in upper case may be wrongly perceived by PDF Assist to be lower case. API Values in form fields like checkboxes and special characters cannot be extracted. Split sections within documents are supported; however, the following points must be taken into consideration: PDF Assist considers each line of text as one cutting across all the sections. In some documents, the split sections may not be aligned equally on the horizontal plane, causing PDF Assist to read each of the sectioned portions as a separate line. In some cases, images in “.tiff” format may be recognized as “.png” images. Though the API supports Java, it is not possible to use the API in platforms other than Windows. Limitations of PDF Automation Pro

Limitations of PDF Automation Pro UI Certain documents may not load properly in the UI; however, this will not affect the working of the API. For example: If there is any text overlapping on top of an image, it may not be rendered properly. If there is any text which is aligned vertically in the document, it will be rendered horizontally within the UI. For API functions which return an array, the UI generates code only for the first element in the array. This code has to be extended if the user needs to access other elements of the array. PDF PerFORM: The API does not have any provision to obtain the page numbers under which each of the form fields are present (unless the document contains bookmarks) Limitations of PDF Automation Pro

Thank you