Digitizing Transmuter. Extracting relevant information from the electronic media into digitized form and accumulating the information bank for further.

Slides:



Advertisements
Similar presentations
Advanced XSLT. Branching in XSLT XSLT is functional programming –The program evaluates a function –The function transforms one structure into another.
Advertisements

Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
What is a Database By: Cristian Dubon.
Module 8 Importing and Exporting Data. Module Overview Transferring Data To/From SQL Server Importing & Exporting Table Data Inserting Data in Bulk.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Overview of Search Engines
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
Collections Management Museums Reporting in KE EMu.
 A data processing system is a combination of machines and people that for a set of inputs produces a defined set of outputs. The inputs and outputs.
Reporting in EMu Crystal != Reporting or Why is reporting so difficult and can we do anything about it? Bernard Marshall KE Software.
Databases & Data Warehouses Chapter 3 Database Processing.
1 LOMGen: A Learning Object Metadata Generator Applied to Computer Science Terminology A. Singh, H. Boley, V.C. Bhavsar National Research Council and University.
Gravity Control™: Is a new generation graphic user interface for searching, sorting and managing large amounts of data from different sources. Makes interaction.
Overview of SQL Server Alka Arora.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
LATTICE TECHNOLOGY, INC. For Version 10.0 and later XVL Web Master Advanced Tutorial For Version 10.0 and later.
Server-side Scripting Powering the webs favourite services.
4-1 INTERNET DATABASE CONNECTOR Colorado Technical University IT420 Tim Peterson.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
OracleAS Reports Services. Problem Statement To simplify the process of managing, creating and execution of Oracle Reports.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Module 7 Reading SQL Server® 2008 R2 Execution Plans.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
ENCLOUT Bring API to People. API Ecosystem Gap  Business Analysts  Good with spreadsheets  Limiting scripting or SQL skills  API Developers  Knowledge.
Transforming Documents „a how-to of transforming xml documents“ Lecture on Walter Kriha.
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
Microsoft Access Designing and creating tables and populating data.
HARDWARE INPUT DEVICES GETTING DATA INTO THE COMPUTER.
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
By: Namrata Lele Mentors: Dave Vieglais Bruce Wilson 1 VDC/TWG Meeting August 09.
ENG College of Engineering Engineering Education Innovation Center 1 Array Accessing and Strings in MATLAB Topics Covered: 1.Array addressing. 2.
G045 Lecture 08 DFD Level 1 Diagrams (Data Flow Diagrams Level 1)
ENG College of Engineering Engineering Education Innovation Center 1 More Script Files in MATLAB Script File I/O : Chapter 4 1.Global Variables.
Access Chapter 5-Table Tricks, Advanced Queries and Custom Forms.
ACIS Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning.
Using the Open PHACTS API with KNIME Daniela Digles Open PHACTS Community Workshop.
DAY 21: MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Aliya Farheen October 29,2015.
Chapter 1: Overview of SAS System Basic Concepts of SAS System.
ACG 4401 XSLT Extensible Stylesheet Language for Transformations Presenting XML and XBRL.
Week 9 : Text processing (Reading and writing files)
CSC 2720 Building Web Applications Basic Frameworks for Building Dynamic Web Sites / Web Applications.
Is this going to be difficult? Identify by Radius?
WORLD CONSORTIUM Welcome to. An overview by Phil Elliott Satzconcept Skandinavia a.s.
1 CSE 2337 Chapter 7 Organizing Data. 2 Overview Import unstructured data Concatenation Parse Create Excel Lists.
B Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Working with PDF and eText Templates.
ACG 4401 XSLT Extensible Stylesheet Language for Transformations Presenting XML and XBRL.
Analytics Plus Product Overview. Introduction Analytics Plus is a self-service Business Intelligence and advanced analytics software. On-premise reporting.
Data Exchange Framework
Microsoft Power Query: an Excel Users Dream for Data Extraction and Cleansing Presented by: Belinda Allen Smith & Allen Consulting, Inc.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Pennsylvania Information Management System (PIMS) PIMS Cognos Reporting Instructions December 2007.
Product Overview.
Creates the file on disk and opens it for writing
Data Warehousing/Loading the DW—Topics
WOCAT Mapping methodology
Bulk Loading Documents* into Windchill
ACG 4401 XSLT Extensible Stylesheet Language for Transformations
Data Migration to DOORS DNG Presented By Adam Hammett
Creates the file on disk and opens it for writing
Analytics Plus Product Overview 1.
Practice Activity – Part 1
Analytics Plus Product Overview.
Lecture 13 Teamwork Bryan Burlingame 1 May 2019.
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Data Warehousing/Loading the DW—Topics
Presentation transcript:

Digitizing Transmuter

Extracting relevant information from the electronic media into digitized form and accumulating the information bank for further processing based on the user needs. Problem statement

Solution “Digitizing Transmuter” is the answer to the unanswered question. As opposed to the general transmuters, Digitizing Transmuter controls the input and renders it onto a output which can be inturn used as an input. Electronic Media Digitizing Transmuter Digitized Information Bank

Digitizing Transmuter A common barrier for the present transmuters in market is the PDF format. Pdf provides information based on few parameters viz. Text objects which are considered as one or more glyph shapes representing characters of text. Similarly with the image object. These parameters makes it difficult for other transmuters to use the information for any further processing Digitizing Transmuter makes it happen in seconds. It transmutes all the necessary data to more and more sententious and processable data.

Digitizing Transmuter Input PDF file Read parameters like height, weight Read line & rectangle based on start & end points Get co-ordinate space by Transformation matrix Region or table detection based on connected lines Is parent rectangl e Region contains literals Form rows and tables based on the literals Divide rows into columns Retrieve Headers Does header matches in mapping structure Fetch Data under the header Yes No Yes Grow region till next rectangle Process

Digitizing Transmuter Mapping file decides the data to be fetched from a input file. Input string in mapping file is the header under which the necessary data resides. Other parameters like data type and the names to be appeared in output file are mentioned.

Overview Process Submit a electronic form of data viz. PDF, Excel to DT. Instruct DT of what is needed and what to fetch. DT performs the parsing instructed by the user with the validation on the parsed data. Presents the output in digital form viz. DB,XML. Components Settings Parser Validation Engine Report Generation

How it work Settings This component helps to select the input file, to locate the mapping file and gets the path to produce the output file.

How it work Parser Select the parser viz. PDF,Excel,List etc. Select the file or folder of files to parse. Validation Is performed internally after parsing is completed.

How it work Report Generation Report generation takes place in 3 types. XML form CSV form HTML form

Achievements Commercial use of Digitizing Transmuter RiskSpan Serving the MBS and ABS marketplace, also offers integrated solutions that combine powerful analytics, data and expert advisory services. Is using Digitizing Transmuter for last 12 months

Achievements Digitized Transmuter handles monthly ABX deals for RiskSpan Few hundred users access the ABX analysis data on a monthly basis

Achievements It is estimated that Digitized Transmuter can parse and output data from 7000 deals in 3 working days (24 hrs). Digitized Transmuter has parsed and rendered 3000 deals in a.lst format in under 3 minutes 3000 parsed and rendered 3000 deals 3000 Deals parsed under 3 minutes Deal s