Creating a Virtual Knowledge Base for Financial Risk and Reporting

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Emerging Ontology Work Product Showcase 1 The EDM Council Semantics Repository: Building global consensus for the Financial Services Industry Mike Bennett.
1 Semantics in the Financial Industry: The EDM Council Semantics Repository Progress Report Mike Bennett Hypercube Limited 89 Worship Street, London EC2A.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
OMG Finance Domain Task Force (FDTF) Monthly Status/review call Wednesday Dec 4 th 2013.
Financial Industry Semantics and Ontologies The Universal Strategy: Knowledge Driven Finance Financial Times, London 30 October 2014.
Oncor’s EIM Program.
Financial Intermediaries Indirect Finance –An Institution stands between lender and borrower. Direct Finance –Borrowers and lenders deal directly with.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Confidential 111 The Financial Industry Business Ontology Explanatory Material Mike Bennett, EDM Council August
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
The GPAA RFP to implement Enterprise Data Management 1 GPAA15/2015.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Ultrawrap: SPARQL Execution on Relational Data Juan F. Sequeda, Daniel P. Miranker University of Texas - Austin ISWC 2009 Seoul National University Internet.
Emerging Technologies Work Group Master Data Management (MDM) in the Public Sector Don Hoag Manager.
McGraw-Hill/Irwin Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Financial Instruments, Financial Markets, and Financial.
Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
On the Semantics of R2RML and its Relationship with the Direct Mapping Juan F. Sequeda Research in Bioinformatics and Semantic Web (RiBS) Lab Department.
Confidential 111 Financial Industry Business Ontology (FIBO) [FIBO– Business Entities] Understanding the Business Conceptual Ontology For FIBO-Business.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Database Systems Lecture 1. In this Lecture Course Information Databases and Database Systems Some History The Relational Model.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
OMG Finance Domain Task Force (FDTF) Monthly Status/review call Wednesday January 9 th 2013.
11 Copyright © 2009, Oracle. All rights reserved. Enhancing ETL Performance.
Financial Industry Business Ontology (FIBO) Monthly Status/review call Wednesday January 11 th 2012.
RDB2RDF Working Group Cunxin Jia. Why Mapping RDBs to RDF?
Open Governance Platform
spec.edmcouncil.org/fibo Site Walk-through
Data Platform and Analytics Foundational Training
Agenda Federated Enterprise Architecture Vision
Chapter 2: Database System Concepts and Architecture - Outline
Track B Summary Co-champions: Mike Bennett, Andrea Westerinen
Triple Stores.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Overview of Market Participants and Financial Innovation
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
OMG Finance Domain Task Force (FDTF)
OMG Finance Domain Task Force (FDTF)
Hypercube Ltd.; EDM Council Inc.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Stop Data Wrangling, Start Transforming Data to Intelligence
Triple Stores.
Service-enabling in Financial Domain
FIBO-aligned Semantic Triples
Conceptual Ontology Engineering Tutorial Session 5: Putting it to Work
Database Systems Instructor Name: Lecture-3.
Publishing Ordnance Survey Ireland's geospatial data as Linked Data
Data Warehouse.
OMG Finance Domain Task Force (FDTF)
Co-champions: Mike Bennett, Andrea Westerinen
Data Warehousing Concepts
Improving Machine Learning using Background Knowledge
Triple Stores.
Chapter 2 Database Environment Pearson Education © 2009.
OMG Finance Domain Task Force (FDTF)
Chapter 2 Database Environment Pearson Education © 2009.
Co-champions: Mike Bennett, Andrea Westerinen
Financial Industry Business Unified Model (FIBUM)
Presentation transcript:

Creating a Virtual Knowledge Base for Financial Risk and Reporting Juan Sequeda, Capsenta Inc. Mike Bennett, Hypercube Ltd. Ontology Summit 2016 24 March 2016

Risk reporting – New regulatory requirements The Basel Committee for Banking Supervision Set of 14 principles, across 4 broad themes Identifies a number of Systemically Important Banks G-SIB or G-SIFI “Too Big to Fail” Domestic SIFIs to follow… Mandates basic data integration and reuse

The Search for a Conceptual Something Data dictionary? Business Glossary? Business dictionary Terminology? Thesaurus? Or what?... All these words about words…

Creating a Reference Ontology Requires an understanding of cognitive and terminological principles Semiotics Classification theory (taxonomy) Abstraction and encapsulation of concepts Core concepts grounded in the reality of contracts, accounting etc. Use an Upper Ontology Needs to be a common language across the business The Two Ontological Questions: What kind of Thing is this? What distinguishes it from other Things? Needs to be framed in formal logic Unambiguous across business and machines

FIBO: Scope and Content Upper Ontology FIBO Foundations: High level abstractions FIBO Business Entities FIBO Financial Business and Commerce FIBO Indices and Indicators FIBO Contract Ontologies Securities (Common, Equities) Securities (Debt) Derivatives Loans, Mortgage Loans Funds Rights and Warrants FIBO Pricing and Analytics (time-sensitive concepts) Pricing, Yields, Analytics per instrument class FIBO Process Corporate Actions, Securities Issuance and Securitization Future FIBO: Portfolios, Positions etc. Concepts relating to individual institutions, reporting requirements etc.

Semantic Data Virtualization technologies Introducing Capsenta R2RML-compliant “wrappers” Maps conventional databases to RDF ETL (Extract Transform and Load) vs. NoETL (Federated) Example investigation: Wealth management integration issues

Mapping Relational Databases to RDF and OWL Alice 25 Alice Person foaf:name foaf:age foaf:name ID NAME AGE CID 1 Alice 25 100 2 Bob NULL <Person/1> <Person/2> Direct Mapping foaf:based_near City CID NAME 100 Austin 200 Madrid R2RML foaf:name <City/100> Austin foaf:name <City/200> Madrid Standards to map Relational Data to RDF A Direct Mapping of Relational Data to RDF Default automatic mapping of relational data to RDF R2RML: RDB to RDF Mapping Language Customizable language to map relational data to RDF

Ultrawrap: Semantic Data Virtualization Risk, Compliance Search API Dashboard Reports Semantic Queries Reference Ontology R2RML Mappings Internal Data External Data Wealth Management Bond Ordinary Share Equity Preference Share Debt BCBS 329 SQL Server Oracle Golden Source Semi-Structured Unstructured HIVE Impala, etc Postgres Asset Control Legacy Data Sources and Systems

Ultrawrap: Semantic Data Virtualization 1) Create Mappings 2) Use Mappings Create mappings for each Source databases to the Target Ontology Mappings are represented in a declarative language: R2RML Use the mappings to integrate the data ETL transforms the source data into data in terms of the Target Ontology NoETL executes queries written in terms of the Target Ontology over each of the sources

Ultrawrap Mapper Reference Ontology Ultrawrap Mapper R2RML Mapping <TriplesMap1> a rr:TriplesMap; rr:logicalTable [ rr:tableName”Person" ]; rr:subjectMap [ rr:template "http://www.ex.com/Person/{ID}"; rr:class foaf:Person ]; rr:predicateObjectMap [ rr:predicate foaf:based_near ; rr:objectMap [ rr:parentTripelMap <TripleMap2>; rr:joinCondition [ rr:child “CID”; rr:parent “CID”; ] . Ultrawrap Mapper Reference Ontology R2RML Mapping Ultrawrap Mapper

Ultrawrap ETL RDBMS RDF Graph Triplestore Ultrawrap ETL SPARQL SPARQL Results Ultrawrap ETL R2RML Mapping RDBMS RDF Graph Triplestore

Ultrawrap NoETL RDBMS Semantics Ultrawrap NoETL SPARQL SPARQL Results SQL R2RML Mapping SQL Results RDBMS Semantics

NoETL Architecture Reporting SPARQL Federator Ultrawrap NoETL R2RML R2RML R2RML R2RML RDBMS RDBMS RDBMS RDBMS

Hybrid NoETL and ETL Architecture Reporting SPARQL Federator R2RML Ultrawrap NoETL Ultrawrap NoETL RDF Triplestore RDBMS R2RML R2RML Ultrawrap ETL RDBMS RDBMS R2RML RDBMS

Financial Institution with Golden Copy Reporting Query Response Reference Ontology Data Lineage – Metadata management … Golden Copy DB Other DB N Market Data Feed Source DB 1 … Source DB N Market Data Feed 1..n

Data Strategy Considerations Real-time data: Unlikely to want to stand up in triple store Usually apply market data feed (pricing, yield) directly to position data Critical reference data Provenance etc. determined upstream of Golden Copy Any benefit to replicating golden copy data in triple store? Possible re-use of GC data in risk, compliance Data Migration Strategy Implementation of semantics provides an opportunity to review data requirements, Also consider third party data licensing / rights management

Virtualization versus Triple Store When to stand up the Triple Store? Only when that is already in place or a new triple-based application When to query through to source DBs directly? Whenever possible Depends on: Data provenance / DQ measures Temporality Virtual Federation Not have to create the “tower” of a triple store, but you get what looks like one. SPARQL as fast as SQL

Integration: Mapping Considerations Generally not one to one Develop “Mapping Patterns” One data element may represent different ontological things Defined in different contexts How is the context (in this case the presence of other data elements) dealt with? One target ontological Thing or property may be represented by some unique combination of database elements Example: use of “type” enumerations on a class (this is just one design choice in the DB design)

Mapping Considerations (1) Reference Ontology Source DB Autonomous Agent Party In Role hasIdentity All of this (graph) Is a hasAddress Loan Borrower Loan Borrower Name Borrower Address Loan Borrower SSID Maps to… Loan Borrower Address Line 1 Loan hasBorrower … hasAddressElement this. Loan Borrower ZIP Code PostalCode

Mapping Considerations (2) Capsenta has a specific set of transformations that deal with enumerations; Enum entries are transformed to OWL classes Reference Ontology Source DB Negotiable Security Security Security Name: string Issuer: string Identifier: Identifier Security Type: SecurityType Equity Debt … Maps to… This set of values: SecurityType (enum) Ordinary Share Ordinary Share Preference Share Medium Term Note Bond Bond MediumTermNote This class MortgageBackedSecurity …

Semantic Query-based Applications Risk and Compliance Compliance and what-if reports Internal reports Risk reports Generate and update using semantic queries Understood and signed off by business stakeholders

Reporting Reports: Reports are created using semantic queries Timely, accurate, complete Able to generate new kinds of report at short notice Increased frequency during times of crisis Reports are created using semantic queries Tools exist for graphical creation of semantic queries Business analysts can create these queries Starting point: replicate existing reports using semantic queries No additional IT effort required to update reports New reports generated as needed provided the concepts are in the Reference Ontology

Summary Semantic queries are used to recreate reports and dashboards Provide the means to extend and vary these quickly Minimal application development. Non-disruptive: Existing bank systems of record are used No need for centralized data storage (relational or graph database). Existing data quality measures remain unaffected Probabilistic mapping methods to semi-automatically create the mappings

Thank you! Questions? Juan Sequeda, PhD Mike Bennett Founder – SVP Technical Sales and Research, Capsenta Inc. www.Capsenta.com Twitter: @JuanSequeda Mike Bennett Director, Hypercube Ltd. www.hypercube.co.uk Twitter: @MikeHypercube