Confidential. The material in this presentation is the property of Fair Isaac Corporation, is provided for the recipient only, and shall not be used, reproduced,

Slides:



Advertisements
Similar presentations
©2013 Primal Fusion Inc. Data Synthesis The Big Problem with Small Data.
Advertisements

Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
Advances in the PARCC Mathematics Assessment August
Line Efficiency     Percentage Month Today’s Date
Unit Number Oct 2011 Nov 2011 Dec 2011 Jan 2012 Feb 2012 Mar 2012 Apr 2012 May 2012 Jun 2012 Jul 2012 Aug 2012 Sep (3/4 Unit) 7 8 Units.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
Semantic data model
Time-Series Forecast Models EXAMPLE Monthly Sales ( in units ) Jan Feb Mar Apr May Jun Jul Aug Sept Oct Nov Dec Data Point or (observation) MGMT E-5070.
1 Press Ctrl-A ©G Dear2009 – Not to be sold/Free to use Comparison of Data Sets Stage 6 - Year 12 General Mathematic (HSC)
1 Press Ctrl-A ©G Dear2009 – Not to be sold/Free to use Displaying Multiple Data Sets Stage 6 - Year 12 General Mathematic (HSC)
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Artificial Intelligence: Introduction Department of Computer Science & Engineering Indian Institute of Technology Kharagpur.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The Birthday Paradox July Definition 2 Birthday attacks are a class of brute-force techniques that target the cryptographic hash functions. The.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Data mining in web applications
Jan 2016 Solar Lunar Data.
IT Strategy Roadmap Template

ITI Portfolio Plan Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Current Date Visibility of ITI Projects ITI Projects.
PLANNED MAINTENANCE ACTUAL CASE STUDY AMKOR / ANAM
Analyzing patterns in the phenomena
Q1 Jan Feb Mar ENTER TEXT HERE Notes
Comparative Statistics September 2017

Project timeline # 3 Step # 3 is about x, y and z # 2
Average Monthly Temperature and Rainfall
IT STRATEGY ROADMAP TEMPLATE
MULTIPLE PRODUCT ROADMAP

“How to Make a Lawyerbot that Can Review NDA’s”
Mammoth Caves National Park, Kentucky
2017 Jan Sun Mon Tue Wed Thu Fri Sat

Gantt Chart Enter Year Here Activities Jan Feb Mar Apr May Jun Jul Aug
Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE
Free PPT Diagrams : ALLPPT.com

Proposed Strategic Planning Process for FY 2013/14 thru FY 2015/16

Step 3 Step 2 Step 1 Put your text here Put your text here
Calendar Year 2009 Insure Oklahoma Total & Projected Enrollment
MONTH CYCLE BEGINS CYCLE ENDS DUE TO FINANCE JUL /2/2015
Jan Sun Mon Tue Wed Thu Fri Sat
IT STRATEGY ROADMAP TEMPLATE
The Birthday Paradox June 2012.
©G Dear 2008 – Not to be sold/Free to use
GOALS Focus on release of info and data in timely, relevant, ongoing fashion (based on audience) Create communications tools and strategies that reflect.
Electricity Cost and Use – FY 2016 and FY 2017
Unemployment in Today’s Economy
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE
Free PPT Diagrams : ALLPPT.com

Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Project timeline # 3 Step # 3 is about x, y and z # 2
TIMELINE NAME OF PROJECT Today 2016 Jan Feb Mar Apr May Jun
Chapter 14 February 26, 2004.

Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE
Proposed core price & service spec L1 product milestone 31 Dec 2019
Presentation transcript:

Confidential. The material in this presentation is the property of Fair Isaac Corporation, is provided for the recipient only, and shall not be used, reproduced, or disclosed without Fair Isaac Corporation's express consent. © 2008 Fair Isaac Corporation. HNC Data Alignment Research Direction Richard Rohwer Senior Principal Scientist, Advanced Technologies HNC Software / Fair Isaac

2 © 2008 Fair Isaac Corporation. Confidential. Cognition needs Semantics needs Massive Data Massive Data Tacit Knowledge Explicit Knowledge KNOWLEDGE Statistics includes Semantics / Meaning = Association Statistics Information Organization Statistics Reasoning Theorem: Probability distributions are the UNIQUE logically consistent knowledge representation.

3 © 2008 Fair Isaac Corporation. Confidential. From massive data to machine cognition: The technical principles Mathematical ingredients:  Association-Grounded Semantics (AGS) - To capture meaning mathematically.  Semantically-Driven Segmentation (SDS) - To extract the most meaningful patterns.  Distributional Alignment (DA) - To compare meanings abstractly. Semantically Enriched Reasoning Engine  To think in terms of meanings instead of symbols.

4 © 2008 Fair Isaac Corporation. Confidential. Association-Grounded Semantics (AGS): Meaning = Usage fro onto reachin g acrs btwn beyond frm inside alg across via thru ovr around near between within through into over by from at jun sept apr jul nov oct dec aug feb sep jan captain mr gen msgt ltc tsgt cpt sgt ssgt capt maj lt bsb msj tng opv adm atm cpo bdo notal u b Cables

5 © 2008 Fair Isaac Corporation. Confidential. Distributional Alignment (DA) Abstraction ~ Structural Commonality Align semantic spaces by distribution of content.  No need to understand content. Transport meaning between  Languages  Dialects  Cultures Transport metaphorically between topics. transLign algorithm: No language knowledge. No tie words. No aligned corpora.

6 © 2008 Fair Isaac Corporation. Confidential. Alignment: Terminology RP English Cable English Blog Dialects Less Commonly Taught Language Institutional Dialects Terror Cell Obfuscated Slang Professional Dialects Newswire English Foreign Newswire Polysemy (Sense resolution) Good solutions from NIMD:  Entity Disambiguation (5.5% err vs. 13.5% err in KDD)  General terms Information Loss (Unequal expressive power) Automation AGS techniques do not require manually constructed resources… … but can use them when available. “bank” “river bank” “bank note” AGS Semantic Space fluffy snow What ‘cha call it? Naïve Bayes

7 © 2008 Fair Isaac Corporation. Confidential. Alignment: Schemata Column name I n s t a n c e Table name Column name I n s t a n c e Table name Natural Language Corpora Natural Language Corpora Semantic Alignment Instance Statistics (Joined across schema) Semantic Alignment Structural Alignment Schema Graph

8 © 2008 Fair Isaac Corporation. Confidential. Alignment: Ontologies More complex graph structure  Reflecting multiple (transitive) relations - is-a, part-of, reports-to, prerequisite-for, …  Implies more options for defining AGS statistics - More relations, more ways to define co- occurrence. Big Picture issue:  Ontological structure makes general statements about instances of relationships within data.  So does AGS.  How are these related?