Improving Machine Learning using Background Knowledge

Slides:



Advertisements
Similar presentations
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Advertisements

When the bartender asked, "How's it going, Norm
Financial Industry Semantics and Ontologies The Universal Strategy: Knowledge Driven Finance Financial Times, London 30 October 2014.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
RESEARCH DESIGN.
The Bayesian Web Adding Reasoning with Uncertainty to the Semantic Web
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Knowledge representation
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
Ontology Summit 2015 Track C Report-back Summit Synthesis Session 1, 19 Feb 2015.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Conversation Analysis Introduction to Conversation Analysis 2e Anthony J. Liddicoat, March 2011.
Dr. Leo Obrst Information Semantics Cognitive Science & Artificial Intelligence Information Technology Technical Center Center for Connected Government.
1 CS 385 Fall 2006 Chapter 1 AI: Early History and Applications.
1 The main topics in AI Artificial intelligence can be considered under a number of headings: –Search (includes Game Playing). –Representing Knowledge.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
13 April 2015Ontology Summit 2015 Symposium1 Ontology Summit 2015 Internet of Things: Toward Smart Networked Systems and Societies Symposium – April 13,
RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
1 Sobah Abbas Petersen Adjunct Associate Professor, NTNU Researcher, Sintef TDT4252 Modelling of Information Systems Advanced Course TDT4252,
Chapter 9 The Communicative Approach.
Financial Industry Business Ontology (FIBO) Monthly Status/review call Wednesday November 2 nd 2011.
Preparing to teach OCR GCSE (9-1) Geography B (Geography for Enquiring Minds) Planning, constructing and introducing your new course.
Financial Industry Business Ontology (FIBO) Monthly Status/review call Wednesday January 11 th 2012.
Ontology Summit 2016: Launch Framing the Conversation: Ontologies within Semantic Interoperability Ecosystems Donna Fritzsche and Michael Grüninger February.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
What about the Assessment System?
Chapter 3 Internal Controls.
Information Organization
WCI - iNED - MAID Modelling, Analytics and Insights from Data
The Semantic Web By: Maulik Parikh.
COMP6215 Semantic Web Technologies
Sentiment analysis algorithms and applications: A survey
Track B Summary Co-champions: Mike Bennett, Andrea Westerinen
Object-Oriented Analysis and Design
Francisco da Silva, Senior Councillor, Huawei
Knowledge Representation and Reasoning into Machine and Deep Learning
An Overview Of Vision 1 Summer 1395.
Component 2 Differentiation in Instruction
Making Connections: guidance on non-exam assessment
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Web Mining Ref:
Ontology Summit 2016 – Track B
Intelligent Agents Chapter 2.
High Schools that Work Conference – 2018 Orlando, Florida
Literacy Content Specialist, CDE
FIBO-aligned Semantic Triples
ece 627 intelligent web: ontology and beyond
Overview of Machine Learning
Ontology Summit Synthesis
Fitting data collection into your Stats lessons
Build ontologies from texts and using them for IR
Starter Following what we covered in the last session, match the terms with their correct definitions. Mode Representation Genre Register Put these terms.
FUTURE BILC THEMES AND TOPICS
Chapter 11 user support.
Francisco da Silva, Senior Councillor, Huawei
Upper Ontologies for Specifying Context
RESEARCH BASICS What is research?.
Co-champions: Mike Bennett, Andrea Westerinen
What is AO4? AO4: relate texts to their social, cultural and historical contexts; explain how texts have been influential and significant to self and to.
Who Are Champions, What Are Their Impacts and How Do We Know
Machine Learning for Space Systems: Are We Ready?
Co-champions: Mike Bennett, Andrea Westerinen
Presentation transcript:

Improving Machine Learning using Background Knowledge Ontology Summit 2017 Track B – 15 March

Track B : Using Background Knowledge to Improve Machine Learning Results Track Co-champions: Mike Bennett Andrea Westerinen Blog Page http://ontologforum.org/index.php/Blog:Improving_Machine_Lear ning_using_Background_Knowledge

Motivations Machine Learning (ML) is based on defining and using mathematical models to perform tasks, predict outcomes, make recommendations, etc. Initial models can be specified by a data scientist, and/or constructed through combinations of supervised and unsupervised learning and pattern analysis Challenges with this: If no background knowledge is employed, the ML results may not be understandable There is a bewildering array of model choices and combinations Background knowledge could improve the quality of ML results by using reasoning techniques to select learning models and prepare the training and examined data (reducing large, noisy data sets to manageable, focused ones)

Summit Graphical Overview

Track B Positioning

Objective The objective of this Ontology Summit 2017 track is to understand: Challenges in using different kinds of background knowledge in machine learning Role of ontologies, vocabularies and other resources to improve machine learning results Design/construction/content/evolution/... requirements for an ontology to support machine learning

Today’s Session We explore the problem space by way of a targeted presentation on learning for decision support We want to start a conversation on the kinds of ontologies needed for ML, and their "ground rules" and requirements We will also explore other aspects of an ontology- driven natural language architecture: The application of semantics to neural learning functionality The possible role of statistical analysis Where does the human fit in the loop and what do they do? These issues will set the scene for the second Track B session on April 12.

Some Use Cases Parsing regulatory texts Parsing informal chatter To identify hot spots for the organization to address To identify new and emerging requirements Parsing informal chatter Intelligence – who is connected to whom? Compliance – investor chatter, malfeasance Drawing inferences from text in company filings What do we know about relations between entities? What do we know about exposures, risks etc.?

Considerations How is natural language analysed What are the steps / pieces getting from there to “Meaning”? The role of ontologies Last week in Track A we heard about generating ontologies from natural language texts In this track we want to understand how or whether one can use ontologies to improve results from unstructured text Kinds of ontology or knowledge resource to use Formal ontology (FOL e.g. OWL) Vocabulary or thesaurus etc. Other formalisms Role of synonyms / near synonyms etc.

Ontologies What does the ontology need to do And how does this determine what kinds of ontology can be used in this way? For instance: given a loose, human use of a generic word like “deal” Do we identify the precise concept of “deal” at one of its most general in the ontology Actually Deal is a synonym for 2 financial concepts: an offering or a trade Do we populate the ontology with near-synonyms that may be used to refer to a concept even when they don’t refer precisely? For this, any ontology used needs to have a good taxonomy Support multiple facets; deep specific to general hierarchy Additional metadata helps Synonyms (giving heteronyms when looked at the other way!) Broader and narrower near-synonyms (SKOS?) Synecdochal usages – where a part is used to describe the whole or vice versa Banking example: use of early, verbose draft material from FIBO meant there were more words in a given context (where a class or property is the context), making it easier to disambiguate different usages How does this influence the style of design of an ontology, or the selection of existing ones? E.g. do formal restrictions play a role in the use of an ontology in machine learning? If not, where does the meaning come from?

Other Pieces of the Puzzle Linguistic Analysis How do we get from the range of words in English or Chinese or Arabic, to the core noun and verb concepts in an ontology? Fuzzy and imprecise human usages Statistical analysis Whether “Bank” is a part of a river or a financial institution ontology/ies as the reference point “This is what Bank means in this context” Additional contextual content including metadata Machine learning What about neural learning functions? If it all happens in a “Black box” can we understand meaning for any of it?

Today’s Session The overall theme for both of today’s presentations is how to achieve situational awareness from unstructured source material Different kinds of situation / time Different kinds of source material Different kinds of knowledge resource / ontology The question: how do we quickly and effectively get actionable information from the data available Split second decision making in real time Business compliance / reaction Forensic investigation Humans, ontologies, AI and the balance of these

Today’s Presentations Simon Davidson (Psonify) The Investigator's Toolkit – Deriving Immediate, Actionable Insights from Unstructured Data What we will learn Extraction of core terms from language Using an ontology for natural language processing Demo of the Investigative Toolkit Getting actionable insights from what is in the text Ken Baclawski (Northeastern University) Combining Ontologies and Machine Learning to Improve Decision Making Decision making – combining machine learning and other techniques Benefits – quality, understandability of decisions, adaptability Ontologies in decision making Human in the loop – integrating humans and computers Application domains

Over to our first speaker… Please put questions in the “Soaphub” chat window use the Bluejeans chat for logistics only These will form the basis for discussion later in the sessions Presenters will also respond to those questions in the chat once they are done speaking This session is being recorded and both the recording and the chat transcript will be made available