Copyright © 2014 Tapestry Solutions, Inc. All rights reserved. Role of Ontology in ‘Big Data’ Jens Pohl, PhD Monday 28 July, 2014 TAPESTRY / MIRO – PROPRIETARY.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

Chapter 1 Business Driven Technology
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Demonstrators: Mudasir Nazir(08-CS-41).  I am highly addicted to this field.  Working with W3C in research program(building CSS for creating web site.
CHAPTER 7 Roderick Dickson Kelli Grubb Tracyann Pryce Shakita White.
Markov Logic Networks Instructor: Pedro Domingos.
Lecture 1 Introduction to the ABAP Workbench
The future of technology, law enforcement, and justice ICJIA Feburary, 2013.
Accessing Organizational Information—Data Warehouse
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
MD 815 Session #3 Technology Forecasting & Scenario Planning 1. Introduction 2. S-Curves and quantitative forecasting 3. Scenario Planning.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
SESSION 10 MANAGING KNOWLEDGE FOR THE DIGITAL FIRM.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
CSE 574: Artificial Intelligence II Statistical Relational Learning Instructor: Pedro Domingos.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
Chapter 1: The Database Environment and Development Process
Business Driven Technology Unit 2
Supporting Decision Making Chapter 10 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
CHAPTER 08 Accessing Organizational Information – Data Warehouse
RSBM Business School Research in the real world: the users dilemma Dr Gill Green.
DC Chp 1 Jeopardy Review Section 1Section 2Section 3Section 4Section
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
1 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Web-Enabled Decision Support Systems
Fundamentals of Information Systems, Fifth Edition
Concepts of Engineering and Technology Copyright © Texas Education Agency, All rights reserved.
The Yellow Group Design Informatics (Regli, Stone, Kusiak, Leifer, Gupta, Chung, Fenves, Law, Kopena)
Using SAS® Information Map Studio
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
7-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 7 IT Infrastructures.
Patterns and Reuse. Patterns Reuse of Analysis and Design.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Fundamentals of Information Systems, Seventh Edition 1 Chapter 3 Data Centers, and Business Intelligence.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Introduction to Software Engineering. Why SE? Software crisis manifested itself in several ways [1]: ◦ Project running over-time. ◦ Project running over-budget.
Benchmarking ontology-based annotation tools for the Semantic Web Diana Maynard University of Sheffield, UK.
1.1 © 2010 Dr. Tarek Abd El-Hafeez Decision Support Systems Lecture 2 Decision Support Systems.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Lecture 1 – Operations Research
MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.
Computers in Education Past, Present, and Future
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
PowerPoint Presentation by Charlie Cook Copyright © 2004 South-Western. All rights reserved. Chapter 5 Business Intelligence and and Knowledge Management.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
Deployment of Ontology Mediation Of Information Flow Modified from Presentations made in 2002, 2003 and 2004 This material is not specific to any project.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Information Architecture & Design Week 9 Schedule - Web Research Papers Due Now - Questions about Metaphors and Icons with Labels - Design 2- the Web -
1 Chapter 2 Database Environment Pearson Education © 2009.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
MD 815 Session #3 Technology Forecasting 1. Introduction 2. Quantitative forecasting techniques 3. Predicting Diffusion Case.
Integrated Modeling Environment System Engineering Seminar Johnny Medina / Code 531 Chris Stone / Code 531 / Constellation Software Engineering.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
Monitoring Windows Server 2012
The Components of Information Systems
Overview of MDM Site Hub
Knowledge Management Systems
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
The Components of Information Systems
MANAGING KNOWLEDGE FOR THE DIGITAL FIRM
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Information System Building Blocks
Presentation transcript:

Copyright © 2014 Tapestry Solutions, Inc. All rights reserved. Role of Ontology in ‘Big Data’ Jens Pohl, PhD Monday 28 July, 2014 TAPESTRY / MIRO – PROPRIETARY

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 2 Origins of Big Data Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems By-product of the evolution of larger and more complex societal structures. Result of the exponential increase in data due to global connectivity. Big Data is not a completely new phenomenon.

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 3 Planning is a Critical Tool Big Data & Ontologies Expectation that the plans will be effective. Decisions must be made in a timely manner. Forecasts must be at least reasonably accurate. Organizational complexity generates a need for efficiency through planning. Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 4 Planning is Predictive Big Data & Ontologies Plans are based on assumptions. Assumptions are predictive in nature. Forecasting future conditions and events based on past experience is problematic. Planning and forecasting are closely related. Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 5 Forecasts are Mostly Wrong Big Data & Ontologies Western Union Exective – 1876: "The telephone has too many shortcomings to be seriously considered as a means of communication." Lord Kelvin – 1895: "Heavier-than-air flying machines are not possible." Thomas Watson, IBM Chairman – 1943: "I think in the world there is a market for maybe five computers." Ken Olson – 1977: "There is no reason for individuals to have a computer in their home." Bill Gates – 1981: " bytes of memory ought to be enough for anybody." Robert Metcalfe (inventor of the Ethernet): "The Internet will catastrophically collapse in 1996." Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 6 Early Data Analysis Problems Big Data & Ontologies We rely largely on the analysis of past events to identify future trends. Periodic collection of population census data (every 10 years in the US). Collection of data is time consuming, but the analysis of the data is even more onerous. The 1880 US census took 8 years to process. Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Traditional Big Data Analysis Inferential Statistics Collection of very small sample. Representativeness ensured by randomness. Mathematical analysis of sample. Predictions about entire corpus of data Traditional Big Data Analysis Hypothesis based on theory(s). Collection of represen- tative data sample. Correlation analysis of random sample. Testing of hypothesis (and data) Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 7 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY. 8 New Big Data Analysis Approach Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Correlation: If A occurs with B then we can predict that A is likely to occur wherever B occurs; - i.e., B is a proxy for A. The analysis is based on a data set that is essentially equivalent to the entire corpus of data. Assumption: Any data domain changes are gradual and not abrupt. Assumption: The corpus of data is continuously extended with new domain data. The correlation is a probabilistic likelihood and not an absolute certainty. Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Different Computational Approaches Blind correlation through brute force computation Massive Data Automated extraction of meaning What! Particular Knowledge Why ! Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 9

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 10 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems From Data to Knowledge LOW VOLUME HIGH VALUE HIGH VOLUME LOW VALUE KNOWLEDGE (INTERPRETATIONS AND RULES) INFORMATION (RICH IN RELATIONSHIPS) PURPOSEFUL DATA (ORGANIZED) LOW LEVEL DATA (UNORGANIZED)

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 11 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Wasteful Use of Human Resources Human computer user must interpret and manipulate data by adding context.. Context Data without context cannot be automatically interpreted by computers Knowledge Information Organized and Unorganized Data

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 12 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Fundamental Distinctions rain airport 6 to dense clear for not Scotland in hours field 117 pilot expected 49 Glasgow railcar fog week 82 "…dense fog in Glasgow, Scotland, not expected to clear for 6 hours…" KNOWLEDGE comprises inferences derived from information. Aircraft bound for Glasgow International Airport are likely to be rerouted or delayed. INFORMATION is numbers and words with relationships. DATA are numbers and words without relationships.

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 13 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Principal Context Components History Time Location Environment Identity Culture Urgency Activity "...any information that characterizes the interaction of entities (i.e., players and objects), within a given situation…"

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 14 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Context as an Enabler Context is a prerequisite for … Automated interpretation of data. 1 Automated filtering of data. 2 Automated retrieval of useful data. 3 Intelligent collaborative decision tools. 4 Self-healing and secure information networks. 5 Responsive human-computer interfaces. 6

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 15 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Virtual Model of Real World Context

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 16 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Ontology Representation Defines the innate nature and operational context within which the actual values of entities can be accurately interpreted. Rich Relationships Logic (Business Rules) Modeling Patterns and Techniques Provides: Semantic context.

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 17 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Ontology Construction

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 18 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Ontology of Real World Context

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 19 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Human-Computer Partnership Ontology provides context for automated reasoning by software agents. Human Context Computer Context Organized Data Information Knowledge Unorganized Data Ontology Data capture in context

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 20 Load - Planning from a Data Viewpoint Big Data & Ontologies The common parameters of Big Data are Volume, Velocity and Variety (3Vs). Over 30,000 cargo items per ship. From one load-plan in two days to four load-plans in two hours. Over 300 attributes per cargo item, 320 ships, 250 aircraft configurations, and over 15,000 railcars. Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems Volume Velocity Variety

ICODES-GS v6: Overview Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 21 ICODES v6 Portal Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Evolution of ICODES: 2011: Fielding of ICODES GS Within a Collaborative Information Workspace (CIW), ICODES GS becomes a set of intelligent reusable services, with user-transparent data exchange capabilities, which are accessed through a single sign-on portal. Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 22 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES GS: Applications and Services Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 23 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES-GS v6: SLP Single Load Planner (SLP) Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 24 Single Load Planner (SLP) Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 25 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems ICODES - SLP: Knowledge Domains The core of ICODES GS is its knowledge-base of context and business rules that allow agents to automatically interpret data changes and provide useful assistance to the operator. ICODES knowledge domains include: ICODES user-interfaces include:

ICODES: Ontology-Based Multi-Agent System Hazard Agent Hatches Agent Doors Agent Openings Agent Access Agent Ramps Agent USER MULTI- MEDIA Trim & Stability Agent Layered Ontology CAD Engine Cranes Agent Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 26 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES : Ship Load-Planning User-Interface Tool Bars Agent Status Bar Graphics Window Status Bar Main Menu Bar Associations Toolbar Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 27 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES GS: Four Domains – Aircraft, Ship, Rail and Yards AircraftShip RailYards Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 28 Big Data & Ontologies Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES-GS v6: DC Data Cleanser (DC) Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 29 Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES -GS v6: DC A set of components that serve as the source of reference data for ICODES v6 and provides the operator with the capability to validate, correct, and automatically populate cargo data. Problem: Incorrect or partial user input. Solution: Validate and Auto-Fill using Ref. Data MARVEL AES-based solution ICODES 6 Reference Data Data Cleanser Service ICODES 6 Applications & Services reads Data Cleanser (DC) Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 30 Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES-GS v6: DC DC Web-Application Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 31 Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES-GS v6: IR Information Repository (IR) Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 32 Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems

ICODES-GS v6: IR Information Repository (IR) ICODES v6 Components Web Service Define and list categories of data Publish data to a category Remove data from a category Browse, search, and retrieve data residing in a category Standalone Application Provides a user interface that enables an end- user to utilize the IR Web Service capabilities. Embeddable Components Provides a unified software library that allows ICODES v6 applications to present standardized dialogs for import and export. Enterprise Users Desktop Users SLP CB IR Standalone Application EIP Environment SLP, CE, BBT, and CB DB IR Service MARVEL AES offers inexact search over metadata. ICODES v6 components that provide operators, services, and applications with a centralized location for sharing data in support of user collaboration. Copyright © 2014 Tapestry Solutions, Inc. / Miro Technologies, Inc. All rights reserved. TAPESTRY / MIRO – PROPRIETARY 33 Global Services & Support | Training Systems and Government Services | Logistic Information Management Systems