© 2006 Tom Beckman and Ronald P. Reck1 Issues in Knowledge Representation and Semantic Interoperability Tom Beckman Principal, Beckman Associates

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
C6 Databases.
Copyright Irwin/McGraw-Hill Data Modeling Prepared by Kevin C. Dittman for Systems Analysis & Design Methods 4ed by J. L. Whitten & L. D. Bentley.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Basics of Knowledge Management ICOM5047 – Design Project in Computer Engineering ECE Department J. Fernando Vega Riveros, Ph.D.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Managing Data Resources
Data Quality Class 3. Goals Dimensions of Data Quality Enterprise Reference Data Data Parsing.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Chapter 1: The Database Environment and Development Process
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Knowledge Acquisition. Concepts of Knowledge Engineering Knowledge engineering The engineering discipline in which knowledge is integrated into computer.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Knowledge representation
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Chapter 7 System models.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Why Standardize Metadata?. Why Have a Standard? Think for a moment how hard it would be to… … bake a cake without standard units of measurement. … put.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Information Retrieval
0 / Database Management. 1 / Identify file maintenance techniques Discuss the terms character, field, record, and table Describe characteristics.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
© 2006 Tom Beckman Knowledge Representations for Semantic Interoperability Conceptual Primitives Knowledge Structures Reasoning Mechanisms Tom Beckman.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Understanding the Value and Importance of Proper Data Documentation 5-1 At the conclusion of this module the participant will be able to List the seven.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
Managing Data Resources File Organization and databases for business information systems.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Introduction Multimedia initial focus
ece 627 intelligent web: ontology and beyond
DATA MODELS.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
MANAGING DATA RESOURCES
The ultimate in data organization
Presentation transcript:

© 2006 Tom Beckman and Ronald P. Reck1 Issues in Knowledge Representation and Semantic Interoperability Tom Beckman Principal, Beckman Associates Ronald Reck Principal, RRecktek

© 2006 Tom Beckman and Ronald P. Reck2 Why is Knowledge Representation Important to Semantic Systems Provide systematic method to capture, define, and describe relevant semantics Provide explicit definition and description of Concepts, Attributes, Values, and Relationships Explicitly represent knowledge, experience, and expertise Determine the best Knowledge Representations to use depending on the task, environment, and user Employ appropriate knowledge structures and inference engines Select the best elicitation techniques to capture knowledge Compare and integrate similar concepts and data elements Determine the best level of granularity to describe data values Ensure proper consolidation of data elements across legacy systems Develop improved definition and description of concepts Provide improved data sharing, semantic interoperability, and search capability

© 2006 Tom Beckman and Ronald P. Reck3 Knowledge Representation Basics Knowledge Representation Characteristics:  Models knowledge and reasoning about knowledge  Describes characteristics and dimensions of knowledge  Formally defines structures and processes for electronic and human reasoning  Exposes Knowledge Structures and hides Inference Engines Symbolic Knowledge Representation Components:  Knowledge Structures:Objects DeclarativeStatic  Reasoning Mechanisms:Process ProceduralDynamic  Content and Context:Symbols SemanticsMeaning Explicit Representation of:  Knowledge  Experience  Expertise  Semantics of Content: Symbols, numbers, and language  Context: Concept relationships and features  Importance  Uncertainty

© 2006 Tom Beckman and Ronald P. Reck4 Knowledge Representation Dimensions Concept:  Symbolic Format:  Concept Types: Object, Entity, and Abstraction  Domain content knowledge Structure:  Declarative representation  Composed of Nodes and Links  Expert System types parallel human cognitive schema Process:  Procedural representation  Reasoning and Inference  Modeling and Simulation

© 2006 Tom Beckman and Ronald P. Reck5 Concept Definition Symbolic Format: Typing: Object, Entity, Abstraction, and Event Features:  Essential attributes that define the concept  Attributes that discriminate boundaries between concepts  Attribute Values also help define concept details Relations:  Typing: causal, taxonomy/hierarchy, associative  Between concepts  Between attributes

© 2006 Tom Beckman and Ronald P. Reck6 Concept Dimensions Meaning: is nothing more than the sum of these concept dimensions (after Bruner) Definition Attributes:  Stereotypical description of characteristics  Format: Relations:  Between concepts  Between attributes Linguistics:  Part of speech  Grammar rules Context:  Based on user experience and purpose  Common understanding Mental Models and Cognitive Schema

© 2006 Tom Beckman and Ronald P. Reck7 Attribute Value Typology Numeric:  Ordinal: Likert Scale  Interval: Range, Continuous Variable  Continuous: Ratio, normalized continuous variable Semantic:  Text Value Types:  Unstructured: Instant Messaging  Semi-Structured: , Memo  Structured: Document, Hypertext  Symbolic Value Types: Binary: Boolean Categorical: Unrelated Nominal Ordinal: Related Nominal Sensory:  Image: Digital spatial array, picture, video  Signal: Time series, audio, sensor

© 2006 Tom Beckman and Ronald P. Reck8 Attribute Value Typing Numeric:  Most precise reasoning  Needs explanation of computing and results  Often hidden – working in the background behind symbols and text Symbolic:  Less precise but most concise  Ease of reasoning and explanation Linguistic:  Best for explanation and understanding  Not as precise, and not concise  Hard to directly reason with – language parsers Image: Described and classified using taxonomies and metadata Signal: Time series are described, interpreted, and classified using taxonomies, metadata, and analysis engines

© 2006 Tom Beckman and Ronald P. Reck9 Conceptual Primitives Knowledge Templates are key conceptual primitives:  Represent assertions – the basic building blocks of structures  Define, describe, and detail symbol features, values, & relations  Can also represent Uncertainty & Importance  Come in several standard templates: Basic: Faceted: Measured: Declarative Conceptual Primitives:  Feature Descriptor: Ex:  Relation: Ex: Procedural Conceptual Primitives:  Action: Ex:  Inference: Ex: These knowledge elements have certain properties:  Naming (Object)  Describing (Attribute and Value)  Organizing (Hierarchy)  Relating (Functional, Causal, & Empirical Links)  Constraining and Negating (Networks and Rules)

© 2006 Tom Beckman and Ronald P. Reck10 Determining Concept Similarity (1) Compare potentially identical concepts across disparate legacy databases, documents, and Web sites Real understanding for potential data sharing comes from detailed examination and matching of concept characteristics Compare Symbol Names:  Identical or synonym  Degree of similarity as relative specificity Same Concept Type: Object, Entity, Abstraction, or Event Compare Concept Descriptions:  Natural language descriptions found in data dictionaries  Compare keywords and context in concept descriptions Compare Concept Relations:  Relation Typing: causal, taxonomy/hierarchy, associative  Relations between concepts and between attributes  Relative position of concept in taxonomy  Metadata Compare Concept Task Usage and Context

© 2006 Tom Beckman and Ronald P. Reck11 Determining Concept Similarity (2) At the data schema or logical level, concepts are described and defined as attributes/features and their allowable values Compare Concept Attributes (Data Elements):  Compare attribute names and determine similarity  Compare attribute descriptions in data schema  Determine relative importance of attributes  Number of essential common attributes between concept schema  Number of dissimilar attributes between schema  Look for compound data elements Compare Data Element Values:  Value descriptions found in data schema and data dictionaries  Compare keywords and context in attribute value descriptions  Compare Value Typing: numeric, symbolic, natural language, image/signal  Compare Value Characteristics: field length, valid numeric range, boundary values, metrics  Compare importance and uncertainty measures Compare Business Rules: Validity and consistency checks

© 2006 Tom Beckman and Ronald P. Reck12 Examples of Data Element Matching Ex 1: Comparing Address Zip Codes  Concept = Address  Attributes =  Attribute Value Options: = o Field Length: 5 digits, 9 digits (full Zip Code), 6 digits (Canadian) o Typing: US: numeric; Canadian/English: alpha and numeric o Valid Zip Code Values o Business Rule: If IRS Philadelphia SC, then Zip Code = Ex 2: Determining Suspect Height  Qualitative Approach: Height = o o Taller or shorter than witness or interviewer  Quantitative Approach: Height = o In inches; feet and inches; range estimate; o Valid range (for adult – context) = 30 to 90 inches

© 2006 Tom Beckman and Ronald P. Reck13 Capturing Accurate Data Values MIT Expert System story  Investment Expert System – Randy Davis suggestion  Qualitative values for economic attributes  rule explosion DOJ Teenage Drug Usage Survey  Monthly survey on frequency of teenage drug usage  During the past month, how often did you smoke weed?  Values =  Should have just captured an estimate of the number of times; later the data could be grouped as needed  As captured, unclear if data trends mean anything unless huge shift Data Accuracy Principle:  Capture numeric values when possible and relatively easy  Even qualitative data can be put on Likert value scale (1-7) Harmonizing values from varied data schema  When possible, estimate and convert qualitative values to numeric  Perhaps place new converted values in enterprise data warehouse

© 2006 Tom Beckman and Ronald P. Reck14 Issues in Semantic Systems No consolidation/harmonization of existing legacy databases:  Terms are semantically under-defined and under-described: must find and validate possible synonyms across disparate legacy DBs, metadata, and text/content.  Terms as related and organized into concept hierarchy, but more general or specific.  Multiple attributes and their values define and describe the concept and context.  Consolidate terms/concepts from disparate DBs: determine synonym, similarity, and context.  Consolidate attributes from disparate DBs: is the same feature being described?  Consolidate value typing and problematic conditions from disparate DBs Need improved description of concepts Need improved attribute typing and values Determine needed level of concept precision/granularity Creation of standard vocabularies Explicit representation of relationships, importance, and uncertainty Weak accuracy of matching and access function How to assert new facts Handling of Synonyms, Antonyms, Typing, and Metadata

© 2006 Tom Beckman and Ronald P. Reck15 Semantic Web Components Domain Content: Knowledge, experience, and expertise Domain Taxonomy and Ontology:  RDF/OWL  Object Methods: Inheritance and Classification Organization and Structure: Web sites and document collections Classification Methods:  Similarity-based  Rule-based  Network-based  Object-based Indexing: Item typing and meta-tagging Linguistics: Natural Language Processing & Text Generation Search Query: Keyword Bayesian Search & Semantic Search Entity Extraction: People, places, and events Analysis Methods: Data Mining, Text Mining, Link Analysis, Machine Learning, & Knowledge Discovery Intelligent Agents: Simulation and Modeling

© 2006 Tom Beckman and Ronald P. Reck16 Issues in Knowledge Representation and Semantic Interoperability Tom Beckman Principal, Beckman Associates Ronald P. Reck Principal, RRecktek Questions??