Streaming Knowledge Bases Onkar Walavalkar, Anupam Joshi Tim Finin and Yelena Yesha University of Maryland, Baltimore County 27 October 2008.

Slides:



Advertisements
Similar presentations
ROWLBAC – Representing Role Based Access Control in OWL
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
AVATAR: Advanced Telematic Search of Audivisual Contents by Semantic Reasoning Yolanda Blanco Fernández Department of Telematic Engineering University.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Active Databases as Information Systems
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
Novelty Detection and Profile Tracking from Massive Data Jaime Carbonell Eugene Fink Santosh Ananthraman.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
Introduction to Databases Transparencies
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Knowledge-Based NLP and the Semantic Web Sergei Nirenburg Institute for Language and Information Technologies University of Maryland Baltimore County Workshop.
Principle of Functional Verification Chapter 1~3 Presenter : Fu-Ching Yang.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 5 Slide 1 Requirements engineering l The process of establishing the services that the.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Audumbar Chormale Advisor: Dr. Anupam Joshi M.S. Thesis Defense
DR EBTISSAM AL-MADI Computers in Dental Education.
Chapter 1 Overview of Databases and Transaction Processing.
Introduction to DBMS Purpose of Database Systems View of Data
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Semantics for Big Data (,) Security and Privacy Tim Finin and Anupam Joshi University of Maryland, Baltimore County Baltimore MD NSF Workshop on Big Data.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Bina Nusantara 2 C H A P T E R INFORMATION SYSTEM BUILDING BLOCKS.
SOUPA: Standard Ontology for Ubiquitous and Pervasive Applications Harry Chen, Filip Perich, Tim Finin, Anupam Joshi Department of Computer Science & Electrical.
Division of IT Convergence Engineering Towards Unified Management A Common Approach for Telecommunication and Enterprise Usage Sung-Su Kim, Jae Yoon Chung,
Tim Finin University of Maryland, Baltimore County 29 January 2013 Joint work with Anupam Joshi, Laura Zavala and our students SRI Social Media Workshop.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Chapter 1 : Introduction §Purpose of Database Systems §View of Data §Data Models §Data Definition Language §Data Manipulation Language §Transaction Management.
Research support was provided by NSF, award NSF-ITR-IIS , PI Tim Finin, UMBC. SPIRE Semantic Prototypes in Research Ecoinfomatics Approach We are.
1-1 System Development Process System development process – a set of activities, methods, best practices, deliverables, and automated tools that stakeholders.
Ontology Summit 2015 Track C Report-back Summit Synthesis Session 1, 19 Feb 2015.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
Nigel Koay, Pavandeep Kataria, and Radmilla Juric, Dipl.-Ing. University of Westminster, London, United Kingdom Telemedicine and e-Health.
POLICY ENGINE Research: Design & Language IRT Lab, Columbia University.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin University of Maryland,
Temporal Mediators: Integration of Temporal Reasoning and Temporal-Data Maintenance Yuval Shahar MD, PhD Temporal Reasoning and Planning in Medicine.
Supporting Researchers and Institutions in Exploiting Administrative Databases for Statistical Purposes: Istat’s Strategy G. D’Angiolini, P. De Salvo,
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
…optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research.
DReSS Engineering a Replay Application Based on RDF and OWL Chris Greenhalgh, Andy French, Jan Humble, Paul Tennent School of Computer Science, University.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
2-1 A Federation of Information Systems. 2-2 Information System Applications.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
CoOL: A Context Ontology Language to Enable Contextual Interoperability Thomas Strang, Claudia Linnhoff-Popien, and Korbinian Frank German Aerospace Centor.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Automatic Video Editing Stanislav Sumec. Motivation  Multiple source video data – several cameras in the meeting room, several meeting rooms in teleconference,
Semantic Water Quality Portal Jin Guang Zheng and Ping Wang Tetherless World Constellation.
Helping the Cause of Medical Device Interoperability Through Standards- based Test Tools DoC/NIST John J. Garguilo January 25,
Vermelding onderdeel organisatie 5 maart The future of databases DBDM 07/08 Leiden Bas van den Berg, Patrick van Kouteren, Rosa Meijer, Mathijs.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Versatile Information Systems, Inc International Semantic Web Conference An Application of Semantic Web Technologies to Situation.
Stream Reasoning with Linked Data Open Data Open Day 2013 Sina Samangooei, Nick Gibbins 26 June 2013.
Chapter 1 Overview of Databases and Transaction Processing.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Anupam Joshi University of Maryland, Baltimore County Joint work with Tim Finin and several students Computational/Declarative Policies.
Building Trustworthy Semantic Webs
Semantic Event-based Service Oriented Architecture
Access Maintaining and Querying a Database
Data Warehouse.
9. Introduction to signal detection
Pervasive and wearable computing research 13 September 2006
UMBC AN HONORS UNIVERSITY IN MARYLAND
Jena HBase: A Distributed, Scalable, Efficient RDF Triple Store
Jena HBase: A Distributed, Scalable, Efficient RDF Triple Store
Presentation transcript:

Streaming Knowledge Bases Onkar Walavalkar, Anupam Joshi Tim Finin and Yelena Yesha University of Maryland, Baltimore County 27 October 2008

Streaming Knowledge Bases Onkar Walavalkar, Anupam Joshi Tim Finin and Yelena Yesha University of Maryland, Baltimore County 27 October 2008

Streaming Knowledge Bases Onkar Walavalkar, Anupam Joshi Tim Finin and Yelena Yesha University of Maryland, Baltimore County 27 October 2008

Overview Motivation Streaming databases Streaming knowledge bases Experiments and results Conclusions  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

Operating Room of the Future ORs will be awash in low-level data, much of it noisy or incomplete Challenges include coping with the noise and interpreting the low- level data to recognize high-level events and activities ORF drugs patient Monitors staff tools RFID AwarePoint RFID Bluetooth WIFI devices  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

Initial work in OR training UMD Mastri Center is experimenting with OR technologies and training environments The Human Patient Simulator from METI – Designed to react like a human – Responds to medical treatment Generates continuous streams of data, moderated by – Initial conditions (e.g. blunt trauma multiple injuries scenario) – human interactions  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

Efficient Data Stream Management Data is stored/indexed in system Queries applied to stored data as they “stream through” Queries Index Results Data Query Index Results Data Traditional DBMSStream Management System Queries stored/indexed in system Data applied to stored queries as they “stream through” Several efforts: Tapestry, Aurora, TelegraphCQ  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

Stream Processor (TelegraphCQ) Continuous Queries Patient Monitor RFID System Medicines Tools Staff Trend Analyzer Physiological Data Low-Level Event Processor Database Patient History Medical Supplies Staff Rule Base Assert facts Medical Encounter Record Video Clipper Assert facts Event Detection - Level 3 Event Detection - Level 2 Event Detection - Level 1 Events  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

What’s wrong with this picture? We need to enhance this to support semantic interoperability for medical data & knowledge The medial community has a long history developing & using standard ontologies & metadata Incoming streams of data can be in rdf And reference terms in appropriate ontologies  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

What’s wrong with this picture? Streaming Database systems use continuous queries specified over a sliding time window – e.g., [range by ‘30 seconds’ slide by ‘10 seconds’] Issues: – Where do we we do reasoning? – How do we answer queries against a sliding window of data?  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

RDF Stream Processing Static Data Store RangeInfo PropertyTree DomainInfo InverseInfo Classtree input stream handler Special domain rules & queries Input Triple Stream Enhanced Stream Query for Class of Concern Detected Instances  Motivation  Stream DBs  Stream KBs  Experiments  Conclusions 

Experiments and results Three simple reasoners – Jena, in core – Pre-computed custom hash tables – Using tables in TelegraphCQ Various scenarios – Ontology size: MB – Number of subclasses: ,000 – Subclass depth: – Data rate: triples per second

Domain Example Monitor data stream looking for observations of invasive species from Bioblitz and eco-blogging data streams Uses our Ethan ontologies for ecoinformatics Tree of life (~340K taxons from ITIS and other sources) Species profiles Invasive species definitions Observation

Reasoning delay comparison for all approaches

VM Usage comparison of all 3 approaches

VM Usage for Jena for different classes

VM usage comparison for Hashtable and TCQ

Conclusions If the incoming triple data rate goes beyond a certain limit, the reasoning speed starts to lag and tends to slow down the incoming stream. The speedup achieved by using TCQ and a hashtable prove the value of pre-processing an ontology, particularly for fast streaming facts.