Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 1 The Evolution of Semantic Technologies-The Value of Merging Smart Data With Big Data Eric Little, PhD VP.

Slides:



Advertisements
Similar presentations
Autodesk Integrations Overview SmartDesk A seamlessly integrated, affordable, out-of-the-box, Windows based drawing and document management tool for.
Advertisements

Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
BI Web Intelligence 4.0. Business Challenges Incorrect decisions based on inadequate data Lack of Ad hoc reporting and analysis Delayed decisions.
© 2014 Cognizant 4 th March 2015 MBaaS: Mobile Backend as a Service Pablo Gutiérrez / Senior Mobility developer.
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.
Data Model driven applications using CASE Data Models as the nucleus of software development in a Computer Aided Software Engineering environment.
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Information and Business Work
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Business Intelligence Michael Gross Tina Larsell Chad Anderson.
Copyright 2012, Enterprise Elements, Inc. Elements DataTraction Information Management for the Enterprise.
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
1 Meeting the Reporting Challenges at International Paper.
Irwin/McGraw-Hill Copyright © 2004 The McGraw-Hill Companies. All Rights reserved Whitten Bentley DittmanSYSTEMS ANALYSIS AND DESIGN METHODS6th Edition.
DSS: Decision Support Systems and AI: Artificial Intelligence
Knowledge Portals and Knowledge Management Tools
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
Framework for Model Creation and Generation of Representations DDI Lifecycle Moving Forward.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Data Mining Techniques
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Bina Nusantara 2 C H A P T E R INFORMATION SYSTEM BUILDING BLOCKS.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
© 2015 Modus Operandi, Inc., All Rights Reserved. 1 SHIFTING THE PARADIGM: THE VALUE OF INNOVATION IN DRIVING ENTREPRENEURSHIP.
Computers Are Your Future Tenth Edition Chapter 12: Databases & Information Systems Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Almost 4 decades of Advanced Analytics & DM expertise.
Charles Tappert Seidenberg School of CSIS, Pace University
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
2131 Structured System Analysis and Design By Germaine Cheung Hong Kong Computer Institute Lecture 2 (Chapter 2) Information System Building Blocks.
November 2003 Presented to “Commercializing RDF” Semantic Software Solutions for Enterprise Web Management International World Wide Web Conference 2004.
Satish Ramanan April 16, AGENDA Context Why - Integrate Search with BI? How - do we get there? - Tool Strategy What - is in it for me ? - Outcomes.
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
Creating New Business Value with Big Data Attivio Active Intelligence Engine®
Information Explosion. Reality: New Machine-Generated Data Non-relational and relational data outside of the EDW † Source: Analytics Platforms – Beyond.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
8a Certified. About Us  Headquarters in Vienna, VA  Service Disabled Veteran-owned Small Business  SBA 8(a) program participant  Small Disadvantaged.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
BI TOOLS FOR THE IBM I …AND BEYOND A quick look at IBM’s DB2 Web Query for the i and WebFocus.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Building Enterprise Applications Using Visual Studio®
Big Data Enterprise Patterns
Digital Workplace.
Software Engineering and Best Practices
Pentaho 7.1.
MANAGING KNOWLEDGE FOR THE DIGITAL FIRM
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
XtremeData on the Microsoft Azure Cloud Platform:
Improve Patient Experience with Saama and Microsoft Azure
Business Intelligence
CRM DMP – a marriage of two acronyms
Customer 360.
Presentation transcript:

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 1 The Evolution of Semantic Technologies-The Value of Merging Smart Data With Big Data Eric Little, PhD VP – Chief Scientist

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 2 Who is Modus Operandi?  Privately held small business headquartered in Melbourne, FL. Satellite locations in Aberdeen, MD and Ft. Huachuca, AZ. 82% of employees possess a security clearance.  U.S. Government is our primary customer Expanding into select commercial markets

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 3 IT’s common challenge  TOO BIG - Too much data or too many variables  SILOED - Data is in legacy silos so nothing is integrated  LOST EXPERTISE - SME info is lost in people’s heads  NO EXCHANGE - No good processes for data exchange  NO VIZ - No good ways to visualize data  NO QUALITATIVE - Cannot use statistical tools to get qualitative answers  DIRTY DATA - Too many errors in the data  NO RULES - No way to capture business rules without big coding effort  NO VOCAB - No good vocabularies exist to capture data elements  MANY MODELS - Too many data models to be controlled effectively

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 4 MODUS APPROACH Rather than build one type of technology we realize the need for an end-to-end platform to provide solutions for our customers

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 5 The “Cognitive Evolution” of Intelligent Software  Semantic technologies are part of an IT evolution from code to data centricity In the Code-Centric years, data was often stored in flat files with no structure, while complex, procedural “edit” programs contained all knowledge about the data The creation of databases, specifically Network and RDBMS, was one of the first steps leading to Data-Centric evolution The last decade has seen standards such as XML, RDF, Web services, and now OWL, that further evolve IT to a Data-Centric environment  Big data and scalability is now helping to shape semantic tech at large scale. Big data science Retrieval at Scale is most important

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 6 New & Expanding Tech Areas  The past few years have seen a significant rise in new tech fields data science, big data analytics, semantic technologies, natural language processing, graph computing, and systemics  These areas provide new paradigms for data analysis and integration  These are driving new innovations in the ways people can access and use their data.

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 7 Innovation is Key in These Types of Tech Spaces  The idea seems straight-forward and easy But it is difficult to find true spots of Blue Ocean  Requires new approaches that are taken from numerous disciplines  Small businesses need to compete by focusing and being disruptive Being disruptive involves the counter intuitive approach of focusing on specific market segments Requires an ability to be nimble and respond quickly to needs (iterative prototyping) Every wave is different – reading the wave is key

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 8 SEMANTIC TECHNOLOGY

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 9 Semantics and Reasoning

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 10 Semantic Approach Improves Data Access 10 Traditional Approach Semantic Approach Database Experts Domain Experts & Scientists Systems Engineers Management & Executives Manual Data Correlation Manual Report Generation (High Potential for Error) Integrated Classifications/Schemas Automated Reasoning Capabilities (Significant Error Reduction) Domain Experts & Scientists Systems Engineers Management & Executives Ontology Engine

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 11 Semantic Approach Simplifies Queries Traditional Approach Database Experts Query Must Contain: 1.Data Requirements 2.All Logi c Required to Relate the Data (Rules, Joins, Decode, Sub-queries, etc.) Complexity: HIGH Reusability: LOW-MED Semantic Approach Reasoning is done on the user side for each query Reasoning is performed by Ontobroker within the system Database Experts Scientists, Systems Engineers Management & Executives Query Must Contain: 1.Data Requirements only Complexity: LOW Reusability: HIGH (Logic embedded in Model)

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | Building Semantic Profiles From Raw Data Key data elements are identified – creating lexicon of important terms Data elements are categorized into appropriate classes – ranges are captured for autoclassification Can be applied to any type of data elements: equipment, reports, products, processes, etc. Advanced logics allow for reasoning over data sets such that new patterns and information can be gained

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 13 Utilizing Semantics to Integrate Disparate Medical Data ACETAMINOPHEN (TYLENOL) LEVEL Acetaminophen (Tylenol) Level Acetaminophen (Tylenol) Level ACETAMINOPHEN (TYLENOL) LEVEL Acetaminophen [ Tylenol] Acetaminophen + Codeine Acetaminophen + Codeine Acetaminophen + Codeine Acetaminophen + Codeine Acetaminophen + Codeine (120mg-12mg/5ml) (NF) Liquid Acetaminophen + Codeine (120mg-12mg/5ml) (NF) Liquid Acetaminophen + Codeine (120mg-12mg/5ml) (NF) Liquid Acetaminophen mg PO q4h PRN Temp > Acetaminophen mg PO q6h PRN Pain 64654ACETAMINOPHEN SUPP 325 MG SUPP ACETAMINOPHEN SUPP 325 MG SUPP Acetaminophen Tab 325MG ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB ACETAMINOPHEN TAB 325 MG TAB Acetaminophen (160mg/5ml) Suspension Hospital 1 Data Hospital 2 Data Disparate data sources can be ingested by the system and automatically classified into their appropriate class, attributes, etc. The models only need to be developed initially with the help of medical SMEs (as opposed to continuous point-to-point mappings with traditional systems). Common Data Model

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 14 Classification Schemas Must Reflect Subject Matter Expertise Orbis Technologies, Inc. Proprietary14 SME’s are often ill equipped to capture their knowledge semantically Knowledge can be captured in ontologies (as attributes, advanced relationships, etc.) – but this requires a separate skills set Multiple ontologies can be integrated to capture enterprise-wide applications for advanced business intelligence

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 15 Federated Ontology Layers Allow for Advanced Data Modeling

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 16 Putting It All Together Into A Platform Unstructured Outcomes Data Structured Data Customizable User Interfaces Ontology Engine

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 17 BIG DATA – NOW THAT YOU HAVE SEMANTICS, HOW TO SCALE…

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 18 The Problem of Big Data is Real (And Closing In)  The past couple of decades have been spent on data gathering and storage  Most Data Stores were not built to get data out  The new push is connecting data  New high-performance systems are required to meet those needs Data solutions must be big, smart, and easy to deploy

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 19 Big Data Analytics Challenge for Intelligent Systems  Data Analysis in the many spaces requires near-real-time decision support tools.  Connecting the dots is paramount to successful and effective analysis  This requires a culmination of new techniques that combine robust data modeling and linkage (e.g., graphs) with high-performance computing capability

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 20 Capturing Complex Data Is Difficult  People are now attempting to utilize their data like never before. Semantics has shown significant promise but has not scaled well in the past. Entities, attributes, locations, temporal signatures, etc. result in data explosions  Breakthroughs in cloud computing and high performance graph stores are providing new ways to innovate data science.  Multiple users can now apply perspectives  Can be driven to an entire enterprise Built on Standards- based Approaches

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 21 Scaling semantics  Semantics has not scaled well in the past  Entities, inferred data, facets, over time, with quality attribution,… = a data explosion  Our newest breakthroughs in cloud computing and high performance graph stores allow semantics- at-scale BIG GRAPHS + +

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 22 Scaling semantics (cont.)  Enterprise-level graph computing requires cutting edge technology components  Data Ingest at Cloud scale – must be able to ingest millions of entities and thousands of documents per second. (Modus Operandi Wave Engine)  Data Storage (Triple Stores and Cloud DBs) 60 billion triples, sub-second queries, thousands of unstructured docs processed per second  Data Traversal (High-performance UI’s) – app stores and BI tools to provide a diverse user experience High- performance triple store Semantic Reasoner Graph- based Appliances

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 23 Avoiding the Hype Cycle

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 24 EASY-TO-USE DATA

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 25 EASY SMART BIG Easy UI’s Leverage Common Models Our user Interfaces are designed around common use models HDFS / Hadoop / MapReduce Accumulo Key Value Store Semantic search Geospatial views Semantic Wiki – collaborate Timelines Explore Visualizations Large-scale Semantic triple stores with reasoning

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 26 Driving the Knowledge to Multiple Users  Combining software tools in innovative ways allows for multiple users to view the same data at once. These technologies are providing new platforms that are driving new ways to utilize advanced analytics like never before  Information can be driven to multiple users in near-real- time for improved decision support End Users

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 27 Visualizing patterns  Correlations, associations and patterns require special purpose visualizations  Our ExtJS/Ozone framework enables fast assembly of point solutions  Patterns recognition leads to prediction  Prediction leads to prevention

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 28 Big Data Results in a Highly Intuitive UIs is Key  Complex data does not require complex UIs Many new tech innovations involve simple, intuitive front ends (apps)  Users must be able to quickly manipulate information  Must be able to quickly derive answers  Different technologies must be integrated into a common look and feel

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 29 Providing an End-to-End Solution  Many companies our size provide a capability or two Modus Operandi provides a complete platform for a multitude of user applications (and growing).  Information can be ingested from nearly any source (structured, semi-structured or unstructured). Common models such as UMLS, Ucore-SL, BFO, etc. Custom models can be created based on project specifics.  Information is stored in a high-performance graph knowledge base (we can integrate numerous ones – currently using Bigdata, Rya and Allegrograph).  Results can be driven to a wide variety of easy-to-use UI’s that can be highly customized to fit user needs.  Smart + Big + Easy provides a new means to successfully apply semantic technologies to large scale graph computing.

Jan. 22, 2013 | © 2013 Modus Operandi, Inc. | 30 THANK YOU QUESTIONS?