Unclassified//For Official Use Only 1 Analysis of Uncertain Data in Text Documents Carnegie Mellon University and DYNAM i X Technologies PI : Jaime G.

Slides:



Advertisements
Similar presentations
Norman Sadeh – Carnegie Mellon University – DAML PI Meeting- Feb. 13, 2002 DAML PI Meeting Status Briefing A Semantic Web Environment for Mobile Context-Aware.
Advertisements

Systems Investigation and Analysis
Information Extraction Lecture 4 – Named Entity Recognition II CIS, LMU München Winter Semester Dr. Alexander Fraser, CIS.
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
ITAuditing Using GAS & CAATs
Consultation on Senior Cycle Science Anna Walshe Brendan Duane
Requirements Engineering n Elicit requirements from customer  Information and control needs, product function and behavior, overall product performance,
Irwin/McGraw-Hill Copyright © 2000 The McGraw-Hill Companies. All Rights reserved Whitten Bentley DittmanSYSTEMS ANALYSIS AND DESIGN METHODS5th Edition.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
An Agent Capable of Learning to Create and Maintain Websites Anthony Tomasic, Ravi Mosur Alex Rudnicky, Raj Reddy, John Zimmerman Carnegie Mellon University.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
1 Agenda 01/13/05 Announcement Database Development Process (Chapter 2)
Experimental Components for the Evaluation of Interactive Information Retrieval Systems Pia Borlund Dawn Filan 3/30/04 610:551.
UCB CS Research Fair Search Text Mining Web Site Usability Marti Hearst SIMS.
The LINDI Project Linking Information for New Discoveries UIs for building and reusing hypothesis seeking strategies. Statistical language analysis techniques.
Overview of Search Engines
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
ArcGIS Workflow Manager An Introduction
Intégration Sémantique de l'Information par des Communautés d'Intelligence en Ligne ISICIL.
Striving for Quality Using continuous improvement strategies to increase program quality, implementation fidelity and durability Steve Goodman Director.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Requirements Analysis
Impact of different relation extraction methods on network analysis results Jana Diesner.
Advanced Topics in Requirement Engineering. Requirements Elicitation Elicit means to gather, acquire, extract, and obtain, etc. Requirements elicitation.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
Help Desk System How to Deploy them? Author: Stephen Grabowski.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Natacha Fernandez-Ureña, Database Manager Jose Hernandez, Supervising Case Manager Shannon Skinner, Coordinator for Testing & Linkage to Care Dustin Przybilla,
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.
ISM 5316 Week 3 Learning Objectives You should be able to: u Define and list issues and steps in Project Integration u List and describe the components.
Interviewing and Deception Detection Techniques for Rapid Screening and Credibility Assessment Dr. Jay F. Nunamaker, Jr. Dr. Judee K. Burgoon.
Content The system development life cycle
Integrated Risk Management Charles Yoe, PhD Institute for Water Resources 2009.
1 Introduction to Software Engineering Lecture 1.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
………….AUSTRALIAN GREENHOUSE EMISSIONS INFORMATION SYSTEM………… Inventory Management & Archiving Australian Greenhouse Emissions Information System (AGEIS)
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
PAINT RAPID : Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University DYNAM i X Technologies PI: Jaime Carbonell Eugene.
Principles of Information Systems, Sixth Edition Systems Investigation and Analysis Chapter 12.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
September 25, 2006 NASA Feasibility Study Status Update.
1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime G. Carbonell / / (412)
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
FirewallPK Security tool for centralized Access Control List Management th RoEduNet International Conference - Networking in Education and Research.
1 Overview of Maintenance CPRE 416-Software Evolution and Maintenance-Lecture 3.
You Can’t Afford to be Late!
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
Analysis of Uncertain Data: Tools for Representation and Processing Bin Fu Eugene Fink Jaime G. Carbonell.
Scheduling with Uncertain Resources Eugene Fink, Jaime G. Carbonell, Ulas Bardak, Alex Carpentier, Steven Gardiner, Andrew Faulring, Blaze Iliev, P. Matthew.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix B: Getting Started in Systems Analysis and Design.
Reasoning under Uncertainty Eugene Fink LTI Seminar November 16, 2007.
Automated Assistant for Crisis Management (Reflective Agent with Distributed Adaptive Reasoning) RADAR.
Viewpoint Modeling and Model-Based Media Generation for Systems Engineers Automatic View and Document Generation for Scalable Model- Based Engineering.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Scheduling with uncertain resources Collaboration with the user Eugene Fink, Ulaş Bardak, Brandon Rothrock, Jaime Carbonell Carnegie Mellon University.
SIP Rubric 2004 Kay Poyner Brown West 40 ISC #2 December 6-7, 2004 Modified from presentation developed by ISBE.
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
You Can’t Afford to be Late!
An Empirical Study of Learning to Rank for Entity Search
CS246: Information Retrieval
Metadata supported full-text search in a web archive
Presentation transcript:

Unclassified//For Official Use Only 1 Analysis of Uncertain Data in Text Documents Carnegie Mellon University and DYNAM i X Technologies PI : Jaime G. Carbonell / / (412) Co-PI: Eugene Fink / / (412) HNC and Fair Isaac Co-PI: Dayne Freitag / / (858) Co-PI: Richard Rohwer / / (858) PAINT

Unclassified//For Official Use Only 2 Proposed functionality We will integrate the text-extraction system developed by HNC / Fair Isaac with the uncertainty-analysis system developed by CMU / DYNAM i X. The integrated system will support the following main capabilities. Extraction of relevant facts, relations, and causal links from natural-language text documents Automated intent inferences and identification of surprising developments based on uncertain data Evaluation of given hypotheses Proactive information gathering Application to the analysis of Iranian nanotechnology plans and capabilities We will also build an external API for future integration with other PAINT systems, and evaluate its effectiveness by implementing an optional “loose” integration with the predictive-analysis system developed by Berkeley / LLC.

Unclassified//For Official Use Only 3 HNC / Fair Isaac: REALISM System Knowledge base entities, relations, implication pool Unstructured text archive by genre Basic IE model learning [background] Information extraction (entities and relations) Abstract IE model learning [background] Data acquisition [Real-time IR] IE models Academic Newswire Blog... Genre detection Background / model-learning data paths Real-time / modeling data paths Extracted relations and causal links (structured rules) Extracted facts and entities (structured tables)

Unclassified//For Official Use Only 4 HNC / Fair Isaac: REALISM System Output: Large structured tables of relevant facts and entities, which include uncertainty Inference-rule representation of relations and causal links, also including uncertainty Input: Requirements and filters for the information extraction Natural-language documents World-wide web

Unclassified//For Official Use Only 5 Critical uncertainties CMU / DYNAM i X : RAPID System Uncertain inference rules Query matches Evaluation of hypotheses Prioritized plans for proactive data collection Inferred facts Learned inference rules Goals, queries, and hypotheses Reality interpretation: Tables of uncertain facts Manual entry, selection, and editing of knowledge

Unclassified//For Official Use Only 6 CMU / DYNAM i X : RAPID System Output: Inferences from uncertain data New learned inference rules Exact and approximate matches for given queries Hypothesis assessment Proactive plans for collecting additional data Input: “Reality interpretation tables,” which represent uncertain facts Uncertain inference rules Queries for specific relevant data Analyst’s hypotheses

Unclassified//For Official Use Only 7 Integrated system Reality interpretation: Tables of uncertain facts Uncertain inference rules Information requests Query matches Evaluation of hypotheses Plans for proactive data collection Manual entry, selection, and editing of knowledge Inferred facts Learned inference rules HNC / Fair Isaac CMU / DYNAM i X Structured relations and causal links Structured facts and entities Goals, queries, and hypotheses Topic filters Testing with Berkeley / LLC System

Unclassified//For Official Use Only 8 Empirical evaluation Data: We will use public data about Iranian nanotechnology in the system evaluation. When the PAINT challenge-problem data about Iran becomes available, we will combine it with the public data. Component evaluation: We will measure the following performance factors: Accuracy and completeness of text extraction Accuracy of hypothesis evaluation Effectiveness of data-collection plans Speed of each system component

Unclassified//For Official Use Only 9 Empirical evaluation Specific tasks: Find data relevant to given hypotheses Evaluate the validity of these hypotheses Identify critical uncertainties and propose a plan for collecting additional relevant data Performance measurements: Number of tasks completed during the experiment Accuracy of hypothesis evaluation Effectiveness of proactive data-collection plans Experimental group: Use of REALISM / RAPID Control group: Use of standard tools Evaluation of the integrated system: We will compare the productivity of subjects using the developed system with that of subjects who perform the same tasks using off-the-shelf tools.

Unclassified//For Official Use Only 10 Empirical evaluation Component utility: We will also evaluate the utility of REALISM and RAPID by comparing the productivity of subjects under the following three conditions: Use of the integrated system Use of REALISM without RAPID Use of RAPID without REALISM

Unclassified//For Official Use Only 11 Schedule Data models and API sNov Initial versions of separate REALISM and RAPID systems; feasibility demo Feb Integrated REALISM / RAPID system; initial integrated demo May 2008 Evaluation of the integrated system; extended demo Aug. 2008