Trio A System for Data, Uncertainty, and Lineage

Slides:



Advertisements
Similar presentations
Uncertainty in Data Integration Ai Jing
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Query Methods (SQL). What is SQL A programming language for databases. SQL (structured Query Language) It allows you add, edit, delete and run queries.
LIVE A lineage-supported, versioned DBMS  Anish Das Sarma  Martin Theobald  Jennifer Widom.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Relational Algebra Chapter 4, Part A.
Uncertainty Lineage Data Bases Very Large Data Bases
Trio: A System for Data, Uncertainty, and Lineage Search “stanford trio”
Trio: A System for Data, Uncertainty, and Lineage Search “stanford trio”
Trio: A System for Data, Uncertainty, and Lineage Jennifer Widom et al Stanford University.
Trio: A System for Data, Uncertainty, and Lineage Jennifer Widom Stanford University.
1 Introduction to databases concepts CCIS – IS department Level 4.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Chapter 2 CIS Sungchul Hong
ULDBs: Databases with Uncertainty and Lineage O. Benjelloun, A. Das Sarma, A. Halevy, J. Widom.
2. Database System Concepts and Architecture
Component 4: Introduction to Information and Computer Science Unit 6: Databases and SQL Lecture 3 This material was developed by Oregon Health & Science.
Component 4/Unit 6c Topic III Structured Query Language Background information What can SQL do? How is SQL executed? SQL statement characteristics What.
(1) Introduction to Models using the Play Framework Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University.
Database Systems DBMS Environment Data Abstraction.
EXAM 1 NEXT TUESDAY…. EXAMPLE QUESTIONS 1.Why is the notion of a “state” important in relational database technology? What does it refer to? 2.What do.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
UCL DEPARTMENT OF SPACE AND CLIMATE PHYSICS MULLARD SPACE SCIENCE LABORATORY Taverna Plugin VAMDC and HELIO (part of the ‘taverna-astronomy’ edition) Kevin.
SQL.. AN OVERVIEW lecture3 1. Overview of SQL 2  Query: allow questions to be asked of the data and display only the information required. It can include.
Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS Martin Theobald Jennifer Widom Stanford University.
SQL Injection Anthony Brown March 4, 2008 IntroductionQuestionsBackgroundTechniquesPreventionDemoConclusions.
Jennifer Widom Relational Databases The Relational Model.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Chapter 3 The Relational Model. Why Study the Relational Model? Most widely used model. Vendors: IBM, Informix, Microsoft, Oracle, Sybase, etc. “Legacy.
Machine Language Computer languages cannot be directly interpreted by the computer – they are not in binary. All commands need to be translated into binary.
1 Working Models for Uncertain Data Anish Das Sarma, Omar Benjelloun, Alon Halevy, Jennifer Widom Stanford InfoLab.
Understanding Core Database Concepts Lesson 1. Objectives.
CS 325 Spring ‘09 Chapter 1 Goals:
Chapter 2: Database System Concepts and Architecture - Outline
ODBC, OCCI and JDBC overview
Oracle & SQL Introduction
Chapter 1: Introduction
Special Topics in CCIT: Databases
Querying Relational Databases
CS422 Principles of Database Systems Course Overview
1Z0-071 Exam : Oracle Database 12c SQL
Database Management:.
Chapter 1: Introduction
Introduction To Database Systems
DATABASE SQL= Structure Query Language مبادئ قواعد بيانات
TRIO Data Uncertainty Lineage Data Model Query Language System
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
 DATAABSTRACTION  INSTANCES& SCHEMAS  DATA MODELS.
Introduction to Database Systems
Chapter 2 Database Environment.
Querying Relational Databases
Relational Databases The Relational Model.
Relational Databases The Relational Model.
2018, Fall Pusan National University Ki-Joune Li
The Relational Model Textbook /7/2018.
Data Model.
The Trio System for Data, Uncertainty, and Lineage: Overview and Demo
SQL .. An overview lecture3.
Instructor 彭智勇 武汉大学软件工程国家重点实验室 电话:
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Probabilistic Databases
Chapter 1: Introduction
Chapter 1: Introduction
Chapter 1: Introduction
MIS2502: Data Analytics MySQL and MySQL Workbench
Chapter 2 Database Environment Pearson Education © 2009.
Database SQL.
Understanding Core Database Concepts
Chapter 1: Introduction
Database Dr. Roueida Mohammed.
Presentation transcript:

Trio A System for Data, Uncertainty, and Lineage Data Model Uncertainty Lineage ULDBs Uncertainty-Lineage Databases Saw (witness,car) (Mary,Honda) ∥ (Mary,Toyota) ∥ (Mary,Mazda) (Susan,Lexus) ID Drives (person,car) 21 (Bob,Honda) ∥ (Hank,Honda) 22 (Jim,Toyota) ∥ (Jim,Acura) ID Saw (witness,car) 11 (Mary,Honda) ∥ (Mary,Toyota) Understandable (we think ) and complete = can represent any set of possible instances ? Suspects = πperson(Saw ⋈ Drives) Alternatives Maybe annotations Confidences ID Suspects (person) 31 Bob ∥ Hank 32 Jim λ(31,1) = (11,1),(21,1); λ(31,2) = (11,1),(21,2) λ(32,1) = (11,2),(22,1) ? ? Drives (person,car) (Bob,Honda): 0.6 ∥ (Hank,Honda): 0.4 (Jim,Toyota): 0.2 ∥ (Jim,Acura): 0.5 See ULDB talk Friday 11:00 AM, room 310 Lineage... Enables simple & consistent representation of uncertain data Correlates uncertainty in query results with uncertainty in the input data Can make computation over uncertain data more efficient ? Formal semantics based on possible instances Query Language TriQL The Trio Query Language Confidences in Query Results Start with SQL … Modify semantics for ULDBs Default probabilistic interpretation Saw (witness,car) (Mary,Honda): 0.6 ∥ (Mary,Toyota): 0.4 Drives (person,car) (Bob,Honda): 0.3 ∥ (Hank,Honda): 0.7 Relational query Q on ULDB D semantics (logical) implementation Create Table Suspects(person) as Select Drives.person From Saw, Drives Where Saw.car = Drives.car Suspects (person) Bob: 0.18 ∥ Hank: 0.42 ? creates Suspects table with lineage to Saw and Drives But query can override implementation of Q D D + Result Select Drives.person, min(conf(Saw),conf(Drives)) as conf From Saw, Drives … … add constructs for querying uncertainty and lineage possible instances representation of instances Q on each instance Suspects (person) Bob: 0.3 ∥ Hank: 0.6 ? D1, D2, …, Dn Q(D1), Q(D2), …, Q(Dn) Built-in functions conf(), lineage(), lineage*() Confidences computed on-demand based on lineage feature System System Structure Demo Database DDL commands TriQL queries Schema browsing Table browsing Explore lineage On-demand confidence computation TrioExplorer (GUI client) Command-line client Trio API and translator (Python) Standard SQL Standard relational DBMS Encoded Data Tables Trio Metadata Lineage Tables Trio Stored Procedures DATA UNCERTAINTY LINEAGE