Presentation is loading. Please wait.

Presentation is loading. Please wait.

Cooperative Query Answering Based on a talk by Erick Martinez.

Similar presentations


Presentation on theme: "Cooperative Query Answering Based on a talk by Erick Martinez."— Presentation transcript:

1 Cooperative Query Answering Based on a talk by Erick Martinez

2 MOTIVATION: Responses to queries posed by a user of a database do not always contain the information required DB and information systems are often hard to use because they do not explicitly attempt to cooperate with their users. They answer literally the queries posed to them A user might need more information than requested, or might actually need different information An answer with extra or alternative information may be more useful and less misleading to a user

3 Cooperative Answer (CA) A CA should be a correct, non-misleading, and useful answer to a query.

4 Grice's maxims Maxim of Quality: a system should never give an answer which might mislead the user Maxim of Quantity: an answer should not be more informative, or more detailed, than necessary Maxim of Relation: an answer should be always relevant to the user who asked the question Maxim of Manner: an answer should not be ambiguous, leaving the user with choices to make about its meaning Q 0 : “Which students are enrolled? A 0 : “joana, jacob, shakil, …“ A 0 : “  X. student(X)“

5 Database Stonewalling Q 1 : "Who passed COSC6115 in the winter semester of 2001? A 1 : “No one“ Q 2 : "Who failed COSC6115 in the winter semester of 2001? A 2 : “No one“ Q 3 : "Who taught COSC6115 in the winter semester of 2001? A 3 : “No one" DB stonewall - will answer a yes/no question with a yes or no regardless of whether the answer is misleading.

6 QUERY / ANSWER SYSTEMS Natural language interfaces Databases (relational) Logic programming and deductive databases ( * )

7 Deductive Databases (DDB) A deductive database consists of three parts: Facts – the set of all facts constitute the extensional database (EDB) Rules – the set of rules constitute the intensional database (IDB) Integrity constraints (IC) – the set of logical formula that must be true of the database e.g. IC 0 :  enrolled_in(X, Y), not student(X). Distinction between data and knowledge: Data represented in the EDB, and knowledge in the IDB and IC. Knowledge is the semantics of the DB, that which must be true of the DB’s state, and the logical conclusions that must follow from given data

8 TECHNIQUES Evaluation of presuppositions in a query ( * ) Detection and correction of misconceptions in a query ( * ) Relaxation and generalization of queries and responses ( * ) Consideration of specific information about a user's state of mind Formulation of intensional answers

9 Presuppositions: Usually, asking a query not only presupposes the existence of all components of the query, but also presupposes an answer to the query itself. i.e. "Which employees own red cars?“ Q 4 :  emp(X), owns(X,Y), car(Y), red(Y). Two atoms in a query are joined if they share a variable. A query is connected if every two atoms in the query are connected. 2 n - 2 sub-queries for a conjunctive query with n atoms (exp. cost)  Algorithm: Report the smallest sub-queries that fail, considering only connected sub-queries TECHNIQUESTECHNIQUES

10 Lattice of sub-queries: If a sub-query has no answers, the query cannot have any answers either (scalar implicature) Finding presuppositions (failed sub-queries) is independent of domain specific knowledge. TECHNIQUESTECHNIQUES Presuppositions:

11 Misconceptions: Integrity constraints: IC 1 :  professor(X), student(X). IC 2 :  enrolled_in(X, Y), not student(X). Query: "Which professor is enrolled in COSC6115?“ Q 5 :  professor(X), enrolled_in(X, COSC6115). Answer: “No one is both a professor and a student. Anyone who is enrolled in a class is a student. So no one is a professor and enrolled in class.“ TECHNIQUESTECHNIQUES

12 Relaxation: Taxonomy clause: C 6 : travel(From, To)  serves_area(A, From), serves_area(B, To), flight(A,B) *. Reciprocal clause: C 6 T : relax(flight(A,B) )  serves_area(A, From), serves_area(B, To), travel(From, To). Relaxation step: let  be a substitution after unifying atom in goal with key ( * ) in the taxonomy clause 1. Apply  across the taxonomy clause. 2. Replace the query atom with the head atom of the taxonomy clause. 3. Add the non-key literals from the body of the taxonomy clause to the new query as constraints on the variables. TECHNIQUESTECHNIQUES

13 … Relaxation: Original query: Q 6 :  flight(‘Dulles, ‘Orly’). Q 6r :  relax ( flight(‘Dulles, ‘Orly’)). Relaxing via reciprocal clause C 6 T : Q 6r ’ :  serves_area(‘Dulles, From), serves_area(‘Orly’, To), travel(From, To). Resolving with taxonomy clause C 6 : Q 6r ’’ :  serves_area(‘Dulles, From), serves_area(‘Orly’, To), serves_area(A, From), serves_area(B, To), flight(A, B). C 6 : travel(From, To)  serves_area(A, From), serves_area(B, To), flight(A,B) *. C 6 T : relax(flight(A,B) )  serves_area(A, From), serves_area(B, To), travel(From, To). TECHNIQUESTECHNIQUES

14 … Relaxation: Q 6r ’’ :  serves_area(‘Dulles’, From), serves_area(‘Orly’, To), serves_area(A, From), serves_area(B, To), flight(A, B). When A = ‘Dulles’ and B = ‘Orly’, solving flight(‘ Dulles, ‘ Orly’ ) again and will get the same answers When A  ‘Dulles’ and B  ‘Orly’, will get new answers: – From = ‘Washington, D.C.’ and – serves_area(A, ‘Washington, D.C.’) will be satisfied by A = ‘National’, A = ‘BWI’ … TECHNIQUESTECHNIQUES

15 Generalization: Relaxation is strictly a syntactic notion, a rewrite mechanism. Generalization is a semantic counterpart to relaxation. Literal answers to the relaxed query should include answers to the original query, plus some new neighbourhood answers with respect to the original query. C 6 T : relax(flight(A,B) )  serves_area(A, From), serves_area(B, To), travel(From, To). TECHNIQUESTECHNIQUES After applying relaxation a new query is a generalization only if all the non-key atoms are satisfied whenever the key atom is satisfied. (conservative reciprocal clause) When all reciprocal clauses are conservative, resolution over a relaxed query will produce all the answers of the original query.

16 USER GOALS AND MODELS Types of knowledge about a user relevant to CA Interests and preferences Needs – user constraints (UC) Goals and intent

17 KEY POINTS: CA is mostly intended for DDB as a platform. For RDB, a deductive database interface should be implemented on top of any relational system. The system should support natural language input to some extend for some domains (the natural language translator generates a logical query) The system should produce natural language responses CA techniques, in particular relaxation, can useful for applications like Internet queries It is not evident that first order logic can serve as an adequate ontology for CA

18 The End

19 A CA SYSTEM (at U of Maryland) Uniform system: – Defined and implemented through logic – Uniform representation and support for all cooperative methods Portable – General approach for RDB, DDB and logic programs – Domain-independent Natural language interface – Accept natural language queries – Provide cohesive and coherent responses in natural language

20 Deductive Database Structure: EDB: prerequisite(‘MATH-300’, ‘MATH-350’). prerequisite(‘MATH-350’, ‘MATH-400’). teaches(smith, ‘MATH-400’). … IDB : teaches(X, Y)  teaches(X, Z), prerequisite(Y, Z). … IC :  enrolled_in(X, Y), not student(X). …


Download ppt "Cooperative Query Answering Based on a talk by Erick Martinez."

Similar presentations


Ads by Google