A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto.

Slides:



Advertisements
Similar presentations
A Semantic Model for Vague Quantifiers Combining Fuzzy Theory and Supervaluation Theory Ka Fat CHOW The Hong Kong Polytechnic University The title of.
Advertisements

Fuzzy Sets and Fuzzy Logic
Ch2 Data Preprocessing part3 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Representing and Querying Correlated Tuples in Probabilistic Databases
INSTRUCTOR: DR.NICK EVANGELOPOULOS PRESENTED BY: QIUXIA WU CHAPTER 2 Information retrieval DSCI 5240.
Fast Algorithms For Hierarchical Range Histogram Constructions
CLASSICAL LOGIC and FUZZY LOGIC. CLASSICAL LOGIC In classical logic, a simple proposition P is a linguistic, or declarative, statement contained within.
Fuzzy Expert Systems. Lecture Outline What is fuzzy thinking? What is fuzzy thinking? Fuzzy sets Fuzzy sets Linguistic variables and hedges Linguistic.
ผศ. ดร. สุพจน์ นิตย์ สุวัฒน์ ตอนที่ interval, 2. the fundamental concept of fuzzy number, 3. operation of fuzzy numbers. 4. special kind of fuzzy.
Approximate Reasoning 1 Expert Systems Dr. Samy Abu Nasser.
© C. Kemke Approximate Reasoning 1 COMP 4200: Expert Systems Dr. Christel Kemke Department of Computer Science University of Manitoba.
Web Search - Summer Term 2006 II. Information Retrieval (Basics Cont.)
IR Models: Overview, Boolean, and Vector
Incorporating Language Modeling into the Inference Network Retrieval Framework Don Metzler.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
Modeling Modern Information Retrieval
© 2002 Franz J. Kurfess Approximate Reasoning 1 CPE/CSC 481: Knowledge-Based Systems Dr. Franz J. Kurfess Computer Science Department Cal Poly.
Vector Space Model CS 652 Information Extraction and Integration.
Theory and Applications
Quantum Computing Joseph Stelmach.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
3-1 Introduction Experiment Random Random experiment.
WELCOME TO THE WORLD OF FUZZY SYSTEMS. DEFINITION Fuzzy logic is a superset of conventional (Boolean) logic that has been extended to handle the concept.
Ming-Feng Yeh General Fuzzy Systems A fuzzy system is a static nonlinear mapping between its inputs and outputs (i.e., it is not a dynamic system).
Fussy Set Theory Definition A fuzzy subset A of a universe of discourse U is characterized by a membership function which associate with each element.
1 First order theories. 2 Satisfiability The classic SAT problem: given a propositional formula , is  satisfiable ? Example:  Let x 1,x 2 be propositional.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Fuzzy Sets Introduction/Overview Material for these slides obtained from: Modern Information Retrieval by Ricardo Baeza-Yates and Berthier Ribeiro-Neto.
Fuzzy Logic. Lecture Outline Fuzzy Systems Fuzzy Sets Membership Functions Fuzzy Operators Fuzzy Set Characteristics Fuzziness and Probability.
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
General Database Statistics Using Maximum Entropy Raghav Kaushik 1, Christopher Ré 2, and Dan Suciu 3 1 Microsoft Research 2 University of Wisconsin--Madison.
A Markov Random Field Model for Term Dependencies Donald Metzler W. Bruce Croft Present by Chia-Hao Lee.
1 A Theoretical Framework for Association Mining based on the Boolean Retrieval Model on the Boolean Retrieval Model Peter Bollmann-Sdorra.
1 A Maximizing Set and Minimizing Set Based Fuzzy MCDM Approach for the Evaluation and Selection of the Distribution Centers Advisor:Prof. Chu, Ta-Chung.
Advanced information retrieval Chapter. 02: Modeling (Set Theoretic Models) – Fuzzy model.
Theory and Applications
Pattern-directed inference systems
Book: Bayesian Networks : A practical guide to applications Paper-authors: Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete, Carlos Martine, Alfonso.
CS344: Introduction to Artificial Intelligence Lecture: Herbrand’s Theorem Proving satisfiability of logic formulae using semantic trees (from Symbolic.
CS621: Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 28– Interpretation; Herbrand Interpertation 30 th Sept, 2010.
Ranking in Information Retrieval Systems Prepared by: Mariam John CSE /23/2006.
A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Copyright © Curt Hill Quantifiers. Copyright © Curt Hill Introduction What we have seen is called propositional logic It includes.
Logical Systems and Knowledge Representation Fuzzy Logical Systems 1.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Theory and Applications
“Principles of Soft Computing, 2 nd Edition” by S.N. Sivanandam & SN Deepa Copyright  2011 Wiley India Pvt. Ltd. All rights reserved. CHAPTER 12 FUZZY.
FUZZY LOGIC INFORMATION RETRIEVAL MODEL Ferddie Quiroz Canlas, ME-CoE.
DL Overview Second Pass Ming Fang 06/19/2009. Outlines  Description Languages  Knowledge Representation in DL  Logical Inference in DL.
2004 謝俊瑋 NTU, CSIE, CMLab 1 A Rule-Based Video Annotation System Andres Dorado, Janko Calic, and Ebroul Izquierdo, Senior Member, IEEE.
Set Theoretic Models 1. IR Models Non-Overlapping Lists Proximal Nodes Structured Models Retrieval: Adhoc Filtering Browsing U s e r T a s k Classic Models.
Human and Machine Understanding of normal Language (NL) Character Strings Presented by Peter Tripodes.
ece 627 intelligent web: ontology and beyond
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
1 Lecture 4 The Fuzzy Controller design. 2 By a fuzzy logic controller (FLC) we mean a control law that is described by a knowledge-based system consisting.
LDK R Logics for Data and Knowledge Representation Description Logics: family of languages.
Fuzzy Relations( 關係 ), Fuzzy Graphs( 圖 形 ), and Fuzzy Arithmetic( 運算 ) Chapter 4.
Lecture 9: Query Complexity Tuesday, January 30, 2001.
Chapter 3: Fuzzy Rules & Fuzzy Reasoning Extension Principle & Fuzzy Relations (3.2) Fuzzy if-then Rules(3.3) Fuzzy Reasonning (3.4)
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Microsoft Research Cambridge,
An Effective Statistical Approach to Blog Post Opinion Retrieval Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (CIKM 2008)
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
The Propositional Calculus
CLASSICAL LOGIC and FUZZY LOGIC
Quantum Computing Joseph Stelmach.
Relational Calculus Chapter 4, Part B
Presentation transcript:

A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto Bugarín, and Senén Barro Present by Chia-Hao Lee

2 outline Introduction Fuzzy Quantifiers –Probabilistic Quantifier Fuzzification Mechanisms New View in Crisp Representatives –FA Quantifier Fuzzification mechanism –Properties of the Model Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval –Fuzzy Quantifiers and Information Retrieval –Example Information Retrieval Experiments Conclusion

3 Introduction The ability of fuzzy quantifiers to model linguistic statements in a natural way has proved useful in diverse areas such as expert systems, data mining, control systems, database systems, etc. In the information retrieval (IR) field, fuzzy quantification has been proposed for handling expressive queries giving rise to flexible query language.

4 Fuzzy Quantifiers A fuzzy set X its membership function is denoted as : for example: : the powerset of E : the fuzzy powerset of E stands for the α-cut of level α of X

5 Definition 1 (Classic Quantifier) : A classic s-ary quantifier on a base or referential set E is a mapping Q : A typical example of a classic quantifier is the following definition of an all statement which can be used for sentences such as “ ” : Fuzzy Quantifiers

6 For example : Let us consider the evaluation of the sentence “80% or more of students are Spanish” where the properties “students” and “Spanish” are, respectively, defined as X 1 ={1,0,1,0,1,0,1,1}, X 2 ={1,0,1,0,1,0,0,0} and “80% or more” is defined as in (1). Then

7 Definition 2 (Fuzzy Quantifier) : An s-ary fuzzy quantifier Q on a base set is a mapping An example of a fuzzy quantifier is, which can defined as a fuzzy extension of 1 using a typical definition for the fuzzy inclusion operator: Fuzzy Quantifiers

8 For example : Let us consider the evaluation of sentence “all tall people are blond” in the referential set. Let us assume that properties “tall” and “blond” are, respectively, defined as Using expression (2) then: In many cases, it is not easy to achieve consensus on an intuitive and generally applicable expression for implementing a given quantified expression. Fuzzy Quantifiers

9 Definition 3 (Semi-fuzzy Quantifier) : An s-ary semi-fuzzy quantifier Q on a base set is a mapping which assigns a gradual result to each choice of crisp.

10 Fuzzy Quantifiers Examples of semi-fuzzy quantifier are :

11 Fuzzy Quantifiers For example : Let us consider the evaluation of the sentence “about 80% or more of the students are Spanish”. Let us assume that properties “students” and “Spanish” are, respectively, defined as X 1 ={1,0,1,0,1,0,1,1}, X 2 ={1,0,1,0,1,0,0,0} then

12 Fuzzy Quantifiers Although semi-fuzzy quantifiers are much more intuitive and easier to define than fuzzy quantifiers, they cannot be directly applied for handling linguistic statements, since semi-fuzzy quantifiers are defined on crisp sets. Such methods are known as quantifier fuzzification mechanisms (QFM) and formally defined as a mapping with domain in the universe of semi-fuzzy quantifiers and range in the universe of fuzzy quantifiers:

13 Fuzzy Quantifiers Probabilistic Quantifier Fuzzification Mechanisms : In the universe of discourse E is finite and expressions and unary then both expressions collapse into the same discrete expression The value can be interpreted as the probability that is selected as the crisp representative for the fuzzy set X.

14 Fuzzy Quantifiers For example : Let us consider the evaluation of the quantified sentence “almost all students are tall.” Suppose that we model the property tall for a referential set of students through the fuzzy set tall and we support the quantified expression “almost all” by means of the following semi-fuzzy quantifier :

15 Fuzzy Quantifiers given the fuzzy set tall, the values are and the fuzzification process runs as follows:

16 New View on Crisp Representatives Given a fuzzy set, the process that selects a number of elements in E to be includes in a crisp representative of X can be viewed as a random process in which n mutually independent binary decisions are made. Every individual decision involving an element may be viewed as a Bernoulli trial whose probability of success equals.

17 New View on Crisp Representatives Definition 4 ( ) : We define the probability that a crisp set is a crisp representative of X as Definition 5 ( ) : Let be a semi-fuzzy quantifier. For simplicity, fuzzification process :

18 New View on Crisp Representatives We will denote by a referential containing m elements. By we will denote a crisp (fuzzy) set on this referential. Let us consider a unary semi-fuzzy quantitative quantifier : a function with the form

19 New View on Crisp Representatives For this case, the expression becomes And we instead of

20 New View on Crisp Representatives Example of the approach

21 New View on Crisp Representatives It can be proved that all the value can be obtained with a complexity

22 New View on Crisp Representatives We can advance that the model is well-behaved because it fulfills the properties of correct generalization of crisp expressions, induced operations, external negation, internal negation, duality, internal meets, monotonicity in arguments monotonicity in quantifiers and coherece with logic.

23 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval IR is the science concerned with the effective and efficient retrieval of information for the subsequent use by interested parties. IR models differ in the way in which documents and queries are represented and matched. The proposal designs a general framework based on the NVM method in which quantifiers with different degrees of expressiveness can be handled.

24 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Consider a query with the form. Given a document of the document base, every query term produces a score which represents the connection between the document’s semantics and the term. Formally, every document induces a fuzzy set on the set of query terms which is defined applying the popular weighting strategy : the raw frequency of term in the document : the maximum raw frequency computed over all terms mentioned by the document

25 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval The fuzzy set models the connection between the document and every query component. Quantification can now be applied on for evaluating the quantified symbol all.

26 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Example : Let us suppose that we apply the following power function for supporting a given query quantification symbol Q : Imagine a query and consider a document whose fuzzy set induced on the query components is Applying now the fuzzification process explained along this paper, the query-document matching is assigned a score n : the number of query terms

27 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Let us now apply the NVM approach to handle the same example. The score assigned is equal to It follows that the final value yielded by the NVM method is:

28 Information Retrieval Experiment We ran experiments against the Wall Street Journal (WSJ) documents, which are about 173,000 news articles (from 1987 to 1992). Natural language documents are preprocessed as follow: –First, common words such as prepositions, articles, etc. are eliminated. –Second, terms are reduced to their syntactical root by applying the popular Porter’s stemmer.

29 Information Retrieval Experiment We tried out different semi-fuzzy quantifiers for relaxing the interpretation of the quantified statement all and, for each semi-fuzzy quantifier, both the fuzzification approach and the NVM approach were applied. We experimented with power functions and exponential functions, both of them normalized in the interval as follows :

30 Information Retrieval Experiment

31 Information Retrieval Experiment

32 Information Retrieval Experiment

33 Information Retrieval Experiment

34 Information Retrieval Experiment

35 Information Retrieval Experiment

36 Conclusion In the paper, we present a new probabilistic quantifier fuzzification mechanism, its efficient implementation and its application for the basic information retrieval task.