BOOLEAN INFORMATION RETRIEVAL 1Adrienn Skrop. Boolean Information Retrieval  The Boolean model of IR (BIR) is a classical IR model and, at the same time,

Slides:



Advertisements
Similar presentations
Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
Advertisements

© by Kenneth H. Rosen, Discrete Mathematics & its Applications, Sixth Edition, Mc Graw-Hill, 2007 Chapter 1: (Part 2): The Foundations: Logic and Proofs.
Instructor: Hayk Melikya
Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
IB Math Studies – Topic 3. IB Course Guide Description.
Logic Chapter 2. Proposition "Proposition" can be defined as a declarative statement having a specific truth-value, true or false. Examples: 2 is a odd.
Denoting the beginning
Analytical Methods in CS (CIS 505)
2/17/2008Sultan Almuhammadi1 ICS Logic & Sets (An Overview) Week 1.
1 Section 1.2 Propositional Equivalences. 2 Equivalent Propositions Have the same truth table Can be used interchangeably For example, exclusive or and.
1 Math 306 Foundations of Mathematics I Math 306 Foundations of Mathematics I Goals of this class Introduction to important mathematical concepts Development.
Discrete Math 6A Max Welling. Recap 1. Proposition: statement that is true or false. 2. Logical operators: NOT, AND, OR, XOR, ,  3. Compound proposition:
Sets.
Lecture 8 Introduction to Logic CSCI – 1900 Mathematics for Computer Science Fall 2014 Bill Pine.
Set theory Sets: Powerful tool in computer science to solve real world problems. A set is a collection of distinct objects called elements. Traditionally,
1.1 Sets and Logic Set – a collection of objects. Set brackets {} are used to enclose the elements of a set. Example: {1, 2, 5, 9} Elements – objects inside.
Course Outline Book: Discrete Mathematics by K. P. Bogart Topics:
Lecture for Week Spring.   Introduction to Propositional Logic  Types of Proposition  Operator and Truth table Agenda.
BY: MISS FARAH ADIBAH ADNAN IMK. CHAPTER OUTLINE: PART III 1.3 ELEMENTARY LOGIC INTRODUCTION PROPOSITION COMPOUND STATEMENTS LOGICAL.
Chapter 3 – Set Theory  .
Set, Combinatorics, Probability & Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS Slides Set,
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 7 Review Important Terms, Symbols, Concepts 7.1. Logic A proposition is a statement (not a question.
Discrete Structure Sets. 2 Set Theory Set: Collection of objects (“elements”) a  A “a is an element of A” “a is a member of A” a  A “a is not an element.
DISCRETE COMPUTATIONAL STRUCTURES
CS201: Data Structures and Discrete Mathematics I
Sets and Sentences Open Sentences Foundations of Real Analysis.
Chapter 7 Logic, Sets, and Counting
CompSci 102 Discrete Math for Computer Science
Propositional Logic. Propositions Any statement that is either True (T) or False (F) is a proposition Propositional variables: a variable that can assume.
Chapter SETS DEFINITION OF SET METHODS FOR SPECIFYING SET SUBSETS VENN DIAGRAM SET IDENTITIES SET OPERATIONS.
Chapter 7 Logic, Sets, and Counting Section 1 Logic.
6.1 Logic Logic is not only the foundation of mathematics, but also is important in numerous fields including law, medicine, and science. Although the.
Logical Form and Logical Equivalence Lecture 1 Section 1.1 Wed, Jan 12, 2005.
Logical Form and Logical Equivalence M Logical Form Example 1 If the syntax is faulty or execution results in division by zero, then the program.
Sets Goal: Introduce the basic terminology of set theory.
Basic Principles (continuation) 1. A Quantitative Measure of Information As we already have realized, when a statistical experiment has n eqiuprobable.
LOGIC AND QUANTIFIERS. INTRODUCTION Many algorithms and proofs use logical expressions such as: “IF p THEN q” or “If p1 AND p2, THEN q1 OR q2” Therefore.
1 Propositional Logic Introduction. 2 What is propositional logic? Propositional Logic is concerned with propositions and their interrelationships. 
CSNB143 – Discrete Structure Topic 4 – Logic. Learning Outcomes Students should be able to define statement. Students should be able to identify connectives.
Chapter 2 With Question/Answer Animations. Section 2.1.
Introduction to Set theory. Ways of Describing Sets.
(CSC 102) Lecture 13 Discrete Structures. Previous Lectures Summary  Direct Proof  Indirect Proof  Proof by Contradiction  Proof by Contra positive.
TRUTH TABLES. Introduction The truth value of a statement is the classification as true or false which denoted by T or F. A truth table is a listing of.
Discrete Mathematics Set.
Symbolic Logic The Following slide were written using materials from the Book: The Following slide were written using materials from the Book: Discrete.
Set Theory Concepts Set – A collection of “elements” (objects, members) denoted by upper case letters A, B, etc. elements are lower case brackets are used.
Module #3 - Sets 3/2/2016(c) , Michael P. Frank 2. Sets and Set Operations.
Discrete Mathematics CS 2610 August 31, Agenda Set Theory Set Builder Notation Universal Set Power Set and Cardinality Set Operations Set Identities.
Chapter 2 1. Chapter Summary Sets (This Slide) The Language of Sets - Sec 2.1 – Lecture 8 Set Operations and Set Identities - Sec 2.2 – Lecture 9 Functions.
Chapter 1 Logic and proofs
DISCRETE COMPUTATIONAL STRUCTURES CSE 2353 Fall 2010 Most slides modified from Discrete Mathematical Structures: Theory and Applications by D.S. Malik.
VECTOR SPACE INFORMATION RETRIEVAL 1Adrienn Skrop.
Discrete Structures for Computer Science Presented By: Andrew F. Conn Slides adapted from: Adam J. Lee Lecture #1: Introduction, Propositional Logic August.
Sets, Permutations, and Combinations. Lecture 4-1: Sets Sets: Powerful tool in computer science to solve real world problems. A set is a collection of.
Dr. Ameria Eldosoky Discrete mathematics
Chapter 2 Sets and Functions.
Chapter 1: Foundations: Sets, Logic, and Algorithms
CHAPTER 3 SETS, BOOLEAN ALGEBRA & LOGIC CIRCUITS
CSNB 143 Discrete Mathematical Structures
CSE15 Discrete Mathematics 01/23/17
CSNB 143 Discrete Mathematical Structures
Lecture 1 – Formal Logic.
Set, Combinatorics, Probability & Number Theory
Chapter 1 Logic and Proofs Homework 2 Given the statement “A valid password is necessary for you to log on to the campus server.” Express the statement.
Taibah University College of Computer Science & Engineering Course Title: Discrete Mathematics Code: CS 103 Chapter 2 Sets Slides are adopted from “Discrete.
CS100: Discrete structures
CHAPTER 1: LOGICS AND PROOF
Discrete Mathematics CS 2610
TRUTH TABLES.
Logic Logic is a discipline that studies the principles and methods used to construct valid arguments. An argument is a related sequence of statements.
Presentation transcript:

BOOLEAN INFORMATION RETRIEVAL 1Adrienn Skrop

Boolean Information Retrieval  The Boolean model of IR (BIR) is a classical IR model and, at the same time, the first and most adopted one.  It is used by virtually all commercial IR systems today.  The BIR is based on:  Boolean Logic and  classical Sets Theory in that both the documents to be searched and the user's query are conceived as sets of terms.  Retrieval is based on whether or not the documents contain the query terms. 2Adrienn Skrop

Boolean logic 3Adrienn Skrop

Proposition  The proposition is a statement (formulation, assertion) which can be assigned either a value T or a value F (there is no third alternative), where T and F are two different values, i.e., T  F.  For example, T = true, F = false, or T = yes, F = no, or T = white, F = black, or T = 1, F = 0. The values T and F will be referred to as truth values. A proposition cannot be true and false at the same time (principle of noncontradiction). 4Adrienn Skrop

Example  “I am reading this text.” is a  true  proposition.  The sentence “The sun is shining.” is also a proposition because either the value T or F can be assigned to it.  The sentence „The cooks wearing red hats are playing football at the North Pole.“ becomes a proposition if a truth value can be assigned to it.  In general, it is not a necessary quality of an assertion or proposition to be true. For example, the proposition “It is raining.” may be true or false. However, there are propositions that are ‘absolutely’ true (e.g., “The year 2001 is the first year of the 21st century.”) 5Adrienn Skrop

Negation  The negation of a proposition P is a proposition denoted by  P and pronounced “not P”. If P is true, then  P is false, and if P is false, then  P is true.  Hence,  (  P) is always P (law of double negation). Truth table of logical negation  “I am not reading this text.” is a  false  proposition, and is the negation of the proposition “I am reading this text.”. 6 P PP TF FT Adrienn Skrop

Conjunction  Given two propositions: P, Q. The proposition denoted by P Λ Q (pronunciation: “P and Q”) is called conjunction. The conjunction is true if and only if both P and Q are true, and false otherwise.  Thus, P Λ (  P) is always false (law of contradiction). Truth table of logical conjunction 7 PQ P Λ Q TTT TFF FTF FFF Adrienn Skrop

Example  “I am reading this text. Λ It is raining.” is a proposition, and its truth value can be assigned by the Reader.  “I am thinking at myself. Λ A bicycle has two wheels.” is a proposition (the Reader can assign a truth value to it), albeit one would rarely link its two constituent propositions into one sentence in everyday speech. 8Adrienn Skrop

Disjunction  Given two propositions: P, Q. The proposition denoted by P V Q (pronunciation: “P or Q”) is called disjunction. The disjunction is false if and only if both P and Q are false, and true otherwise.  Thus, P V (  P) is always true (law of excluded third).  Truth table of logical disjunction 9 PQ P V Q TTT TFT FTT FFF Adrienn Skrop

Example  “I am reading this text. V It is raining.” is a true proposition (regardless of whether it is actually raining or not). 10Adrienn Skrop

Sets theory 11Adrienn Skrop

Sets  The notion of set is a fundamental one. It does not have a mathematical definition. A set is a collection of distinct objects.  The objects in a set are called elements. If an object x is an element of a set S (equivalent formulation: x belongs to S), this is denoted as x  S. x  S means that x does not belong to S.  It is very important to note that:  An element can occur at most once in a set.  The order of the elements in a set is unimportant. 12Adrienn Skrop

Sets  A set can be given  by enumerating its elements between brackets, e.g., A = {a 1, a 2,...,a n }, or  by giving a property P(x) all elements must share as follows: A = {x | P(x)}.  A set having a fixed number of elements is finite, and infinite otherwise.  The empty set contains no elements and is denoted by . 13Adrienn Skrop

Example  ℕ = {1, 2,…,n,…} denotes the set of natural numbers.  ℤ ={...,  2,  1, 0, 1, 2,...} denotes the set of integer numbers.  ℚ denotes the set of rational numbers,  ℝ denotes the set of real numbers,  ℂ denotes the set of complex numbers,  {thought, ape, quantum, Rembrandt} is a set.  {mammal | water content of mammal’s milk is less than 20%} is a set 14Adrienn Skrop

Union  The union of sets A and B is denoted by the symbol  and defined as follows: A  B = {x | (x  A) V (x  B)}. 15Adrienn Skrop

Union example {thought, ape, quantum, Rembrandt}  {1, 2} = {thought, ape, quantum, Rembrandt, 1, 2}.  Note that the operation of union is a purely formal one (just like the other set operations): it does not require that the elements of the sets be compatible with each other, or have the same nature, in any way. 16Adrienn Skrop

Union Set union satisfies the following properties (as can be easily checked using the definitions of sets equality and union):  Commutativity: A  B = B  A, for any two sets A, B;  Associativity: A  (B  C)= (A  B)  C, for any three sets A, B, C;  Idempotency: A  A = A, for any set A. 17Adrienn Skrop

Intersection  The intersection of sets A and B is denoted by the symbol  and defined as follows: A  B = {x | (x  A) Λ (x  B)}. Visualisation of set intersection A  B 18Adrienn Skrop

Disjoint sets  If A  B = , sets A and B are said to be disjoint sets Visualisation of the disjoint sets A and B: 19Adrienn Skrop

Intersection example {thought, ape, quantum, Rembrandt}  {thought, Rembrandt, 1, 2} = {thought, Rembrandt}. Note that the result of the intersection consists of the elements which are exactly the same. 20Adrienn Skrop

Intersection Set intersection satisfies the following properties (as can be easily checked using the definitions of sets equality and intersection):  Commutativity: A  B = B  A, for any two sets A, B;  Associativity: A  (B  C)= (A  B)  C, for any three sets A, B, C;  Idempotency: A  A = A, for any set A. 21Adrienn Skrop

Difference  The difference of sets A and B (in this order) is denoted by the symbol \, and is defined as follows A \ B = {x | (x  A) Λ (x  B)}. Note: in general, A \ B  B \ A (i.e., set difference does not commute) 22Adrienn Skrop

Powerset  The powerset  (A) of a set A is defined as follows:  (A) = {X | X  A}, i.e., the set of all subsets of A.  The empty set  is a member of the powerset of any set A, i.e.,    (A).  Example   ({thought, ape, quantum}) = { , {thought}, {ape}, {quantum}, {thought, ape}, {thought, quantum}, {ape, quantum}, {thought, ape, quantum}}. 23Adrienn Skrop

Notations Given a finite set T = {t 1, t 2,...,t j,...,t m } of elements called terms (e.g. words or expressions  which may be stemmed  describing or characterising documents such as, for example, keywords given for a journal article) 24Adrienn Skrop

Notation Given a finite set D = {D 1,...,D i,...,D n }, D i  (T) of elements called documents. These documents are formally conceived, for retrieval purposes, as being represented by sets of terms. 25Adrienn Skrop

Notation Given a Boolean expression Q called a query. For example (the terms are A,B,C,D,E): 26Adrienn Skrop

Boolean Information Retrieval Retrieval is defined as follows: 1. The set S i of documents are obtained that contain or not the term under focus: for term A: S i = {D | A  D} for negated term A, i.e., ¬A: S i = {D | A ∉ D} 2. Those documents are retrieved in response to Q, which belong to the set obtained as a result of the corresponding sets operations: intersection  corresponds to logical AND, union  corresponds to logical OR. 27Adrienn Skrop

BIR example 1  Q = A OR (B AND C)  S 1 results set for term A,  S 2 results set for term B,  S 3 results set for term C,  The retrieved set in response to Q: S 1  (S 2  S 3 ) 28Adrienn Skrop

BIR example 2 Let the set of original documents be O = {O 1, O 2, O 3 } where:  O 1 = Bayes' Principle: The principle that, in estimating a parameter, one should initially assume that each possible value has equal probability (a uniform prior distribution).  O 2 = Bayesian Decision Theory: A mathematical theory of decision-making which presumes utility and probability functions, and according to which the act to be chosen is the Bayes act, i.e. the one with highest Subjective Expected Utility. If one had unlimited time and calculating power with which to make every decision, this procedure would be the best way to make any decision. 29Adrienn Skrop

BIR example 2  O 3 = Bayesian Epistemology: A philosophical theory which holds that the epistemic status of a proposition (i.e. how well proven or well established it is) is best measured by a probability and that the proper way to revise this probability is given by Bayesian conditionalisation or similar procedures. A Bayesian epistemologist would use probability to define, and explore the relationship between, concepts such as epistemic status, support or explanatory power. 30Adrienn Skrop

BIR example 2  Let the set T of terms be: T = {t 1 = Bayes' Principle, t 2 = probability, t 3 = decision-making, t 4 = Bayesian Epistemology}.  Then, the set D of documents is as follows: D = {D 1, D 2, D 3 } where  D 1 = {Bayes' Principle, probability},  D 2 = {probability, decision-making},  D 3 = {probability, Bayesian Epistemology}.. 31Adrienn Skrop

BIR example 2  Let the query Q be: Q = probability Λ decision-making  Firstly, the following sets S 1 and S 2 of documents D i are obtained:  S 1 = {D i | probability  D i } = {D 1, D 2, D 3 },  S 2 = {D i | decision-making  D i } = {D 2 }.  Finally, the following documents D i are retrieved in response to Q:  {D i |D i  S 1  S 2 } = {D 1, D 2, D 3 }  {D 2 } = {D 2 }. 32Adrienn Skrop