Download presentation
Presentation is loading. Please wait.
Published byRodger Warner Modified over 9 years ago
1
APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람
2
IR ( Information retrieval ) Returning relevant texts for query A measure of similarity is computed between the query and each document The similarity scores The vector space model
3
Counting Letters
6
Counting words
8
Counting Pronouns Occurring
9
heshe himher hisher hishers himselfherself
10
TEXT COUNT AND VECTOR
11
Vectors and Angles 두 Text 를 비교하기 위해 Angle 이용 Vector 를 이용하여 Angle 을 구한다. Angle 값이 0 에 가까울 수록 두 Text 는 유사함
12
Vectors and Angles Inner product Dot product
13
Vectors and Angles Vector length =
14
Computing Angles
17
cosθ = 0.89503 Angle of 0.46230 radians, which about 26.5º
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.