Knowledge based Personalization by Wonjung Kim
Outline Introduction Background – InfoQuilt system Personalization in InfoQuilt Related Work Conclusions and Future Work
Introduction Semantic web - components Semantics of data Semantics of human’s interest Personalization is a part of the second component
Background – the InfoQuilt system Semantics based information processing IScape : Information correlation Knowledge sharing based on multiple ontologies
Background – Overall Architecture server
Background – Architecture of a Peer Personalized Knowledge Base Personalization AgentIScape Execution
Background – Personalized Knowledge Base Shared ontologiesPersonalized ontologies
Personalization – in InfoQuilt system Representation of user profiles Personalization Techniques Personalization Algorithm Examples
Representation of user profiles Set of tuples of type Keyword: the term used to query Ontology: used in IScape Frequency: frequency of query Latest interest: boolean value IScape: the name of the last queried IScape
Personalization Techniques Score can be computed based on a scale of 0..1 Keywords matched Profiles matched Knowledge about latest context Frequency of querying a domain Query relationship Distance from a domain of interest
Personalization Techniques -keywords matched Not in profiles Query : “Bulldog Football ” Total number of keyword n = 2 Number of keywords matched m BulldogFootball P2p UGABaseball UGABasketball UGAFootball ½½1½½1 collegefootball professionalfootball UGAFootball ½½1½½1
Personalization Techniques - profiles matched P 1 P 2 Query: “Bulldog Schedule” P2p UGABaseball UGABasketball UGAFootball collegefootball professionalfootball 0/2 1 0/2 schedule bulldog
Personalization Techniques - knowledge about latest context Advantage: take the current ontology of the current query Example: P 1 P 2 P 3 P 4 It shows UGAFootball is the current ontology of the term bulldog”
Personalization Techniques - frequency of querying a domain P 1 P 2 P 3 Query: “bulldog football” Matched ontologies: UGAFootball and UGABasketball UGAFootball: (10+12)/( ) UGABasketball: 5 / ( )
Personalization Techniques - Query relationships More concrete than e-commerce market association rules Buy Cereal Buy Milk Query Relationship if a bulldog football team has a game scheduled, then the user may be interested in attending the game so he may query for flight ticket and vice versa. Use framework for inter-ontological relationships to define query relationships spatiallyNear(UGAFootball.gameVenue, Flight.arrivalCity) && temporallyNear(UGAFootball.gameDate, Flight.arrivalDate)
Personalization Techniques - Query relationships TeamFlight QueryBasketballFootball DateNov. 16, 2001Nov. 17, 2001Nov. 19, 2001Dec. 1, 2001 LocationAtlanta, GAAthens, GASpringfield, MAAthens, GA Query Relationships: Flight UGAFootball, Flight UGABasketball Query: “bulldog schedule”
Personalization Techniques - Personalization Techniques - Distance from a domain of interest The smaller the distance, the more relevant it is likely to be. Example) there is no query history about the term “gamecock” in a user’s profile. P1 Query: “gamecock schedule” P2P gamecocks, USCFootball USCFootball:1*0.5*0.25 = Gamecocks: 1*0.5*0.5*0.5*0.5*0.25*0.25=
Personalization Algorithm TechniqueCase 1Case 2 Keywords Matched Profiles Matched Knowledge of Latest Context Frequency of Querying a Domain Query Relationships Distance from a Domain of Interest
Personalization Algorithm TechniqueCase 1Case 2 1Keywords Matched Profiles Matched Query Relationships Frequency of Querying a Domain Knowledge of Latest Context 0.1 6Distance from a Domain of Interest 0.05 These weights are configurable
Examples Personalized Knowledge Base
Example 1 – without profile information (first Query)
Example 1 – keyword matching Ontologies123456Total UGAFootball0.5*1.00.0*0.00.1*0.00.0* * * UGABasketball0.5*1.00.0*0.00.1*0.00.0* * * UGAHockey0.5*1.00.0*0.00.1*0.00.0* * * JCBulldogs0.5*0.50.0*0.00.1*0.00.0* * * CollegeSports0.5*0.50.0*0.00.1*0.00.0* * * AnimalBulldogs0.5*0.50.0*0.00.1*0.00.0* * * CollegeNews0.5*0.50.0*0.00.1*0.00.0* * * CollegeBasketball0.5*0.50.0*0.00.1*0.00.0* * * USCNewspaper0.5*0.50.0*0.00.1*0.00.0* * * CollegeFootball0.5*0.50.0*0.00.1*0.00.0* * * USCBasketball0.5*0.50.0*0.00.1*0.00.0* * *
Example 2
Example 2 – use of user profile Ontologies123456Total UGAFootball0.4*1.00.2*1.00.1* * * UGAHockey0.4*1.00.2*0.00.1* * * UGABasketball0.4*1.00.2*0.00.1* * * AnimalBulldogs0.4*1.00.2*0.00.1* * * JCBulldogs0.4*1.00.2*0.00.1* * * P 1 Query: “bulldogs”
Example 3
Example 3 – latest context P 1 P 2 Query: “bulldogs” Ontologies123456Total UGAFootball0.4*1.00.2*0.50.1*0.00.1* * * UGAHockey0.4*1.00.2*0.00.1* * * UGABasketball0.4*1.00.2*0.50.1*1.00.1* * * AnimalBulldogs0.4*1.00.2*0.00.1* * * JCBulldogs0.4*1.00.2*0.00.1* * *
Example 4 - query relationship P 1 P 2 P 3 Query: “bulldogs”
Example 4
Example 4 – query relationship Ontologies123456Total UGAFootball0.4*1.00.2*0.50.1*0.00.1* * * UGAHockey0.4*1.00.2*0.00.1* * * UGABasketball0.4*1.00.2*0.50.1*1.00.1* * * AnimalBulldogs0.4*1.00.2*0.00.1* * * JCBulldogs0.4*1.00.2*0.00.1* * * TeamFlight QueryUGABasketballUGAFootball DateNov. 29, 2001 Nov. 30, 2001Dec. 30, 2001 LocationAtlanta, GASpringfield, MAAthens, GA
Example5 – Query with the new term
Example 5 – new query term Ontologies123456Total USCFootball0.5*1.00.1* * * USCHockey0.5*1.00.1* * * USCBasketball0.5*1.00.1* * * USCNewsPaper0.5*1.00.1* * * P 1 P 2 P 3 Query: “gamecocks”
Related Work Features of Knowledge Based personalization in InfoQuilt not supported by any other personalization systems Keywords and concepts in ontologies are used to locate them Query relationships between domains identify domains that the user’s profile provides no information for
Related Work… OBIWAN ( Alexander P, Susan G) Use a vector space model to classify documents use length, time, and the strength of match to track users’ interest myPlanet (Yannis K, John D, Enrico M, Maria V, Simon S) An ontology-driven personalized news publishing service Use simple relationships in the ontologies to deliver content that may be of interest to the user
Related Work… Scalable online personalization on the web (Anindya D, Kaushik D, Debra V, Krithi R, Shamkant N) Collaborative filtering approach Action rules and market basket rules Dynamic profile
Conclusion Personalization in InfoQuilt Ontologies in the personalized knowledge base reflect the user’s perception of the domain Keywords that are specified by the ontology, are useful for identifying other relevant ontologies A number of techniques combined to help the users find relevant ontologies Query relationships can identify related domains of interest in the current context of user’s query
Future Work For each domain, it is possible to identify a set of terms that indicate the context. These can also be used to locate ontologies. The only type of relationships in the ontologies used for identifying domains that may be of interest to the user is “is-a”. We can explore the user of other types of relationships supported by ontologies Evaluating query relationships requires work equivalent to evaluating one IScape. Instead, the results from the previous IScape can be cached.
Future Work Keyword matching can be further given weights depending on which component of ontology the keyword matched. For example, if a keyword matches the name of a class as opposed to description, it should have higher value. Experimenting with large amount of users and ontologies can help in identifying a reasonable weight assignment for the techniques.
Thank You!