Gradual Adaption Model for Estimation of User Information Access Behavior J. Chen, R.Y. Shtykh and Q. Jin Graduate School of Human Sciences, Waseda University,

Slides:



Advertisements
Similar presentations
ELIBRARY CURRICULUM EDITION The ultimate K-12 curriculum and reference solution.
Advertisements

Database VS. Search Engine
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Web Search - Summer Term 2006 III. Web Search - Introduction (Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Searching the Library Catalogue October Objectives To help you to: Locate material on your reading list in the library and online Find information.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
LinkSelector: Select Hyperlinks for Web Portals Prof. Olivia Sheng Xiao Fang School of Accounting and Information Systems University of Utah.
Web Mining Research: A Survey
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Web Data Mining and Applications Part I
SIGIR’09 Boston 1 Entropy-biased Models for Query Representation on the Click Graph Hongbo Deng, Irwin King and Michael R. Lyu Department of Computer Science.
Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date:
The Research Process Why Do Research?. Research is a process made up of many small steps. What Next? Steps in the Research Process 1. Define your research.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
English 1113 Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science, University of Iowa Instructional Services.
Web 2.0 Based e-Learning System for CEFR Kohji Shibano Research Institute for Languages and Culture of Asia and Africa Tokyo University of Foreign Studies.
Put it to the Test: Usability Testing of Library Web Sites Nicole Campbell, Washington State University.
User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
1 Searching through the Internet Dr. Eslam Al Maghayreh Computer Science Department Yarmouk University.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Dr. Susan Gauch When is a rock not a rock? Conceptual Approaches to Personalized Search and Recommendations Nov. 8, 2011 TResNet.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
DISCUS South Carolina’s Virtual Library A program overview.
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.
Developing Trust Networks based on User Tagging Information for Recommendation Making Touhid Bhuiyan et al. WISE May 2012 SNU IDB Lab. Hyunwoo Kim.
Web Personalization Based on Static Information and Dynamic User Behavior Center for E-Business Technology Seoul National University Seoul, Korea Nam,
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
DISCUS South Carolina’s Virtual Library A program overview.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
GrammAds: Keyword and Ad Creative Generator for Online Advertising Campaigns Author : Stamatina Thomaidou, Konstantinos Leymonis, and Michalis Vazirgiannis.
Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Anita Cellucci.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Thesis Proposal: Prediction of popular social annotations Abon.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
Judit Tóvári PhD Eszterházy Károly College, Eger (Hungary) Institute of Media Informatics From librarian to information manager.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Juan D.Velasquez Richard Weber Hiroshi Yasuda 國立雲林科技大學 National.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
The Research Process Why Do Research? Tennessee State University Undergraduate Classes.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Data mining in web applications
Using computers to search electronic databases
Text & Web Mining 9/22/2018.
Data Analyzing Artificial Intelligence (AI)
Author: Kazunari Sugiyama, etc. (WWW2004)
Web Mining Department of Computer Science and Engg.
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Web archives as a research subject
Web Mining Research: A Survey
Personalization Personalized System Traditional System 3 2 1
Topic: Semantic Text Mining
Presentation transcript:

Gradual Adaption Model for Estimation of User Information Access Behavior J. Chen, R.Y. Shtykh and Q. Jin Graduate School of Human Sciences, Waseda University, Japan

October 1, 2015Waseda University2 Background Why do we need information –In leisure: search a route or map for tour –In work: search business information –In learning: search academic papers Where do we get information –From traditional media such as books, magazines, etc. –From Internet How do we get information –To search it from a bookstore, library, etc. –To search it from Internet What do we have to face in information search –Too many search results including trashes

October 1, 2015Waseda University3 Information Recommendation Information recommendation –Web mining approaches Usage mining Structure mining Content mining Semantic mining –Web mining data Content data: text and multimedia provided by web sites. Structure data: organization inside a web page, internal and external links, and the web site hierarchy. Usage data: access logs data of web sites. User profile: information data of users. Semantic data: the data describe the structure and definition of semantic web sites.

October 1, 2015Waseda University4 Study Approach Proposing a gradual adaption model for estimation of user information access behavior Analyzing a variety of users' information access data in terms of short, medium, long periods, and by remarkable and exceptional categories, and based on Full Bayesian Estimation Conducting experimental simulation to show the operability and effectiveness of the proposed model

October 1, 2015Waseda University5 Related Works WUM (Web Usage Mining) –Based on implicit users’ feedback A new document representation model (Poblete and Baeza-Yates, WWW2008) –Experimented on a web site with a small number of vocabularies and specific to certain topics. Indentifying relevant web sites from user activities (Bilenko and White, WWW2008) –Needs to spend more time to train the system –Personalize information recommendation Dynamic Link Generation (Yan, et al, WWW1996) –Consists of off-line and on-line modules SUGGEST 3.0 (Baraglia and Silvestri, WT2004) –For large web sites, and only have on-line module. But the size of logs used to evaluated the system is small and limited. LinkSelector (Fang and Sheng, ACM2004) –Hyperlinks-structural and theirs access logs were used.

October 1, 2015Waseda University6 Definitions of Keyword, Link and Concept Keyword –Keywords in web pages Link –Web pages’ link Concept –Consists of a number of keywords and links nature painting Aristotle Andersen cartoon culture Leonardo Concept : Philosophy Concept: Literature Concept: Art Link a Link b

October 1, 2015Waseda University7 Full Bayesian Estimation Ð is a data collection of concept d t is the current number of click times of a concept d f is the current number of click times that a concept not be clicked α t is the history number of click times of a concept α f is the history number of click times that a concept not be clicked

October 1, 2015Waseda University8 Gradual Adaption Model Web Documents Concept KB Access Logs Concept Analyser Probability Estimator Estimation Base Input Gradual Adaption Recommender Short Medium Long Remarkable / Exceptional Matchin g Search Query Search Click Off-line On-line

October 1, 2015Waseda University9 Gradual Adaption Model We divide users’ interests into three terms of short, medium, long periods, and by remarkable, exceptional categories. This model is an adaptive one. –It can adapt to a transition of users’ information access behaviors. In the model, training is not needed, since the model uses Full Bayesian Estimation that has a learning function.

October 1, 2015Waseda University10 Gradual Adaption Model Web Documents Concept KB Access Logs Concept Analyser Probability Estimator Estimation Base Input Gradual Adaption Recommender Short Medium Long Remarkable / Exceptional Matchin g Search Query Search Click Off-line On-line

October 1, 2015Waseda University11 Simulation and Evaluation Environment –Java, Tomcat, MySQL, and Nekohtml Data –Wikipedia on DVD Version 0.5 more than 2000 web pages that belong to more than 180 concepts

October 1, 2015Waseda University12 Simulation and Evaluation Short period (such as 7 days / 1 week) –Test case This case is a user who has two interests, and these interests are affected by some factors easily. The expectation is that there is a possibility that the probability of the relation concept can change hugely in short or medium period, but not in long period. Two concepts of “Art” and “Artists” are assumed to be used, and the number of clicks is dynamically varying. –Test result The movement of the concept’s rate changing frequently. In some days, the probability of concepts in short period is bigger than long period.

October 1, 2015Waseda University13 Simulation and Evaluation Medium period (such as 30 days / 1 month) –Test case This case is a user who has a temporary interest. The user access the concept of temporary interest sometime. The expectation is that this concept ought to keep a low rate in the three periods. One concept “Philosophers” is assumed to be used per three days, –Test result The change is becoming smaller. But the probability of concepts in short period is bigger than medium period in some days.

October 1, 2015Waseda University14 Simulation and Evaluation Long period (such as 90 days / 3 months) –Test case This case is a user who has a long-term interest. The expectation is that the probability of the interested concept ought to keep a high rate in long period. One concept “Philosophical thought movements” is assumed to be used everyday, –Test result The change becomes quite stable. There is no big change in the long period.

October 1, 2015Waseda University15 Conclusion In this study, we have proposed a gradual adaption model (GAM) for estimation of user information access behavior. The three periods of GAM can correctly distinguish long-term and temporary interest of users even if has no system training.

October 1, 2015Waseda University16 Future Works To set more different patterns for short, medium and long periods to find more reasonable ones. To evaluate the proposed model with users' involvement. To compare our proposed approach with other related recommendation models.

October 1, 2015Waseda University17 Thank you for your attention.