Recognizing User Interest and Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos.

Slides:



Advertisements
Similar presentations
HTML Basics Customizing your site using the basics of HTML.
Advertisements

Introduction Lesson 1 Microsoft Office 2010 and the Internet
CONTRIBUTIONS Ground-truth dataset Simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser) Implicit.
Using SD K12 SharePoint®.
Web Page Training Summer 2014 Presented by: Mountain Brook Schools Tech Team.
Learning the Basics – Lesson 1
Principles of Web Design 5 th Edition Chapter Nine Site Navigation.
1 Lesson 5 Introduction to Cascading Style Sheets HTML and JavaScript BASICS, 4 th Edition Barksdale / Turner.
Web Design with Cascading Style Sheet Lan Vu. Overview Introduction to CSS Designing CSS Using Visual Studio to create CSS Using template for web design.
Experiences and Directions in Spatial Hypertext Frank Shipman Department of Computer Science & Center for the Study of Digital Libraries Texas A&M University.
Intelligent User Interface Research at Texas A&M University: Designing Adaptive Systems to Support Information Triage Frank Shipman Associate Director,
Michael Donovan, River Campus Libraries – 12/03 DocuShare Overview and Training.
Designing Software for Personal Music Management and Access Frank Shipman & Konstantinos Meintanis Department of Computer Science Texas A&M University.
Managing Distributed Collections: Evaluating Web Page Change, Movement, and Replacement Richard Furuta and Frank Shipman Center for the Study of Digital.
Designing Systems to Support Document Triage Frank Shipman Center for the Study of Digital Libraries Texas A&M University.
Spatial Hypertext for Digital Library Providers and Patrons Frank Shipman Department of Computer Science & Center for the Study of Digital Libraries Texas.
Intelligent User Interfaces Research Group Directed by: Frank Shipman.
Nnadi & Bieber, NJIT © Lightweight Integration of Documents and Services (Digital Library Integration Infrastructure) Nkechi Nnadi and Michael Bieber.
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
The Visual Knowledge Builder: A Second Generation Spatial Hypertext Frank M. Shipman III Haowei Hsieh Preetam Maloor J. Michael Moore.
Managing Software Projects in Spatial Hypertext : Experiences in Dogfooding Frank Shipman Department of Computer Science & Center for the Study of Digital.
Projects in the Intelligent User Interfaces Group Frank Shipman Associate Director, Center for the Study of Digital Libraries.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
IWebFolio Using a Template Tutorial Images in this tutorial:
Designing a Classroom Web Site Using NVU Beginning Level.
Docs, Spreadsheets, & Presentations. What Do YOU Know???
 Using Microsoft Expression Web you can: › Create Web pages and Web sites › Set what you site will look like as you design it › Add text, images, multimedia.
Using SD K12 SharePoint ®. What is SharePoint? Microsoft SharePoint Components Web Browser Collaboration functions Process management modules Search modules.
CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.
NTeQ: Designing an Integrated Lesson
Copyright © Allyn & Bacon 2008 Locating and Reviewing Related Literature Chapter 3 This multimedia product and its contents are protected under copyright.
Windows Tutorial 4 Working with the Internet and
Chapter 3 Copyright © Allyn & Bacon 2008 Locating and Reviewing Related Literature This multimedia product and its contents are protected under copyright.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
CONCLUSION & FUTURE WORK Given a new user with an information gathering task consisting of document IDs and respective term vectors, this can be compared.
Usability and Accessibility CIS 376 Bruce R. Maxim UM-Dearborn.
By: Jennifer Huff & Courtney Stenzhorn. What Do YOU Know???
Tutorial 8 Designing a Web Site with Frames. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore the uses of frames.
Foxbright – Smarter Education Websiteswww.foxbright.com Foxbright Training Foxbright Teacher Pages
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Web Design. Keep a Consistent Appearance Visually show that the site is the same Always use the same background Same graphic style Same type formatting.
Web Design Guidelines by Scott Grissom 1 Designing for the Web  Web site design  Web page design  Web usability  Web site design  Web page design.
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser)
The Basics of Managing Your Department Website March 8, 2012.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Unified Relevance Feedback for Multi-Application User Interest Modeling Sampath Jayarathna PhD Candidate Computer Science & Engineering.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Acknowledgements : This research is supported by NSF grant INTRODUCTION MULTI LAYER PERCEPTRONS (MLP) DATA SET FOR TRAINING Learning weights using.
Recognizing Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos Meintanis,
Microsoft Expression Web - Illustrated Unit A: Getting Started With Microsoft Expression Web.
BIG MEDIUM A Quick Guide for Web Edits and Additions A Quick Guide for Web Edits and Additions
Pasewark & Pasewark 1 Office Lesson 1 Microsoft Office 2007 Basics and the Internet Microsoft Office 2007: Introductory.
Allison Nichols, Ed.D. Evaluation Specialist.  In this workshop we'll explore creating an online survey using Google Documents. You don't need to buy.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Implicit feedback, semi-explicit feedback (annotations), explicit.
Zachary Starr Dept. of Computer Science, University of Missouri, Columbia, MO 65211, USA Digital Image Processing Final Project Dec 11 th /16 th, 2014.
Chapter 10: Web Basics.
Microsoft Office 2010 Basics and the Internet
Discovering Computers 2011: Living in a Digital World Chapter 3
iNQUIRE Taking students and teachers deeper into Information Inquiry!
Microsoft Office 2010 Basics and the Internet
Moving from the Resource Board for Teachers of Students with Significant Disabilities to the Notebooks of the Teacher Resource Link on SLDS.
What is Microsoft Internet Explorer?
Learning the Basics – Lesson 1
Link Label Text Label… Click Here… Image Image Lorem Ipsum Lorem Ipsum
Jim Barton Librarian Glenside Public Library District
Independent work of students
What Makes a Good K-12 Resource
ENDANGERED ANIMALS A RESEARCH PROJECT
Chapter 1: Digital Communication Tools
Presentation transcript:

Recognizing User Interest and Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos Meintanis, Anna Zacchi, Haowei Hsieh, Frank Shipman and Cathy Marshall Center for the Study of Digital Libraries & Department of Computer Science Texas A&M University Microsoft Corporation

What is Document Triage? ● People quickly evaluate a large set of documents selecting documents to read ● People organize them into a personal information collection ● People re-read the documents, progressively refining the organization ● Knowledge forms incrementally as initial understanding becomes more refined over time A specific form of information collecting, reading and organizing 2/16

Prior Document Triage Study (2004) ● Task: organize the documents to help a teacher prepares a set of lessons on ethnomathematics as a reference librarian ● 24 subjects ● 40 documents from NSDL & Google searches ● Organizing tool: Visual Knowledge Builder (VKB) ● Reading tool: Internet Explorer (IE) ● Logged reading & editing events ● Asked subjects to select five most & least useful documents 3/16

Initial Document List 4/16 Document object Collection Metadata Page title Page URL Summary NSDL Search System-generated Visualization based on metadata Google Search

Document in a Web Browser 5/16

Final Organization Sample 6/16 Categories (Collections) Background Color Border Color Border Thickness

Proactive Support for Document Triage 1. Recognizing user interest and document value 2. Representing user interests 3. Recognizing documents of interest 4. Visualizing interest information Motivations 7/16

Recognizing User Interest (1) ● Explicit and implicit interest indicators ● Correlation between reading activity and user interest ● Reading time, # of visits, # of scrolls, … ● Correlation between organizing activity and user interest ● Resize, move, delete … ● Correlation between document attributes and user interest ● # of characters, # of links, # of images … 8/16

Recognizing User Interest (2) ● Prior work has focused on a single application as the source for interest indicators ● Document triage occurs in the context of multiple applications ● Interest profile is the basis for determining, sharing and storing implicit interest 9/16

Interest Profile Manager 10/16

Data Analysis (1) 11/16 Document AttributesReading ActivityOrganizing Activity # of characters # of links # of images Reading time # of clicks # of text selections # of scrolls # of scrolling direction changes Time spent in scrolling Scroll offset # of document accesses # of object moves # of object resizes # of object deletions # of content changes # of background color changes # of border color changes # of border width changes

Data Analysis (2) ● Identified the correlation between user activity & document attributes and user interest ● Found meaningful interest indicators in user activity ● Reading time, # of scrolls, # of resize events … ● Found meaningful interest indicators in document attributes ● # of characters, # of links, # of images … ● No indicator cannot dominantly identify user interest ● Significant difference between individual styles 12/16

Interest Models ● Models to estimate average interest on documents 13/16 Model nameData Statistical Model Reading activity model Reading activity Organizing activity model Organizing activity Combined Model Reading & Organizing activity Qualitative Model Reading & Organizing activity

Evaluation (1) ● The same task and topic as in the prior study in 2004 ● 16 subjects ● 40 documents from NSDL & Google searches ● Asked subjects to select five most & least useful documents ● Scaled to a continuous value between 0 (least useful) and 2 (most useful) ● Calculated the absolute value of the difference between the explicit user rating and each model's predicted rating 14/16

Evaluation (2) ● Combined and qualitative models using reading and organizing activity show better performance than others 14/15

Conclusion ● Predictive models based on user activity collected from multiple applications have been built ● Utilizing user activity from multiple applications rather than single application can improve the accuracy of prediction ● Software infra structure, Interest Profile Manager, has been developed to support the result 16/16