Measuring Learning During Search: Differences in Interactions, Eye-Gaze, and Semantic Similarity to Expert Knowledge Florian Groß 1 28. Mai 2019 1.

Slides:

Advertisements

Similar presentations

Chapter 1 What is listening?

Advertisements

Web Search Results Visualization: Evaluation of Two Semantic Search Engines Kalliopi Kontiza, Antonis Bikakis,

Developing and Evaluating a Query Recommendation Feature to Assist Users with Online Information Seeking & Retrieval With graduate students: Karl Gyllstrom,

American History Foundations

Holyoke Public Schools Professional Development By, Judy Taylor

Digital Marketing Overview Tpugliese Adapted from Anton Koekemoer | April 2012.

1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.

Measuring and reporting outcomes for your BTOP grant 1Measuring and Reporting Outcomes.

Case study - usability evaluation Howell Istance.

Experimental Components for the Evaluation of Interactive Information Retrieval Systems Pia Borlund Dawn Filan 3/30/04 610:551.

Dementia Care Audit Tool Liz Taylor. Learning Outcomes  Understand the DCAT  Understand the purpose and outcomes available from the Tool  Be able to.

Web 2.0 Testing and Marketing E-engagement capacity enhancement for NGOs HKU ExCEL3.

Copyright 2010, The World Bank Group. All Rights Reserved. Training and Procedural Manuals Section A 1.

 Increasing the amount of hours spent studying and doing homework on a daily basis  Increasing the amount of work completed while studying on a daily.

Evaluation of Adaptive Web Sites 3954 Doctoral Seminar 1 Evaluation of Adaptive Web Sites Elizabeth LaRue by.

Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.

1 Can People Collaborate to Improve the relevance of Search Results? Florian Eiteljörge June 11, 2013Florian Eiteljörge.

Information Retrieval Evaluation and the Retrieval Process.

Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.

Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.

Web-based Tools for Designing and Developing Teaching Materials for Integration of Information Technology into Instruction Professor ：陳朝鈞教授 Speaker ：邱志銘.

A /01 Evaluation of English and Spanish Health Information on the Internet Gretchen Berland, M.D. The RAND Corporation.

Elaine Ménard & Margaret Smithglass School of Information Studies McGill University [Canada] July 5 th, 2011 Babel revisited: A taxonomy for ordinary images.

The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.

Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.

Eye Tracking In Evaluating The Effectiveness OF Ads Guide : Dr. Andrew T. Duchowski.

Evaluation of Internet Resources Review of Library Materials Books Periodicals Reference collection Special collection Electronic sources –Internet access,

Working Memory and Learning Underlying Website Structure

The Critical Period for Language Acquisition: Evidence from Second Language Learning CATHERINE E. SNOW AND MARIAN HOEFNAGEL-HÖHLE UNIVERSITY OF AMSTERDAM.

Digital Literacy Concepts and basic vocabulary. Digital Literacy Knowledge, skills, and behaviors used in digital devices (computers, tablets, smartphones)

An Introduction to NHS Evidence

Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.

Developing Assessment Instruments Instructional Design: Unit, 3 Design Phase.

ASSOCIATIVE BROWSING Evaluating 1 Jin Y. Kim / W. Bruce Croft / David Smith by Simulation.

Internet Self-Efficacy Does Not Predict Student Use of Internet-Mediated Educational Technology Article By: Tom Buchanan, Sanjay Joban, and Alan Porter.

MRs. J.Fundora Air Base K-8 Center November 30th, 2016

BookFlix by Scholastic

Understanding the RUC Survey Instrument

Information Storage and Retrieval Fall Lecture 1: Introduction and History.

بازاریابی دیجیتال در یک نگاه

SIE 515 Design Evaluation Lecture 7.

Reading, Processing and Interacting with Hypertext on the Web

English, Literacies and Policy Contexts A

Connecting Interface Metaphors to Support Creation of Path-based Collections Unmil P. Karadkar, Andruid Kerne, Richard Furuta, Luis Francisco-Revilla,

Muneo Kitajima Human-Computer Interaction Group

CS-411 : Digital Education & Learning Analytics

The Use of Social Media in Nursing: Pitfalls and Opportunities

Experiential Financial Literacy: A Field Study of My Classroom Economy

Digital Marketing Overview

Digital Marketing Overview

A Brief Introduction to the Internet

Chapter Six Training Evaluation.

Chapter 12: Automated data collection methods

Athabasca University School of Computing and Information Science

Item 1: This task required students to evaluate search results to choose the most appropriate one for a specified topic. This task illustrates achievement.

Question 1: This task required students to locate and click on a hyperlink. This task illustrates achievement at Level 1 on the ICT Literacy Proficiency.

ACCESS for ELLs Score Reports

Building Academic Language

EBSCOhost Page Composer

Struggling and Success in Web Search

Section VI: Comprehension

Tasks & Grades for MET2.

ENDANGERED ANIMALS A RESEARCH PROJECT

Internet Basics and Information Literacy

Introduction to Web Authoring

Building Academic Language

Information Retrieval and Web Design

Journal of Web Semantics 55 (2019)

Chapter 3: How Standardized Test….

Presentation transcript:

Measuring Learning During Search: Differences in Interactions, Eye-Gaze, and Semantic Similarity to Expert Knowledge Florian Groß 1 28. Mai 2019 1

Outline Introduction Methodology Results Discussion 2 Florian Groß 2

Introduction Information Seeking – “a process, in which humans purposefully engage in order to change their state of knowledge” Marchioni Tie between information search and learning in prior works Consider learning as integral part of information search process How to measure learning? Learning as changes in verbal knowledge from before and after a search session Marchioni -> information search driven by higher lvl human needs.  Consider Information seeking changes state of searchers knowledge… TIE, consider… -> in this work, measure learning not new in IR, learning… -> Our interest is in the learning that takes place at the remembering and factual knowledge level 3 Florian Groß 3

Background Learning outcomes from search are a good evaluation measure of IR systems Typically requires collecting explicit responses from users Which measurement techniques of learning? Which techniques work outside laboratory? Eye tracking to measure change in searchers learning 2 punkt: assessing learning …, eye tracking: prior work used eye gazing to assess differences in lvls of users domain knowledge 4 Florian Groß 4

Goal Construct learning measures that … require minimal input from users do not require users to answer topic-specific comprehension tests do not expose user to topic of search in pre-task assessment attempt to assess a users true knowledge level with minimal scope for guessing 5 Florian Groß 5

Approach 30 participants perform search on the web Two types of learning measure Topic-independent measure Measure based on semantic similarity with expert vocabulary Expectation: searchers that invest more effort and consume more result pages learn more 6 Florian Groß 6

Experimental Design Participants asked to search for health-related information on internet Pre-screened participants Native-level English familiarity Non-expert topic familiarity Uncorrected 20/20 vision (min. problems with eye tracking) All participants reported Using internet for over an hour per day Daily usage of google majority of the participants had been using Google for longer than seven years, and considered themselves proficient in searching for information online 7 Florian Groß 7

Task Each participant performed two search tasks in counterbalanced order On health related topic Simulated work-task approach  Triggering realistic information-need find useful information for helping a family member and a friend. The tasks were designed to be complex, and contained multiple facets 8 Florian Groß 8

Interface Customized version of Google Browser with additional sidebar No advertisements Search engine result page showed seven results per page  Eye fixation tracked on each individual result Browser with additional sidebar Current search task Bookmarking section – save URLs of relevant pages Notes section – note relevant text from web page All other (content) pages shown in their true form Recorded interaction with computer(eye gaze, mouse clicks, key strokes) 9 Florian Groß 9

Procedure Florian Groß 10 Green/yellow patches in (a) are eye-tracking fixation heatmaps, The circle with number in (b) is an eye-fixation with duration, Each experimental session started with the assessment of participant’s working memory capacity and health literacy, Next, the participants performed a training task to familiarize themselves with the custom user interfaces (bookmarking and notetaking) and the study procedure, list as many words or phrases as you can on the topic of the search task. 10 Florian Groß 10

Procedure Assessment of working memory capacity and health literacy Training task to familiarize with interface Pre-task knowledge assessment Searching for information using google Visiting search result webpages Bookmarking webpages that contain relevant information Taking notes (not available in post-task) Assessing knowledge change through post-task questionnaire Measure workload Literacy = bildung, because the task was on a medical topic, NASA-TLX (Task Load Index), for working memory capacity : using memory span task; eHealth Literacy scale for health literacy; 11 Florian Groß 11

Measures All measures calculated for each user task pair Knowledge change Free-recall as many words/phrases on topic Relative difference in number of items before and after task Vocabulary of expert words Semantic similarity pre/post-task recall with expert vocabulary Difference and ratio between post task- and pre task similarity with expert vocabulary pre_exp_sim = sim(pre_task, expert), post_exp_sim = sim(post_task, expert), 12 Florian Groß 12

Measures Eye-tracking Search Interaction Reflects process of reading Calculated on serps, content pages and relevant content pages Eye fixation = reading Eye regression = moving back to fixate on previous word Calculate total duration, count of fixation, number of eye regression Search Interaction Number of visited serps and content pages Dwell time on page type, number of queries/ query reformulations  Search effort = search interaction + acquiring text from web pages Relevant content pages because expected participants to learn most from such pages, search interaction(visiting serps/content pages, entering queries, clicking links), acquiring(nr + duration of reading fixations, length of reading sequence) ; Search-effort is operationalized as a two part, multiple-component construct, composed of the above two groups of measures, SI and ET 13 Florian Groß 13

Results Split KC measure in HI and LO group based on median-split of score Higher knowledge change score Less frequent/ uncommon words in queries Less amount of reading on webpages Reported higher mental workload No difference between groups Search interactions Working memory capacity Online health literacy Expectation: searchers that invest more effort and consume more result pages learn more  higer kc with less reading. 14 Florian Groß 14

Limitations and future work Using just two tasks of similar nature Performing data analysis at the task level Uniform group of participants (Recruitment?) Short-time frame of experimental session Use wider range of tasks More diverse participant samples Additional individual difference tests Assessment of verbal skills Multiple session study to measure learning over a longer period of time Assessment = beurteilung, 16 females; mean age 24.5 years, they did not mention how they recruited their participants, maybe had bigger study in mind but wanted to check results on smaller study first 15 Florian Groß 15

Discussion 16 Florian Groß 16

Backup (crowdsourced and evaluated with doctor) Angular similarity 17 Florian Groß 17