Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song.

Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song

(C) 2005 Doug Bowman, Virginia Tech2 11.2.2 Types of evaluation Cognitive walkthrough Heuristic evaluation Formative evaluation Observational user studies Questionnaires, interviews Summative evaluation Task-based usability evaluation Formal experimentation Sequential evaluation Testbed evaluation

(C) 2005 Doug Bowman, Virginia Tech4 11.4 How VE evaluation is different Physical issues User can’t see world in HMD Think-aloud and speech incompatible Evaluator issues Evaluator can break presence Multiple evaluators usually needed

(C) 2005 Doug Bowman, Virginia Tech5 11.4 How VE evaluation is different (cont.) User issues Very few expert users Evaluations must include rest breaks to avoid possible sickness Evaluation type issues Lack of heuristics/guidelines Choosing independent variables is difficult

(C) 2005 Doug Bowman, Virginia Tech6 11.4 How VE evaluation is different (cont.) Miscellaneous issues Evaluations must focus on lower-level entities (ITs) because of lack of standards Results difficult to generalize because of differences in VE systems

(C) 2005 Doug Bowman, Virginia Tech7 11.6.1 Testbed evaluation framework Main independent variables: ITs Other considerations (independent variables) task (e.g. target known vs. target unknown) environment (e.g. number of obstacles) system (e.g. use of collision detection) user (e.g. VE experience) Performance metrics (dependent variables) Speed, accuracy, user comfort, spatial awareness… Generic evaluation context

(C) 2005 Doug Bowman, Virginia Tech9 Taxonomy Establish a taxonomy of interaction technique for the interaction task being evaluate. Example : Task: Changing the object’s color 3 sub tasks : selecting object Choosing a color Applying color 2 possible technique components (TC) for choosing a color Changing the values of R, G and B sliders Touching a point within a 3D color space

(C) 2005 Doug Bowman, Virginia Tech10 Outside Factors A user’s performance on an interaction task may depend on a variety of factors. 4 categories Task Distance to be traveled, size of object to be manipulated Environment The number of obstacles, the level of activity or motion User Spatial awareness, physical attributes (arm length, etc) System Lighting model, the mean frame rate etc.

(C) 2005 Doug Bowman, Virginia Tech11 Performance Metrics Information about human performance Speed, Accuracy : quantitative More subjective performance values Ease of use, ease of learning, and user comfort The user’s sense and body, user-centric performance measure

(C) 2005 Doug Bowman, Virginia Tech12 Testbed Evaluation Final stages in the evaluation of Interaction techniques for 3D Interaction tasks Generic, generalizable, and reusable evaluation through the creations of test-beds. Test-beds : Environments and tasks Involve all important aspects of a task Evaluate each component of a technique Consider outside influences on performance Have multiple performance measures

(C) 2005 Doug Bowman, Virginia Tech13 Application and Generalization of Results Testbed evaluation produces models that characterize the usability of an interaction technique for the specified task. Usability is given in terms of multiple performance metrics w.r.t various lelvels of outside factors. -> performance Database(DB) More information is added to the DB each time a new technique is run through the testbed. To choose interaction techniques for applications appropriately, one must understand the interaction requirements of the application The performance results from testbed evaluation can be used to recommend interaction techniques that meet those requirements.

(C) 2005 Doug Bowman, Virginia Tech17 Effectiveness case studies Watson experiment: how system performance affects task performance Slater experiments: how presence is affected Design education: task effectiveness

(C) 2005 Doug Bowman, Virginia Tech19 System performance Only important for its effects on user performance / preference frame rate affects presence net delay affects collaboration Necessary, but not sufficient

(C) 2005 Doug Bowman, Virginia Tech20 Case studies - Watson How does system performance affect task performance? Vary avg. frame rate, variability in frame rate Measure perf. on closed-loop, open- loop task e.g.B. Watson et al, Effects of variation in system responsiveness on user performance in virtual environments. Human Factors, 40(3), 403-414.

(C) 2005 Doug Bowman, Virginia Tech22 User preference in the interface UI goals ease of use ease of learning affordances unobtrusiveness etc. Achieving these goals leads to usability Crucial for effective applications

(C) 2005 Doug Bowman, Virginia Tech23 Case studies - Slater questionnaires assumes that presence is required for some applications e.g.M. Slater et al, Taking Steps: The influence of a walking metaphor on presence in virtual reality. ACM TOCHI, 2(3), 201-219. study effect of: collision detection physical walking virtual body shadows movement

(C) 2005 Doug Bowman, Virginia Tech28 Case studies: learning Measure effectiveness by learning vs. control group Metric: standard test Issue: time on task not the same for all groups e.g.D. Bowman et al. The educational value of an information-rich virtual environment. Presence: Teleoperators and Virtual Environments, 8(3), June 1999, 317-331.

(C) 2005 Doug Bowman, Virginia Tech30 11.7 Guidelines for 3D UI evaluation Begin with informal evaluation Acknowledge and plan for the differences between traditional UI and 3D UI evaluation Choose an evaluation approach that meets your requirements Use a wide range of metrics – not just speed of task completion

(C) 2005 Doug Bowman, Virginia Tech31 Guidelines for formal experiments Design experiments with general applicability Generic tasks Generic performance metrics Easy mappings to applications Use pilot studies to determine which variables should be tested in the main experiment Look for interactions between variables – rarely will a single technique be the best in all situations

Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song.

Similar presentations

Presentation on theme: "Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song.

Similar presentations

Presentation on theme: "Evaluation and metrics: Measuring the effectiveness of virtual environments Doug Bowman Edited by C. Song."— Presentation transcript:

Similar presentations

About project

Feedback