Component-specific usability testing Dr Willem-Paul Brinkman Lecturer Department of Information Systems and Computing Brunel University
Topics Introduction Whether and how the usability of components can be tested empirically. -Testing different versions of component -Testing different components Whether and how the usability of components can be affected by other components. -Consistency -Memory load
Introduction Component-Based Software Engineering Empirical Usability Testing
Layered communication
Layered Protocol Theory (Taylor, 1988) = 15+23= Add ProcessorEditor Control results Control equation UserCalculator
Usability Testing Aim to evaluate the usability of a component based on the message exchange between a user and a specific component
Two paradigms Multiple versions testing paradigm Single version testing paradigm Manage Support Re-use Create
Test Procedure Normal procedures of a usability test User task which requires interaction with components under investigation Users must complete the task successfully
Component-specific component measures Number of messages received The effort users put into the interaction Objective performance Perceived ease- of-use Perceived satisfaction Component Control process Control loop
Component-specific component measures Increasing the statistical power Objective performance Perceived ease- of-use Perceived satisfaction y y 1 = x k + k y 2 = x m + m k = k component + k rest m = m component + m rest Assumption k rest m rest messages keys
Component-specific component measures Objective performance Perceived ease-of-use Perceived satisfaction Component-specific questionnaire increase the statistical power because they help help the users to remember their control experience with a particular interaction component
Component-specific component measures Objective performance Perceived ease-of-use Perceived satisfaction Perceived Usefulness and Ease-of-use questionnaire (David, 1989), 6 questions, e.g. Learning to operate [name] would be easy for me. I would find it easy to get [name] to do what I want it to do. UnlikelyLikely
Component-specific component measures Objective performance Perceived ease- of-use Perceived satisfaction Post-Study System Usability Questionnaire (Lewis, 1995) The interface of [name] was pleasant. I like using the interface of [name].Strongly disagreeagree
Experimental validation 80 users 8 mobile telephones 3 components were manipulated according to Cognitive Complexity Theory (Kieras & Polson, 1985) 1.Function Selector 2.Keypad 3.Short Text Messages
Architecture Mobile telephone Send Text Message Send Text Message Function Selector Function Selector Keypad
Experimental validation Functions Selector Broad/shallow Narrow/deep
Experimental validation Keypad Repeated-Key Method “L” Modified-Model-Position method “J”
Experimental validation Send Text Message Simple Complex
Results Average probability that a measure finds a significant (α = 0.05) effect for the usability difference between the two versions of FS, STM, or the Keypad components
Wilcoxon Matched-Pairs Signed-Ranks Tests between the number of correct classification made by discriminant analyses on overall and component-specific measures Results
Topics Introduction Whether and how the usability of components can be tested empirically. -Testing different versions of component -Testing different components Whether and how the usability of components can be affected by other components. -Consistency -Memory load
Two paradigms Multiple versions testing paradigm Single version testing paradigm Manage Create Support Re-use
Testing Different Components Component specific objective performance measure: 1.Messages received + Weight factor A common currency 2.Compare with ideal user A common point of reference Usability of individual components in a single device can be compared with each other and prioritized on potential improvements
Click {1} Click {1} Call <>{2} Set <Fill colour red, no border> {7} Right Mouse Button Menu Properties Assigning weight factors to represent the user’s effort in the case of ideal user
Total effort value Total effort = MR i.W MR i.W : Message received. Weight factor Click {1} Click {1} Call <>{2} Right Mouse Button Menu Properties 5 2 = 7 + 2
Assigning weight factors in case of real user Correction for inefficiency of higher and lower components Visual Drawing Objects Properties Right Mouse Button Menu
Assigning weight factors in case of real user Assign weight factors as if lower components operate optimal Visual Drawing Objects Properties Right Mouse Button Menu Inefficiency of lower level components: need more messages to pass on a message upwards than ideally required
Assigning weight factors in case of real user Visual Drawing Objects Properties Right Mouse Button Menu Inefficiency of higher level components: more messages are requested than ideally required UE: User effort MR i.W : Message received. Weight factor #MSU real :Number of messages sent upward by real user #MSU ideal :Number of messages sent upward by ideal user MR i.W #MSU real #MSU ideal UE =
Ideal User versus Real User Extra User Effort = User Effort - Total effort The total effort an ideal user would make The total effort a real user made The extra effort a real user made Calculate for each component: Prioritize
Experimental validation 40 users 40 mobile telephones 2 components were manipulated (Keypad only Repeated-Key Method) 1.Function Selector 2.Short Text Messages
Results Mobile phones Extra User Effort
Results MeasureFunction Selector Send Text Message Objective Extra keystrokes0.64**0.44** Task duration0.63**0.39** Perceived Overall ease-of-use-0.43**-0.26* Overall satisfaction-0.25*-0.22 Component-specific ease-of-use-0.55**-0.34** Component-specific satisfaction-0.41**-0.37** Partial correlation between extra user effort regarding the two components and other usability measures *p. <.05. **p. <.01.
Comparison with other evaluation methods Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Example: Keystrokes, task duration, overall perceived usability Relatively easy to obtain Unsuitable to evaluate components
Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Based only on lower-level events Pre-processing: selection, abstraction, and re-coding Relation between higher- level component and compound message less direct Components’ status not recorded Comparison with other evaluation methods
Help to understand the problem Only looking at error-free task execution Considers the system only at the lowest-level layer Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Comparison with other evaluation methods
Quicker Evaluator effect (reliability) Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Comparison with other evaluation methods
Topics Introduction Whether and how the usability of components can be tested empirically. -Testing different versions of component -Testing different components Whether and how the usability of components can be affected by other components. -Consistency -Memory load
Consistency problems
Consistency Activation of the wrong mental model
Consistency experiments 48 Users Used 3 applications: 1.4 Room Thermostats 2.4 (2 Web-Enabled TV sets 2 Web Page Layouts) 3.4 Applications (2 Timers 2 Application domains)
Within one layer
Within one layer – Experimental Design Day time Temperature Night time Temperature Moving Pointer Moving Scale Moving Pointer Moving Scale
Within on layer - Results
Between layers Web-enable TV set Browser versus Web pages
Between layers - Page Layout List layout Matrix layout
Between layers - Browser
Between layers – Experimental Design Web Page Version Browser List Matrix Linear Plane
Between layers - Results
Application domain
Between Application domain – Experimental Design Application Timer Alarm radio Microwave Mechanical alarm Hot dish
Application domain - Results
Topics Introduction Whether and how the usability of components can be tested empirically. -Testing different versions of component -Testing different components Whether and how the usability of components can be affected by other components. -Consistency -Memory load
Mental effort problems
Mental Effort - Calculator ProcessorEditor Control results Control equation UserCalculator
Memory load – Experimental Design Equation Editor Easy Difficult Large display Small display
Mental Effort - Heart-rate variability
Mental Effort - Control of higher- level layer
Conclusions Whether and how the usability of components can be tested empirically. -Testing different versions of component : more powerful -Testing different components : prioritized on potential improvements Whether and how the usability of components can be affected by other components. -Consistency : components on the same or on higher- level layers can activate wrong mental models -Memory load : lower-level interaction affects higher- level interaction strategy
Questions Thank you for your attention