Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evaluation Howell Istance. Why Evaluate? n testing whether criteria defining success have been met n discovering user problems n testing whether a usability-related.

Similar presentations


Presentation on theme: "Evaluation Howell Istance. Why Evaluate? n testing whether criteria defining success have been met n discovering user problems n testing whether a usability-related."— Presentation transcript:

1 Evaluation Howell Istance

2 Why Evaluate? n testing whether criteria defining success have been met n discovering user problems n testing whether a usability-related specification has been met n comparing designs

3 Overview of approaches to evaluation n observation and monitoring usage n structured expert reviewing n usability engineering n contextual enquiry n experimental techniques

4 Things to test n Correct for target audience’s ability? n Meets the design concept? n Human-computer dialogue and graphics design principles adhered to? n Material / information correct? n Install correctly, and run, on target machines? Links work, etc? n How easy is it for target audience to use? n Does it meet the client’s expectations? n User feedback positive?

5 Approaches to Evaluation in Human-Computer Interaction n User-centred evaluation of software in general usually focuses on how well users can complete tasks n The idea of a ‘task’ is often less well-defined in multimedia applications

6 Context for evaluation n Free use - users asked to use the application without specific instructions u difficult to ensure users visit all parts of the application u may be unclear to the user what they should u best suited to small information presentations n structured use - users guided in their use of the application u ensures that critical parts of application tested in limited time

7 Observation and monitoring usage n direct and indirect observation n verbal protocolls n user opinions n software logging

8 Structured Direct Observation n give subjects a series of standard tasks to complete using a prototype n observe subject completing tasks under standardised conditions n data collection aimed at ensuring that qualitative descriptions of problems during task completion are captured n what problems are likely in data recording?

9 Standard tasks in structured direct observation n structure tasks into incremental difficulty (easy ones first) n have a clear policy on subject becoming stuck and providing help n have a reason for including each task (avoid unnecessary duplication) n ensure (all) functional areas of interface usage are covered n ensure tasks of sufficient complexity are included

10 Indirect observation - video n enables post-session debriefing 'talk-through' (post-event protocolls) n enables quantitative data to be extracted - e.g. part task timings n serves as a diary and visual record of problems n usually very time consuming to analyse n usability laboratories

11 Verbal protocols n means of enhancing direct observations n user articulates what they are thinking during task completion (think-aloud protocols) n but… u doing this can alter normal behaviour u subject likely to stop when undertaking complex cognitive activities u user may rationalise behaviour in post-event protocols n get subjects working in pairs - co- discovery can overcome some of these problems.

12 Collecting users opinions n interview and questionnaire n suited to both qualitative data and quantitative data collection n interviews u structured interviews (fixed sequence of questions) u semi structured (allows disgressions, but all questions covered) u flexible (exploration of topic governed by users views) n what are the advantages and disadvantages of these?

13 Questionnaires n contain closed questions (attitude scales) and open questions n pre- and post questionnaires obtain ratings on an issue before and after an design change n can be used to standardise attitude measurement of single subjects following direct observation n can be used to survey large user groups

14 Can you use the following edit commands? yes no don't know duplicate paste A simple checklist Types of rating scales

15 Multipoint checklist Rate the usefulness of the duplicate command on the following scale? very of no useful use

16 Likert Scale n statement of opinion to which the subject expresses their level of agreement Computers can simplify complex problems very much agree slightly neutral slightly disagree strongly agree agree disagree disagree

17 Caution! The help facility in system A is much better than the help facility in system B very much agree slightly neutral slightly disagree strongly agree agree disagree disagree what does 'strongly disagree' mean?

18 Semantic differential Scale uses a series of bi-polar adjectives and obtains ratings which respect to each Rate the Beauxarts drawing package on the following dimensions extremely quite slightly neutral slightly quite extremely easy difficult clear confusing fun dreary

19 Rank Order Place the following commands in order of usefulness (rank the most useful as 1, the least useful as 4) paste duplicate group clear

20 Do and Don'ts with Questionnaire evaluation n have a clear idea of what specifically you want information about and ensure there are questions that directly address these issues n don't risk subjects being demotivated u not interested in the questionnaire u questionnaire is too long n don't be lazy u focus questions to the specific interface u avoid 'not applicable' responses n provide specific task reference for questions

21 Possible procedures (free usage)... n Recruit participants representative of intended user group n prepare an introduction which you read n ask person to use/view the presentation until they have finished or up to a time limit

22 Possible procedures... n Observe u sequence of pages visited u behaviour - browse or read u pages which appear to interest or not interest person n Note problems person appears to have n give person a brief questionnaire n discuss what person liked/didn’t like about the presentation

23 Structured Expert Reviews n uses ‘experts’ in HCI and task domain to review design rather than subject-based testing n methods vary according to how the review is structured n two popular methods u heuristic evaluation u cognitive walkthrough

24 Heuristics in Heuristic Evaluation n use simple and natural language n speak the users language n minimise users memory load n be consistent n provide feedback n provide clearly marked n exits n provide shortcuts n provide good error n messages n prevent errors

25 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, transfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function

26 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, transfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function read number before menu item description (minimise users memory load)

27 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, transfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function Avoid gap in menu numbers between 1 and 3 (Simple and Natural Language)

28 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, tansfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function Error message appears too late (simple and Natural Dialogue

29 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, tansfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function Error message is imprecise (precise and constructive error messages)

30 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, tansfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function Do not require dollar amounts to entered in cents (speak the users language)

31 S: Enter 1 for account information, 3 for transfers between accounts.. U: 3# (interrupts) S: Enter account to transfer from U: 1234567890# S: Enter account to transfer to U: # (default) S: Enter amount in cents U: 100000# S: From account number 1234567890 to account number primary account, tansfer of 1000 dollars is to be made. Press 1 to confirm, 0 to cancel U: 1# S: You do not have access to use this function Use alternative and proper English phrase (speak users language)

32 Cognitive Walkthrough n developed on the basis of Cognitive Theory of Initial Learning n intended for systems where the user ‘guesses their way’ through an interaction sequence n the task is decomposed into paths to successful task completion consisting of individual actions n method provides u set of guide lines to support development u procedures and checklist for evaluation

33 Cognitive Walkthrough Checklist n Problems forming correct goals u failure to add goals u failure to drop goals u addition of spurious goals u premature loss of goals n problems identifying action u correct action does not match goal u incorrect actions match goals n problems performing action u physical difficulties u timeouts

34 Example of correct goal structure n Program video for timed recording u Press timed recording button u Set Stream F type stream number F press ‘timed recording’ button u Set start time F type start time (24 hour clock) F press ‘timed recording’ button u …..


Download ppt "Evaluation Howell Istance. Why Evaluate? n testing whether criteria defining success have been met n discovering user problems n testing whether a usability-related."

Similar presentations


Ads by Google