Usability Testing Lecture #8 - March 5th, 2009 213: User Interface Design and Development.

Slides:

Advertisements

Similar presentations

Chapter 14: Usability testing and field studies

Advertisements

Data gathering. Overview Four key issues of data gathering Data recording Interviews Questionnaires Observation Choosing and combining techniques.

6.811 / PPAT: Principles and Practice of Assistive Technology Wednesday, 16 October 2013 Prof. Rob Miller Today: User Testing.

CS305: HCI in SW Development Evaluation (Return to…)

Each individual person is working on a GUI subset. The goal is for you to create screens for three specific tasks your user will do from your GUI Project.

Deciding How to Measure Usability How to conduct successful user requirements activity?

Chapter 14: Usability testing and field studies. 2 FJK User-Centered Design and Development Instructor: Franz J. Kurfess Computer Science Dept.

Quiz next week – Oct 4 (communication, feedback, presentations)

User Testing & Experiments. Objectives Explain the process of running a user testing or experiment session. Describe evaluation scripts and pilot tests.

CyLab Usable Privacy and Security Laboratory 1 C yLab U sable P rivacy and S ecurity Laboratory Designing.

Chapter 14: Usability testing and field studies. Usability Testing Emphasizes the property of being usable Key Components –User Pre-Test –User Test –User.

© De Montfort University, 2001 Questionnaires contain closed questions (attitude scales) and open questions pre- and post questionnaires obtain ratings.

1 User Testing. 2 Hall of Fame or Hall of Shame? frys.com.

Chapter 41 Training for Organizations Research Skills.

Think-aloud usability experiments or concurrent verbal accounts Judy Kay CHAI: Computer human adapted interaction research group School of Information.

User Interface Testing. Hall of Fame or Hall of Shame?  java.sun.com.

Usable Privacy and Security Carnegie Mellon University Spring 2008 Lorrie Cranor 1 Designing user studies February.

An evaluation framework

ISE554 The WWW 3.4 Evaluation Methods. Evaluating Interfaces with Users Why evaluation is crucial to interface design General approaches and tradeoffs.

James Tam Evaluating Interfaces With Users Why evaluation is crucial to interface design General approaches and tradeoffs in evaluation The role of ethics.

PPA 502 – Program Evaluation Lecture 4a – Qualitative Data Collection.

CS 4730 Play Testing CS 4730 – Computer Game Design Credit: Several slides from Walker White (Cornell)

Preparing a User Test Alfred Kobsa University of California, Irvine.

Usability Methods: Cognitive Walkthrough & Heuristic Evaluation Dr. Dania Bilal IS 588 Spring 2008 Dr. D. Bilal.

Heuristic Evaluation “Discount” Usability Testing Adapted from material by Marti Hearst, Loren Terveen.

Chapter 14: Usability testing and field studies

Conducting Usability Tests ITSW 1410 Presentation Media Software Instructor: Glenda H. Easter.

Conducting a User Study Human-Computer Interaction.

User Interface Evaluation Usability Testing Methods.

Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.

Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.

Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.

Presentation: Techniques for user involvement ITAPC1.

Dobrin / Keller / Weisser : Technical Communication in the Twenty-First Century. © 2008 Pearson Education. Upper Saddle River, NJ, All Rights Reserved.

Data gathering. Overview Four key issues of data gathering Data recording Interviews Questionnaires Observation Choosing and combining techniques.

Involving Users in Interface Evaluation Marti Hearst (UCB SIMS) SIMS 213, UI Design & Development April 8, 1999.

Slide 13-1 Copyright © 2004 Pearson Education, Inc.

Evaluating a Research Report

Fall 2002CS/PSY Empirical Evaluation Analyzing data, Informing design, Usability Specifications Inspecting your data Analyzing & interpreting results.

Human Computer Interaction

Usability testing. Goals & questions focus on how well users perform tasks with the product. – typical users – doing typical tasks. Comparison of products.

Preparing a User Test Alfred Kobsa University of California, Irvine.

©2010 John Wiley and Sons Chapter 6 Research Methods in Human-Computer Interaction Chapter 6- Diaries.

Conducting a User Study Human-Computer Interaction.

Usability Testing Chapter 6. Reliability Can you repeat the test?

COMP5047 Pervasive Computing: 2012 Think-aloud usability experiments or concurrent verbal accounts Judy Kay CHAI: Computer human adapted interaction research.

Dr. Engr. Sami ur Rahman Assistant Professor Department of Computer Science University of Malakand Research Methods in Computer Science Lecture: Data Generation.

©2010 John Wiley and Sons Chapter 3 Research Methods in Human-Computer Interaction Chapter 3- Experimental Design.

The product of evaluation is knowledge. This could be knowledge about a design, knowledge about the user or knowledge about the task.

Usability Engineering Dr. Dania Bilal IS 582 Spring 2006.

Research Methods in Psychology Chapter 2. The Research ProcessPsychological MeasurementEthical Issues in Human and Animal ResearchBecoming a Critical.

Facilitate Group Learning

Usability Evaluation, part 2. REVIEW: A Test Plan Checklist, 1 Goal of the test? Specific questions you want to answer? Who will be the experimenter?

EVALUATION PROfessional network of Master’s degrees in Informatics as a Second Competence – PROMIS ( TEMPUS FR-TEMPUS-JPCR)

Usability Engineering Dr. Dania Bilal IS 592 Spring 2005.

Usability Evaluation. Objectives for today Go over upcoming deliverables Learn about usability testing (testing with users) BTW, we haven’t had a quiz.

Usability Evaluation or, “I can’t figure this out...do I still get the donuts?”

Steps in Planning a Usability Test Determine Who We Want To Test Determine What We Want to Test Determine Our Test Metrics Write or Choose our Scenario.

Usability Engineering Dr. Dania Bilal IS 582 Spring 2007.

Usability Engineering Dr. Dania Bilal IS 587 Fall 2007.

Evaluation / Usability. ImplementDesignAnalysisEvaluateDevelop ADDIE.

School of Engineering and Information and Communication Technology KIT305/607 Mobile Application Development Week 7: Usability (think-alouds) Dr. Rainer.

Web Usability Usability 101. Outline What is Web Usability? Why do we need it? Examples Conducting usability test Equipments Design for Usability Conclusion.

Provide instruction.

Usability Testing 3 CPSC 481: HCI I Fall 2014 Anthony Tang.

Parts of a Lab Write-up.

Usability Evaluation, part 2

Usability Evaluation.

Presentation transcript:

Usability Testing Lecture #8 - March 5th, : User Interface Design and Development

Today’s Outline 1)Planning a Usability Test 2)Think Aloud 3)Think Aloud Example 4)Performance Measurement

Usability Testing Test interfaces with real users! Basic process: –Set a goal - what do you want to learn? –Design some representative tasks –Identify a set of likely users –Observe the users performing the tasks –Analyze the resulting data

Conducting a Pilot Test Before unleashing your system and your testing scheme on unwitting users, it helps to pilot test your study Iron out any kinks - either in your software, or your testing setup A pilot test can be conducted with design team members and other readily available people (at least one of them should be a potential user)

Selecting Test Users “Should be as representative as possible of the intended users” If testing with a small number of users, avoid outlier groups If testing with a larger number of users, aim for coverage of all “personas” Include novices, probably experts too It helps if users are already familiar with the underlying hardware (if its not part of your design)

Sources of Test Users Early adopters Students Retirees Paid volunteers … Be creative!

Human Subjects In many universities and research organizations, UI testing is treated with similar care as medical testing Requires filling out and submitting a Human Subjects approval form to the appropriate agency Important considerations include maintaining the anonymity of test users, and obtaining informed consent

STATEMENT OF INFORMED CONSENT If you volunteer to participate in this study, you will be asked to perform some tasks related to XXX, and to answer some questions. Your interactions with the computer may also be digitally recorded on video, audio and/or with photographs. This research poses no risks to you other than those normally encountered in daily life. All of the information from your session will be kept anonymous. We will not name you if and when we discuss your behavior in our assignments, and any potential research publications. After the research is completed, we may save the anonymous notes for future use by ourselves or others. Your participation in this research is voluntary, and you are free to refuse to participate or quit the experiment at any time. Whether or not you chose to participate will have no bearing in relation to your standing in any department of UC Berkeley. If you have questions about the research, you may contact X at Y, or by electronic mail at Z. You may keep a copy of this form for reference. If you accept these terms, please write your initials and the date here: INITIALS___________________ DATE___________________

How to Treat Users Train them if you will assume some basic skills (ex. using a mouse) Do not blame or laugh at the user Make it clear that the system is being tested, not the user Make the first task easy Inform users that they can quit anytime After the test, thank the user

Helping Users Decide in advance how much help you will provide (depending on whether you plan to measure performance) For the most part you should allow users to figure things out on their own, so tell them in advance that you will not be able to help during the test If user gets stuck and you aren’t measuring, give a few hints to get them going again Terminate the test if the user is unhappy and not able to do anything User can always voluntarily end the test

Designers as Evaluators Usually system designers are not the best evaluators Potential for helping users too much, or explaining away usability problems Evaluator should be trained in the evaluation method, and also be an expert in the system being tested Can be a team of a designer and an evaluator, who handles user relations

Designing Test Tasks Should be representative of real use cases Small enough to be completed in finite time, but not so small that they are trivial Should be given to the user in writing, to ensure consistency and a ready reference (Don’t explain how to do it though!) Provide tasks one at a time to avoid intimidating the user Relate the tasks to some kind of overall scenario for continuity

Example Task Description Motivating Scenario: “You are using a mobile phone for accessing and editing contact information.” Tasks: 1.Find the contacts list in the phone. 2.View the contact information for John Smith. 3.Change John Smith’s number to end in a “6”. 4.… Adapted from Jake Wobbrock

Stages of a Usability Test Preparation Introduction Observation Debriefing

Preparation Choose a location that is quiet, interruption-free, and has all the equipment that you need Print out task descriptions, instructions, test materials and/or questionnaires Install the software, and make sure it is in the “start” position for the test Make sure everything is ready before the user shows up

Introduction Explain the purpose of the test Ask user to fill out the Informed Consent form, and any pre-test surveys (including demographics) Ensure the user that their results will be kept confidential, and that they can stop at any time Introduce test procedure and provide written instructions for first task Ask the user if they have any questions

Conducting the Test Assign one person as the primary experimenter, who provides instructions and communicates with the user Experimenter should avoid helping the user too much, while maintaining a positive attitude No help can be given when performance is being measured Make sure to take notes and collect data!

Debriefing Administer subjective satisfaction questionnaires, often using Likert scale –Rate your response to this statement on a scale of 1-5, where 1 means you disagree completely, and 5 means you agree completely “I really liked this user interface!” Ask user for any comments or clarification about interesting episodes Answer any remaining user questions Disclose any deception used in the test Label data and write up your observations

Adapted from Marti Hearst

Thinking Aloud

Formative vs. Summative Evaluation Formative evaluation - Discover usability problems as part of an iterative design process. Goal is to uncover as many problems as possible. Summative evaluation - Assess the usability of a prototype, or compare alternatives. Goal is a reliable, statistically valid comparison.

Thinking Aloud “Having a test subject use the system while continuously thinking aloud” Most useful for formative evaluation Understand how users view the system by externalizing their thought process Generates a lot of qualitative data from a relatively small number of users Focus on what the user is concretely doing and saying, as opposed to their own theories and advice

Getting Users to Open Up Thinking aloud can be unnatural Requires prompting by the experimenter to ensure that the user continues to externalize their thought process May slow them down and affect performance

Example Prompts “Please keep talking.” “Tell me what you are thinking.” “Tell me what you are trying to do.” “Are you looking for something? What?” “What did you expect to happen just now?” “What do you mean by that?” Adapted from Jake Wobbrock

Points to Remember Do not make value judgments User: “This is really confusing here.” Tester: “Yeah, you’re right. It is.” (BAD) Tester: “Okay, I’ll make a note of that.” (GOOD) Video or audio record (with user’s permission), and/or take good notes Screen captures can also be useful When the user is thinking hard, don’t disturb them with a prompt - wait! Adapted from Jake Wobbrock

Think Aloud Variants Co-Discovery: Two users work together –Can spur more conversation –Needs 2x more users Retrospective: Think aloud after the fact, while reviewing a video recording –Doesn’t disturb the user during the task –User may forget some thoughts, reactions Coaching: Expert coach guides the user by answering their questions –Identify training, help and documentation needs

Thinking Aloud Example

Think Aloud Example Choose a partner - one of you will start as the user, and the other will start as the experimenter Experimenter should write down 2-3 tasks to be completed by the user using a mobile phone or laptop (or some other device you have handy) Introduce the task to the user, and ask them to complete it while thinking aloud Experimenter should be taking notes about the user’s breakdowns, workarounds and overall success / failure Remember to keep prompting! After you are done, switch roles! Adapted from Jake Wobbrock

Example Prompts “Please keep talking.” “Tell me what you are thinking.” “Tell me what you are trying to do.” “Are you looking for something? What?” “What did you expect to happen just now?” “What do you mean by that?” Adapted from Jake Wobbrock

Performance Measurement

Implies testing a user interface to obtain statistics about performance Most useful for summative evaluation Can be done to either: –Compare variants or alternatives –Decide whether an interface meets pre- specified performance requirements

Experiment Design Independent variables (Attributes) - the factors that you want to study Dependent variables (Measurements) - the outcomes that you want to measure Levels - Acceptable values for measurements Replication - How often you repeat the measurement, in how many conditions, with how many users, etc. Adapted from Marti Hearst

Performance Metrics Time to complete the task Number of tasks completed Number of errors Number of commands / features used Number of commands / features not used Frequency of accessing help Frequency of help being useful Number of positive user comments Number of negative user comments Proportion of users preferring this system etc…

Reliability Reliability of results can be impacted by variation amongst users –Include more users –Use standard statistical methods to estimate variance and significance Confidence intervals are used for studies of one system Student’s T-test is used for comparing difference between two systems

Validity Validity can be impacted by setting up the wrong experiment –Wrong users –Wrong tasks –Wrong setting –Wrong measurements –Confounding effects Take care in your experimental design about what you are testing, with whom, and where

Between vs. Within Subjects When comparing two interfaces Between-Subjects: Distinct user groups use each variation –Need large number of users to avoid bias in one sample vs. the other –Random vs. matched assignment Within-Subjects: Same users use both variations –Can lead to learning effects –Solution is to counter-balance the study - each group uses one interface first

Experiment Design Varying one attribute (ex. color) is simple - consider each alternative for that attribute separately Varying several attributes (ex. color and icon shape) can be more challenging: –Interaction between attributes –Blowup in the number of conditions

Adapted from Marti Hearst A1A2 B135 B268 A1 A2 B135 B2612 A1 B1 A2 A1 B2 A2 B2 A and B do not interactA and B may interact A2 A1 B1B2B1B2

Dealing with Multiple Attributes Conduct pilot tests to understand which really impact performance Take the remaining attributes, and organize them in a latin square –addressing ordering and making sure all variations are tested Note: each user may only see a subset of the variations, and only some orderings may be considered

Adapted from Marti Hearst G G+ A A A+ A G T1T2T A+ G G+ A+GG+A T4 6

Concerns with Users People get tired! People get bored! People can get frustrated! People can get distracted! People learn how to do things! All of these can be exacerbated in a Within-Subjects test

Adapted from Jake Wobbrock Example Usability Lab

For Next Time Start working on Assignment 2! –Any questions? Readings about Graphic Design

Show & Tell