The art and science of measuring people l Reliability l Validity l Operationalizing.

Slides:



Advertisements
Similar presentations
Copyright © Allyn & Bacon (2007) Single-Variable, Independent-Groups Designs Graziano and Raulin Research Methods: Chapter 10 This multimedia product and.
Advertisements

StatisticalDesign&ModelsValidation. Introduction.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences CHAPTER.
MSS 905 Methods of Missiological Research
The art and science of measuring people l Reliability l Validity l Operationalizing.
The art and science of measuring people l Reliability l Validity l Operationalizing.
Chapter 14: Usability testing and field studies. Usability Testing Emphasizes the property of being usable Key Components –User Pre-Test –User Test –User.
Beginning the Research Design
Analysis of Variance. Experimental Design u Investigator controls one or more independent variables –Called treatment variables or factors –Contain two.
Experimental Methods What is an experiment? Manipulation of one or more variables by the experimenter to determine the effect of this manipulation on another.
Causal Comparative Research: Purpose
An evaluation framework
Company LOGO B2C E-commerce Web Site Quality: an Empirical Examination (Cao, et al) Article overview presented by: Karen Bray Emilie Martin Trung (John)
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
From Controlled to Natural Settings
Copyright 2005, Prentice Hall, Sarafino
Chapter 14 Inferential Data Analysis
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Experimental Design The Gold Standard?.
Chapter 14: Usability testing and field studies
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Conducting a User Study Human-Computer Interaction.
Final Study Guide Research Design. Experimental Research.
Chapter 8 Experimental Design: Dependent Groups and Mixed Groups Designs.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Education 793 Class Notes Welcome! 3 September 2003.
COMM 250 Agenda - Week 12 Housekeeping RP2 Due Wed. RAT 5 – Wed. (FBK 12, 13) Lecture Experiments Descriptive and Inferential Statistics.
Evaluating a Research Report
User Study Evaluation Human-Computer Interaction.
Today: Our process Assignment 3 Q&A Concept of Control Reading: Framework for Hybrid Experiments Sampling If time, get a start on True Experiments: Single-Factor.
HOW TO WRITE RESEARCH PROPOSAL BY DR. NIK MAHERAN NIK MUHAMMAD.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
SURVEY RESEARCH.  Purposes and general principles Survey research as a general approach for collecting descriptive data Surveys as data collection methods.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Learning Objective Chapter 9 The Concept of Measurement and Attitude Scales Copyright © 2000 South-Western College Publishing Co. CHAPTER nine The Concept.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
AMSc Research Methods Research approach IV: Experimental [1] Jane Reid
Academic Research Academic Research Dr Kishor Bhanushali M
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
©2010 John Wiley and Sons Chapter 2 Research Methods in Human-Computer Interaction Chapter 2- Experimental Research.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
SOCW 671: #6 Research Designs Review for 1 st Quiz.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
PS Research Methods I with Kimberly Maring Unit 9 – Experimental Research Chapter 6 of our text: Zechmeister, J. S., Zechmeister, E. B., & Shaughnessy,
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Evaluation through user participation
Instructor’s manual Mass Media Research: An Introduction, 7th Edition
Chapter 4: Studying Behavior
Understanding and conducting
Understanding Results
Chapter Eight: Quantitative Methods
2 independent Groups Graziano & Raulin (1997).
Multidimensional Scaling
Evaluation techniques
The MR process.
Multidimensional Scaling
Inferential Statistics
15.1 The Role of Statistics in the Research Process
HCI Evaluation Techniques
Understanding Statistical Inferences
Presentation transcript:

The art and science of measuring people l Reliability l Validity l Operationalizing

Overview of design and analysis Posing a question l Conceptualizing the question l Operationalizing the related concepts l Identifying Independent, Dependent, & Controlled Variables l Developing the Hypothesis

Choosing the testing method What method is appropriate for the current situation? (experiment, observation, surveys etc.) >> choice of method as a trade off between control realism l Experimental, Quasi-Experimental and Non- Experimental Methods.

Collecting data The art of finding and recruiting participants l A practical view of randomization: Randomization and Pseudo Randomization l Random Selection and Random Assignment. l Practical issues about sample size and statistical power.

Analyzing the data: Basic Statistics Levels of measurement: nominal, ordinal, interval, and ratio l Mean, median, standard deviation l Testing mean differences l Significance levels and what they mean

Analysis of experimental designs: Single Factor Experiments Statistical Hypothesis Testing l Estimates of Experimental Error l Estimates of Treatment Effects l Evaluation of the Null Hypothesis l Various ANOVA models

Multi Factor Experiments Advantages of the factorial design l Interaction Effects l The power of within-subjects designs (reduction of variance) l The two factorial experiment l Higher Order Factorial Designs

Analysis of Non-Experimental Studies Statistical methods for analyzing correlational data l Correlations, Scatter Plots, Partial Corrs l Multiple Regression l Introduction to Factor Analysis, Cluster Analysis and Multidimensional Scaling

Surveys and Questionnaires The design of surveys and questionnaires l How to frame questions l Kinds of scales: Likert, Semantic Differential etc. l Analyzing survey data: which items are useful, Item Response Theory Forming a scale to measure an attribute, e.g., satisfaction. Reliability, validity of scale

Measuring Individual Differences How to test for individual differences within users l Kinds of individual differences variables: -demographic: such a age, gender etc. -situational: motivation, interest, fatigue -cognitive: memory, cognitive style etc., -personality: internal/external locus of control l How to analyze existing data to identify individual differences, and how to design studies to test for individual differences?

Frenzied Shopping: Obstacles to purchase, and the perception of download times - A study on ecommerce conducted by Jared Spool A critical analysis and Illustration of alternative methods of examining this question

Frenzied shopping Create a realistic scenario: in present situation, get person motivated Counted obstacles to purchase –Advantages of measure: »concrete: people agree about measure »valid: good measure of actual ecommerce experience –Disadvantages of measure: »not reliable: since situation is not structured »data analysis problems

Results Found more than 200 obstacles to purchase The more the no of users, the greater the no of problems What’s wrong with each test discovering hundreds of problems? –Client has limited resources, need to focus on solving important (most common / most catastrophic problems)

More results:Perception of download times How long will users wait for pages to download? - Should web developers waste their time in making pages faster. Method: Users were asked to rate the perceived speed of pages after they had completed task. Ave. SpeedRated Amazon.com30 secFastest About.com8 secSlowest

So what do download times relate to? Only correlated with success or failure of shopping. –(Amazon.com judged to be slower than About.com even though About.com was much faster) Result is foregone conclusion given the task. Problems with method: –Memory issues: Users asked for ratings at the end of their experience with all the sites. Retrospective memory problems. –Ask someone waiting for a page to download if it is taking too long!

Timeline Issues Rated speed no longer reflects the browsing, searching part of the experience. Cannot infer that download speeds are not important, can only infer that perception of download speeds can be influenced by other aspects of site

Survey: Are people bothered by long download times? Sample Question: – How often do you leave a site without waiting for the first page to download? »0-5% of times »5-10% of times »10 % and higher –In your opinion, how important are the below web site characteristics. Rate their relative importance. »Download speed »site content »site interactivity Possibilities: Task based surveys Low on control, High on Realism, Difficult to make causal inferences, Sampling Issues

Observation: Is user’s site usage affected by site speed? Method: Choose a few sites which are similar in their general purpose (e.g., shopping sites). Ask subjects to browse / complete a task on all sites. Measurement: Watch participants for signs of frustration or satisfaction with speed of site. Low on control, High on Realism, Difficult to make causal inferences

Experiment: Relationship of perceived download times to actual download times. Method: Create versions of a site with different download speeds. Ask users to complete tasks on the sites (within subjects or between subjects). Make some tasks interesting and some boring. Give users less than enough time to complete the task. Measurement: How many interesting and how many boring tasks did users choose to complete. Relate that to download speed of site. High on control, Low on on Realism, Easier to make causal inferences

Server Logs: Do people leave sites while waiting for slow pages to download? Method: Find similar sites with different speeds. Analyze the server logs for the sites. Measurement: Analyze the logs to find the % of people who do not wait for a page to download. Low on control, High on Realism, Difficult to make causal inferences

Perception of download speed and all the ways to study it...

The state of the art What usability methods are currently prevalent and accepted in the field CUE 2

Comparative Usability Evaluation(CUE) 2 Molich et al., 1999 Purpose: Too much emphasis on one-way mirrors and scan converters Little knowledge of REAL usability testing procedures ”Who checks the checker?” Method: Nine teams tested the usability of a web site Seven professional teams Two student teams Four European, five US teams Test web-site:

Problems found in Comparative Usability Evaluation

Problem Found by Seven Teams During the registration process Hotmail users are asked to provide a password hint question. The corresponding text box must be filled. Most users did not understand the meaning of the password hint question. Some entered their Hotmail password in the Hint Question text box.

Characteristics of the tests

Problems by teams

What factors predict no of problems & no of common (non-exclusive) problems?

Inferences from CUE study Much disagreement about methods of usability testing How to test? Who should test? What methods to use? How many testers to have? How many users to have?