Designing and Developing Useful Language Tests Lyle F. Bachman Department of Applied Linguistics & TESL University of California, Los Angeles.

Slides:



Advertisements
Similar presentations
National Accessible Reading Assessment Projects Defining Reading Proficiency for Accessible Large Scale Assessments Principles and Issues Paper American.
Advertisements

Cross Cultural Research
Updated 11/16/06©1996 & forthcoming, Bachman & Palmer & OUPPage 1 The Place of Intended Impact in Assessment Use Arguments * Lyle F. Bachman Department.
Developing Language Assessments and Justifying their Use
Chapter 1 An Introduction to Assurance and Financial Statement Auditing McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights.
Organization Change and Development
Professional Performance Process OPD 863 – Using the Rubric Our Journey Continues! 21 st Century Supervision and Evaluation.
An Introduction to Assurance and Financial Statement Auditing
CISB444 - Strategic Information Systems Planning
Language Testing Introduction. Aims of the Course The primary purpose of this course is to enable students to become competent in the design, development,
Testing 09. Problems and Directions Past experience with language help us come to the conclusion that we should keep a balance between linguistic and.
Concept of Reliability and Validity. Learning Objectives  Discuss the fundamentals of measurement  Understand the relationship between Reliability and.
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Issues in Assessing Language Proficiency and Academic Achievement Lyle F. Bachman Department of Applied Linguistics & TESL University of California, Los.
Uses of Language Tests.
Auditing A Risk-Based Approach To Conducting A Quality Audit
Validity of Selection. Objectives Define Validity Relation between Reliability and Validity Types of Validity Strategies.
Classroom Assessment A Practical Guide for Educators by Craig A
Performance Management Upul Abeyrathne, Dept. of Economics, University of Ruhuna, Matara.
WEEK 1 – TOPIC 1 OVERVIEW OF ASSESSMENT: CONTEXT, ISSUES AND TRENDS 1.
Foundations of Leadership Studies
AUDIT PROCEDURES. Commonly used Audit Procedures Analytical Procedures Analytical Procedures Basic Audit Approaches - Basic Audit Approaches - System.
LANGUAGE TESTING AND ASSESSMENT ASSESSMENT LITERACY Prepared by Marina Gvozdeva, Natalya Milyavskaya, Tatiana Sadovskaya, Violetta Yurkevich Based on material.
Talent Management Training Methods.
Principles of Language Assessment Ratnawati Graduate Program University State of Semarang.
The 6 Principles of Second language learning (DEECD,2000) Beliefs and Understandings Assessment Principle Responsibility Principle Immersion Principle.
Communicative Language Teaching
Bank of Performance Assessment Tasks in English
HSA 171 CAR. 1436/ 7/4  The results of activities of an organization or investment over a given period of time.  Organizational Performance: ◦ A measure.
Descriptive and Causal Research Designs
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
1-1 System Development Process System development process – a set of activities, methods, best practices, deliverables, and automated tools that stakeholders.
Contingency Leadership Theories
USEFULNESS IN ASSESSMENT Prepared by Vera Novikova and Tatyana Shkuratova.
CHAPTER 1 Understanding RESEARCH
Measuring and reporting outcomes for BTOP grants: the UW iSchool approach Samantha Becker Research Project Manager U.S. IMPACT Study 1UW iSchool evaluation.
LEARNING DOMAINS & OBJECTIVES Southern Illinois University Carbondale Instructor Workshop.
Lesson One Introduction: Teaching and Testing/Assessment.
Validity Validity is an overall evaluation that supports the intended interpretations, use, in consequences of the obtained scores. (McMillan 17)
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
1 TESL Evaluating CALL Packages:Curriculum/Pedagogical/Lingui stics Dr. Henry Tao GUO Office: B 418.
Chapter 6 - Standardized Measurement and Assessment
Professional Ethics and Responsibilities
Job Analysis. The process of collecting and organizing information about jobs performed in the organization and the principle elements involved in performing.
An Overview of Management
June 7, 2016Action Research and Seminar1 Action Research l The task is NOT finished when the project ends l Participants continue to review, evaluate and.
Foundations of Leadership Studies.  Trait theories  Behavioral theories  Situational/contingency theories  Fiedler's Contingency Model  Path-Goal.
HIGH IMPACT ON-THE-JOB LEARNING ACTIVITY MAP
AUDIT STAFF TRAINING WORKSHOP 13 TH – 14 TH NOVEMBER 2014, HILTON HOTEL NAIROBI AUDIT PLANNING 1.
 Planning an audit of cost statements, records and other related documents is considered necessary to ensure achievement of audit objectives with available.
Lecture 14. Technology Center Assessment. QUESTIONS: Assessment- Center- as technology assessment, development and certification of personnel. Model of.
Chapter 1 Market-Oriented Perspectives Underlie Successful Corporate, Business, and Marketing Strategies.
7 Training Employees What Do I Need to Know?
An Introduction to Assurance and Financial Statement Auditing
BILC Seminar, Budapest, October 2016
Software Quality Control and Quality Assurance: Introduction
Principles of Language Assessment
Designing Rubrics with the Three Categories of Knowledge
VALIDITY by Barli Tambunan/
Oleh: Beni Setiawan, Wahyu Budi Sabtiawan
DSS & Warehousing Systems
Implementing Strategy in Companies That Compete in a Single Industry
Communicative Language Teaching
After completing this chapter, you will become more aware of:
Gestalt Theory.
RESEARCH BASICS What is research?.
BASIC PRINCIPLES OF ASSESSMENT
Presentation transcript:

Designing and Developing Useful Language Tests Lyle F. Bachman Department of Applied Linguistics & TESL University of California, Los Angeles

Uses of Language Tests F to make inferences about individuals’ language ability F to make predictions about individuals’ ability to use language in contexts outside the test itself F to make decisions about individuals

Correspondences between Language Test Performance and Language Use

Language Test PerformanceLanguageUse Characteristics of the Language Use Task Characteristics of the Language Test Task Characteristics of the Language User Characteristics of the Test Taker

Language Test PerformanceLanguageUse Characteristics of the Language Use Task Characteristics of the Language Test Task Language Ability of the Language User Language Ability of the Test Taker

Characteristics of the Language Use Task Language Test PerformanceLanguageUse Language Ability of the Test Taker Language Ability of the Language User Characteristics of the Language Test Task

A Framework of Task Characteristics Language Use Task an activity that involves individuals in using language for the purpose of achieving a particular goal in a particular situation (Bachman & Palmer, 1996, p. 44)

A Framework of Task Characteristics Target Language Use (TLU) Domain a set of specific language use tasks that the test taker is likely to encounter outside of the test itself, and to which we want our inferences about language ability to extrapolate (Bachman & Palmer, 1996, p. 44)

A Framework of Task Characteristics Task Characteristics  Setting: the physical circumstances under which either language use or language testing takes place. Includes the physical characteristics of the setting, the participants involved, and the time of the task.

A Framework of Task Characteristics Task Characteristics  Rubric: the context for the task; includes those characteristics that provide the structure for the task, and constrain how language users or test takers are expected to respond to these tasks

A Framework of Task Characteristics Task Characteristics  Input: material contained in the task, which test takers need to process in some way, and to which they are expected to respond.

A Framework of Task Characteristics Task Characteristics  Expected response: the language use that is expected, given the way in which the rubric, or context, for the task is configured, and the particular input that is provided

A Framework of Task Characteristics Task Characteristics  Relationship between input and response: Reactivity: the degree of reciprocity, or interaction involved,Reactivity: the degree of reciprocity, or interaction involved, Scope: the amount and range of input that needs to be processed in order to respond, andScope: the amount and range of input that needs to be processed in order to respond, and Directness: the extent to which the response can be made by using information in the input by itself, or whether the language user or test taker must also rely on information in the context or in his or her own real world knowledge.Directness: the extent to which the response can be made by using information in the input by itself, or whether the language user or test taker must also rely on information in the context or in his or her own real world knowledge.

Describing Target Language Use Tasks 1.Identify the TLU tasks in the relevant domain. 2.Select TLU tasks as a basis for designing test tasks. 3.Describe these tasks in terms of their task characteristics.

Describing Target Language Use Tasks 1.Identify the TLU tasks in the relevant domain. 2.Select TLU tasks as a basis for designing test tasks. 3.Describe TLU tasks in terms of their task characteristics.

TLU Domain PR Department of a Phone Company Task 2 phone discussions with clients, co-workers Task 1 writing promotional material Task 3 responding to customer complaints on the phone Task 4 responding to customercomplaints in writing Task 5 giving complaint information to co-workers by phone Task 6 giving complaint information to co-workers in writing Task Characteristics Framework TLU Task Type A (letter) TLU Task Type B (memo)

Developing Test Tasks F Modify TLU task types for use as test tasks F Create original test tasks

TLU Task Type A (letter) Modify TLU Task Type to Develop Test Task Specifications Test Task Specifications A (write formal letter to customer) TLU Task Type B (memo) Test Task Specifications B (write memo to co-worker)

Developing Test Tasks F Modify TLU task types for use as test tasks F Create original test tasks

Distinctive Characteristics of TLU Task Type Create Specifications for Original Test Tasks Test Task Specifications

Example Test Development Project Project setting: telephone company needs to hire new employees

Describing Target Language Use Tasks 1.Identify the TLU tasks in the relevant domain. 2.Select TLU tasks as a basis for designing test tasks. 3.Describe these tasks in terms of their task characteristics.

Example TLU Domain PR Department of a Phone Company Task 1 writing promotional material Task 2 phone discussions with clients, co-workers Task 3 responding to customer complaints on the phone Task 4 responding to customercomplaints in writing Task 5 giving complaint information to co-workers by phone Task 6 giving complaint information to co-workers in writing

Selecting TLU Tasks for Use in Designing Test Tasks F Some TLU tasks may be essential to performing the job

Example TLU Domain PR Department of a Phone Company Task 4 responding to customercomplaints in writing Task 2 phone discussions with clients, co-workers Task 6 giving complaint information to co-workers in writing Task 1 writing promotional material Task 3 responding to customer complaints on the phone Task 5 giving complaint information to co-workers by phone

Selecting TLU Tasks for Use in Designing Test Tasks F Some TLU tasks may not require the areas of language ability we want to measure. F Some TLU tasks may not be appropriate for all test takers.

Describing Target Language Use Tasks 1.Identify the TLU tasks in the relevant domain. 2.Select TLU tasks as a basis for designing test tasks. 3.Describe these tasks in terms of their task characteristics.

TLU Domain PR Department of a Phone Company Task 2 phone discussions with clients, co-workers Task 1 writing promotional material Task 3 responding to customer complaints on the phone Task 4 responding to customercomplaints in writing Task 5 giving complaint information to co-workers by phone Task 6 giving complaint information to co-workers in writing Task Characteristics Framework TLU Task Type A (letter) TLU Task Type B (memo)

Developing Test Tasks F Modify TLU task types for use as test tasks. F Create original test tasks.

Developing Test Tasks F Modify TLU task types for use as test tasks F Create original test tasks

TLU Task Type A (letter) Modify TLU Task Type to Develop Test Task Specifications Test Task Specifications A (write formal letter to customer) TLU Task Type B (memo) Test Task Specifications B (write memo to co-worker)

Developing Test Tasks F Modify TLU task types for use as test tasks F Create original test tasks

Distinctive Characteristics of TLU Task Type Create Specifications for Original Test Tasks Test Task Specifications

Conclusion Bachman-Palmer Approach to Language Test Development FProvides test developers with a theoretically grounded and principled basis for developing and using language tests.

Conclusion Bachman-Palmer Approach to Language Test Development FProvides test developers with an understanding that will enable them to make their own decisions about either selecting or developing a language test that will be useful for their particular situation.

Conclusion Bachman-Palmer Approach to Language Test Development FRequires test developers to consider the fundamental correspondence between language test performance and non-test language use.

TLU Domain Task Task Task Task Task Task

Using Needs Analysis to Identify TLU Tasks 1.Identify stakeholders. 2.Develop procedures for gathering information about tasks. 3.Gather information about the domain and tasks. 4.Analyze tasks in terms of their task characteristics. 5.Make an initial grouping of tasks into categories.

Test Usefulness F Principle 1: Overall test usefulness, rather than individual qualities is to be maximized F Principle 2: Individual qualities cannot be evaluated independently, but most evaluated in terms of their combined effect on overall usefulness

Test Usefulness F Principle 3: Test usefulness and the appropriate balance among the different qualities can only be determined for a given specific testing situation.

Qualities of Usefulness F Reliability F Construct Validity F Authenticity F Interactiveness F Impact F Practicality

Describing Target Language Use Tasks 1.Identify the TLU tasks in the relevant domain. 2.Select TLU tasks as a basis for designing test tasks. 3.Describe these tasks in terms of their task characteristics.