Aligning Program Goals, Instructional Practices, and Outcomes Assessment Dr. Ray T. Clifford BILC Conference, Budapest 29 May 2006.

Slides:



Advertisements
Similar presentations
Conversation Skills will be tested both as part of Formative & Summative Assessment.
Advertisements

Bringing it all together!
Spiros Papageorgiou University of Michigan
A Tale of Two Tests STANAG and CEFR Comparing the Results of side-by-side testing of reading proficiency BILC Conference May 2010 Istanbul, Turkey Dr.
Assessing Student Learning: Using the standards, progression points and assessment maps Workshop 1: An overview FS1 Student Learning.
Mapping our language programmes Vicky Wright Centre for Language Study
Common Core State Standards (CCSS) Nevada Joint Union High School District Nevada Union High School September 23, 2013 Louise Johnson, Ed.D. Superintendent.
Assessment in the Middle Years Programme. How are students assessed? The MYP offers a criterion-related model of assessment. This means that students'
Consistency of Assessment
Seminar /workshop on cognitive attainment ppt Dr Charles C. Chan 28 Sept 2001 Dr Charles C. Chan 28 Sept 2001 Assessing APSS Students Learning.
National Curriculum Key Stage 2
Curriculum Framework for Romani Seminar for decision makers and practitioners Council of Europe, 31 May and 1 June 2007 An introduction to the Curriculum.
Raili Hildén University of Helsinki Relating the Finnish School Scale to the CEFR.
ESL Phases & ESL Scale Curriculum Corporation 1994.
1 DEVELOPING ASSESSMENT TOOLS FOR ESL Liz Davidson & Nadia Casarotto CMM General Studies and Further Education.
ACADEMIC DIRECTION / TESTING OFFICE. Language proficiency scale 0 No practical proficiency 1 Elementary 2 Fair Limited working 3 Good Minimum professional.
Becoming a Teacher Ninth Edition
The BILC BAT: A Research and Development Success Story Ray T. Clifford BILC Professional Seminar Vienna, Austria 11 October.
Education office, Evaz district, autumn 1393 Presenter: Rahmanpour CEF (Common European Framework): The basis of the new course book development in Iran.
Lesson Planning. Teachers Need Lesson Plans So that they know that they are teaching the curriculum standards required by the county and state So that.
(2) Using age-appropriate activities, students expand their ability to perform novice tasks and develop their ability to perform the tasks of the intermediate.
Study Group 5 STANAG for Non-Specialists. Task Simplify the STANAG document for administrative purposes Outline salient aspects in non-technical.
Classroom Assessments Checklists, Rating Scales, and Rubrics
Ways for Improvement of Validity of Qualifications PHARE TVET RO2006/ Training and Advice for Further Development of the TVET.
1 Chapter 7 Models for Teaching: Direct Melinda Bauer and Shannyn Bourdon.
Creating Rubrics. Information taken from Formative Assessment and Standards-Based Grading Robert Marzano 2010.
Workshop: assessing writing Prepared by Olga Simonova, Maria Verbitskaya, Elena Solovova, Inna Chmykh Based on material by Anthony Green.
Language Implications of NATO’s Expanding Roles Dr. Ray T. Clifford BILC Conference, San Antonio 21 May 2007.
Developing a Teaching Portfolio for the Job Search Graduate Student Center University of Pennsylvania April 19, 2007 Kathryn K. McMahon Department of Romance.
Military Language Testing at the National Defence University and the Common European Framework BILC CONFERENCE BUDAPEST.
Selected Teaching-Learning Terms: Working Definitions...
FCE First Certificate in English. What is it ? FCE is for learners who have an upper- intermediate level of English, at Level B2 of the Common European.
DLIFLC 7-9 FEB 01 Diagnostic Assessment Thomas S. Parry Directorate of Continuing Education Defense Language Institute BILC Professional Seminar 2005 Sofia,
Assessment and Testing
Paraprofessionals and Language Proficiency Requirement Bilingual Paraprofessional Conference March 23, 2005 Hamline University
Anchor Standards ELA Standards marked with this symbol represent Kansas’s 15%
New Writing Expectations Require a New Approach: An Introduction to Ready ® Writing Grades 3-5 Adam Berkin Vice President, Product Development
Benchmark Advisory Test (BAT) Update BILC Conference Athens, Greece Dr. Ray Clifford and Dr. Martha Herzog June 2008.
INSTRUCTIONAL OBJECTIVES
The CEFR and the MFL classroom PDST seminar Maynooth University 7 Nov 2015
What are competencies?  Emphasize life skills and evaluate mastery of those skills according to actual leaner performance.  Competencies consist of.
GCSE English Language 8700 GCSE English Literature 8702 A two year course focused on the development of skills in reading, writing and speaking and listening.
USING ILR/STANAG LEVEL DESCRIPTORS AND TEXT TYPOLOGY AND PASSAGE RATING IN CLASSROOM TEACHING TOWARDS PROFICIENCY James Dirgin Director Proficiency Standards.
Designing a curriculum is a long and complicated process. In designing a curriculum, there are many important elements the designer must consider. Some.
1 Instructing the English Language Learner (ELL) in the Regular Classroom.
Workshop 2014 Cam Xuyen, October 14, 2014 Testing/ assessment/ evaluation BLOOM’S TAXONOMY.
REGISTRATION CODE: EET699
ACCET 2014 Presented by: Brenda Nazari-Robati The Language Company Lynore M. Carnuccio The Language Company.
To my presentation about:  IELTS, meaning and it’s band scores.  The tests of the IELTS  Listening test.  Listening common challenges.  Reading.
“To begin with the end in mind means to start with a clear understanding of your destination. It means to know where you’re going so that you better understand.
Common Core.  Find your group assignment.  As a group, read over the descriptors for mastery of this standard. (The writing standards apply to more.
BILC Conference Athens, Greece 22 – 26 June 2008 Ray T. Clifford
Classroom Assessments Checklists, Rating Scales, and Rubrics
REGISTRATION CODE: EET699
Learning Model for English 2-8 grades
STANAG for Non-Specialists
Teaching and Learning with Technology
Classroom Assessments Checklists, Rating Scales, and Rubrics
EL (English Language) Students and WIDA Standards
Kuwait National Curriculum
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 11/8/2018.
Reading Objectives: Close Reading Analyze visuals. RI.4.7
Reading Objectives: Close Reading
REGISTRATION CODE: EET699
COMPETENCIES & STANDARDS
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 12/3/2018.
Best Practices in STANAG 6001 Testing
Teaching, Learning, and Testing: Finding Congruence
REGISTRATION CODE: EET699
Presentation transcript:

Aligning Program Goals, Instructional Practices, and Outcomes Assessment Dr. Ray T. Clifford BILC Conference, Budapest 29 May 2006

What connects the instructional components that are included in this year’s conference theme? Instructional Practices Outcomes Assessment Program Goals

What connects the instructional components that are included in this year’s conference theme? Standards

BILC Standards-Based Projects The BILC-developed interpretation of STANAG 6001 approved as an official part of that STANAG. A BILC Working Group has prepared descriptors for optional plus levels. A survey was conducted on the desirability of a producing a STANAG 6001 BILC- sponsored, “benchmark” test with Advisory ratings.

Participation in the Survey 16 countries responded to the survey: Austria Bulgaria Canada Denmark EstoniaFinland Germany Hungary Italy Latvia Lithuania Poland Romania Spain Sweden Turkey

Survey Results 1.Would your country use a Benchmark Test if one were available? Definitely yes: 8 Probably yes: 5 Perhaps: 2 Most likely not: 0 Definitely not: 1

Survey Results 2.Does your country use “plus levels” when assigning STANAG ratings? Definitely yes: 3 Probably yes: 0 Perhaps: 1 Most likely not: 1 Definitely not:11

Survey Results 3.Would you like to have plus levels incorporated into a Benchmark Test? Definitely yes: 5 Probably yes: 5 Perhaps: 2 Most likely not: 2 Definitely not: 2

Summary A “benchmark” test would be welcomed by most countries. The scores should be advisory in nature. Providing “plus” level ratings would allow those ratings to be used or ignored. BILC should proceed with plans to: –Develop a benchmark STANAG test of reading comprehension. –Explore internet delivery options.

ACT will Assist with Funding ACT has approved funding to support the development of the BILC Advisory Test (Reading): –Part-time project coordinator. –Computer programming and server support. –Travel expenses for the next meeting of the Test Working Group. Work is underway. –Test specifications have been completed. –Texts and items are being reviewed.

A Comparison of Testing Standards STANAG 6001 The Common European Framework of Reference for Languages: Learning, teaching, assessment

Every Performance Standard has Three Essential Components Task A statement of what is to be done or accomplished. Conditions A description of the conditions under which (or context in which) the task is to be performed. For language this includes the topics to be addressed. Accuracy A definition of how well the task must be performed under the conditions stated.

5 LEVELTASKSCONTEXT/TOPICSACCURACY All expected of an educated NS All subjects Accepted as an educated NS Tailor language, counsel, motivate, persuade, negotiate Wide range of professional needs Extensive, precise, and appropriate Support opinions, hypothesize, explain, deal with unfamiliar topics Practical, abstract, special interests Narrate, describe, give directions Concrete, real- world, factual Intelligible even if not used to dealing with non-NS Errors never interfere with communication & rarely disturb Q & A, create with the languageEveryday survival Intelligible with effort or practice Use memorized phrasesRandom Unintelligible STANAG Speaking (Summarized) as a Standard

STANAG 6001 Scale Validation Exercise Conducted at Sofia, Bulgaria 13 October 2005

Instructions On the top of a blank piece of paper, write the following information: 1.Your current work assignment: Teacher, Tester, Administrator, Other______ 2.Your first (or dominate) language: _________ 3.You do not need to write your name!

Instructions Next, write the numbers: down the left side of the paper.

Instructions You will now be shown 6 descriptions of language speaking proficiency. Each description will be labeled with a color.

Instructions Rank the descriptions according to their level of difficulty by writing their color designation next the appropriate number: 0 (easiest) = Color ? 1 (next easiest) = Color ? 2 (next easiest) = Color ? 3 (next easiest) = Color ? 4 (next easiest) = Color ? 5 (most difficult) = Color ?

Ready? The descriptions will now be presented… –One at a time, –In a random sequence, –For 15 seconds each. You will see each of the descriptors 4 times. Thank you for participating in this experiment.

STANAG 6001 Scale Validation: A Timed Exercise Without Training 74 people turned in their rankings. They marked their current work assignments as: –Administrator 49 –Teacher26 –Tester19 –Other 1

Results of the STANAG Scale Validation ( n = 74 )

The CEF can also be presented as a standard by dividing each of the descriptions into the three components of… Task(s) Conditions/Topics Accuracy expectations

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy A1 Produce simple phrases About people and places Mainly isolated phrases A2 Give a simple description or presentation Of people, living or working conditions, daily routines, likes/dislikes, etc. A short series of simple phrases and sentences linked into a list

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy B1 Sustain a straightforward description One of a variety of subjects within his/her field of interest Reasonably fluent, linear sequence of points

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy B2.1 Give descriptions and presentations, expand and support ideas with subsidiary points and examples Wide range of subjects related to his/her field of interest Clear, detailed, and relevant

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy B2.2 Give descriptions and presentations, with highlighting of significant points, and supporting detail Clear, systematically developed, appropriate, and relevant

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy C1 Give descriptions and presentations on complex subjects, integrate sub-themes, develop particular points, round off with a conclusion Clear, detailed, appropriate

CEF: OVERALL ORAL PRODUCTION (CEF, p. 58) LevelTaskContext/TopicAccuracy C2 Produce …speech with an effective logical structure Clear, smoothly flowing well- structured which helps the recipient to notice and remember significant facts

Why are topics not specified at the higher ability levels? The CEF manual gives the answers…

Ambiguity of Expectations Three types of “proficiency” are recognized in CEF “communicative testing” (Pages 180 and 184): –“Emerging competence” in relevant situations. –Competence on tasks in a “relevant syllabus”. –“The generalisable competencies” evidenced by a candidate’s overall performance. For STANAG 6001, only the last type of generalisable competence is considered “proficiency”.

Ambiguity of Expectations CEF acknowledges a third “blended” category, between achievement and real- world proficiency, but does not label it. (p. 184.) STANAG tester training documents label this “in-between” category as “rehearsed performance” or “pro-chievement” ability to distinguish it from unrehearsed, general ability.

Some Other Examples “Table 2. Common Reference Levels: Self Assessment” (CEF p. 24) –Contains almost no accuracy statements. “Table 3. Common Reference Levels: qualitative aspects of spoken language use” (CEF pp. 28 and 29) –Contains accuracy statements not only under the column labeled “ACCURACY”, but also interwoven in the descriptions found under the columns labeled “RANGE”, “FLUENCY”, “INTERACTION”, and “COHERENCE”.

Why not combine two CEF scales to match the “standard” format? This evidently creates too many rating options for the CEF developers. However, every testing system should decide how to deal with the complexity of the interactions between two factors: –The difficulty of the Communication Tasks tested. –The varying levels of competency demonstrated by the test candidates.

Example # 1 Consider for instance, the combination of the CEF “Overall Oral Production” scale and the “General Linguistic Range” scale. (pp. 58 and 110) –The “Overall Oral Production” scale has 7 defined levels. –The “General Linguistic Range” has 9 defined levels. –The combination could yield 63 different rating combinations.

Options for Reducing Complexity Select a progressive subset of the possible combinations as major progress milestones. Conclude as the CEF does that… –It is not “practical” to “use all the scales at all levels”. (p. 192) –The test rating criteria should be linked to the learner’s textbook and defined by criteria that are appropriate to the “requirements of the assessment task concerned”. (p. 193)

The CEF Approach to Handling Language Complexity Therefore, the CEF suggests… –“Features need to be combined, renamed and reduced into a smaller set of assessment criteria appropriate to the needs of learners”. (p. 193) [Emphasis added] –Test rating criteria should be restricted to those criteria that are appropriate to the “style of the pedagogic culture concerned”. (p. 193) [Emphasis added]

Example # 2 Compare this approach with how STANAG 6001 deals with rating complexity. –6 task levels. –6 content levels. –6 accuracy levels. –The combination could yield 216 different rating combinations.

STANAG 6001 Approach to Handling Language Complexity Therefore, STANAG 6001… –Combined, renamed and reduced features into a smaller set of assessment criteria appropriate to the needs of employers. –Reduced rating complexity by aligning each task level with an appropriate level of expanding content areas, and an increasing level of accuracy that correspond to the type of tasks being tested. –Stipulated that (as with other performance standards) all of the task, condition, and accuracy statements for a given level must be satisfied before that level proficiency can be awarded.

5 LEVELTASKSCONTEXT/TOPICSACCURACY All expected of an educated NS All subjects Accepted as an educated NS Tailor language, counsel, motivate, persuade, negotiate Wide range of professional needs Extensive, precise, and appropriate Support opinions, hypothesize, explain, deal with unfamiliar topics Practical, abstract, special interests Narrate, describe, give directions Concrete, real- world, factual Intelligible even if not used to dealing with non-NS Errors never interfere with communication & rarely disturb Q & A, create with the languageEveryday survival Intelligible with effort or practice Use memorized phrasesRandom Unintelligible STANAG Speaking (Summarized) as a Standard

Technically, STANAG 6001 also adheres to the recommendations of the CEF, because it… –Is a “metasystem”. (CEF, pp ) –Has combined features to create a reduced set of assessment criteria… That match the tasks being assessed. (CEF, p. 193) With between 4 and 7 rating levels. (CEF, p. 193) –Meets the needs of “employers” by testing for generalisable, real-world proficiency. (CEF, p. 183)

STANAG 6001 Diverges from the recommendations of the CEF, because it… –Assigns ratings based on employment needs without considering “the needs of the learners concerned” or “the style of the pedagogic culture concerned.” (CEF, p. 193) –Uses criterion-referenced grading of a “task, topics, and accuracy” hierarchy – rather than a norm-referenced scalar analysis. (CEF, p. 185) –Rates ability based on one’s unrehearsed, real- world proficiency in the language being tested.

A Summary of the Major Contrasts STANAG 6001 The primary purpose is to test individuals’ general proficiency across a wide range of topics regardless of their course of study. The primary users of the information are employers and administrators. By design, STANAG 6001 is under-specified for measuring step-by-step progress within a specific curriculum. CE Framework of Reference The primary purpose is to check learners’ progress in developing communicative competence within a specific course of study. The primary users of the information are the teachers and students. By design, the CE Framework of Reference is under- specified for testing of general, real-world proficiency.

These contrasts are not a problem! No single test or testing framework can meet both the formative needs of learners and the summative needs of employers, so… –Use the CE Framework of Reference for designing curriculum-appropriate achievement and performance tests. –Use the STANAG 6001 assessment as a culminating, independent measure of graduates’ general, real-world ability.

STANAG 6001 Proficiency Scale “Emerging competency” “Competence in a relevant syllabus” STANAG 6001 focus CEF focus “General, unrehearsed, real-world proficiency”

What happens when you compare rehearsed performance ratings with unrehearsed proficiency ratings? Those who can pass an unrehearsed, general proficiency test can also pass a curriculum-based performance test. Those who can pass a rehearsed performance test may or may not be able to pass a general, unrehearsed proficiency test.

Conclusion “The solutions to our problems should be as simple as possible, but no simpler.” Albert Einstein Language tests should match the purpose for which the results will be used. –Use achievement tests for testing mastery of lessons in a textbook. –Use performance tests for checking rehearsed abilities. –Use proficiency tests for determining general, unrehearsed ability in real-world situations.