How Accurate Are Self-Assessments of Second Language Proficiency?

Slides:



Advertisements
Similar presentations
Developmentally Appropriate Practice
Advertisements

Chapter 1 What is listening?
A Tale of Two Tests STANAG and CEFR Comparing the Results of side-by-side testing of reading proficiency BILC Conference May 2010 Istanbul, Turkey Dr.
Neil J. Anderson, Ph.D. Brigham Young University Provo, Utah U.S.A.
Lesson Planning Objectives:
1 DEVELOPING ASSESSMENT TOOLS FOR ESL Liz Davidson & Nadia Casarotto CMM General Studies and Further Education.
What should teachers do in order to maximize learning outcomes for their students?
* Discussion: DO YOU AGREE OR DISAGREE WITH THESE STATEMENTS? WHY OR WHY NOT? 1.The difficulty of a text depends mostly on the vocabulary it contains.
(2) Using age-appropriate activities, students expand their ability to perform novice tasks and develop their ability to perform the tasks of the intermediate.
FROM LANGUAGE NEEDS ANALYSES TO CLASSROOM INSTRUCTION Nana Toradze Language Training School National Defense Academy, Georgia
Welcome to Business Communications!. My Name is Y. Michelle Brown A little bit about me… The Y is for Yvette, but I don’t use my first name. I prefer.
How Much Do We know about Our Textbook? Zhang Lu.
21 st Century Learning and Instruction Session 2: Balanced Assessment.
Taking the Mystery out of the Bilingual Target Language Proficiency Test (BTLPT): Why Candidates Are Failing the Test? Maria Bhattacharjee Ed.D Irene Chen.
Idiom of the Day IN THE LOOP To keep someone informed and up-to-date about what’s happening – usually in the workplace.
COURSE AND SYLLABUS DESIGN
Proposed End-of-Course (EOC) Cut Scores for the Spring 2015 Test Administration Presentation to the Nevada State Board of Education March 17, 2016.
English for Specific Purposes (ESP)
Learning Objectives for Senior School Students. Failing to plan is planning to fail. / Psychology of Achievement /
AAPPL Assessment Follow Up June What is AAPPL Measure? The ACTFL Assessment of Performance toward Proficiency in Languages (AAPPL) is a performance-
BILC Conference Athens, Greece 22 – 26 June 2008 Ray T. Clifford
Language Assessment.
ACCESS for ELLs Score Changes
Hannah Pollard- Admissions Progression Officer
Information for Parents Key Stage 3 Statutory Assessment Arrangements
Where We Are and Where We Want to Be
Reaching your Non-English Speakers
3 Chapter Needs Assessment.
NEEDS ANALYSIS.
Metacognition and its effects on teaching and learning of mathematics
Integrating the four language skills (content based instruction, theme based instruction, experimental learning, task based learning) Created by : Fauziah.
How learners learn in my teaching world…
Introduction to Evaluation
The Role of Expectancy & Self-Efficacy Beliefs
Unit 3 The National English Curriculum
STANAG for Non-Specialists
Diagnostics and Goal-Setting in One Hour or Less
English Language Portfolio
MAKING THE LEAP FROM LEVEL 2 TO LEVEL 3.
Effective A Level languages teaching (ALM1)
Planning for Social Studies Instruction
Writing Tasks and Prompts
Web-Based Virtual Learning Environments:
Differentiation Strategies for Multi-Grade, And Multi-Ability Classrooms By: Linda Miller Baker.
Assessing proficiency in languages
THE JOURNEY TO BECOMING
Assessing Writing Module 5 Activity 2.
Standards- based grading for parents- grades k-2
Skopje, 5-7 Sept STANAG 6001 Testing Workshop
English Language Learning Professional Portfolio
Prepared by: elt. supervisor ABDELRHMAN AHMAD
Teaching Listening Based on Active Learning.
recommendations for new teachers
Please, try: Your mobiles Participate Feel free.
Homework questions How does ACTFL define a beginning level learner? (p.30) What are the principles for teaching speaking to beginning learners? (pp.36-40)
LANGUAGE TEACHING MODELS
Sheltered Instruction Observation Protocol
COMPETENCIES & STANDARDS
ACCESS for ELLs Score Reports
Mastering Interview Questions
FUTURE BILC THEMES AND TOPICS
Exploring Assessment Options NC Teaching Standard 4
Unit 7: Instructional Communication and Technology
Freedom to Talk.
IDEA Student Ratings of Instruction
Why do we assess?.
Approaches to Learning (ATL)
Improving academic performance Building language skills Developing critical thinking Expressing ideas and opinions Ask the audience: What are the core.
Nordic Conference on sustainable adult education and learning in organisational settings Assessment and transformation of competences Bjarne Wahlgren.
Tests are given for 4 primary reasons.
Presentation transcript:

How Accurate Are Self-Assessments of Second Language Proficiency? BILC Conference Vienna, Austria 14 – 19 May 2017 Ray Clifford, BILC Senior Advisor

Justifications for Using Self-Assessments Self Assessments (such as “Can Do” statements) Cost less to produce than a test. Can be administered in less time than a test. Eliminate problems with cheating. Don’t require test security procedures. Increase learner motivation. Help develop self-regulated, autonomous learners. “No one knows better what you know than you do.” As opposed to exams which test a small sample of their language.

Disadvantages of Self-assessment People may know what they know, but they don’t know what they don’t know. People are notoriously bad at self-assessing.

Ehrlinger, J. , Johnson, K. , Banner, M. , Dunning, D. , & Kruger, J Why the Unskilled Are Unaware: Further Explorations of (Absent) Self-Insight Among the Incompetent. Organizational Behavior and Human Decision Processes, 105(1), 98–121. http://doi.org/10.1016/j.obhdp.2007 .05.002

The Dunning-Kruger Effect Hits Language Testing Officers from Two Directions Test takers may have unrealistic impressions of their abilities. “But I am the best speaker of English in my unit! Organizations are unaware of the impact a low level of language proficiency can have on job performance. “But s/he has to have a “3” proficiency profile to be sent to this assignment, and no one else is available.

Being pounded from both sides can make you feel like quitting!

3 Ways to Help Language Learners Rate Themselves More Accurately Remind learners that STANAG 6001 measures real-world proficiency -- not classroom abilities. Create real-world “Can Do” statements. For each “Can Do” statement, replace the traditional Yes/No response option with a range of ability options.

3 Ways to Help Language Learners Rate Themselves More Accurately Remind learners that STANAG 6001 measures real-world proficiency -- not classroom abilities. Create real-world “Can Do” statements. For each “Can Do” statement, replace the traditional Yes/No response option with a range of ability options. All of these strategies were applied in a research project completed this year by Maria M. Summers, a graduate student, at Brigham Young University. Her study involved 93 learners of English. (41 males; 52 males)

-97ew students starting their first semester at the Brigham Young University English Language Center -Languages spoken as first languages: Spanish (57), Chinese (13), Korean (6), Portuguese (7), Russian (2), French (3), Kazakh (2), Chechen (1), Mongolian (1). - 40 males and 53 females with various backgrounds and at all levels of proficiency in English participated in the study.

3 Ways to Help Language Learners Rate Themselves More Accurately Remind them that STANAG 6001 measures real-world proficiency -- not classroom abilities. Create real-world “Can Do” statements. For each “Can Do” statement, replace the traditional Yes/No response option with a range of ability options. All of these strategies were applied in a research project completed this year by Maria M. Summers, a graduate student, at Brigham Young University. Her study involved 93 learners of English. (41 males; 52 males)

A Student’s View of STANAG 6001, Level 3 High academic goals are set and learner outcomes are defined. 1. Developers include examples of the most important goals in a textbook. 2. Teachers present as much of the textbook as time allows. 3. Students are assessed on some of the content that was taught. 4. STANAG 6001, Level 3 communication tasks, contexts, topics, grammar, text types, culture, pronunciation, etc. Textbook Teaching Test

How Students Should View STANAG 6001, Level 3 Course developers sample from the Level 3 real-world domain areas to create a textbook. 2a. Textbook Set instructional goals and define expected learner outcomes. 1. Teachers adapt text materials to learners’ abilities, diagnose learning difficulties, adjust activities and add supplemental materials to help students apply new knowledge and skills in achievement settings, constrained performance areas, and in real-world competency scenarios. 3. STANAG 6001, Level 3 communication tasks, contexts, topics, grammar, text types, culture, pronunciation, etc. Teacher Assessment specialists use an independent sample of the real-world Level 3 domain areas to create proficiency assessments that are not in the textbook. 2b. Students Students practice, expand, and then demonstrate their unrehearsed, extemporaneous proficiency across a broad range of real-world settings that go beyond the textbook. 4. Test

As a reminder, every “Can Do” statement in the research project included a preface similar to this: “How confident are you that you could do the following tasks about work [or some other real-world topic ] without time to prepare or reference tools such as a dictionary?”

3 Ways to Help Language Learners Rate Themselves More Accurately Remind them that STANAG 6001 measures real-world proficiency -- not classroom abilities. Create real-world “Can Do” statements. For each “Can Do” statement, replace the traditional Yes/No response option with a range of ability options. All of these strategies were applied in a research project completed this year by Maria M. Summers, a graduate student, at Brigham Young University. Her study involved 93 learners of English. (41 males; 52 males)

The project used real-world can do statements. I can list the activities I do at work every day. I can talk about what my daily routine is at work. I can talk about work-related challenges such as accomplishing a difficult task. I can evaluate and hypothesize about potential consequences of a change in policy in the workplace. [Level 0+, Work] [Level 1, Work] [Level 2, Work] [Level 3, Work]

3 Ways to Help Language Learners Rate Themselves More Accurately Remind them that STANAG 6001 measures real-world proficiency -- not classroom abilities. Create real-world “Can Do” statements. For each “Can Do” statement, replace the traditional Yes/No response option with a range of ability options. All of these strategies were applied in a research project completed this year by Maria M. Summers, a graduate student, at Brigham Young University. Her study involved 93 learners of English. (41 males; 52 males)

The study had 5 possible levels of response.

The correlation between actual ability and the self ratings was: Results of the study The person reliability on the self assessments was: r = 0.91 r2 = 0.83 The correlation between actual ability and the self ratings was: r = 0.44 r2 = 0.19 The process was 83% reliable but only 19% accurate?

Speaking: Self Ratings versus Item Difficulty Levels Person Reliability = .91 Learners’ self ratings were statisticsally reliable – but they were not accurate! Notice that the average beginning learner thought that s/he could do all of the STANAG 6001 types of speaking tasks – including all of the Level 3 tasks. Learners ratings Items with targeted levels from 0 through 3 Average self rating It tells us a little bit that the items may not be functioning the way we want it to. Ex: 6ALF Lowest ability at the bottom, highest ability at the top (people) Items on the right the highest is 6ALF but the students self-assessed a lot higher (over assessed). If we flip it to the side we see a normal distribution with the persons’ self-Assessment. Probability: If they are on the same line they have a 50% probability that they get it right. For lower items,, they have a higher probability that they will get it right. For higher items, the probability that they will get it right is lower.

Again, Dunning and Kruger were right!

Conclusion More than half of the learners rated themselves as Level 3 speakers. Self ratings are not reliable enough to use for placement or other high stakes purposes.

Might there be a more accurate way to do self ratings? Lately I’ve been offering a different type of self assessment to students who are wondering what their level of proficiency might be. It tries to improve the accuracy of self ratings by providing more detail. Note: This method has not yet been validated, so you can help me validate it.

Might there be a more accurate way to do self ratings? Lately I’ve been offering a different type of self assessment for students wondering what their level of proficiency might be. It tries to improve rating accuracy by providing more detail. This method has not yet been validated. Copies are now being distributed. Would you like to try this new approach?

This detailed rating sheet uses the format displayed below. Rate Yourself: What is your speaking proficiency level?   For STANAG 6001 Proficiency Levels 0 to 3 Characteristics 0+ 1 2 3 Complexity of my communication exchanges My automaticity and skill transfer level Accuracy of the language I produce My self-monitoring ability Topics and cultural content I have mastered My supporting knowledge My active vocabulary

Read the directions and rate yourself Read the directions and rate yourself. (Or if you’d prefer, rate a colleague.) Rate Yourself: What is your speaking proficiency level?   For STANAG 6001 Proficiency Levels 0 to 3 Characteristics 0+ 1 2 3 Complexity of my communication exchanges My automaticity and skill transfer level Accuracy of the language I produce My self-monitoring ability Topics and cultural content I have mastered My supporting knowledge My active vocabulary

Ratings of a Typical “School Learner” Rate Yourself: What is your speaking proficiency level?   For STANAG 6001 Proficiency Levels 0 to 3 Characteristics 0+ 1 2 3 Complexity of my communication exchanges X My automaticity and skill transfer level X  Accuracy of the language I produce My self-monitoring ability Topics and cultural content I have mastered My supporting knowledge My active vocabulary

Ratings of a Typical “Street Learner” Rate Yourself: What is your speaking proficiency level?   For STANAG 6001 Proficiency Levels 0 to 3 Characteristics 0+ 1 2 3 Complexity of my communication exchanges X  My automaticity and skill transfer level Accuracy of the language I produce My self-monitoring ability X Topics and cultural content I have mastered My supporting knowledge  X My active vocabulary

Was your overall rating accurate? Did the more detailed rating approach make it easier to assign yourself a global proficiency rating? Did the “non-compensatory” interpretation of the detailed ratings generate a more realistic global rating? Might the rating process be even more accurate if the final STANAG 6001 rating were only revealed after the component ratings had been completed? Should the rating process be automated?

If you would like to experiment with this alternate approach to self assessment, you are invited to do so. Thank you for your interest in the topic of self-assessment – and beware of The Dunning-Kruger Effect!