Francesco Gratton 2013 Testing in the time of crisis BILC PROFESSIONAL SEMINAR Stockholm, October 14 - 17, 2013 INNOVATIVE TEST DESIGNS AND FORMATS Lt.Col.

Slides:



Advertisements
Similar presentations
Qualifications Update: English Qualifications Update: English.
Advertisements

A Tale of Two Tests STANAG and CEFR Comparing the Results of side-by-side testing of reading proficiency BILC Conference May 2010 Istanbul, Turkey Dr.
Types of Tests. Why do we need tests? Why do we need tests?
Introduction CSCI102 - Systems ITCS905 - Systems MCS Systems.
1 The New Adaptive Version of the Basic English Skills Test Oral Interview Dorry M. Kenyon Funded by OVAE Contract: ED-00-CO-0130 The BEST Plus.
Chapter 5 Selecting and Developing Staff. Objectives: §1. List and explain the elements of a Job Description. §2. List the elements of a good system or.
Chapter 41 Training for Organizations Research Skills.
NCATE Institutional Orientation Session on PROGRAM REVIEW Moving Away from Input- based Programs Toward Performance-based Programs Emerson J. Elliott,
Office of Research, Evaluation, and Assessment April 19, 2008.
BADGER 3-8 EXAM : Wisconsin Smarter Assessment Updates & Resources 02/20/2015.
Virginia Teacher Performance Evaluation System
BBI 2420 ORAL INTERACTION SKILLS 1 ST FACE TO FACE SESSION 15 FEBRUARY 2015 SEM 2, 2014/2015.
INTRODUCTION.- PROGRAM EVALUATION
BILC Standardization Initiatives and Conference Objectives
EXAMS International English Language Testing System.
Assessment Literacy for Language Teachers by Peggy Garza Partner Language Training Center Europe Associate BILC Secretary for Testing Programs.
6 th semester Course Instructor: Kia Karavas.  What is educational evaluation? Why, what and how can we evaluate? How do we evaluate student learning?
Qualifications Update: Higher English Qualifications Update: Higher English.
The BILC BAT: A Research and Development Success Story Ray T. Clifford BILC Professional Seminar Vienna, Austria 11 October.
IT Introduction to Website Development Welcome!
What’s cooking at Sponsorium Miami, February 10, 2015.
Qualifications Update: Environmental Science Qualifications Update: Environmental Science.
Software Evaluation Criteria Automated Assignment Applications RSCoyner 10/8/04.
Asking the Right Questions Assessing Language Skills 2008 Presentation to ATESL Central Local Sheri Rhodes, Mount Royal College.
Standardizing Testing in NATO Peggy Garza and the BAT WG Bureau for International Language Co-ordination.
Developing a process of electronic question banking in 2 nd and 3 rd year courses Dalal ALQahtani; BDS,MSc,M ed Oral and Maxillofacial Pathologist Lecturer.
Bureau for International Language Coordination Julie J. Dubeau BILC Secretary Istanbul, Turkey May 24, 2010.
Microsoft Visio Design Plan Beverly Ashford 8/1/09 EDTC 3332 – Instructional Technology Practicum.
D1.HGE.CL7.01 D1.HGA.CL6.08 Slide 1. Introduction Design, prepare and present reports  Classroom schedule  Trainer contact details  Assessments  Resources:
Qualifications Update: Modern Languages Qualifications Update: Modern Languages.
Computerized Testing System in Science Based on Clickit platform Michal Biran Moshinsky R&D and Training Center - Ort Israel Wingate Seminar - May 2005.
Student Growth Measures in Teacher Evaluation Using Data to Inform Growth Targets and Submitting Your SLO 1.
International Diabetes Federation (IDF) East Mediterranean and Middle East Region (EMME) Workshop on Professional Educational Methodology in Diabetes
NATO BAT Testing: The First 200 BILC Professional Seminar 6 October, 2009 Copenhagen, Denmark Dr. Elvira Swender, ACTFL.
Qualifications Update: Human Biology Qualifications Update: Human Biology.
Program Standards for Bilingual Authorization Jo A. Birdsell, Ed.D. Commission on Teacher Credentialing Technical Assistance Meetings November 21 & 25,
Action Plans for Test Development and Administration STANAG 6001 BILC Conference, Mons, SHAPE, Belgium, September 2013 Ludmila Ianovici-Pascal Head, Language.
ABA Roundtable May IN THE BEGINNING,.... There was nothing.
Testing and Evaluation
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
Benchmark Advisory Test (BAT) Update BILC Conference Athens, Greece Dr. Ray Clifford and Dr. Martha Herzog June 2008.
Developing a curriculum according to Job Requirements Elias Papadopoulos Instructor of English as a foreign language. Examiner of officers and non-commissioned.
AP French Language and Culture Exam Information Lynn Gouacide.
Qualifications Update: Music Technology (Higher) Qualifications Update: Music Technology (Higher)
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Questionnaire Design CHAPTER eleven.
Appropriate Testing Administration
Qualifications Update: Higher Media Qualifications Update: Higher Media.
GCSE English Language 8700 GCSE English Literature 8702 A two year course focused on the development of skills in reading, writing and speaking and listening.
PARTNERS IN CRIME EST – LAT co-operation on the STANAG 6001 Piret Paju EST 2009.
Qualifications Update: Computing Science Qualifications Update: Computing Science.
Instructional Plan | Slide 1 AET/515 Instructional Plan For Associate’s Degree in Library Skills (Donna Roy)
Component D: Activity D.3: Surveys Department EU Twinning Project.
Case Study of the TOEFL iBT Preparation Course: Teacher’s perspective Jie Chen UWO.
Dr Anie Attan 26 April 2017 Language Academy UTMJB
Questionnaire Design.
50 Years of BILC: The Evolution of STANAG – 2016 and the first Benchmark Advisory Test Ray Clifford 24 May 2016.
ASSESSMENT OF STUDENT LEARNING
BBI 2420 ORAL INTERACTION SKILLS
Statistics and Research Desgin
Starting a Performance Based Testing Program:
Roadmap Towards a Validity Argument
LANGUAGE TRAINING PROGRAMS COMMON EUROPEAN FRAMEWORK
BBI 2420 ORAL INTERACTION SKILLS
Best Practices in STANAG 6001 Testing
Defence Requirements Authority for Culture and Language (DRACL)
Challenges of Piloting Test Items
ELP Assessment: Screening, Placement, and Annual Test Participation
BiH Test Piloting Mary Jo DI BIASE.
BILC ANNUAL CONFERENCE 2019 Tartu, Estonia
Quality management and Process improvement
Presentation transcript:

Francesco Gratton 2013 Testing in the time of crisis BILC PROFESSIONAL SEMINAR Stockholm, October , 2013 INNOVATIVE TEST DESIGNS AND FORMATS Lt.Col. F. Gratton

Francesco Gratton 2013 Summary: Past situation (up to Sept 2013) New Course Of Actions adopted A do-it-all software Proposals course of action course of action

Francesco Gratton :Multilevel test (level 1 to 4) 2:Multiple choice questions (60 for L & R) 3: No penalties for wrong answers 4:Duration: R 105’ / L 90’ 5:Separate Sections (& levels) 6: # of correct answers multiplied for a coefficient 7: Potential use of “F” factor RECEPTIVE SKILLS (Listening & Reading) Stanag Proficiency Test 1.0

Francesco Gratton 2013 Stanag Proficiency Test How STANAG levels were awarded (Stanag Proficiency Test 1.0) # of correct answers fixed coefficient (1,66) Multiplied by RECEPTIVE SKILLS

Francesco Gratton 2013

Francesco Gratton :Functional language assessed in a global manner 2: Structured interview 3:Tailored to the candidate 4: Checks & probes 5: 1 to 2 role plays SPEAKING (holistically assessed ) PRODUCTIVE SKILLS Proficiency Test 1.0

Francesco Gratton 2013 Three tasks (one for each level) PRODUCTIVE SKILLS JFLT 1.0 WRITING (holistically assessed )

Francesco Gratton 2013 Summary: Past situation Past situation New COAs adopted New COAs adopted A do-it-all software A do-it-all software Proposals Proposals

Francesco Gratton 2013 Specifications Specifications Cut-off score Cut-off score Joint Database Joint Database New COAs adopted

Francesco Gratton 2013

Francesco Gratton 2013 PURPOSE PURPOSE ADMINISTRATION PROCEDURES ADMINISTRATION PROCEDURES VALIDATION PROCEDURES VALIDATION PROCEDURES TEST FORMAT TEST FORMAT LEVELS OF LINGUISTIC KNOWLEDGE LEVELS OF LINGUISTIC KNOWLEDGE TEST CONTENT TEST CONTENT

Francesco Gratton :Multilevel test (level 1 to 4) 2:Multiple choice questions (60 for L & R) 3: No penalties for wrong answers 4:Duration: R 105’ / L 90’ 5:Separate Sections (& levels) 6: # of correct answers multiplied by a coefficient 7: Potential use of “F” factor RECEPTIVE SKILLS (Listening & Reading) Stanag Proficiency Test 1.0 TWAS HIGH TIME!!!

Francesco Gratton 2013 Stanag Proficiency Test 2.0 New Key-factors Each section is a mini-test (L & R) Plus levels Percentages Each section is a mini-test (L & R) Plus levels Percentages

Francesco Gratton 2013 Specifications Specifications Cut-off score Cut-off score Joint Database Joint Database New COAs adopted

Francesco Gratton 2013 Section 1 –Stanag level 1 (questions from 1 to 15) No.Correct answ. Level awarded Section 2 –Stanag level 2 (questions from 16 to 30) No.Correct answ. Level awarded Section 3 –Stanag level 3 (questions from 31 to 45) No.Correct answ. Level awarded Section 4 –Stanag level 4 (questions from 46 to 60) No.Correct answ. Level awarded JFLT 2.0: RDS (Listening & Reading)

Francesco Gratton 2013 EXAMPLE Level 1 (15 questions) Level 2 (15 questions) Level 3 (15 questions) Max score: 45 Level 1 (15 questions) Level 2 (15 questions) Level 3 (15 questions) Max score: 45 All candidates answer correctly to 30 questions Old test Old test WithWith they would all get the same score same score

Francesco Gratton 2013 CANDIDATO CORRECT ANSWERS LEVEL 1 (15 Questions ) LEVEL 2 (15 Questions ) LEVEL 3 (15 Questions ) FINAL LEVEL BIANCHI ROSSI VERDI GIALLI ARANCIONI

Francesco Gratton 2013 Specifications Specifications Cut-off score Cut-off score Joint Database Joint Database New COAs adopted

Francesco Gratton 2013 NEW LISTENING & READING ITEMS JDB (JOINT DATA BASE) JOINT EFFORT NOT TO HOW NOT TO MAKE JOINT EFFORTS

Francesco Gratton 2013 Test-writers involved Accustomed to military environment Accustomed to military environment Language Testing Seminar Language Testing Seminar Qualified Qualified Norming Sessions Norming Sessions

Francesco Gratton 2013 THE JDB FLOW CARABINIERI ARMY AIR FORCE NAVY

Francesco Gratton 2013 SEPOCTNOVDECJANFEB MAR APR MAJ JUNJUL st Group PREPARATION OF FIRST BATCH OF ITEMS SENT TO OTHER GROUP Phase 1 TESTERS MEET FOR 1° REVISION Phase 2 PRE-TESTING REVISION TRIALLIN# 50 ITEMS (-30%) Phase 3 FINAL MODIFICATION APPROVAZIONE Phase 4 AUG WGGdL 2nd Group FEBRUARY NEW ITEMS. INTO JDB PREPARATION OF FIRST BATCH OF ITEMS SENT TO OTHER GROUP Phase 1 TESTERS MEET FOR 1° REVISION Phase 2 Phase 3 PRE-TESTING REVISION TRIALLIN# 50 ITEMS (-30%) Phase 4 FINAL MODIFICATION APPROVAZIONE PREPARATION OF FIRST BATCH OF ITEMS SENT TO OTHER GROUP Phase 1 TESTERS MEET FOR 1° REVISION Phase 2 Phase 3 PRE-TESTING REVISION TRIALLIN# 50 ITEMS (-30%) 3rd Group 4th Group REPERIMENTO PREPARAZIONE # 120 TTIVAZIONI INVIO AD ALTRO GRUPPO Phase 1 MAJ NEW ITEMS. INTO JDB NOVEMBER NEW ITEMS. INTO JDB TIMINGS AUGUST NEW ITEMS. INTO JDB

Francesco Gratton 2013 Summary: Past situation Past situation COA (specs, JDB, cut-off score) COA (specs, JDB, cut-off score) A do-it-all software A do-it-all software Proposals Proposals

Francesco Gratton 2013 WHAT’S THE DIFFERENCE ?

Francesco Gratton 2013 PC-assessment-related terminology TermDefinition Stakes Assessment Any systematic method of obtaining evidence (through questions) for a purpose. Quiz … measures for the purpose of providing feedback to the student. Low Survey … to determine needs required to fulfill a defined purpose. Low Test … measures knowledge for the purpose of informing the student on their current level Medium Exam … measures knowledge for the purpose of documenting the current level of knowledge High

Francesco Gratton 2013 the software is used for: Needs analysis:(surveys ) Placement test Any training activity Assessment: – First level survey – Post-course – Pre-certification – Certification

Francesco Gratton 2013 Create questions And organize them in tests using a windows based PC

Francesco Gratton 2013 Assessment …… via Browser Assessment Definitions Questions … allows to choose: Time limits Feedback to test-taker Styles (Template) Jumps Question shuffling Instructions to test- takers … allows to choose: Time limits Feedback to test-taker Styles (Template) Jumps Question shuffling Instructions to test- takers On Windows PC

Francesco Gratton 2013 Assessments … also created with authoring manager by selecting Qs previously created Any question can be chosen from the database … also created with authoring manager by selecting Qs previously created Any question can be chosen from the database Assessment Definitions Questions su PC Windows  via Browser

Francesco Gratton 2013 Create questions And organize them in tests using a windows based PC Set security parameters, schedule assessment and link to other (Learning Management Systems assessment published using any browser, secure browsers, or a PC/MAC Result reports, CIA, graphs, gimmicks, you name it …..

Francesco Gratton 2013 Types of Questions: Multiple Choice

Francesco Gratton 2013 Likert Scale (for questionnaires)

Francesco Gratton 2013 Essay Question 1.Candidate can write free text in the space provided 2.Testers will evaluate later 1.Candidate can write free text in the space provided 2.Testers will evaluate later

Francesco Gratton 2013

Francesco Gratton 2013 Summary:sing Past situation Past situation COA (specs, JDB, cut-off score) COA (specs, JDB, cut-off score) A do-it-all software A do-it-all software Proposals Proposals

Francesco Gratton 2013 Wide project Wide project Sharing experience & capabilities Sharing experience & capabilities Optimization of resources Optimization of resources No alternative to B.A.T. No alternative to B.A.T. Testing in the time of crisis

Francesco Gratton 2013

Francesco Gratton 2013 A Bilateral-based CDB A multilateral-based CDB COMBINED DATABASE or …

Francesco Gratton 2013

Francesco Gratton 2013 FLOW TIME SCHEDULE SPECS 12 3

Francesco Gratton 2013

Francesco Gratton 2013 Thank you