TSL 3112 – LANGUAGE ASSESSMENT BASIC TESTING TERMINOLOGY

Slides:

Advertisements

Similar presentations

Ed-D 420 Inclusion of Exceptional Learners. CAT time Learner-Centered - Learner-centered techniques focus on strategies and approaches to improve learning.

Advertisements

Assessing Student Performance

The Journey – Improving Writing Through Formative Assessment Presented By: Sarah McManus, Section Chief, Testing Policy & Operations Phyllis Blue, Middle.

Assessment Adapted from text Effective Teaching Methods Research-Based Practices by Gary D. Borich and How to Differentiate Instruction in Mixed Ability.

Test Development.

ASSESSMENT: FORMATIVE & SUMMATIVE

Types of Tests. Why do we need tests? Why do we need tests?

Chapter Fifteen Understanding and Using Standardized Tests.

Grading. Why do we grade? To communicate To tell students how they are doing To tell parents how students are doing To make students uneasy To wield power.

Assessment and Evaluation September 28, What Principles have we explored so far? Effective Teachers:  Appreciate and Understand Adolescents  Understand.

By: Michele Leslie B. David MAE-IM WIDE USAGE To identify students who may be eligible to receive special services To monitor student performance from.

Uses of Language Tests.

Principles of High Quality Assessment

Importance of Testing In Educational situations To determine the progress of students To ascertain achievement of educational objectives To make sound.

Dr. Pratibha Gupta Associate professor Deptt. of Community Medicine ELMC & H, Lucknow.

Grade 12 Subject Specific Ministry Training Sessions

Standardized Testing and California Schools’ API Scores What’s the Connection?

Assessing and Evaluating Learning

Standardized Test Scores Common Representations for Parents and Students.

Formative and Summative Assessments Secondary Language Arts PLC: Andrea Bergreen, Jo Lane Jennifer Doerner, RHS Susanne Cuatt, RHS.

Formative and Summative Assessment

Chapter 1 Assessment in Elementary and Secondary Classrooms

Chapter 14 Understanding and Using Standardized Tests Viewing recommendations for Windows: Use the Arial TrueType font and set your screen area to at least.

Chap. 3 Designing Classroom Language Tests

Bank of Performance Assessment Tasks in English

Topic 4: Formal assessment

Classroom Assessment and Grading

Classroom Assessments Checklists, Rating Scales, and Rubrics

Classroom Assessment A Practical Guide for Educators by Craig A

August 2007FFP Testing and Evaluation Techniques Chapter 7 Florida State Fire College Ocala, Florida.

Integrating Differentiated Instruction & Understanding by Design: Connecting Content and Kids by Carol Ann Tomlinson and Jay McTighe.

EDU 385 Education Assessment in the Classroom

Assessment in Education Patricia O’Sullivan Office of Educational Development UAMS.

Teaching Today: An Introduction to Education 8th edition

Assessing Student Learning

Classroom Evaluation & Grading Chapter 15. Intelligence and Achievement Intelligence and achievement are not the same Intelligence and achievement are.

Record Keeping and Using Data to Determine Report Card Markings.

TEST,MEASUREMENT AND EVALUATION

Formative and Summative Evaluation. Formative Evaluation The goal of formative assessment is to Monitor student learning Provide ongoing feedback Improve.

Common Formative Assessments for Science Monica Burgio Daigler, Erie 1 BOCES.

Classroom Assessment (1) EDU 330: Educational Psychology Daniel Moos.

Assessment and Testing

Assessment Information from multiple sources that describes a student’s level of achievement Used to make educational decisions about students Gives feedback.

Grand Island K-8 SCIENCE Common Formative Assessments for Science Monica Burgio Daigler, Erie 1 BOCES.

Georgia will lead the nation in improving student achievement. 1 Georgia Performance Standards Day 3: Assessment FOR Learning.

The Value in Formative Assessment Prepared By: Jen Ramos.

ASSESSMENT: FORMATIVE & SUMMATIVE Practices for the Classroom.

ASSESSMENT: FORMATIVE & SUMMATIVE Practices for the Classroom.

ASSESSMENT CRITERIA Jessie Johncock Mod. 2 SPE 536 October 7, 2012.

ASSESSMENTS: FORMATIVE & SUMMATIVE Practices for the Classroom.

Monitoring and Assessment Presented by: Wedad Al –Blwi Supervised by: Prof. Antar Abdellah.

AEMP Grade Level Collaborative Module 8 Office of Curriculum, Instruction, and School Support Language Acquisition Branch Academic English Mastery Program.

Evaluation and Assessment Evaluation is a broad term which involves the systematic way of gathering reliable and relevant information for the purpose.

PGCE Evaluation of Assessment Methods. Why do we assess? Diagnosis: establish entry behaviour, diagnose learning needs/difficulties. Diagnosis: establish.

Assessment Formative & Summative Dr. Anupam Banerjee Dr. Shilpa Jain.

Chapter 1 Assessment in Elementary and Secondary Classrooms

Classroom Assessments Checklists, Rating Scales, and Rubrics

ASSESSMENT: FORMATIVE & SUMMATIVE

Classroom Assessments Checklists, Rating Scales, and Rubrics

Learning About Language Assessment. Albany: Heinle & Heinle

Prepared by: Toni Joy Thurs Atayoc, RMT

ASSESSMENT: FORMATIVE & SUMMATIVE

Standards and Assessment Alternatives

Understanding and Using Standardized Tests

Exploring Assessment Options NC Teaching Standard 4

ASSESSMENT: FORMATIVE & SUMMATIVE

Unit 7: Instructional Communication and Technology

TESTING AND EVALUATION IN EDUCATION GA 3113 lecture 1

ASSESSMENT: FORMATIVE & SUMMATIVE

EDUC 2130 Quiz #10 W. Huitt.

Presentation transcript:

TSL 3112 – LANGUAGE ASSESSMENT BASIC TESTING TERMINOLOGY Norm-referenced & criterion referenced tests Formative and summative tests Objective & subjective tests

LECTURE’S OBJECTIVES Identify the different formats of tests found Distinguish different types of tests: norm-referenced and criterion-referenced tests Compare and contrast formative and summative tests Differentiate between objective and subjective tests (Main reference - Brown, H. Douglas, 2004. Language Assessment: Principles and classroom practices. )

NORM-REFERENCED & CRITERION REFERENCED TESTS

NORM-REFERENCED TESTS To rank each student with respect to the achievement of others in broad areas of knowledge. Normed using large groups of test takers. Compares one taker to another. Measure achievement, predicts future performance. Each individual is compared with other examinees and assigned a score--usually expressed as a percentile, a grade or equivalent score. Student achievement is reported for broad skill areas, although some norm-referenced tests do report student achievement in specific sub-areas.

NORM-REFERENCED TEST Measures broad skill areas sampled from a variety of textbooks, syllabi, and the judgments of curriculum experts. Each skill is, usually, tested by less than four items. Items vary in difficulty. Items are selected that discriminate between high and low achievers. If too many people get a question correct, or too many score well, then test questions are “thrown out” until they achieve a normal curve again.

CRITERION-REFERENCED TEST Criterion-referenced tests, also called mastery tests, compare a person's performance to a set of objectives. Anyone who meets the criterion can get a high score. Everyone knows what the benchmarks / objectives are and can attain mastery to meet them. It is possible for ALL the test takers to achieve 100% mastery. Measure a student against a specific set of knowledge (criterion).

CRITERION-REFERENCED TEST To determine whether each student has achieved specific skills or concepts. To find out how much students know before instruction begins and after it has finished. Measures specific skills which make up a designated curriculum. These skills are identified by teachers and curriculum experts. Each skill is expressed as an instructional objective. Each individual is compared with a preset standard for acceptable achievement. The performance of other examinees is irrelevant. Each skill is tested by at least four items in order to obtain an adequate sample of student performance and to minimize the effect of guessing. The items which test any given skill are parallel in difficulty.

NORM & CRITERION REFERENCED TESTS The following is adapted from: Popham, J. W. (1975). Educational evaluation. Englewood Cliffs, New Jersey: Prentice-Hall, Inc. Dimension Criterion-Referenced Tests Norm-Referenced Tests Purpose To determine whether each student has achieved specific skills or concepts. To find out how much students know before instruction begins and after it has finished. To rank each student with respect to the achievement of others in broad areas of knowledge. To discriminate between high and low achievers. Content Measures specific skills which make up a designated curriculum. These skills are identified by teachers and curriculum experts. Each skill is expressed as an instructional objective. Measures broad skill areas sampled from a variety of textbooks, syllabi, and the judgments of curriculum experts.

NORM & CRITERION REFERENCED TESTS Dimension Criterion-Referenced Tests Norm-Referenced Tests Item Characteris- tics Each skill is tested by at least four items in order to obtain an adequate sample of student performance and to minimize the effect of guessing. The items which test any given skill are parallel in difficulty. Each skill is usually tested by less than four items. Items vary in difficulty. Items are selected that discriminate between high and low achievers.

NORM & CRITERION REFERENCED TESTS Dimension Criterion-Referenced Tests Norm-Referenced Tests Score Interpre-tation Each individual is compared with a preset standard for acceptable achievement. The performance of other examinees is irrelevant. A student's score is usually expressed as a percentage. Student achievement is reported for individual skills. Each individual is compared with other examinees and assigned a score--usually expressed as a percentile, a grade equivalent score, or a stanine. Student achievement is reported for broad skill areas, although some norm-referenced tests do report student achievement for individual skills.

NORM & CRITERION REFERENCED TESTS Uses of Test Results for Teachers Two main ways that test results can be used by teachers: For revising instruction for entire class. For developing intervention strategies for individual students. Standardized test results have not typically been used to aid teachers in making instructional decisions. Data-driven decision making takes some practice and experience for classroom teachers.

CRITERION-REFERENCED TESTS COMPARING NORM & CRITERION-REFERENCED TESTS Criterion-referenced Mastery Basic skills Prerequisites Affective Psychomotor Grouping for instruction Norm-referenced General ability Range of ability Large groups Compares people to people-comparison groups Selecting top candidates

COMMON CHARACTERISTICS OF NRT & CRT Require a relevant and representative sample of test items Require specification of the achievement domain to be measured Use the same type of test items Use the same rules for item writing Judged by the same qualities (validity and reliability) Useful in educational measurement

ADVANTAGES AND DISADVANTAGES OF NRT They easy for instructors to use They work well in situations requiring rigid differentiation among students They are generally appropriate in large courses Disadvantages： An individual's grade is determined not only by his/her achievements, but also by the achievements of others. no indication of prerequisite knowledge for more advanced material has been mastered less appropriate for measuring affective and psychomotor objectives encourages competition and comparison scores

ADVANTAGES AND DISADVANTAGES OF CRT Students are not competing with each other Students are thus more likely to actively help each other learn. A student's grade is not influenced by the caliber of the class. Disadvantages： It is difficult to set a reasonable standard for students Most experienced faculty set criteria based on their knowledge of how students usually perform Criterion-referenced systems often become fairly similar to norm-referenced systems. absolute standards difficult to set in some areas standards tend to be arbitrary not appropriate comparison when others are valuable

FORMATIVE & SUMMATIVE TESTS

THE GARDEN ANALOGY If we think of our children as plants … Summative assessment of the plants is the process of simply measuring them. It might be interesting to compare and analyze measurements but, in themselves, these do not affect the growth of the plants. Formative assessment, on the other hand, is the equivalent of feeding and watering the plants appropriate to their needs - directly affecting their growth. Formative and summative assessment are interconnected. They seldom stand alone in construction or effect.

FORMATIVE ASSESSMENT Assessment for learning Taken at varying intervals throughout a course to provide information and feedback that will help improve the quality of student learning the quality of the course itself The purpose is: To promote further improvement of student learning during the learning process To involve students in the ongoing assessment of their own achievement Provides information on what an individual student needs To practice To have re-taught To learn next

KEY ELEMENTS OF FORMATIVE ASSESSMENT The identification by teachers & learners of learning goals, intentions or outcomes and criteria for achieving these. Rich conversations between teachers & students that continually build and go deeper. The provision of effective, timely feedback to enable students to advance their learning. The active involvement of students in their own learning. Teachers responding to identified learning needs and strengths by modifying their teaching approach(es). Black & Wiliam, 1998

BENEFITS OF FORMATIVE ASSESSMENT FOR TEACHERS (Boston, 2002) Teachers are able to determine what standards students already know and to what degree. Teachers can decide what minor modifications or major changes in instruction they need to makes so that all students can succeed in upcoming instruction and on subsequent assessments. Teachers can create appropriate lessons and activities for groups of learners or individual students. Teaching can inform students about their current progress in order to help them set goals for improvement.

BENEFITS OF FORMATIVE ASSESSMENTS FOR STUDENTS Students are more motivated to learn. Students take responsibility for their own learning. Students become users of assessment. Students learn valuable lifelong skills such as self-evaluation, self-assessment, and goal setting. Student achievement can improve from 21-41 percentile points. (marzano 2003; stiggens et. al, 2006)

SUMMATIVE ASSESSMENT Assessment of learning Generally taken by students at the end of a unit or semester to demonstrate the "sum" of what they have or have not learned. Summative assessment methods are the most traditional way of evaluating student work. "Good summative assessments--tests and other graded evaluations--must be demonstrably reliable, valid, and free of bias" (Angelo and Cross, 1993).

POSSIBLE ASSESSMENT METHODS Formative Assessment includes Questions Classroom Discussions Learning Activities Feedback Conferences Interviews Student Self-Assessment Summative Assessment Selected Response Multiple Choice True/False Matching Fill-in Extended Written Response Performance Assessment

BALANCED CLASSROOM ASSESSMENT SYSTEM FORMATIVE ASSESSMENT SUMMATIVE ASSESSMENT A tool used after instruction to measure student achievement which provides evidence of student competence or program effectiveness. students are evaluated upon completion of the work and the focus is on the final product. A process used by teachers and students during instruction that provides feedback to adjust ongoing teaching and learning to help students improve their achievement of intended instructional outcomes.

COMPARISON OF ASSESSMENTS FORMATIVE SUMMATIVE Occurs During Instruction Not Graded Process Descriptive Feedback Continuous Occurs at the end Graded Product Evaluative Feedback Periodic Sort students in rank order

COMPARISON OF ASSESSMENTS A Fine Sieve Formative assessment informs both teachers and students about student understanding at a point when timely adjustments can be made. These adjustments help to ensure students achieve targeted standards-based learning goals within a set time frame. A course sieve Summative assessments happen too far down the learning path to provide information at the classroom level and to make instructional adjustments and interventions during the learning process

OBJECTIVES & SUBJECTIVES TESTS

OBJECTIVE AND SUBJECTIVE ASSESSMENT Objective assessment is a form of questioning which has a single correct answer. Subjective assessment is a form of questioning which may have more than one current answer (or more than one way of expressing the correct answer).

OBJECTIVE TEST Objective tests include multiple choice, true-false, matching, and fill-in questions. They tend to focus more on specific facts than on general ideas and concepts Questions on a tests that only have one correct answer Objective tests require far more careful preparation than subjective tests Objective examination can be part of formative (diagnostic) and summative (final assessment) exams. Most popular objective exam is Multiple Choice Questions (MCQ). (the method of scoring is the only factor that distinguishes an objective test from a subjective test)

MCQ Advantages of multiple choice question: The ability to create a test item bank Quick grading – can be easily computer scored If written well, high reliability - only one possible answer Objective grading Wide coverage of content Can be used for mass testing Precision in providing information regarding specific skills and abilities. Students are familiar with the item type – directions are easy to understand.

MCQ Weaknesses of multiple choice question: Difficult and time consuming to construct Low validity Mainly tests recognition knowledge and recall of facts. Guessing may have considerable effect Cheating may be facilitated Sometimes skills and areas are tested because they are testable than important Places a high degree of dependence on student’s reading ability and teacher’s writing ability. It may limit beneficial washback. This technique strictly limits what can be tested.

SUBJECTIVE TEST Subjective tests include essay, short-answer, vocabulary, and take-home tests Questions on a test that have more than one correct answer. Each examiner uses his own judgment in evaluating performance and awarding marks.

SUBJECTIVE TEST Strengths: Easy to set High validity Can assess affective and interpretive aspects of language skills allow a candidate to express originality of thought allow the examiner to assess the candidate's quality of written expression. Weaknesses: Marking is time consuming Reliability is low Inter-rater as well as intra-rater variability are probable. Dependence on presentation.- good hand writing vs bad handwriting Question evasion - possible for the candidate to avoid questions in areas of the curriculum in which they are weak.

OBJECTIVE VS. SUBJECTIVE TEST short answer closed response mostly recognition, limited production difficult to write well quick and easy to grade reliable workload “up front” Subjective long answer open response emphasis on production relatively easy to write difficult to grade time-consuming inter-rater reliability not as reliable workload post test

REFERENCES Classroom Assessment: Basic Concepts. Formative vs.Summative Assessments. Retrieved October 20, 2008 from http://fcit.usf.edu/assessment/basic/basica.html Formative vs. Summative Evaluation. Retrieved October 20, 2008 from http.jan.ucc.nau.edu/edtech/etc/667/proposal/evaluation/ summative_vs_formative.htm Formative and Summative Assessment. Retrieved October 20, 2008 from http://www.krauseinnovationcenter.org/ewyl/modules/module6-3.html. Classroom Assessment: Basic Concepts. Formative vs.Summative Assessments. Retrieved October 24, 2008 from http://fcit.usf.edu/assessment/basic/basica.html Pawlas, G., Oliva, P. (2008) Supervision for Today’s Schools, Sixth Edition. New York: John Wiley and Sons

Arter, Judith, and Jay McTighe. Scoring Rubrics in the. Classroom Arter, Judith, and Jay McTighe. Scoring Rubrics in the Classroom. Thousand Oaks, CA: Corwin Press, INC., 2001. Marzano, Robert J., Debra Pickering, and Jay McTighe. Assessing Student Outcomes. Alexandria, VA: Association for Supervision and Curriculum Development, 1993. Schoenbach, Ruth, et al. Reading for Understanding, A Guide to Improving Reading in Middle and High School Classrooms. San Francisco, CA: Jossey-Bass, Inc., 1999.