Evaluation of Teaching Excellence, a Guide for Administrators Heather McGovern, PhD Director of the Institute for Faculty Development Associate Professor.

Slides:



Advertisements
Similar presentations
Ed-D 420 Inclusion of Exceptional Learners. CAT time Learner-Centered - Learner-centered techniques focus on strategies and approaches to improve learning.
Advertisements

Assessing Student Performance
School Based Assessment and Reporting Unit Curriculum Directorate
Alternative Strategies for Evaluating Teaching How many have used end-of-semester student evaluations? How many have used an alternative approach? My comments.
Teacher Excellence and Support System
 Goals: 1. To help faculty respond critically to student generated course evaluation; 2. To help faculty improve student learning.  Outcomes: Faculty.
[Insert faculty Banner] Consistency of Assessment
Mark Troy – Data and Research Services –
Indiana State University Assessment of General Education Objectives Using Indicators From National Survey of Student Engagement (NSSE)
PEER REVIEW OF TEACHING WORKSHOP SUSAN S. WILLIAMS VICE DEAN ALAN KALISH DIRECTOR, UNIVERSITY CENTER FOR ADVANCEMENT OF TEACHING ASC CHAIRS — JAN. 30,
IDEA Student Ratings of Instruction: A Diagnostic Guide for Improvement Dr. Kristi Roberson-Scott.
Using the IDEA Student Ratings System: An Introduction University of Saint Thomas Fall
Understanding and Using Your IDEA Evaluation Results Nina Campanicki & Ernie Linsay Faculty Development Day March 20, 2010.
IDEA Student Ratings of Instruction Update Carrie Ahern and Lynette Molstad Selected slides reproduced with permission of Dr. Amy Gross from The IDEA Center.
IDEA What it is and How to Implement the System Texas A & M, February 2013 Shelley A. Chapman, PhD Senior Educational Consultant.
Grading. Why do we grade? To communicate To tell students how they are doing To tell parents how students are doing To make students uneasy To wield power.
Evaluation of Teaching at Stockton Heather McGovern Director of the Institute for Faculty Development August 2010.
Using Rubrics for Evaluating Student Learning. Purpose To review the development of rubrics for the purpose of assessment To share an example of how a.
IDEA Student Ratings of Instruction Insight Improvement Impact ® Loyola University of Chicago April 10-11, 2013 Steve Benton, PhD Senior Research Officer.
TTU Teaching Evaluations at TTU Using the IDEA Instrument.
Consistency of Assessment
Interpreting Student Evaluations Heather McGovern, Fall 2011.
The Program Review Process: NCATE and the State of Indiana Richard Frisbie and T. J. Oakes March 8, 2007 (source:NCATE, February 2007)
ICE Evaluations Some Suggestions for Improvement.
Interpreting IDEA Heather McGovern Director of the Institute for Faculty Development November 2009 add info on criterion vs. normed scores Add info on.
Using Rubrics for Evaluating Student Learning Office of Assessment and Accreditation Indiana State University.
Writing Program Assessment Report Fall 2002 through Spring 2004 Laurence Musgrove Writing Program Director Department of English and Foreign Languages.
Principles of High Quality Assessment
Measuring Learning Outcomes Evaluation
Assessing and Evaluating Learning
Ways to Utilize the 2012 FCPS Working Conditions Survey April 11, 12, 13 Laurie Fracolli, Sid Haro, and Andrew Sioberg.
Consistency of assessment Technology subjects (7-12)
Analyzing and Improving College Teaching: Here’s an IDEA Alan C. Lacy, Associate Dean College of Applied Science and Technology Illinois State University.
Grade Point Average - Your grade point average (GPA) is calculated by dividing the total amount of quality points earned by the total amount of.
AN EVALUATION OF THE EIGHTH GRADE ALGEBRA PROGRAM IN GRAND BLANC COMMUNITY SCHOOLS 8 th Grade Algebra 1A.
METHODS Study Population Study Population: 224 students enrolled in a 3-credit hour, undergraduate, clinical pharmacology course in Fall 2005 and Spring.
Office of Institutional Research, Planning and Assessment January 24, 2011 UNDERSTANDING THE DIAGNOSTIC GUIDE.
Student Engagement Survey Results and Analysis June 2011.
Evaluation of Teaching at Stockton Bill Reynolds, PhD, LCSW Associate Professor of Social Work Director, Institute for Faculty Development August 26, 2015.
Classroom Assessments Checklists, Rating Scales, and Rubrics
T 7.0 Chapter 7: Questioning for Inquiry Chapter 7: Questioning for Inquiry Central concepts:  Questioning stimulates and guides inquiry  Teachers use.
IDEA Student Ratings of Instruction Shelley A. Chapman, PhD Insight Improvement Impact ® University of Alabama Birmingham September 11, 2012.
Measuring Complex Achievement
Evaluation of Teaching at Stockton Heather McGovern, PhD Associate Professor of BASK and Writing Director of the Institute for Faculty Development August.
Data-Informed Faculty Development Planning Howard Community College August 17, 2012 Shelley A. Chapman, PhD.
University Teaching Symposium January 9,  Individual Development and Educational Assessment  Kansas State University  A diagnostic to.
Performance-Based Assessment HPHE 3150 Dr. Ayers.
 Shelley A. Chapman, PhD Texas A & M University February 2013.
Classroom Assessment, Grading, and Standardized Testing
The Conceptual Framework: What It Is and How It Works Linda Bradley, James Madison University Monica Minor, NCATE April 2008.
The selection of appropriate assessment methods in a course is influenced by many factors: the intended learning outcomes, the discipline and related professional.
McGraw-Hill/Irwin © 2012 The McGraw-Hill Companies, Inc. All rights reserved. Obtaining Valid and Reliable Classroom Evidence Chapter 4:
Assessment at KS4 Bury C of E High School Engaging Parents Information.
My Professors Just Don’t Care! Carl Burns Director, Counseling Center Tammy Pratt Coordinator, Academic Support Programs.
IDEA STUDENT EVALUATION REPORTS Insight Improvement Impact ® Using IDEA as a Tool for Reflection about Your Teaching Raritan Valley Community College January.
Summary of CPHS Course Evaluations: AY and Chuck Phillips August 2009.
Using IDEA for Assessment, Program Review, and Accreditation Texas A & M University November 8, 2012 Shelley A. Chapman, PhD.
DEVELOPED BY MARY BETH FURST ASSOCIATE PROFESSOR, BUCO DIVISION AMY CHASE MARTIN DIRECTOR OF FACULTY DEVELOPMENT AND INSTRUCTIONAL MEDIA UNDERSTANDING.
USING STUDENT EVALUATIONS AT STOCKTON Heather McGovern Director of the Institute for Faculty Development Associate Professor of Writing January 2012.
Classroom Assessments Checklists, Rating Scales, and Rubrics
PeerWise Student Instructions
Evaluation of Teaching at Stockton
Evaluation of Teaching at Stockton
Teaching Evaluations at TTU Using the IDEA Instrument
The University of Texas-Pan American
Classroom Assessments Checklists, Rating Scales, and Rubrics
Interpreting IDEA Results: Getting the Most from Course Evaluations
EVAAS Overview.
Evaluation of Teaching at Stockton
IDEA Student Ratings of Instruction
Presentation transcript:

Evaluation of Teaching Excellence, a Guide for Administrators Heather McGovern, PhD Director of the Institute for Faculty Development Associate Professor of Writing and BASK November 2011

Candidates should provide multiple ways for teaching to be evaluated. Stockton policy states that “evidence of teaching performance should be demonstrated by a teaching portfolio, as outlined below, which should contain the following:  A self-evaluation of teaching  Student evaluations of teaching and preceptorial teaching  Peer evaluations of teaching  Other evidence of effectiveness in teaching”

Student ratings should be less than half of the evaluation of teaching. The IDEA Center “strongly recommends that additional sources of evidence be used when teaching is evaluated and that student ratings constitute only 30% to 50% of the overall evaluation of teaching.” Primary reasons: o “some components of effective teaching are best judged by peers and not students” o “it is always useful to triangulate information...” o no instrument is fully valid o no instrument is fully reliable

How student ratings align to Stockton’s definition of “excellence in teaching” “A thorough and current command of the subject matter, teaching techniques and methodologies of the discipline one teaches. Sound course design and delivery in all teaching assignments…as evident in clear learning goals and expectations, content reflecting the best available scholarship or artistic practices, and teaching techniques aimed at student learning. The ability to organize course material and to communicate this information effectively. The development of a comprehensive syllabus for each course taught, including expectations, grading and attendance policies, and the timely provision of copies to students. …respect for students as members of the Stockton academic community, the effective response to student questions, and the timely evaluation of and feedback to students.” “Where appropriate, additional measures of teaching excellence are Ability to use technology in teaching The capacity to relate the subject matter to other fields of knowledge Seeking opportunities outside the classroom to enhance student learning of the subject matter” Orange: Student ratings may be a valid measure Red: Student ratings should be a valid measure and those used at Stockton elicit information Underlined: student ratings may be one of the best sources

Reliability and representativeness: # of classes needed for evaluation The IDEA Center “recommends using six to eight classes, not necessarily all from the same academic year, that are representative of all of an instructor’s teaching responsibilities.” In a person’s first few years at Stockton, evaluators will not be able to do what is ideal. This makes using teaching observations and other evidence of good teaching in a file even more important.

The # of student responders affects interrater reliability (consistency of student responses) IDEA reports the following median rates: 10 raters.69 reliability 15 raters.83 reliability 20 raters.83 reliability 30 raters.88 reliability 40 raters.91 reliability Reliability ratings below.70 are highly suspect. Starting in Fall 2010, to respond to this issue, Stockton began using a small class instrument (that gathers qualitative rather than quantitative data) for classes of fewer than 15 students, usually determined following the last day to withdraw. Which instrument is used is not a faculty option, but is dictated. Many faculty will continue to have unreliable data from earlier years and even last year in their files.

The percentage of student responders affects representativeness Higher response rates=more representative data Lower response rates=less representative data Online classes using the IDEA online currently have, as a group, the lowest response rate at Stockton. Low response rate can cause a course with a high enough enrollment to use the regular IDEA to provide unreliable data.

It matters whether faculty say something is “important” or “essential.” In the IDEA Progress toward Relevant Objectives scores on page one of the report, items of minor importance do not count at all, and items that are “essential” count double items that are “important.” Faculty choice also reflects their philosophy of teaching for the class.

The objectives faculty choose affect some of the summary report data. Item A. on page one and column one in the graph report Progress on Relevant Objectives. The “Summary Evaluation” provided on page one of the IDEA report weights Progress toward Relevant Objectives at 50% and Excellent Teacher and Excellent Course at 25%. Data on page two reports student ratings on only the items faculty selected. Data on page four reports all ratings. On the small class form, students will list objectives they feel they progressed on.

How many objectives should faculty select? Usually, 3-5. This is a rule of thumb. Faculty may have good reason to select as few as one or to select more than 5. Choices logically related to the relevant course may be good choices. IDEA (and so Sonia and I) advise faculty that  It is harder for students to make progress if the class has many objectives.  Research indicates that student ratings tend to decrease when larger numbers of objectives are selected.

Myths about objectives Faculty have to choose a certain number. No. If they choose none as important or essential, then by default all will be important, which usually lowers the mean in composite scores. Faculty have to have at least one essential or at least one important. No. Any combination is ok, but if all selections are essential or important, they’ll be equally weighted.

Disciplinary codes—see row in small table on first page and columns on other pages Ideally, a faculty member’s code is as good a match to their class as possible. In most cases, matches were selected for faculty and they can check and ask for a change. Many comparisons are too general to be of much help. If “NA” appears it is because IDEA’s database is insufficient and is no fault of the faculty member.

STUDENT RATINGS, BASIC INFORMATION

Remember that the results report Student ratings reflect student perception, which is not the same as student learning and may differ from reality.

Outliers affect mean scores. Evaluators can check the statistical detail on page 4. Standard deviations of.7 are typical. Standard deviations of over 1.2 indicate “unusual diversity.” If the distribution is bimodal, then the class may have contained “ two types of students who are so distinctive that what “works” for one group will not for the other. For example, one group may have an appropriate background for the course while the other may be under- prepared….” (IDEA) In these cases, IDEA recommends detailed item examination; there may be issues beyond instructor control.

Scores and comments can be affected by the halo effect “the tendency of raters to form a general opinion of the person being rated and then let that opinion color all specific ratings. If the general impression is favorable, the "halo effect" is positive and the individual receives higher ratings on many items than a more objective evaluation would justify. The "halo effect" can also be negative; an unfavorable general impression will lead to low marks "across the board", even in areas where performance is strong.”

How can you know? Look at the pattern of student responses on page 4 or on the student forms. If a form gives someone a 5 all the way down, regardless of whether a class covered a particular learning objective—halo effect! In most cases, also true with a 1 or any other number all the way down…

The Error of Central Tendency can affect scores “Most people have a tendency to avoid the extremes (very high and very low) in making ratings. As a result, ratings tend to pile up more toward the middle of the rating scale than might be justified. In many cases, ratings which are "somewhat below average" or "somewhat above average" may represent subdued estimates of an individual's status because of the "Error of Central Tendency.”

Things evaluators should check  The teacher selected objectives. If not, by default, all will be considered “important.” PRO scores on the first page of the report are worthless.  The objectives the teacher chose seem reasonable for the course.  The teacher discusses problematic objective choices or irregularities in the class.

IDEA compares class results to three groups (page one and two) 1) Three years of IDEA student ratings at multiple institutions 2) Classes at your institution in the most recent five years 3) Classes in the same discipline in the most recent five years where at least 400 classes with the same disciplinary code were rated

The validity of comparisons varies The validity of comparisons depends on a number of factors, including how “typical” a class is, compared to classes at Stockton or all classes in the IDEA database or how well the class aligns with other classes with the same IDEA disciplinary code.

External factors can affect comparisons and ratings Students in required courses tend to report lower. Students in lower level classes tend to report lower. Arts and humanities >social science > math (this may be because of differences in teaching quality or due to quantitative nature of courses, both, or other factors). Race/gender/age/culture/height/physical attractiveness and more may be factors, as they are in many other areas of life.

Some external factors don’t usually affect ratings Time of day of the course Time in the term in which ratings are given (after midterm) Age of student Level of student Student GPA

We should use converted scores when making comparisons IDEA states that “Institutions that want to make judgments about teaching effectiveness on a comparative basis should use converted scores.” Converted scores are reported in the graph and lower table on page one and on page two.

Why we should use converted scores The 5-point averages of progress ratings on “Essential” or “Important” objectives vary across objective. For instance, the average for “gaining factual knowledge” is 4.00, while that for “gaining a broader understanding and appreciation for intellectual/cultural activity is Unconverted averages disadvantage “broad liberal education” objectives. Using converted averages “ensures that instructors choosing objectives where average progress ratings are relatively low will not be penalized for choosing objectives that are particularly challenging or that address complex cognitive skills.”

Norming sorts people into broad categories Scores are normed. Therefore, it is unrealistic to expect most people to score above the similar range. Statistically, 40% of people ALWAYS score in the similar range and 30% above and 30% below that range. Many teachers teach well. Therefore, the comparative standard is relatively high. Because the instrument is not perfectly valid or reliable, trying to compare scores within the five major categories IDEA provides is not recommended.

Why we should use adjusted averages in most cases Adjusted scores adjust for “student motivation, student work habits, class size, course difficulty, and student effort. Therefore, in most circumstances, the IDEA Center recommends using adjusted scores.”

How are they adjusted? “Work Habits (mean of Item 43, As a rule, I put forth more effort than other students on academic work) is generally the most potent predictor…Unless ratings are adjusted, the instructors of such classes would have an unfair advantage over colleagues with less dedicated students.”

How are they adjusted, part II “Course Motivation (mean of Item 39, I really wanted to take this course regardless of who taught it) is the second most potent predictor. …unless ratings are adjusted, the instructors of such classes would have an unfair advantage over colleagues with less motivated students.”

How are they adjusted, part III “Size of Class…is not always statistically significant; but when it was, it was always negative – the larger the class, the lower the expected rating.”

How are they adjusted, part IV “Course Difficulty, as indicated by student ratings of item 35, Difficulty of subject matter” is complicated because the instructor influences students’ perception of difficulty. Therefore, “A statistical technique was used to remove the instructor’s influence on “Difficulty” ratings in order to achieve a measure of a class’s (and often a discipline’s) inherent difficulty. Generally, if the class is perceived as difficult (after taking into account the impact of the instructor on perceived difficulty), an attenuated outcome can be expected.” Notable examples: in “Creative capacities” and “Communication skills” “high difficulty is strongly associated with low progress ratings.” In two cases, high difficulty leads to high ratings on progress toward objectives: “Factual knowledge” and “Principles and theories.”

How are they adjusted, part V “Student Effort is measured with responses to item 37, I worked harder on this course than on most courses I have taken. “ Here, because response reflects the students’ general habits and how well the teacher motivated students, the latter is statistically removed from the ratings leaving the fifth extraneous factor, “student effort not attributable to the instructor.” Usually, student effort is negatively related to ratings. A special case is “Classes containing an unusually large number of students who worked harder than the instructor’s approach required” which get low progress ratings, maybe because people were unprepared for the class or lack self-confidence and so under achieve “or under-estimate their progress in a self-abasing manner.”

A critical exception to using adjusted scores “We recommend using the unadjusted score if the average progress rating is high (for example, 4.2 or higher).” In these cases, students are so motivated and hard-working that the teacher has little opportunity to influence their progress, but “instructors should not be penalized for having success with a class of highly motivated students with good work habits.”

Bottom Line For evaluation purposes, use the higher of the two scores (adjusted or raw).

Myths about IDEA, page 3 Effective teaching=students make progress on all 12 learning objectives Effective teachers= teachers who employ all 20 teaching methods

Attend to other evidence Student ratings should not be the most important element in evaluating teaching excellence, despite my focus today. Philosophy of teaching, reflection, teaching observations, and other evidence (syllabi, assignments, direct evidence of student learning, etc.) should compose more of the the evidence for or against a candidate as an excellent teacher.

References Cashin, William. “Student Ratings of Teaching, the Research Revisited.” Idea paper Cashin, William. “Student Ratings of Teaching: A Summary of the Research.” Idea paper Colman, Andrew, Norris, Claire., and Preston, Carolyn. “Comparing Rating Scales of Different Lengths: Equivalence of Scores from 5-Point and 7-Point Scales.” Psychological Reports 80: Hoyt, Donald and Pallett, William. “Appraising Teaching Effectiveness: Beyond Student Ratings.” Idea paper “Interpreting Adjusted Ratings of Outcomes.” 2002, updated Pallet, Bill. “IDEA Student Ratings of Instruction.” Stockton College, May “Using IDEA Results for Administrative Decision-making.”