검색 상세

What Should Scoring Scale Descriptors Look Like?

What Should Scoring Scale Descriptors Look Like?

초록/요약

The present study is aimed at investigating whether the specification of scoring scale descriptors affects the reliability of Korean EFL raters, and to what degree this occurs. A comparison of two scoring scales with differing levels of specification was undertaken. 32 Korean EFL secondary school teachers rated four writing samples using the two scoring scales and responded to a questionnaire asking about their perceptions of the two scales. The data were analyzed using a multi-faceted Rasch analysis. It was found that more specific scales improved inter-rater reliability. More specific scales also contributed to preventing biased rating against criteria. However, they failed to narrow the gap between raters in terms of degree of severity and to increase individual raters’ consistency in scoring different writing samples. An analysis of the raters’ responses to the survey questionnaire revealed that the more specific scoring scales were preferred as it was more useful in deciding on a score. Conversely, they were criticized for being too complex and time-consuming to apply.

more